Bapat-Beg theorem
From Wikipedia, the free encyclopedia
The Bapat-Beg theorem [1] gives the joint cumulative distribution function of order statistics of independent but not necessarily identically distributed random variables in terms of the cumulative distribution functions of the random variables. Simpler proof of this can be found in [2]
This result describes the order statistics when each element of the sample is obtained from a possibly different population with a different probability distribution. Ordinarily, all elements of the sample are obtained from the same population and thus have the same probability distribution.
Contents |
[edit] The theorem
Let
,
be independent real valued random variables with cumulative distribution functions
. Denote the order statistics by
, with
. Further let
, and
for all
For
and
, the joint cumulative distribution function of the subset
of the order statistics satisfies
where
is the permanent of a block matrix with the subscript
denoting the dimension of a block obtained by repeating the same entry, and the block row index
and block column index
.
[edit] The independent identically distributed case
In the case when the variables
,
are independent and identically distributed with cumulative probability distribution function
for all
, the Bapat-Beg theorem reduces to [3]
where again
[edit] Remarks
- No assumption of continuity of the cumulative distribution functions is needed.
- The theorem is formulated for the joint cumulative probability distribution function in terms of a subset of the order statistics and ordered arguments. If the inequalities
are not imposed, some of the inequalities
may be redundant (because always
and the argument list needs to be first reduced by dropping all
such that
for some
.
[edit] Complexity
The Bapat-Beg formula involves exponentially many permanents, and the complexity of the computation of the permanent itself is at least exponential. Thus, in the general case, the formula is not practical. However, when the random variables have only two possible distributions, the complexity can be reduced to O(m2k) [3]. Thus, in the case of two populations, the complexity is polynomial in m for any fixed number of statistics k.
[edit] See also
- For standard results for the distribution of order statistics for independent and identically-distributed random variables see, for example, [4]
- permanent
- independent and identically-distributed random variables
[edit] References
- ^ R. B. Bapat and M. I. Beg. Order statistics for nonidentically distributed variables and permanents. Sankhyā Ser. A, 51(1):79-93, 1989. MR1065561
- ^ Sayaji Hande. A note on order statistics for nonidentically distributed variables Sankhyā Ser. A, 56(2):365-368, 1994. MR1664921
- ^ a b Deborah H. Glueck, Anis Karimpour-Fard, Jan Mandel, Larry Hunter, Keith E. Muller. Fast computation by block permanents of cumulative distribution functions of order statistics from several populations, arXiv:0705.3851, 2007
- ^ N. Balakrishnan and C. R. Rao. Order statistics: an introduction. In Order statistics: theory & methods, volume 16 of Handbook of Statist., pages 3--24. North-Holland, Amsterdam, 1998.


![\begin{align} & P_{i_{1},\ldots,i_{k}}\left( y_{1},\ldots,y_{k}\right) \\ & \quad=\text{per}\begin{bmatrix} \left[ F_{i}(y_{j})-F_{i}(y_{j-1})\right] _{\left( i_{j}-i_{j-1}\right) \times1}\end{bmatrix} _{j=1,i=1}^{j=k,i=m}\end{align}](../../../../math/f/8/6/f868806ceb01b212cf8c71a1a6e340d2.png)
![\begin{align} & F_{Y_{n_{1}},\ldots Y_{n_{k}}}\left( y_{1},\ldots,y_{k}\right) \\
& =\Pr\left\{ Y_{n_{1}}\leq y_{1}\wedge Y_{n_{2}}\leq y_{2}\wedge \cdots\wedge Y_{n_{k}}\leq y_{k}\right\} \\ & =\sum_{i_{k}=n_{k}}^{m}\cdots\sum_{i_{2}=n_{2}}^{i_{3}}\,\sum_{i_{1}=n_{1}}^{i_{2}}m!\prod\limits_{j=1}^{k+1}\frac{\left[ F\left( y_{j}\right) -F\left( y_{j-1}\right) \right] ^{i_{j}-i_{j-1}}}{\left( i_{j}-i_{j-1}\right) !}, \end{align}](../../../../math/5/4/6/546cfff2e4fc656a35a18a6bffc9f858.png)


