Empirical distribution function

From Wikipedia, the free encyclopedia

In statistics, an empirical distribution function is a cumulative probability distribution function that concentrates probability 1/n at each of the n numbers in a sample.

Let $X_1,\ldots,X_n$ be iid random variables in $\mathbb{R}$ with the cdf F(x).

The empirical distribution function $F n (x)$ based on sample $X_1,\ldots,X_n$ is a step function defined by

$F_n(x) = \frac{ \mbox{number of elements in the sample} \leq x}n = \frac{1}{n} \sum_{i=1}^n I(X_i \le x),$

where I(A) is the indicator of event A.

For fixed x, $I(X_i\leq x)$ is a Bernoulli random variable with parameter p = F(x), hence $n F n (x)$ is a binomial random variable with mean nF(x) and variance nF(x)(1 − F(x)).

[edit] Asymptotical properties

By the strong law of large numbers,

$F_n(x)\to F(x)$ almost surely for fixed x.

In other words,

F n (x)

is a consistent unbiased estimator of the cumulative distribution function F(x).

By the central limit theorem,

$\sqrt{n}(F_n(x)-F(x))$

converges in distribution to a normal distribution N(0, F(x)(1 − F(x))) for fixed x.

The Berry–Esséen theorem provides the rate of this convergence.

By the Glivenko-Cantelli theorem $F_n(x)\to F(x)$ uniformly over x, that is

$\|F_n(x)-F(x)\|_\infty\to 0$ with probability 1.

The Dvoretzky-Kiefer-Wolfowitz inequality provides the rate of this convergence.

Kolmogorov showed that

$\sqrt{n}\|F_n(x)-F(x)\|_\infty$ converges in distribution to the Kolmogorov distribution, provided that F(x) is continuous.

The Kolmogorov-Smirnov test for goodness-of-fit is based on this fact.

By Donsker's theorem,

$\sqrt{n}(F_n-F)$ , as a process indexed by x, converges weakly in $\ell^\infty(\mathbb{R})$ to a Brownian bridge B(F(x)).

[edit] See also

Categories: Data analysis | Non-parametric statistics

Views

Interaction

Search

Languages

Powered by MediaWiki

Wikimedia Foundation

This page was last modified 20:14, 30 April 2008 by Wikipedia user Michael Hardy. Based on work by Wikipedia user(s) Melcombe, Jmath666, LasseMakkonen, Ntsimp, Igny, DavidCBryant, Kku, Schmock, YurikBot and Maksim-e and Anonymous user(s) of Wikipedia.
All text is available under the terms of the GNU Free Documentation License. (See Copyrights for details.)
Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a U.S. registered 501(c)(3) tax-deductible nonprofit charity.
About Wikipedia
Disclaimers