User:Orimosenzon/notes

From Wikipedia, the free encyclopedia

< User:Orimosenzon

[edit] Expectation

[edit] Expectation according to joint distribution equals single distribution Expectation

$E_{p(x_1,x_2)}(X_1) = \sum_{x_1,x_2}{p(x_1,x_2)(x_1)} = \sum_{x_1} \sum_{x_2} p(x_1)p(x_2|x_1)x_1 =$

$\sum_{x_1}p(x_1)x_1 \sum_{x_2}p(x_2|x_1) = \sum_{x_1}p(x_1)x_1 \cdot 1 = E_{p(x_1)}(X_1)$

Hence:

$E_{p(x_1,x_2)}(X_1) = E_{p(x_1)}(X_1)$

Also:

$E_{p(x_1,x_2)}(f(X_1)) = E_{p(x_1)}(f(X_1))$

[edit] linearity

$E(X_1+X_2) = \sum_{x_1,x_2}{p(x_1,x_2)(x_1+x_2)} =$

$\sum_{x_1,x_2}{p(x_1,x_2)x_1}+\sum_{x_1,x_2}{p(x_1,x_2)x_2} =$

$\sum_{x_1}{p(x_1)x_1}+\sum_{x_2}{p(x_2)x_2} = E(X_1)+E(X_2)$

hence:

$E (X 1 + X 2) = E (X 1) + E (X 2)$

E(λX) =	∑	p(x)λx = λ	∑	p(x)x = λE(X)
	x		x

hence:

$E (λ X) = λ E (X)$

[edit] Variance & Standard deviation

[edit] definitions

$V (X) = d e f = E ([X - E (X)] 2)$

$\sigma(X) =def= \sqrt{V(X)}$

[edit] The meaning of standard deviation

One way to look at standard deviation is as an approximation of the "expected drift" from the expectation. The "expected drift" could be defined as:

$E D (X) = d e f ? = E ( | X - E (X) | )$

This value is probably not easy to manipulate.

Suppose that X can have only the two values $k$ and $- k$ and that $E (X) = 0$ . Then:

$V (X) = d e f = E ([X - E (X)] 2) = E (X 2) = k 2$

and

$\sigma(X) = \sqrt{V(X)} = k$

and

$E D (X) = E ( | X - E (X) | ) = E ( | X | ) = E (k) = k = σ(X)$

$V$ , $σ$ and $E D$ doesn't change by adding a constant so any random variable $X$ that all its drifts are of the same absolute value $k$ has $σ(X) = E D (X)$ .

Whenever the drift values are not the same, $σ$ averages with bigger weights to bigger values while $E D$ keep fair plane average. *todo*: show why

Example:Suppose you are performing the following experiment: you flip a coin, if it's head you go 5 meters to the left, if it is tail, you go 5 meters to the right. The variance in this case is 25 and the standard deviation is 5. The expected drift is also 5 (all the drift values are equal). More on that example, see here.

[edit] Alternative definition of variance

$V (X) = d e f = E ((X - E (X)) 2) = E (X 2 + E 2 (X) - 2 X E (X)) =$

$E (X 2) + E 2 (X) - 2 E 2 (X) = E (X 2) - E 2 (X)$

hence:

$V (X) = E (X 2) - E 2 (X)$

[edit] variance (and sd) doesn't change by adding a constant

$V (X + c) = E ([X + c - E (X + c)] 2) = E ([X + c - E (X) - E (c)] 2) = E ([X - E (X)] 2) = V (X)$

[edit] variance of multiplication

$V (λ X) = E ((λ X) 2) - E 2 (λ X) = λ 2 E (X 2) - λ 2 E 2 (X) = λ 2 (E (X 2) - E 2 (X)) = λ 2 V (X)$

hence:

$V (λ X) = λ 2 V (X)$

[edit] SD of multiplication

$\sigma(\lambda X) = \sqrt{V(\lambda X)} = \sqrt{\lambda^2V(X)} = \lambda \sqrt{V(X)} = \lambda \sigma(X)$

hence:

$σ(λ X) = λσ(X)$

[edit] Variance of sum of random variables

$V (X 1 + X 2) = E ((X 1 + X 2) 2) - E 2 (X 1 + X 2) =$

$E(X_1^2)+E(X_2^2)+2E(X_1 X_2) -$

$(E^2(X_1) + E^2(X_2) + 2E(X_1)E(X_2))= \cdot$

$V(X_1)+V(X_2)+2Cov(X_1,X_2) \cdot$

hence:

$V (X 1 + X 2) = V (X 1) + V (X 2) + 2 C o v (X 1, X 2)$

When $X 1$ and $X 2$ are independent, $C o v (X 1, X 2) = 0$ and hence:

$X 1, X 2$ Independent $\Rightarrow V(X_1+X_2) = V(X_1)+V(X_2)$

When $X 1$ and $X 2$ are i.i.d (identically independent distributed) then:

$X 1, X 2$ i.i.d $\Rightarrow V(X_1+X_2) = V(X_1)+V(X_2) = 2V(X_1)$

Or more generally:

$X_1, X_2, \ldots, X_n$ i.i.d $\Rightarrow V(\sum_{i=1}^n{X_i}) = \sum_{i=1}^n V(X_i) = n V(X_1)$

hence:

$X_1, X_2, \ldots, X_n$ i.i.d $\Rightarrow \sigma (\sum_{i=1}^n{X_i}) = \sqrt{V(\sum_{i=1}^n{X_i})} = \sqrt{ n V(X_1) } = \sqrt{n} \cdot \sigma (X_1)$

Note the difference from summing the variable with itself (identically distributed but not independent):

$V (X 1 + X 1) = V (2 X 1) =$ $4$ $V (X 1)$

and

$σ(X 1 + X 1) = σ(2 X 1) =$ $2$ $σ(X 1)$

[edit] more on the last result

We've showed that:

$X_1, X_2, \ldots, X_n$ i.i.d $\Rightarrow \sigma (\sum_{i=1}^n{X_i}) = \sqrt{n} \cdot \sigma (X_1)$

Why is this important?

$σ$ is a measure for expected drift. The last result shows that the expected drift goes as square root (less than linear) with successive experiments... this means that the mean drift tends to zero:

$\lim_{n\to\infty}\frac{\sigma(\sum_{i=1}^n X_i)}{n} = \lim_{n\to\infty}\frac{\sqrt{n} \cdot \sigma (X_1)}{n} = 0$

Recall the example of the random walk +-5. Now, suppose You repeat the process $n$ times. What is the expected drift?

The standard deviation, which can be considered as a measure to that drift is: $\sqrt{n} \cdot 5$

The mean drift is: $5 \frac{\sqrt{n}}{n}$

For example, for 10000 iterations, the mean drift is: $5 \frac{\sqrt{100000}}{10000} = 0.05$ meter. Instead of 5 meter in each step it is 5 centimeter. The total drift is only 500 instead of 50,000.

todo:*...example of random walk +-5 gnuplot picture. the relation to the law of big numbers... the fact that frequent ration converges is an assumption in probability theory or a result?..

[edit] misc

$V(X) = 0 \Leftrightarrow E([X-E(X)]^2) = 0 \Leftrightarrow \forall{x}, x-E(X) = 0 \Leftrightarrow X$ is constant.

hence:

$V(X) = 0 \Leftrightarrow X$ is constant.

[edit] Covariance

[edit] Alternative definition

$C o v (X 1, X 2) = E ((X 1 - E (X 1))(X 2 - E (X 2))) = E (X 1 X 2) + E (X 1) E (X 2) - 2 E (X 1) E (X 2) = E (X 1 X 2) - E (X 1) E (X 2)$

hence:

$C o v (X 1, X 2) = E (X 1 X 2) - E (X 1) E (X 2)$

A special case is a covariance of two of the same random variable: $C o v (X, X) = E (X X) - E (X) E (X) = V (X)$

[edit] Covariance of independent variables

Assume that $X 1$ and $X 2$ are independent:

$E(X_1 X_2) = \sum_{x_1,x_2}{p(x_1,x_2)x_1 x_2} = \sum_{x_1,x_2}{p(x_1) p(x_2)x_1 x_2}$ $\sum_{x_1} p(x_1) x_1 \sum_{x_2} p(x_2)x_2 = E(X_1) E(X_2)$

And hence:

$X 1, X 2$ independent $\implies Cov(X_1,X_2) = 0$

The contrary is not true, however. For example, if X is a constant random variable then

$C O V (X, X) = V (X) = 0$

But of course, X and X are very much dependent.

[edit] Wiener processes

(also known as "Brownian motion")

Let Z be a stochastic process with the following properties: 1. The change $δ Z$ in a small period of time $δ t$ is

$\delta Z = \epsilon \cdot \sqrt{\delta t}$

where:

$ε˜φ(0,1)$

[edit] Summary

[edit] Expectation

$E_{p(x_1,x_2)}(X_1) = E_{p(x_1)}(X_1)$
$E (X 1 + X 2) = E (X 1) + E (X 2)$
$E (λ X) = λ E (X)$

[edit] Variance and standard deviation

$V (X) = E (X 2) - E 2 (X)$
$V (λ X) = λ 2 V (X)$
$σ(λ X) = λσ(X)$
$V (X 1 + X 2) = V (X 1) + V (X 2) + 2 C o v (X 1, X 2)$
$X 1, X 2$ Independent $\Rightarrow V(X_1+X_2) = V(X_1)+V(X_2)$
$X_1, X_2, \ldots, X_n$ i.i.d $\Rightarrow V(\sum_{i=1}^n{X_i}) = n V(X_1)$
$X_1, X_2, \ldots, X_n$ i.i.d $\Rightarrow \sigma (\sum_{i=1}^n{X_i}) = \sqrt{n} \cdot \sigma (X_1)$

[edit] Covariance

$C o v (X 1, X 2) = E (X 1 X 2) - E (X 1) E (X 2)$

[edit] Misc

int main() {
  cout << "hello lord\n";
}

User:Orimosenzon/notes

From Wikipedia, the free encyclopedia

Contents

[edit] Expectation

[edit] Expectation according to joint distribution equals single distribution Expectation

[edit] linearity

[edit] Variance & Standard deviation

[edit] definitions

[edit] The meaning of standard deviation

[edit] Alternative definition of variance

[edit] variance (and sd) doesn't change by adding a constant

[edit] variance of multiplication

[edit] SD of multiplication

[edit] Variance of sum of random variables

[edit] more on the last result

[edit] misc

[edit] Covariance

[edit] Alternative definition

[edit] Covariance of independent variables

[edit] Wiener processes

[edit] Summary

[edit] Expectation

[edit] Variance and standard deviation

[edit] Covariance

[edit] Misc

Views

Navigation

Interaction

Search