¿Cuál es la intuición detrás de la independencia de

18

Esperaba que alguien pudiera proponer un argumento que explicara por qué las variables aleatorias $Y_1=X_2-X_1$ e $Y_2=X_1+X_2$ , $X_i$ tiene la distribución normal estándar, son estadísticamente independientes. La prueba de este hecho se deduce fácilmente de la técnica de MGF, pero me parece extremadamente contra-intuitiva.

Por lo tanto, agradecería la intuición aquí, si la hay.

Gracias de antemano.

EDITAR : los subíndices no indican estadísticas de pedidos, sino observaciones IID de la distribución normal estándar.

probability self-study mathematical-statistics

— JohnK
fuente

¿Qué es la "técnica MGF"?

— ameba dice Reinstate Monica

@amoeba Es el uso de funciones generadoras de momento para determinar la distribución de una variable aleatoria. En mi caso, me refiero al teorema de que

e

son independientes si y solo si

,

es igual a

Y_{1}

$Y_1$

Y_{2}

$Y_2$

M (t_{1}, t_{2}) = M (t_{1}, 0) \times M (0, t_{2})

$M(t_1,t_2)=M(t_1,0) \times M(0,t_2)$

M (t_{1}, t_{2})

$M(t_1,t_2)$

. Elija cualquier otra técnica y estoy seguro de que obtendrá el mismo resultado.

E (e^{t_{1} Y_{1} + t_{2} Y_{2}})

$E(e^{t_1Y_1+t_2Y_2})$

— JohnK

1

Puede encontrar información sobre el hilo relacionado en stats.stackexchange.com/questions/71260 .

— whuber

Es posible obtener un intuición, considerando lo que sucede con cada uno de estos si se agrega una constante, digamos

, a cada

. Y qué sucede si multiplica cada

por una constante, digamos

μ

$\mu$

X

$X$

X

$X$

σ

$\sigma$

— rvl

1

Muy relacionado: la referencia para la suma y la diferencia de variables altamente correlacionadas es casi no correlacionada .

— gung - Restablece a Monica

22

Estos son datos distribuidos normales estándar: diagrama de dispersión en el primer sistema de coordenadas tenga en cuenta que la distribución es simétrica circulatoria.

Cuando cambia a e , efectivamente gira y escala el eje, así: Este nuevo sistema de coordenadas tiene el mismo origen que el original, y los ejes son ortogonal. Debido a la simetría circulatoria, las variables aún son independientes en el nuevo sistema de coordenadas. $Y_1 = X_2 - X_1$ $Y_2 = X_1 + X_2$ diagrama de dispersión con sistema de coordenadas rotado

— dobiwan
fuente

44

El resultado se aplica incluso cuando

y

están correlacionados con los márgenes normales de la unidad. Por lo tanto, su explicación solo cubre un subcase del resultado original. Sin embargo, la idea básica aquí es sólida.

X_{1}

$X_1$

X_{2}

$X_2$

— Glen_b -Reinstate Monica

1

@Glen_b, sí, tienes razón. Quería centrarme en un caso simple, ya que JohnK ya parece saber cómo probar el caso general, pero carece de la intuición intuitiva.

— dobiwan

7

El resultado funciona para conjuntamente normal (es decir, con correlación, $(X_1,X_2)$ $-1<\rho<1$ ), con común . $\sigma$

Si conoce un par de resultados básicos, esto es todo lo que necesita:

$\quad\quad\quad$ ingrese la descripción de la imagen aquí

El enfoque de dobiwan es esencialmente bueno, es solo que el resultado es más general que el caso tratado allí.

— Glen_b -Reinstate a Monica
fuente

3

+1 para quitar el resultado deseado a lo esencial. Agregaré que para el caso más general de normalidad articular con variaciones desiguales, una rotación de ejes por

lugar de

θ = \frac{1}{2} \arctan (\frac{2 ρ \cdot σ_{1} σ_{2}}{σ_{1}^{2} - σ_{2}^{2}})

$\theta = \frac{1}{2}\arctan\left ( \frac{2\rho\cdot\sigma_1\sigma_2}{\sigma_1^2 - \sigma_2^2}\right )$

en implícita

produce variables aleatorias normales independientes.

\pm \frac{π}{4}

$\pm \frac{\pi}{4}$

(X_{1}, X_{2}) \to (X_{1} + X_{2} . X_{1} - X_{2})

$(X_1,X_2) \to (X_1+X_2. X_1-X_2)$

— Dilip Sarwate

6

El resultado que usted afirma que es verdadero no es cierto en general, ni siquiera para el caso en que todo lo que se sabe es que y son variables aleatorias normales con una varianza idéntica, pero el resultado se cumple para el $X_1$ $X_2$ interpretación habitual de la condición usted dijo más tarde:

Los subíndices no indican estadísticas de pedidos, sino observaciones de la distribución normal estándar.

La interpretación habitual de las últimas palabras en esta declaración es, por supuesto, que y son variables aleatorias independientes (normales) y, por lo tanto, variables aleatorias conjuntas normales. $X_1$ $X_2$

Para variables aleatorias normales conjuntas con varianza idéntica, es cierto que y son variables aleatorias independientes (normales) (con, en general, variaciones desiguales), y la explicación intuitiva para esto es mejor dada en la respuesta de Glen_b. Para su caso especial de que y sean independientes, la respuesta de dobiwan, que usted ha aceptado, es más simple, y de hecho revela que cualquier rotación de los ejes, no solo por el $X_1+X_2$ $X_1-X_2$ $X_1$ $X_2$ implícito en la transformación $\pm \frac{\pi}{4}$ $(X_1,X_2)\to (X_1+X_2, X_1-X_2)$

$X$ $Y$ , sin importar qué otras propiedades se les puedan atribuir.

$X$ $Y$ $X+Y$ $X-Y$

\begin{aligned} cov (X + Y, X - Y) & = cov (X, X) - cov (X, Y) + cov (Y, X) - cov (Y, Y) \\ = var (X) - cov (X, Y) + cov (X, Y) - var (Y) \\ = 0. \end{aligned}

$\begin{align} \operatorname{cov}(X+Y, X-Y) &= \operatorname{cov}(X,X) - \operatorname{cov}(X,Y) + \operatorname{cov}(Y,X) - \operatorname{cov}(Y,Y)\\ &= \operatorname{var}(X) - \operatorname{cov}(X,Y) + \operatorname{cov}(X,Y) - \operatorname{var}(Y)\\ &= 0. \end{align}$ Here we have used the fact that

cov (X, X)

$\operatorname{cov}(X,X)$ is just the variance

var (X)

$\operatorname{var}(X)$ of

X

$X$ (and similarly for

Y

$Y$ ) and, of course,

cov (Y, X) = cov (X, Y)

$\operatorname{cov}(Y,X) = \operatorname{cov}(X,Y)$ . Note that this result holds when

X

$X$ and

Y

$Y$ are (marginally) normal random variables but not necessarily jointly normal random variables. (If you are not familiar with this notion of marginal normality not being the same as joint normality, see this great answer by cardinal). In the special case when

X

$X$ and

Y

$Y$ are jointly normal (but not necessarily independent) normal random variables, so are

X + Y

$X+Y$ and

X - Y

$X-Y$ jointly normal, and since their covariance is

0

$0$ ,

X + Y

$X+Y$ and

X - Y

$X-Y$ are independent random variables.

— Dilip Sarwate
fuente

2

I first argue for general identically distributed $X_1,X_2$ that the conditional mean of $Y_1$ conditional on $Y_2$ is constant $0$ . Based on this, I argue that the covariance of $Y_1,Y_2$ is 0. Then, under normality, zero covariance implies independence.

The conditional mean

Intuition: $X_1+X_2=y$ does not imply anything about which component contributed more to the sum (e.g., $X_1=x, X_2 = y-x$ is as likely as $X_1 = y-x, X_2=x$ ). Thus, the expected difference must be 0.

Proof: $X_1$ and $X_2$ have identical distribution and $X_1+X_2$ is symmetric with respect to the indexing. Thus, for symmetry reasons, the conditional distribution $X_1 \mid Y_2 = y$ must be equal to the conditional distribution $X_2 \mid Y_2 = y$ . Hence, the conditional distributions also have the same mean, and

E (Y_{1} ∣ Y_{2} = y) = E (X_{1} - X_{2} ∣ X_{1} + X_{2} = y) = E (X_{1} ∣ X_{1} + X_{2} = y) - E (X_{2} ∣ X_{1} + X_{2} = y) = 0.

$\begin{equation} \mathbb{E}(Y_1 \mid Y_2 = y) = \mathbb{E}(X_1 - X_2 \mid X_1+X_2 = y) \\ = \mathbb{E}(X_1 \mid X_1+X_2 = y) - \mathbb{E}(X_2 \mid X_1+X_2 = y)= 0. \end{equation}$

(Caveat: I did not consider the possibility that the conditional mean might not exist.)

Constant conditional mean implies zero correlation/covariance

Intuition: correlation measures how much $Y_1$ tends to increase when $Y_2$ increases. If observing $Y_2$ never changes our mean of $Y_1$ , $Y_1$ and $Y_2$ are uncorrelated.

Proof: By definition, covariance is

C o v (Y_{1}, Y_{2}) = E [(Y_{1} - E (Y_{1})) (Y_{2} - E (Y_{2}))]

$\begin{equation} Cov(Y_1,Y_2) = \mathbb{E}\left[\left(Y_1 - \mathbb{E}(Y_1)\right)\left(Y_2 -\mathbb{E}(Y_2) \right)\right] \end{equation}$ to this expectation, we apply the law of iterated expectations: take the expectation of the conditional expectation conditional on

Y_{2}

$Y_2$ :

= E [E [(Y_{1} - E (Y_{1})) (Y_{2} - E (Y_{2})) ∣ Y_{2}]] = E [(Y_{2} - E (Y_{2})) E [Y_{1} - E (Y_{1}) ∣ Y_{2}]] .

$\begin{equation} = \mathbb{E}\left[\mathbb{E}\left[\left(Y_1 - \mathbb{E}(Y_1)\right)\left(Y_2 -\mathbb{E}(Y_2) \right) \mid Y_2\right]\right] = \mathbb{E}\left[(Y_2 - \mathbb{E}(Y_2))\mathbb{E}\left[Y_1 - \mathbb{E}(Y_1) \mid Y_2\right] \right]. \end{equation}$ Recall that the conditional mean was shown to be independent of

Y_{2}

$Y_2$ and thus the expression simplifies as

= E [(Y_{2} - E (Y_{2})) E [Y_{1} - E (Y_{1})]]

$\begin{equation} = \mathbb{E}\left[(Y_2 - \mathbb{E}(Y_2))\mathbb{E}\left[Y_1-\mathbb{E}(Y_1)\right]\right] \end{equation}$ but the inner expectation is

0

$0$ and we get

= E [(Y_{2} - E (Y_{2})) \times 0] = 0.

$\begin{equation} = \mathbb{E}\left[(Y_2 - \mathbb{E}(Y_2))\times0\right] = 0. \end{equation}$

Independence

Just by assuming identical distributions for $X_1,X_2$ , it was shown that $Y_1$ and $Y_2$ are uncorrelated. When $X_1,X_2$ are jointly normal (for example, iid. normal as in the question), their linear combinations $Y_1,Y_2$ are also jointly normal and thus uncorrelatedness implies independence.

— Juho Kokkala
fuente