¿Cuál es la diferencia entre

18

En general, ¿cuál es la diferencia entre $E(X|Y)$ y $E(X|Y=y)$ ?

Primero es función de $y$ y último es función de $x$ ? Es muy confuso ...

conditional-expectation notation definition

— 신범준
fuente

Hmmm ... ¡Esto último no debería ser una función de x sino un número! ¿Me equivoco?

— David

23

En términos generales, la diferencia entre $E(X \mid Y)$ y $E(X \mid Y = y)$ es que la primera es una variable aleatoria, mientras que la segunda es (en cierto sentido) una realización de $E(X \mid Y)$ . Por ejemplo, si

(X, Y) \sim N (0, (1 ρ ρ 1))

$(X, Y) \sim \mathcal N\left(\mathbf 0, \begin{pmatrix} 1 & \rho \\ \rho & 1 \end{pmatrix}\right)$ entonces

E(X∣Y) $E(X \mid Y)$ es la variable aleatoria

E (X ∣ Y) = ρ Y .

$E(X \mid Y) = \rho Y.$ Por el contrario, una vez quese observa

Y=y $Y = y$ es más probable que nos interese la cantidad

E(X∣Y=y)=ρy $E(X \mid Y = y) = \rho y$ que es un escalar.

Tal vez esto parezca una complicación innecesaria, pero considerar $E(X \mid Y)$ como una variable aleatoria por derecho propio es lo que hace que cosas como la ley de la torre $E(X) = E[E(X \mid Y)]$ tengan sentido: el Lo que está dentro de las llaves es aleatorio, por lo que podemos preguntar cuál es su expectativa, mientras que no hay nada aleatorio sobre $E(X \mid Y = y)$ . En la mayoría de los casos, podríamos esperar calcular

E (X ∣ Y = y) = \int x f X ∣ Y (x ∣ y) d x

$E(X \mid Y = y) = \int x f_{X\mid Y}(x \mid y) \ dx$

y luego obtenga "conectando" la variable aleatoria en lugar de en la expresión resultante. Como se insinuó en un comentario anterior, hay un poco de sutileza que puede arrastrarse con respecto a cómo se definen rigurosamente estas cosas y vincularlas de la manera adecuada. Esto tiende a suceder con probabilidad condicional, debido a algunos problemas técnicos con la teoría subyacente. $E(X \mid Y)$ $Y$ $y$

— chico
fuente

8

Supongamos que $X$ e $Y$ son variables aleatorias.

Sea $y_0$ un número real fijo , digamos $y_0 = 1$ . Entonces, $E[X\mid Y=y_0]= E[X\mid Y = 1]$ es un número : es el valor condicional esperado de $X$ dado que $Y$ tiene el valor $1$ . Ahora, tenga en cuenta para algún otro número real fijo $y_1$ , digamos $y_1=1.5$ , $E[X\mid Y = y_1] = E[X\mid Y = 1.5]$ sería el valor condicional esperado de $X$ dado $Y = 1.5$ (un número real). No hay razón para suponer que $E[X\mid Y = 1.5]$ y $E[X\mid Y = 1]$ tienen el mismo valor. Por lo tanto, también podemos considerar $E[X\mid Y=y]$ como unafunción de valor real $g(y)$ que asigna números reales $y$ a números reales $E[X\mid Y = y]$ . Tenga en cuenta que la afirmación en la pregunta del OP de que $E[X\mid Y = y]$ es una función de $x$ es incorrecta: $E[X\mid Y = y]$ es una función de valor real de $y$ .

On the other hand, $E[X\mid Y]$ is a random variable $Z$ which happens to be a function of the random variable $Y$ . Now, whenever we write $Z = h(Y)$ , what we mean is that whenever the random variable $Y$ happens to have value $y$ , the random variable $Z$ has value $h(y)$ . Whenever $Y$ takes on value $y$ , the random variable $Z = E[X\mid Y]$ takes on value $E[X\mid Y = y] = g(y)$ . Thus, $E[X\mid Y]$ is just another name for the random variable $Z = g(Y)$ . Note that $E[X\mid Y]$ is a function of $Y$ (not $y$ as in the statement of the OP's question).

As a a simple illustrative example, suppose that $X$ and $Y$ are discrete random variables with joint distribution

P (X = 0, Y = 0) P (X = 1, Y = 0) = 0.1, P (X = 0, Y = 1) = 0.2, = 0.3, P (X = 1, Y = 1) = 0.4.

$\begin{align} P(X=0,Y=0) &= 0.1,~~ P(X=0, Y=1) = 0.2,\\ P(X=1,Y=0) &= 0.3,~~ P(X=1,Y=1) = 0.4. \end{align}$ Note that

X $X$ and

Y $Y$ are (dependent) Bernoulli random variables with parameters

0.7 $0.7$ and

0.6 $0.6$ respectively, and so

E[X]=0.7 $E[X] = 0.7$ and

E[Y]=0.6 $E[Y] = 0.6$ . Now, note that conditioned on

Y=0 $Y=0$ ,

X $X$ is a Bernoulli random variable with parameter

0.75 $0.75$ while conditioned on

Y=1 $Y = 1$ ,

X $X$ is a Bernoulli random variable with parameter

23 $\frac 23$ . If you cannot see why this is so immediately, just work out the details: for example

P (X = 1 ∣ Y = 0) = P ( X = 1 , Y = 0 ) P ( Y = 0 ) = 0.3 0.4 = 3 4, P (X = 0 ∣ Y = 0) = P ( X = 0 , Y = 0 ) P ( Y = 0 ) = 0.1 0.4 = 1 4,

$P(X=1\mid Y = 0) = \frac{P(X=1, Y=0)}{P(Y=0)} = \frac{0.3}{0.4} = \frac 34,\\ P(X=0\mid Y = 0) = \frac{P(X=0, Y=0)}{P(Y=0)} = \frac{0.1}{0.4} = \frac 14,$ and similarly for

P(X=1∣Y=1) $P(X=1\mid Y=1)$ and

P(X=0∣Y=1) $P(X=0\mid Y = 1)$ . Hence, we have that

E [X ∣ Y = 0] = 3 4, E [X ∣ Y = 1] = 2 3 .

$E[X\mid Y = 0] = \frac 34, \quad E[X \mid Y = 1] = \frac 23.$ Thus,

E[X∣Y=y]=g(y) $E[X\mid Y = y] = g(y)$ where

g(y) $g(y)$ is a real-valued function enjoying the properties:

g (0) = 3 4, g (1) = 2 3 .

$g(0) = \frac 34, \quad g(1) = \frac 23.$

On the other hand, $E[X\mid Y] = g(Y)$ is a random variable that takes on values $\frac 34$ and $\frac 23$ with probabilities $0.4 = P(Y=0)$ and $0.6 = P(Y=1)$ respectively. Note that $E[X\mid Y]$ is a discrete random variable but is not a Bernoulli random variable.

As a final touch, note that

E [Z] = E [E [X ∣ Y]] = E [g (Y)] = 0.4 \times 3 4 + 0.6 \times 2 3 = 0.7 = E [X] .

$E[Z] = E\left[E[X\mid Y]\right] = E[g(Y)] = 0.4\times \frac 34 + 0.6\times \frac 23 = 0.7 = E[X].$ That is, the expected value of this function of

Y $Y$ , which we computed using only the marginal distribution of

Y $Y$ , happens to have the same numerical value as

E[X] $E[X]$ !! This is an illustration of a more general result that many people believe is a LIE:

E [E [X ∣ Y]] = E [X] .

$E\left[E[X\mid Y]\right] = E[X].$

Sorry, that's just a small joke. LIE is an acronym for Law of Iterated Expectation which is a perfectly valid result that everyone believes is the truth.

— Dilip Sarwate
fuente

3

$E(X|Y)$ is the expectation of a random variable: the expectation of $X$ conditional on $Y$ . $E(X|Y=y)$ , on the other hand, is a particular value: the expected value of $X$ when $Y=y$ .

Think of it this way: let $X$ represent the caloric intake and $Y$ represent height. $E(X|Y)$ is then the caloric intake, conditional on height - and in this case, $E(X|Y=y)$ represents our best guess at the caloric intake ( $X$ ) when a person has a certain height $Y = y$ , say, 180 centimeters.

— abaumann
fuente

4

I believe your first sentence should replace "distribution" with "expectation" (twice).

— Glen_b -Reinstate Monica

4

E(X∣Y) $E(X\mid Y)$ isn't the distribution of

X $X$ given

Y $Y$ ; this would be more commonly denotes by the conditional density

fX∣Y(x∣y) $f_{X \mid Y} (x \mid y)$ or conditional distribution function.

E(X∣Y) $E(X \mid Y)$ is the conditional expectation of

X $X$ given

Y $Y$ , which is a

Y $Y$ -measurable random variable.

E(X∣Y=y) $E(X \mid Y = y)$ might be thought of as the realization of the random variable

E(X∣Y) $E(X \mid Y)$ when

Y=y $Y = y$ is observed (but there is the possibility for measure-theoretic subtlety to creep in).

— guy

1

@guy Your explanation is the first accurate answer yet provided (out of three offered so far). Would you consider posting it as an answer?

— whuber

@whuber I would but I'm not sure how to strike the balance between accuracy and making the answer suitably useful to OP and I'm paranoid about getting tripped up on technicalities :)

— guy

@Guy I think you have already done a good job with the technicalities. Since you are sensitive about communicating well with the OP (which is great!), consider offering a simple example to illustrate--maybe just a joint distribution with binary marginals.

— whuber

1

$E(X|Y)$ is expected value of values of $X$ given values of $Y$ $E(X|Y=y)$ is expected value of $X$ given the value of $Y$ is $y$

Generally $P(X|Y)$ is probability of values $X$ given values $Y$ , but you can get more precise and say $P(X=x|Y=y)$ , i.e. probability of value $x$ from all $X$ 's given the $y$ 'th value of $Y$ 's. The difference is that in the first case it is about "values of" and in the second you consider a certain value.

You could find the diagram below helpful.

Bayes theorem diagram form Wikipedia

— Tim
fuente

This answer discusses probability, while the question asks about expectation. What is the connection?

— whuber