Distribución de muestreo de dos poblaciones independientes de Bernoulli

Supongamos que tenemos muestras de dos variables aleatorias independientes de Bernoulli, y . $\mathrm{Ber}(\theta_1)$ $\mathrm{Ber}(\theta_2)$

¿Cómo demostramos que ?

\frac{({\bar{X}}_{1} - {\bar{X}}_{2}) - (θ_{1} - θ_{2})}{\sqrt{\frac{θ_{1} (1 - θ_{1})}{{norte}_{1}} + \frac{θ_{2} (1 - θ_{2})}{{norte}_{2}}}} \overset{re}{\to} norte (0 0, 1)

$\frac{(\bar X_1-\bar X_2)-(\theta_1-\theta_2)}{\sqrt{\frac{\theta_1(1-\theta_1)}{n_1}+\frac{\theta_2(1-\theta_2)}{n_2}}}\xrightarrow{d} \mathcal N(0,1)$

Suponga que . $n_1\neq n_2$

distributions sampling bernoulli-distribution

— Un anciano en el mar.
fuente

Z_i = X_1i - X_2i es una secuencia de iid rv de media finita y varianza. Por lo tanto, satisface el teorema del límite central de Levy-Linderberg del cual se derivan sus resultados. ¿O estás pidiendo una prueba del clt en sí?

— Tres Diag

@ThreeDiag ¿Cómo aplica la versión LL del CLT? No creo que sea correcto. Escribe una respuesta para que revise los detalles.

— Un viejo en el mar.

Todos los detalles ya están ahí. Para que se aplique LL, necesita una secuencia de iid rv con media finita y varianza. La variable Z_i = X_i1 y X_i2 satisface los tres requisitos. La independencia se desprende de la independencia de los dos vars bernoulli originales y se puede ver que E (Z_i) y V (Z_i) son finitos aplicando propiedades estándar de E y V

— Three Diag

"muestras de dos variables aleatorias de Bernoulli independientes" - expresión incorrecta. Debe ser: "dos muestras independientes de distribuciones de Bernoulli".

— Viktor

Agregue "como

n_{1}, n_{2} \to \infty

$n_1,n_2\to \infty$

— Viktor

Respuestas:

Poner , $a=\frac{\sqrt{\theta_1(1-\theta_1)}}{\sqrt{n_1}}$ , , . Tenemos . En términos de funciones características, significa $b=\frac{\sqrt{\theta_2(1-\theta_2)}}{\sqrt{n_2}}$ $A=(\bar{X}_1-\theta_1)/a$ $B=(\bar{X}_2-\theta_2)/b$ $A\to_d N(0,1),\ B\to_d N(0,1)$ Queremos demostrar que

ϕ_{A} (t) \equiv E e^{i t A} \to e^{- t^{2} / 2}, ϕ_{B} (t) \to e^{- t^{2} / 2} .

$\phi_A(t)\equiv {\bf E}e^{itA}\to e^{-t^2/2},\ \phi_B(t)\to e^{-t^2/2}.$

D := \frac{a}{\sqrt{a^{2} + b^{2}}} A - \frac{b}{\sqrt{a^{2} + b^{2}}} B \to_{d} N (0, 1)

$D:=\frac{a}{\sqrt{a^2+b^2}}A-\frac{b}{\sqrt{a^2+b^2}}B\to_d N(0,1)$

Como y son independientes, $A$ $B$ como deseamos que sea.

ϕ_{D} (t) = ϕ_{A} (\frac{a}{\sqrt{a^{2} + b^{2}}} t) ϕ_{B} (- \frac{b}{\sqrt{a^{2} + b^{2}}} t) \to e^{- t^{2} / 2},

$\phi_D(t)=\phi_A\left(\frac{a}{\sqrt{a^2+b^2}}t\right)\phi_B\left(-\frac{b}{\sqrt{a^2+b^2}}t\right)\to e^{-t^2/2},$

Esta prueba está incompleta. Aquí necesitamos algunas estimaciones para la convergencia uniforme de funciones características. Sin embargo, en el caso bajo consideración podemos hacer cálculos explícitos. Poner . $p=\theta_1,\ m=n_1$ como. Por lo tanto, para unafija,

\begin{aligned} ϕ_{X_{1, 1}} (t) & = 1 + p (e^{i t} - 1), \\ ϕ_{{\bar{X}}_{1}} (t) & = (1 + p (e^{i t / m} - 1))^{m}, \\ ϕ_{{\bar{X}}_{1} - θ_{1}} (t) & = (1 + p (e^{i t / m} - 1))^{m} e^{- i p t}, \\ ϕ_{A} (t) & = (1 + p (e^{i t / \sqrt{m p (1 - p)}} - 1))^{m} e^{- i p t \sqrt{m} / \sqrt{p (1 - p)}} \\ = {((1 + p (e^{i t / \sqrt{m p (1 - p)}} - 1)) e^{- i p t / \sqrt{m p (1 - p)}})}^{m} \\ = {(1 - \frac{t^{2}}{2 m} + O (t^{3} m^{- 3 / 2}))}^{m} \end{aligned}

$\begin{align} \phi_{X_{1,1}}(t) &= 1+p(e^{it}-1), \\ \phi_{\bar X_{1}}(t) &= (1+p(e^{it/m}-1))^m, \\ \phi_{\bar X_{1}-\theta_1}(t) &= (1+p(e^{it/m}-1))^m e^{-ipt}, \\ \phi_{A}(t) &= (1+p(e^{it/\sqrt{mp(1-p)}}-1))^m e^{-ipt\sqrt{m}/\sqrt{p(1-p)}} \\[5pt] &= \left( \left(1+p(e^{it/\sqrt{mp(1-p)}}-1)\right)e^{-ipt/\sqrt{mp(1-p)}}\right)^m \\[5pt] &=\left( 1-\frac{t^2}{2m}+O(t^3m^{-3/2}) \right)^m \end{align}$

t^{3} m^{- 3 / 2} \to 0

$t^3m^{-3/2}\to 0$

t

$t$

ϕ_{D} (t) = {(1 - \frac{a^{2} t^{2}}{2 (a^{2} + b^{2}) n_{1}} + O (n_{1}^{- 3 / 2}))}^{n_{1}} {(1 - \frac{b^{2} t^{2}}{2 (a^{2} + b^{2}) n_{2}} + O (n_{2}^{- 3 / 2}))}^{n_{2}} \to e^{- t^{2} / 2}

$\phi_D(t)=\left( 1-\frac{a^2t^2}{2(a^2+b^2)n_1}+O(n_1^{-3/2}) \right)^{n_1} \left( 1-\frac{b^2t^2}{2(a^2+b^2)n_2}+O(n_2^{-3/2}) \right)^{n_2} \to e^{-t^2/2}$ (even if

a \to 0

$a\to 0$ or

b \to 0

$b\to 0$ ), since

| e^{- y} - (1 - y / m)^{m} | \leq y^{2} / 2 m

$\left|e^{-y}-(1-y/m)^m\right|\le {y^2}/{2m}\$ when

y / m < 1 / 2

$\ y/m<1/2$ (see /math/2566469/uniform-bounds-for-1-y-nn-exp-y/ ).

Note that similar calculations may be done for arbitrary (not necessarily Bernoulli) distributions with finite second moments, using the expansion of characteristic function in terms of the first two moments.

— Viktor
fuente

This seems correct. I'll get back to you later on, when I have time to check everything. ;)

— Un viejo en el mar.

-1

Proving your statement is equivalent to proving the (Levy-Lindenberg) Central Limit Theorem which states

If $\{Z_i\}_{i=1}^n$ is a sequence of i.i.d random variable with finite mean $\mathbb{E}(Z_i) = \mu$ and finite variance $\mathbb{V}(Z_i) = \sigma^2$ then

\sqrt{n} (\bar{Z} - μ) \to^{d} N (0, σ^{2})

$\sqrt{n}(\bar{Z} - \mu) \to^d N(0,\sigma^2)$

Here $\bar{Z} = \sum_i Z_i/n$ that is the sample variance.

Then it is easy to see that if we put

Z_{i} = X_{1} i - X_{2} i

$Z_i = X_1i - X_2i$ with

X_{1 i}, X_{2 i}

$X_{1i}, X_{2i}$ following a

B e r (θ_{1})

$Ber(\theta_1)$ and

B e r (θ_{2})

$Ber(\theta_2)$ respectively the conditions for the theorem are satisfied, in particular

E (Z_{i}) = θ_{1} - θ_{2} = μ

$\mathbb{E}(Z_i) = \theta_1 - \theta_2 = \mu$

and

V (Z_{i}) = θ_{1} (1 - θ_{1}) + θ_{2} (1 - θ_{2}) = σ^{2}

$\mathbb{V}(Z_i)= \theta_1(1-\theta_1) +\theta_2(1-\theta_2)= \sigma^2$

(There's a last passage, and you have to adjust this a bit for the general case where $n_1 \neq n_2$ but I have to go now, will finish tomorrow or you can edit the question with the final passage as an exercise )

— Three Diag
fuente

I could not obtain what I wanted exactly because of the possibility of

n_{1} \neq n_{2}

$n_1\neq n_2$

— An old man in the sea.

I will show later if you can't get it. Hint: compute the variance of the sample mean of Z and use that as the variable in the theorem

— Three Diag

Three, could you please add the details for when

n_{1} \neq n_{2}

$n_1 \neq n_2$ ? Gracias

— Un viejo en el mar.

Lo haré tan pronto como encuentre un pequeño timr. De hecho, había una sutileza que evita el uso de LL clt sin ajuste. Hay tres formas de hacerlo, la más simple de las cuales es invocar el hecho de que para grandes n1 y n2, X1 y X2 se distribuyen a normales, entonces una combinación lineal de normal también es normal. Esta es una propiedad de las normales que puede tomar como dada, de lo contrario puede probarlo mediante funciones características.

— Tres Diag

Los otros dos requieren un clt diferente (posiblemente Lyapunov) o, alternativamente, tratar n1 = i y n2 = i + k. Luego, para grandes i, esencialmente puede ignorar k y puede volver a aplicar LL (pero aún requerirá un poco de cuidado para lograr la varianza correcta)

— Three Diag