Cómo interpretar las estimaciones de parámetros en los resultados de Poisson GLM [cerrado]

Cerrado. Esta pregunta está fuera de tema . Actualmente no está aceptando respuestas.

¿Quieres mejorar esta pregunta? Actualice la pregunta para que esté en el tema de Cross Validated.

Cerrado hace 5 años .

Call:
glm(formula = darters ~ river + pH + temp, family = poisson, data = darterData)

Deviance Residuals:
    Min      1Q   Median     3Q    Max
-3.7422 -1.0257   0.0027 0.7169 3.5347

Coefficients:
              Estimate Std.Error z value Pr(>|z|)
(Intercept)   3.144257  0.218646  14.381  < 2e-16 ***
riverWatauga -0.049016  0.051548  -0.951  0.34166
pH            0.086460  0.029821   2.899  0.00374 **
temp         -0.059667  0.009149  -6.522  6.95e-11 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

(Dispersion parameter for poisson family taken to be 1)
Null deviance: 233.68 on 99 degrees of freedom
Residual deviance: 187.74 on 96 degrees of freedom
AIC: 648.21

Quiero saber cómo interpretar cada estimación de parámetro en la tabla anterior.

— tomjerry001
fuente

La interpretación es idéntica: stats.stackexchange.com/a/126225/7071

— Dimitriy V. Masterov

Esta pregunta parece estar fuera de tema porque se trata de explicar una salida R sin ninguna forma de pregunta inteligente detrás. Esta es la categoría "Vuelco la salida de mi computadora allí y usted ejecuta el análisis de estadísticas para mí" ...

— Xi'an

Su parámetro de dispersión parece indicar que hay algunos problemas con su modelo. Quizás debería considerar usar una distribución de cuasipoisson en su lugar. Apuesto a que sus estimaciones de parámetros cambiarán drásticamente y también lo hará la interpretación. Si ejecuta "plot (model)" obtendrá algunos gráficos de sus residuos, eche un vistazo a estos gráficos para patrones no deseados antes de comenzar a interpretar su modelo real. Para trazar rápidamente el ajuste de su modelo, también puede usar "visreg (modelfit)" del paquete visreg

— Robbie

@ Xi'an, aunque la pregunta es escasa y requiere edición, no creo que esté fuera de tema. Considere estas preguntas que no se consideran fuera de tema: Interpretación de la salida lm () de R , e Interpretación de la salida de R para la regresión binomial . Sin embargo, parece ser un duplicado .

— gung - Restablece a Monica

Este es un duplicado de ¿Cómo interpretar los coeficientes en una regresión de Poisson? Por favor, lea el hilo vinculado. Si aún tiene una pregunta después de leer eso, regrese aquí y edite su pregunta para indicar lo que ha aprendido y lo que aún necesita saber, entonces podemos proporcionar la información que necesita sin simplemente duplicar material en otro lugar que ya no ayudó. tú.

— gung - Restablece a Monica

No creo que el título de su pregunta capture con precisión lo que está pidiendo.

La cuestión de cómo interpretar los parámetros en un GLM es muy amplia porque el GLM es una clase muy amplia de modelos. Recuerde que un GLM modela una variable de respuesta que se supone que sigue una distribución conocida de la familia exponencial, y que hemos elegido una función invertible tal que $y$ $g$ paralas variables predictoras . En este modelo, la interpretación de cualquier parámetro particular es la tasa de cambio de con respecto a . Definir

mi [y El | X] = {sol}^{- 1} (X_{0 0} + X_{1} β_{1} + \dots + X_{J} β_{J})

$\mathrm{E}\left[y\,|\,x\right] = g^{-1}{\left(x_0 + x_1\beta_1 + \dots + x_J\beta_J\right)}$

J

$J$

x

$x$

β_{j}

$\beta_j$

g (y)

$g(y)$

x_{j}

$x_j$

para mantener limpia la notación. Entonces, para cualquier

μ \equiv E [y | x] = g^{- 1} (x)

$\mu \equiv \mathrm{E}{\left[y\,|\,x\right]} = g^{-1}{\left(x\right)}$

η \equiv x \cdot β

$\eta \equiv x \cdot \beta$

j \in {1, \dots, J}

$j \in \{1,\dots,J\}$

Ahora defina

como un vector deceros

y un solo

en laposición

, de modo que, por ejemplo, si

entonces

. Entonces

β_{j} = \frac{\partial η}{\partial x_{j}} = \frac{\partial g (μ)}{\partial x_{j}} .

$\beta_j = \frac{\partial\,\eta}{\partial\,x_j} = \frac{\partial\,g(\mu)}{\partial\,x_j} \text{.}$

e_{j}

$\mathfrak{e}_j$

J - 1

$J-1$

1

$1$

j

$j$

J = 5

$J=5$

e_{3} = (0, 0, 1, 0, 0)

$\mathfrak{e}_3 = \left(0,0,1,0,0\right)$

β_{j} = g (E [y | x + e_{j}]) - g (E [y | x])

$\beta_j = g{\left(\mathrm{E}{\left[y\,|\,x + \mathfrak{e}_j \right]}\right)} - g{\left(\mathrm{E}{\left[y\,|\,x\right]}\right)}$

Lo que simplemente significa que es el efecto sobre de un aumento unitario en . $\beta_j$ $\eta$ $x_j$

También puede establecer la relación de esta manera: y

\frac{\partial E [y | x]}{\partial x_{j}} = \frac{\partial μ}{\partial x_{j}} = \frac{d μ}{d η} \frac{\partial η}{\partial x_{j}} = \frac{\partial μ}{\partial η} β_{j} = \frac{d g^{- 1}}{d η} β_{j}

$\frac{\operatorname{\partial}\mathrm{E}{\left[y\,|\,x\right]}}{\operatorname{\partial}x_j} = \frac{\operatorname{\partial}\mu}{\operatorname{\partial}x_j} = \frac{\operatorname{d}\mu}{\operatorname{d}\eta}\frac{\operatorname{\partial}\eta}{\operatorname{\partial}x_j} = \frac{\operatorname{\partial}\mu}{\operatorname{\partial}\eta} \beta_j = \frac{\operatorname{d}g^{-1}}{\operatorname{d}\eta} \beta_j$

E [y | x + e_{j}] - E [y | x] \equiv Δ_{j} \hat{y} = g^{- 1} ((x + e_{j}) β) - g^{- 1} (x β)

$\mathrm{E}{\left[y\,|\,x + \mathfrak{e}_j \right]} - \mathrm{E}{\left[y\,|\,x\right]} \equiv \operatorname{\Delta_j} \hat y = g^{-1}{\left( \left(x + \mathfrak{e}_j\right)\beta \right)} - g^{-1}{\left( x\,\beta \right)}$

Sin saber nada sobre , eso es lo más lejos que podemos llegar. es el efecto sobre , en la media condicional transformada de , de una unidad de aumento en , y el efecto sobre la media condicional de de un aumento de una unidad en es . $g$ $\beta_j$ $\eta$ $y$ $x_j$ $y$ $x_j$ $g^{-1}{\left(\beta\right)}$

Pero parece estar preguntando específicamente sobre la regresión de Poisson usando la función de enlace predeterminada de R, que en este caso es el logaritmo natural. Si ese es el caso, está preguntando acerca de un tipo específico de GLM en el que y . Entonces podemos obtener algo de tracción con respecto a una interpretación específica. $y \sim \mathrm{Poisson}{\left(\lambda\right)}$ $g = \ln$

Por lo que dije anteriormente, sabemos que $\frac{\operatorname{\partial}\mu}{\operatorname{\partial}x_j} = \frac{\operatorname{d}g^{-1}}{\operatorname{d}\eta} \beta_j$ . And since we know $g(\mu) = \ln(\mu)$ , we also know that $g^{-1}(\eta) = e^\eta$ . We also happen to know that $\frac{\operatorname{d}e^\eta}{\operatorname{d}\eta} = e^\eta$ , so we can say that

\frac{\partial μ}{\partial x_{j}} = \frac{\partial E [y | x]}{\partial x_{j}} = e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J}} β_{j}

$\frac{\operatorname{\partial}\mu}{\operatorname{\partial}x_j} = \frac{\operatorname{\partial}\mathrm{E}{\left[y\,|\,x\right]}}{\operatorname{\partial}x_j} = e^{x_0 + x_1\beta_1 + \dots + x_J\beta_J}\beta_j$

which finally means something tangible:

Given a very small change in $x_j$ , the fitted $\hat y$ changes by $\hat y\,\beta_j$ .

Note: this approximation can actually work for changes as large as 0.2, depending on how much precision you need.

And using the more familiar unit change interpretation, we have:

\begin{aligned} Δ_{j} \hat{y} & = e^{x_{0} + x_{1} β_{1} + \dots + (x_{j} + 1) β_{j} + \dots + x_{J} β_{J}} - e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J}} \\ = e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J} + β_{j}} - e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J}} \\ = e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J}} e_{j}^{β} - e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J}} \\ = e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J}} (e_{j}^{β} - 1) \end{aligned}

$\begin{align} \operatorname{\Delta_j} \hat y &= e^{ x_0 + x_1\beta_1 + \dots + \left(x_j + 1\right)\,\beta_j + \dots + x_J\beta_J } - e^{x_0 + x_1\beta_1 + \dots + x_J\beta_J} \\ &= e^{ x_0 + x_1\beta_1 + \dots + x_J\beta_J + \beta_j} - e^{x_0 + x_1\beta_1 + \dots + x_J\beta_J} \\ &= e^{ x_0 + x_1\beta_1 + \dots + x_J\beta_J}e^\beta_j - e^{x_0 + x_1\beta_1 + \dots + x_J\beta_J} \\ &= e^{ x_0 + x_1\beta_1 + \dots + x_J\beta_J} \left( e^\beta_j - 1 \right) \end{align}$ which means

Given a unit change in $x_j$ , the fitted $\hat y$ changes by $\hat y \left( e^\beta_j - 1 \right)$ .

There are three important pieces to note here:

The effect of a change in the predictors depends on the level of the response.
An additive change in the predictors has a multiplicative effect on the response.
You can't interpret the coefficients just by reading them (unless you can compute arbitrary exponentials in your head).

So in your example, the effect of increasing pH by 1 is to increase $\ln \hat y$ by $\hat y \left( e^{0.09} - 1 \right)$ ; that is, to multiply $\hat y$ by $e^{0.09} \approx 1.09$ . It looks like your outcome is the number of darters you observe in some fixed unit of time (say, a week). So if you're observing 100 darters a week at a pH of 6.7, raising the pH of the river to 7.7 means you can now expect to see 109 darters a week.

— shadowtalker
fuente

I made a couple tweaks here, @ssdecontrol. I think they'll make your post a little easier to follow, but if you don't like them, roll them back with my apologies.

— gung - Reinstate Monica

I you can't figure that out from my answer then clearly I need to revise the answer. What are you still confused about?

— shadowtalker

Plug those numbers into the equation just like in linear regression

— shadowtalker

@skan no, I mean

E [y | x]

$E[y|x]$ .

x

$x$ and

y

$y$ are random variables representing to a single observation.

x

$x$ is a vector indexed by

j

$j$ ;

x_{j}

$x_j$ is the random variable representing a specific feature/regressor/input/predictor for that observation.

— shadowtalker

And don't overthink it. Once you understand all the pieces in a GLM, the manipulations here are just a direct application of calculus principles. It really is as simple as taking the derivative with respect to the variable you're interested in.

— shadowtalker

My suggestion would be to create a small grid consisting of combinations of the two rivers and two or three values of each of the covariates, then use the predict function with your grid as newdata. Then graph the results. It is much clearer to look at the values that the model actually predicts. You may or may not want to back-transform the predictions to the original scale of measurement (type = "response").

— Russ Lenth
fuente

As much as I like this approach (I do it all the time) I think it's counterproductive for building understanding.

— shadowtalker