Interpretation von Parameterschätzungen in Poisson-GLM-Ergebnissen [geschlossen]

Geschlossen. Diese Frage ist nicht zum Thema . Derzeit werden keine Antworten akzeptiert.

Möchten Sie diese Frage verbessern? Aktualisieren Sie die Frage so dass es beim Thema für Kreuz Validated.

Geschlossen vor 5 Jahren .

Call:
glm(formula = darters ~ river + pH + temp, family = poisson, data = darterData)

Deviance Residuals:
    Min      1Q   Median     3Q    Max
-3.7422 -1.0257   0.0027 0.7169 3.5347

Coefficients:
              Estimate Std.Error z value Pr(>|z|)
(Intercept)   3.144257  0.218646  14.381  < 2e-16 ***
riverWatauga -0.049016  0.051548  -0.951  0.34166
pH            0.086460  0.029821   2.899  0.00374 **
temp         -0.059667  0.009149  -6.522  6.95e-11 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

(Dispersion parameter for poisson family taken to be 1)
Null deviance: 233.68 on 99 degrees of freedom
Residual deviance: 187.74 on 96 degrees of freedom
AIC: 648.21

Ich möchte wissen, wie jede Parameterschätzung in der obigen Tabelle zu interpretieren ist.

— tomjerry001
quelle

Die Interpretation ist identisch: stats.stackexchange.com/a/126225/7071

— Dimitriy V. Masterov

Diese Frage scheint nicht zum Thema zu gehören, da es darum geht, eine R-Ausgabe ohne irgendeine Form von intelligenter Frage dahinter zu erklären. Dies ist die Kategorie "Ich stelle meine Computerausgabe dort ab und Sie führen die Statistikanalyse für mich durch" ...

— Xi'an

Ihr Dispersionsparameter scheint darauf hinzudeuten, dass bei Ihrem Modell einige Probleme vorliegen. Vielleicht sollten Sie stattdessen eine Quasipoisson-Verteilung verwenden. Ich wette, Ihre Parameterschätzungen werden sich drastisch ändern, ebenso wie die Interpretation. Wenn Sie "plot (model)" ausführen, werden einige Diagramme Ihrer Residuen angezeigt. Sehen Sie sich diese Diagramme nach unerwünschten Mustern an, bevor Sie mit der Interpretation Ihres tatsächlichen Modells beginnen. Zum schnellen Plotten der Passform Ihres Modells können Sie auch "visreg (modelfit)" aus dem visreg-Paket verwenden

— Robbie,

@ Xi'an, obwohl die Frage spärlich ist und eine Bearbeitung erfordert, denke ich nicht, dass sie vom Thema abweicht. Betrachten Sie diese Fragen , die nicht vom Thema betrachtet werden: Die Interpretation der R lm () ausgegeben , und Interpretation von R - Ausgang für binomische Regression . Es scheint jedoch ein Duplikat zu sein .

— gung - Wiedereinsetzung von Monica

Dies ist ein Duplikat von Wie werden Koeffizienten in einer Poisson-Regression interpretiert? Bitte lies den verlinkten Thread. Wenn Sie nach dem Lesen immer noch eine Frage haben, kommen Sie hierher zurück und bearbeiten Sie Ihre Frage, um anzugeben, was Sie gelernt haben und was Sie noch wissen müssen, dann können wir die benötigten Informationen bereitstellen, ohne einfach Material an einer anderen Stelle zu duplizieren, das bereits nicht geholfen hat Sie.

— gung - Wiedereinsetzung von Monica

Ich glaube nicht, dass der Titel Ihrer Frage genau das wiedergibt, wonach Sie fragen.

Die Frage, wie die Parameter in einem GLM zu interpretieren sind, ist sehr weit gefasst, da das GLM eine sehr breite Klasse von Modellen darstellt. Denken Sie daran, dass ein GLM eine Antwortvariable modelliert , von der angenommen wird, dass sie einer bekannten Verteilung aus der Exponentialfamilie folgt, und dass wir eine invertierbare Funktion so gewählt haben, dass $y$ $g$ für Prädiktorvariablen . In diesem Modell ist die Interpretation eines bestimmten Parameters ist die Änderungsrate von in Bezug auf . Definiere

E [y | x] = g^{- 1} (x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J})

$\mathrm{E}\left[y\,|\,x\right] = g^{-1}{\left(x_0 + x_1\beta_1 + \dots + x_J\beta_J\right)}$

J

$J$

x

$x$

β_{j}

$\beta_j$

g (y)

$g(y)$

x_{j}

$x_j$

und

, um die Notation sauber zu halten. Dann wird für jeden

μ \equiv E [y | x] = g^{- 1} (x)

$\mu \equiv \mathrm{E}{\left[y\,|\,x\right]} = g^{-1}{\left(x\right)}$

η \equiv x \cdot β

$\eta \equiv x \cdot \beta$

j \in {1, \dots, J}

$j \in \{1,\dots,J\}$

Definieren Sie nun

als einen Vektor von

Nullen und einer einzelnen

in der

ten Position, so dass zum Beispiel, wenn

dann

. Dann ist

β_{j} = \frac{\partial η}{\partial x_{j}} = \frac{\partial g (μ)}{\partial x_{j}} .

$\beta_j = \frac{\partial\,\eta}{\partial\,x_j} = \frac{\partial\,g(\mu)}{\partial\,x_j} \text{.}$

e_{j}

$\mathfrak{e}_j$

J - 1

$J-1$

1

$1$

j

$j$

J = 5

$J=5$

e_{3} = (0, 0, 1, 0, 0)

$\mathfrak{e}_3 = \left(0,0,1,0,0\right)$

β_{j} = g (E [y | x + e_{j}]) - g (E [y | x])

$\beta_j = g{\left(\mathrm{E}{\left[y\,|\,x + \mathfrak{e}_j \right]}\right)} - g{\left(\mathrm{E}{\left[y\,|\,x\right]}\right)}$

Was nur bedeutet, dass die Auswirkung einer Einheitszunahme von auf . $\beta_j$ $\eta$ $x_j$

Sie können die Beziehung auch folgendermaßen : und

\frac{\partial E [y | x]}{\partial x_{j}} = \frac{\partial μ}{\partial x_{j}} = \frac{d μ}{d η} \frac{\partial η}{\partial x_{j}} = \frac{\partial μ}{\partial η} β_{j} = \frac{d g^{- 1}}{d η} β_{j}

$\frac{\operatorname{\partial}\mathrm{E}{\left[y\,|\,x\right]}}{\operatorname{\partial}x_j} = \frac{\operatorname{\partial}\mu}{\operatorname{\partial}x_j} = \frac{\operatorname{d}\mu}{\operatorname{d}\eta}\frac{\operatorname{\partial}\eta}{\operatorname{\partial}x_j} = \frac{\operatorname{\partial}\mu}{\operatorname{\partial}\eta} \beta_j = \frac{\operatorname{d}g^{-1}}{\operatorname{d}\eta} \beta_j$

E [y | x + e_{j}] - E [y | x] \equiv Δ_{j} \hat{y} = g^{- 1} ((x + e_{j}) β) - g^{- 1} (x β)

$\mathrm{E}{\left[y\,|\,x + \mathfrak{e}_j \right]} - \mathrm{E}{\left[y\,|\,x\right]} \equiv \operatorname{\Delta_j} \hat y = g^{-1}{\left( \left(x + \mathfrak{e}_j\right)\beta \right)} - g^{-1}{\left( x\,\beta \right)}$

Ohne etwas über wissen , ist das so weit wie möglich. ist die Wirkung auf , der den transformierten bedingten Mittelwert von , der eine Einheitszunahme in , und die Wirkung auf den bedingten Mittelwert von einer Einheit Erhöhung $g$ $\beta_j$ $\eta$ $y$ $x_j$ $y$ $x_j$ ist . $g^{-1}{\left(\beta\right)}$

Sie scheinen jedoch speziell nach der Poisson-Regression mit der Standard-Link-Funktion von R zu fragen, die in diesem Fall der natürliche Logarithmus ist. Wenn das der Fall ist, sind Sie zu fragen , eine bestimmte Art von GLM in denen und . Dann können wir eine gewisse Zugkraft in Bezug auf eine bestimmte Interpretation bekommen. $y \sim \mathrm{Poisson}{\left(\lambda\right)}$ $g = \ln$

Aus dem, was ich oben gesagt habe, wissen wir, dass . Und da wir wissen, dass, wissen wir auch, dass. Wir wissen zufällig auch, dass $\frac{\operatorname{\partial}\mu}{\operatorname{\partial}x_j} = \frac{\operatorname{d}g^{-1}}{\operatorname{d}\eta} \beta_j$ $g(\mu) = \ln(\mu)$ $g^{-1}(\eta) = e^\eta$ , also können wir sagen, dass $\frac{\operatorname{d}e^\eta}{\operatorname{d}\eta} = e^\eta$

\frac{\partial μ}{\partial x_{j}} = \frac{\partial E [y | x]}{\partial x_{j}} = e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J}} β_{j}

$\frac{\operatorname{\partial}\mu}{\operatorname{\partial}x_j} = \frac{\operatorname{\partial}\mathrm{E}{\left[y\,|\,x\right]}}{\operatorname{\partial}x_j} = e^{x_0 + x_1\beta_1 + \dots + x_J\beta_J}\beta_j$

was endlich etwas greifbares bedeutet:

$x_j$ $\hat y$ $\hat y\,\beta_j$

Hinweis: Diese Näherung kann tatsächlich für Änderungen bis zu 0,2 funktionieren, je nachdem, wie viel Präzision Sie benötigen.

\begin{aligned} Δ_{j} \hat{y} & = e^{x_{0} + x_{1} β_{1} + \dots + (x_{j} + 1) β_{j} + \dots + x_{J} β_{J}} - e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J}} \\ = e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J} + β_{j}} - e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J}} \\ = e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J}} e_{j}^{β} - e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J}} \\ = e^{x_{0} + x_{1} β_{1} + \dots + x_{J} β_{J}} (e_{j}^{β} - 1) \end{aligned}

$\begin{align} \operatorname{\Delta_j} \hat y &= e^{ x_0 + x_1\beta_1 + \dots + \left(x_j + 1\right)\,\beta_j + \dots + x_J\beta_J } - e^{x_0 + x_1\beta_1 + \dots + x_J\beta_J} \\ &= e^{ x_0 + x_1\beta_1 + \dots + x_J\beta_J + \beta_j} - e^{x_0 + x_1\beta_1 + \dots + x_J\beta_J} \\ &= e^{ x_0 + x_1\beta_1 + \dots + x_J\beta_J}e^\beta_j - e^{x_0 + x_1\beta_1 + \dots + x_J\beta_J} \\ &= e^{ x_0 + x_1\beta_1 + \dots + x_J\beta_J} \left( e^\beta_j - 1 \right) \end{align}$ which means

Given a unit change in $x_j$ , the fitted $\hat y$ changes by $\hat y \left( e^\beta_j - 1 \right)$ .

There are three important pieces to note here:

The effect of a change in the predictors depends on the level of the response.
An additive change in the predictors has a multiplicative effect on the response.
You can't interpret the coefficients just by reading them (unless you can compute arbitrary exponentials in your head).

So in your example, the effect of increasing pH by 1 is to increase $\ln \hat y$ by $\hat y \left( e^{0.09} - 1 \right)$ ; that is, to multiply $\hat y$ by $e^{0.09} \approx 1.09$ . It looks like your outcome is the number of darters you observe in some fixed unit of time (say, a week). So if you're observing 100 darters a week at a pH of 6.7, raising the pH of the river to 7.7 means you can now expect to see 109 darters a week.

— shadowtalker
quelle

I made a couple tweaks here, @ssdecontrol. I think they'll make your post a little easier to follow, but if you don't like them, roll them back with my apologies.

— gung - Reinstate Monica

I you can't figure that out from my answer then clearly I need to revise the answer. What are you still confused about?

— shadowtalker

Plug those numbers into the equation just like in linear regression

— shadowtalker

@skan no, I mean

E [y | x]

$E[y|x]$ .

x

$x$ and

y

$y$ are random variables representing to a single observation.

x

$x$ is a vector indexed by

j

$j$ ;

x_{j}

$x_j$ is the random variable representing a specific feature/regressor/input/predictor for that observation.

— shadowtalker

And don't overthink it. Once you understand all the pieces in a GLM, the manipulations here are just a direct application of calculus principles. It really is as simple as taking the derivative with respect to the variable you're interested in.

— shadowtalker

My suggestion would be to create a small grid consisting of combinations of the two rivers and two or three values of each of the covariates, then use the predict function with your grid as newdata. Then graph the results. It is much clearer to look at the values that the model actually predicts. You may or may not want to back-transform the predictions to the original scale of measurement (type = "response").

— rvl
quelle

As much as I like this approach (I do it all the time) I think it's counterproductive for building understanding.

— shadowtalker