Randeffekt des Probit- und Logit-Modells

12

Kann jemand erklären, wie man den Randeffekt des Probit- und Logit-Modells in Laienbegriffen berechnet?

Ich bin neu in der Statistik und bin verwirrt über diese beiden Modelle.

— Kennzeichen
quelle

Beachten Sie, dass die Zahlen, die aus Probit- und Logit-Modellen stammen, so aussehen, als würden sie ungefähr dasselbe messen, sich jedoch häufig numerisch unterscheiden. Wenn Sie sie wieder in das wirkliche Leben übersetzen, wird der Unterschied zwischen den beiden normalerweise viel kleiner.

— Henry

13

Ich denke , eine bessere Möglichkeit , den marginalen Effekt einer bestimmten Variablen, sagen zu sehen , ist ein Streudiagramm der vorhergesagten Wahrscheinlichkeit auf der vertikalen Achse zu erzeugen, und haben $X_j$ $X_j$ auf der horizontalen Achse. Dies ist die "Laien" -Methode, die ich mir vorstellen kann, um anzuzeigen, wie einflussreich eine bestimmte Variable ist. Keine Mathematik, nur Bilder. Wenn Sie viele Datenpunkte haben, kann ein Boxplot oder ein Streudiagramm glatter helfen, um zu sehen, wo sich die meisten Daten befinden (im Gegensatz zu nur einer Punktwolke).

Ich bin mir nicht sicher, wie "Laie" der nächste Abschnitt ist, aber Sie finden ihn möglicherweise nützlich.

Wenn wir den Randeffekt betrachten, nennen wir ihn und stellen fest, dass $m_j$ $g(p)=\sum_kX_k\beta_k$ ist

m_{j} = \frac{\partial p}{\partial X_{j}} = \frac{β_{j}}{g^{'} [g^{- 1} (X^{T} β)]} = \frac{β_{j}}{g^{'} (p)}

$m_j=\frac{\partial p}{\partial X_j}=\frac{\beta_j}{g'\left[g^{-1}(X^T\beta)\right]}=\frac{\beta_j}{g'(p)}$

Der marginale Effekt hängt also zusätzlich zum Beta von der geschätzten Wahrscheinlichkeit und dem Gradienten der Link-Funktion ab. Die Division durch ergibt sich aus der Kettenregel zur Differenzierung und der Tatsache, dass $g'(p)$ . Dies kann gezeigt werden, indem beide Seiten der offensichtlich wahren Gleichung. Wir haben auch, dassper Definition ist. Für ein Logit-Modell gilt $\frac{\partial g^{-1}(z)}{\partial z}=\frac{1}{g'\left[g^{-1}(z)\right]}$ $z=g\left[g^{-1}(z)\right]$ $g^{-1}(X^T\beta)=p$ , und der marginale Effekt ist: $g(p)=\log(p)-\log(1-p)\implies g'(p)=\frac{1}{p}+\frac{1}{1-p}=\frac{1}{p(1-p)}$

m_{j}^{l o g i t} = β_{j} p (1 - p)

$m_j^{logit}=\beta_jp(1-p)$

Was bedeutet das? Die Vertiefung ist bei und bei Null und erreicht bei ihren Maximalwert von . Der marginale Effekt ist also am größten, wenn die Wahrscheinlichkeit nahe , und am kleinsten, wenn nahe oder nahe . Allerdings hängt noch von , so dass die marginalen Effekten kompliziert sind. In der Tat, weil es darauf ankommt $p(1-p)$ $p=0$ $p=1$ $0.25$ $p=0.5$ $0.5$ $p$ $0$ $1$ $p(1-p)$ $X_j$ , werden Sie einen anderen marginalen Effekt für verschiedene bekommen $p$ $X_k,\;k\neq j$ Werte. Möglicherweise ein guter Grund, nur dieses einfache Streudiagramm zu erstellen - Sie müssen nicht auswählen, welche Werte der Kovariaten verwendet werden sollen.

Für ein Probit-Modell gilt wobeiStandard normales CDF undStandard normales PDF ist. So bekommen wir: $g(p)=\Phi^{-1}(p)\implies g'(p)=\frac{1}{\phi\left[\Phi^{-1}(p)\right]}$ $\Phi(.)$ $\phi(.)$

m_{j}^{p r o b i t} = β_{j} ϕ [Φ^{- 1} (p)]

$m_j^{probit}=\beta_j\phi\left[\Phi^{-1}(p)\right]$

Beachten Sie, dass dies hat die meisten Eigenschaften , dass der marginaler Effekt I früher diskutiert, und gilt auch für jede Link - Funktion , die etwa symmetrisch ist (und gesund, natürlich, zum Beispiel $m_j^{logit}$ $0.5$ $g(p)=tan(\frac{\pi}{2}[2p-1])$ ). The dependence on $p$ is more complicated, but still has the general "hump" shape (highest point at $0.5$ , lowest at $0$ and $1$ ). The link function will change the size of the maximum height (e.g. probit maximum is $\frac{1}{\sqrt{2\pi}}\approx 0.4$ , logit is $0.25$ ), and how quickly the marginal effect is tapered towards zero.

— probabilityislogic
quelle

The effects package in R can easily produce such plots of predicted probability on the vertical axis vs X on the horizontal axis. See socserv.socsci.mcmaster.ca/jfox/Misc/effects/index.html

— landroni

See also: stats.stackexchange.com/questions/18814/…

— landroni

5

The logit and probit models are typically used to figure out a probability that the dependent variable y is 0 or 1 based on a number of input variables.

In English: Suppose you're trying to predict a binary value, such as whether or not somebody will develop heart disease during their life. You have a number of input variables such as blood pressure, age, whether or not they are a smoker, their BMI, where they live, etc. etc. All those variables may contribute in some way to the chances of somebody developing heart disease.

The marginal effect of a single input variable is if you raise that variable by a bit, how does that affect the probability of having heart disease? Suppose blood pressure increases by a slight amount, how does that change the chances of having heart disease? Or if you raise the age by a year?

Some of these effects could also be non-linear: increasing BMI by a slight amount may have a very different effect for somebody who has a very healthy BMI than for somebody who does not.

— robbrit
quelle

1

You'd still want your layman to know the calculus, as marginal effect is the derivative of a fitted probability with respect to the variable of interest. As fitted probability is the link function (logit, probit or whatever) applied to the fitted values, you need the chain rule to compute it. So, in linear index models (where parameters enter as something like X'b) it is equal to the parameter estimate times the derivative of the link function. As the derivative is different at different values of the regressors (unlike the case of a linear model), you have to decide, where to evaluate the marginal effect. A natural choice would be mean values of all the regressors. Another approach would be to evaluate the effect for the each observation and then average over them. The interpretation differs accordingly.

— Alex
quelle