Stichprobenverteilung des Radius der 2D-Normalverteilung

Die bivariate Normalverteilung mit Mittelwert $\mu$ und Kovarianzmatrix $\Sigma$ kann in Polarkoordinaten mit Radius $r$ und Winkel umgeschrieben werden $\theta$ . Meine Frage lautet: Was ist die Stichprobenverteilung von , dass der Abstand von einem Punkt zu der geschätzten Mitte gegeben , um die Probe Kovarianzmatrix ? $\hat{r}$ $x$ $\bar{x}$ $S$

Hintergrund: Der wahre Abstand $r$ von einem Punkt $x$ zum Mittelwert $\mu$ folgt einer Hoyt-Verteilung . Mit den Eigenwerten $\lambda_{1}, \lambda_{2}$ von $\Sigma$ und $\lambda_{1} > \lambda_{2}$ ist sein Formparameter $q=\frac{1}{\sqrt{(\lambda_{1}+\lambda_{2})/\lambda_{2})-1}}$ , und sein Skalierungsparameter ist $\omega = \lambda_{1} + \lambda_{2}$ . Es ist bekannt, dass die kumulative Verteilungsfunktion die symmetrische Differenz zwischen zwei Marcum-Q-Funktionen ist.

Die Simulation legt nahe, dass das Einfügen von Schätzungen und für und in das echte cdf für große Stichproben funktioniert, nicht jedoch für kleine Stichproben. Das folgende Diagramm zeigt die Ergebnisse von 200 Mal $\bar{x}$ $S$ $\mu$ $\Sigma$

simulating 20 2D normal vectors for each combination of given $q$ ( $x$ -axis), $\omega$ (rows), and quantile (columns)
for each sample, calculating the given quantile of the observed radius $\hat{r}$ to $\bar{x}$
for each sample, calculating the quantile from the theoretical Hoyt (2D normal) cdf, and from the theoretical Rayleigh cdf after plugging in the sample estimates $\bar{x}$ and $S$ .

enter image description here

Wenn sich 1 nähert (die Verteilung wird kreisförmig), nähern sich die geschätzten Hoyt-Quantile den geschätzten Rayleigh-Quantilen, die von . Wenn wächst, nimmt die Differenz zwischen den empirischen und den geschätzten Quantilen zu, insbesondere im Ende der Verteilung. $q$ $q$ $\omega$

— Karakal
quelle

Was ist die Frage?

— John

@ John Ich habe die Frage hervorgehoben: "Wie ist die Stichprobenverteilung von [Radius]

, dh der Abstand von einem Punkt

zum geschätzten Zentrum

gegebener Stichprobenkonvarianzmatrix

r

$r$

x

$x$

\bar{x}

$\bar{x}$

S

$S$

— Caracal

Warum

im Gegensatz zu

\hat{r}

$\hat{r}$

\hat{r^{2}}

$\hat{r^2}$

— SomeEE

@MathEE

, nur weil die Literatur , die ich kenne mit der Verteilung von (true) betreffen

, nicht (true)

. Beachten Sie, dass dies anders ist als bei der in dieser Frage diskutierten Mahalanobis-Distanz . Natürlich Ergebnisse für die Verteilung von

wären sehr willkommen.

\hat{r}

$\hat{r}$

r

$r$

r^{2}

$r^{2}$

{\hat{r}}^{2}

$\hat{r}^{2}$

— Caracal

Wie Sie in Ihrem Beitrag erwähnt haben, kennen wir die Verteilung der Schätzung von wenn wir damit wir die Verteilung der Schätzung von des wahren . $\widehat{r_{true}}$ $\mu$ $\widehat{r^2_{true}}$ $r^2$

Wir wollen die Verteilung von wobei

\hat{r^{2}} = \frac{1}{N} \sum_{i = 1}^{N} (x_{i} - \bar{x})^{T} (x_{i} - \bar{x})

$\widehat{r^2} = \frac{1}{N}\sum_{i=1}^N (x_i-\overline{x})^T(x_i-\overline{x})$

x_{i}

$x_i$ are expressed as column vectors.

Wir machen jetzt den Standardtrick

\begin{array}{rcl} \hat{r_{t r u e}^{2}} & = & \frac{1}{N} \sum_{i = 1}^{N} (x_{i} - μ)^{T} (x_{i} - μ) \\ = & \frac{1}{N} \sum_{i = 1}^{N} (x_{i} - \bar{x} + \bar{x} - μ)^{T} (x_{i} - \bar{x} + \bar{x} - μ) \\ = & [\frac{1}{N} \sum_{i = 1}^{N} (x_{i} - \bar{x})^{T} (x_{i} - \bar{x})] + (\bar{x} - μ)^{T} (\bar{x} - μ) (1) \\ = & \hat{r^{2}} + (\bar{x} - μ)^{T} (\bar{x} - μ) \end{array}

$\begin{eqnarray*} \widehat{r^2_{true}} &=& \frac{1}{N}\sum_{i=1}^N(x_i - \mu)^T(x_i-\mu)\\ &=& \frac{1}{N}\sum_{i=1}^N(x_i-\overline{x} + \overline{x} -\mu)^T(x_i-\overline{x} + \overline{x}-\mu)\\ &=&\left[\frac{1}{N}\sum_{i=1}^N(x_i - \overline{x})^T(x_i-\overline{x})\right] + (\overline{x} - \mu)^T(\overline{x}-\mu) \hspace{20pt}(1)\\ &=& \widehat{r^2} + (\overline{x}-\mu)^T(\overline{x}-\mu) \end{eqnarray*}$ where

(1)

$(1)$ arises from the equation

\frac{1}{N} \sum_{i = 1}^{N} (x_{i} - \bar{x})^{T} (\bar{x} - μ) = (\bar{x} - \bar{x})^{T} (\bar{x} - μ) = 0

$\frac{1}{N}\sum_{i=1}^N(x_i-\overline{x})^T(\overline{x}-\mu) = (\overline{x} - \overline{x})^T(\overline{x} - \mu) = 0$ and its transpose.

$\widehat{r^2}$ $S$ $(\overline{x}-\mu)^T(\overline{x}-\mu)$ $\overline{x}$

\hat{r_{t r u e}^{2}} = \hat{r^{2}} + (\bar{x} - μ)^{T} (\bar{x} - μ)

$\widehat{r_{true}^2} = \widehat{r^2} + (\overline{x}-\mu)^T(\overline{x}-\mu)$ as the sum of two independent random variables. We know the distributions of the

\hat{r_{t r u e}^{2}}

$\widehat{r^2_{true}}$ and

(\bar{x} - μ)^{T} (\bar{x} - μ)

$(\overline{x} - \mu)^T(\overline{x}-\mu)$ and so we are done via the standard trick using that characteristic functions are multiplicative.

Edited to add:

$||x_i-\mu||$ is Hoyt so it has pdf

f (ρ) = \frac{1 + q^{2}}{q ω} ρ e^{- \frac{(1 + q^{2})^{2}}{4 q^{2} ω} ρ^{2}} I_{O} (\frac{1 - q^{4}}{4 q^{2} ω} ρ^{2})

$f(\rho) = \frac{1+q^2}{q\omega}\rho e^{-\frac{(1+q^2)^2}{4q^2\omega} \rho^2}I_O\left(\frac{1-q^4}{4q^2\omega} \rho^2\right)$ where

I_{0}

$I_0$ is the

0^{t h}

$0^{th}$ modified Bessel function of the first kind.

This means that the pdf of $||x_i-\mu||^2$ is

f (ρ) = \frac{1}{2} \frac{1 + q^{2}}{q ω} e^{- \frac{(1 + q^{2})^{2}}{4 q^{2} ω} ρ} I_{0} (\frac{1 - q^{4}}{4 q^{2} ω} ρ) .

$f(\rho) = \frac{1}{2}\frac{1+q^2}{q\omega}e^{-\frac{(1+q^2)^2}{4q^2\omega}\rho}I_0\left(\frac{1-q^4}{4q^2\omega}\rho\right).$

To ease notation set $a = \frac{1-q^4}{4q^2\omega}$ , $b=-\frac{(1+q^2)^2}{4q^2\omega}$ and $c=\frac{1}{2}\frac{1+q^2}{q\omega}$ .

The moment generating function of $||x_i-\mu||^2$ is

{\begin{cases} \frac{c}{\sqrt{(s - b)^{2} - a^{2}}} & (s - b) > a \\ 0 & else \end{cases}

$\begin{cases} \frac{c}{\sqrt{(s-b)^2-a^2}} & (s-b) > a\\ 0 & \text{ else}\\ \end{cases}$

Thus the moment generating function of $\widehat{r^2_{true}}$ is

{\begin{cases} \frac{c^{N}}{((s / N - b)^{2} - a^{2})^{N / 2}} & (s / N - b) > a \\ 0 & else \end{cases}

$\begin{cases} \frac{c^N}{((s/N-b)^2-a^2)^{N/2}} & (s/N-b) > a\\ 0 & \text{else} \end{cases}$ and the moment generating function of

| | \bar{x} - μ | |^{2}

$||\overline{x} - \mu||^2$ is

{\begin{cases} \frac{N c}{\sqrt{(s - N b)^{2} - (N a)^{2}}} = \frac{c}{\sqrt{(s / N - b)^{2} - a^{2}}} & (s / N - b) > a \\ 0 & else \end{cases}

$\begin{cases} \frac{Nc}{\sqrt{(s-Nb)^2-(Na)^2}} = \frac{c}{\sqrt{(s/N-b)^2-a^2}} & (s/N-b) > a\\ 0 & \text{ else} \end{cases}$

This implies that the moment generating function of $\widehat{r^2}$ is

{\begin{cases} \frac{c^{N - 1}}{((s / N - b)^{2} - a^{2})^{(N - 1) / 2}} & (s / N - b) > a \\ 0 & else . \end{cases}

$\begin{cases} \frac{c^{N-1}}{((s/N-b)^2-a^2)^{(N-1)/2}} & (s/N-b) > a\\ 0 & \text{ else}. \end{cases}$

Applying the inverse Laplace transform gives that $\widehat{r^2}$ has pdf

g (ρ) = \frac{\sqrt{π} N c^{N - 1}}{Γ (\frac{N - 1}{2})} {(\frac{2 i a}{N ρ})}^{(2 - N) / 2} e^{b N ρ} J_{N / 2 - 1} (i a N ρ) .

$g(\rho) = \frac{\sqrt{\pi}Nc^{N-1}}{\Gamma(\frac{N-1}{2})}\left(\frac{2\mathrm{i} a}{N\rho}\right)^{(2 - N)/2} e^{b N \rho} J_{N/2-1}( \mathrm{i} a N \rho).$

— SomeEE
quelle

Thank you! I'll have to work out the details before accepting.

— caracal

\hat{r_{true}^{2}} \sim Hoyt

$\widehat{r^{2}_{\text{true}}} \sim \text{Hoyt}$ , and

| | \bar{x} - μ | |^{2} \sim N (0, \frac{1}{N} Σ)

$||\bar{x}-\mu||^{2} \sim \mathcal{N}(0, \frac{1}{N}\Sigma)$ ? Then the characteristic function of

\hat{r^{2}}

$\widehat{r^{2}}$ is the product of the two characteristic functions as explained here. That indeed answers my question. Do you know how we might suitably transform

\hat{r^{2}}

$\widehat{r^{2}}$ such that its distribution is known without access to

Σ

$\Sigma$ ? Like the Mahalanobis distance, or the univariate

t

$t$ statistic?

— caracal

I've edited my response to a full answer. Please let me know if you agree.

— SomeEE

I am not sure about unknown

Σ

$\Sigma$ . The obvious thing to do would be to try to "divide"

\hat{r^{2}}

$\widehat{r^2}$ by the sample covariance

S

$S$ which would look like a sum of Mahalanobis distances, i.e. consider

\frac{1}{N} \sum_{i = 1}^{N} (x_{i} - \bar{x})^{T} S^{- 1} (x_{i} - \bar{x})

$\frac{1}{N} \sum_{i=1}^N(x_i - \overline{x})^T S^{-1}(x_i-\overline{x})$ . Unfortunately this sum is always

1

$1$ .

— SomeEE

Thanks for continuing to work on the answer! I'm not sure about the distribution of

| | x_{i} - μ | |^{2}

$||x_{i}-\mu||^{2}$ . I'm not able to do deal with this analytically, but a quick simulation of

r^{2}

$r^{2}$ gives a different distribution than

Γ (q, \frac{ω}{q})

$\Gamma(q, \frac{\omega}{q})$ : R simulation code. Although it could well be that I don't correctly understand the

Γ

$\Gamma$ parametrization.

— caracal