Warum hat die Anzahl der stetigen einheitlichen Variablen auf (0,1), die erforderlich sind, damit ihre Summe eine überschreitet, einen Mittelwert für

Summieren wir einen Strom von Zufallsvariablen, $X_i \overset{iid}\sim \mathcal{U}(0,1)$ ; Sei $Y$ die Anzahl der Terme, die wir benötigen, damit die Summe eins überschreitet, dh $Y$ ist die kleinste Zahl, so dass

X 1 + X 2 + \dots + X Y > 1.

$X_1 + X_2 + \dots + X_Y > 1.$

Warum ist der Mittelwert von $Y$ gleich Eulers Konstante $e$ ?

E (Y) = e = 1 0 ! + 1 1 ! + 1 2 ! + 1 3 ! + \dots

$\mathbb{E}(Y) = e = \frac{1}{0!} + \frac{1}{1!} + \frac{1}{2!} + \frac{1}{3!} + \dots$

— Silberfisch
quelle

Ich poste dies im Geiste einer Selbststudienfrage, obwohl ich glaube, dass ich diese Frage vor über einem Jahrzehnt zum ersten Mal gesehen habe. Ich kann mich nicht erinnern, wie ich es damals beantwortet habe, obwohl ich mir sicher war, dass mir das nicht so einfiel, als ich diese Eigenschaft sah, die im Thread Approximate

e $e$ using Monte Carlo Simulation erwähnt wurde . Da ich den Verdacht habe, dass dies eine ziemlich häufige Übungsfrage ist, habe ich mich dafür entschieden, eine Skizze anstatt einer vollständigen Lösung vorzulegen, obwohl die Haupt- "Spoilerwarnung" wohl in die Frage selbst gehört!

— Silverfish

Ich bin weiterhin sehr an alternativen Ansätzen interessiert. Ich weiß, dass dies eine Frage in Gnedenkos Wahrscheinlichkeitstheorie war (ursprünglich auf Russisch, aber weithin übersetzt), aber ich weiß nicht, welche Lösung dort erwartet oder woanders gestellt wurde.

— Silverfish

Ich schrieb eine Simulationslösung in MATLAB Ihre Simplex - Methode. Ich wusste nichts über die Verbindung zu Simplexen, es ist so unerwartet.

— Aksakal

Antworten:

Erste Beobachtung: $Y$ hat eine angenehmere CDF als PMF

Die Wahrscheinlichkeitsmassenfunktion $p_Y(n)$ ist die Wahrscheinlichkeit, dass $n$ "gerade genug" ist, damit die Summe die Einheit überschreitet, dh $X_1 + X_2 + \dots X_n$ überschreitet eins, während $X_1 + \dots + X_{n-1}$ tut nicht.

Die kumulative Verteilung $F_Y(n) = \Pr(Y \leq n)$ erfordert lediglich $n$ ist "genug", dh $\sum_{i=1}^{n}X_i > 1$ ohne Einschränkungen auf wie viel von. Dies scheint ein viel einfacheres Ereignis zu sein, um mit der Wahrscheinlichkeit umzugehen.

Zweite Beobachtung: $Y$ nimmt nicht negative ganzzahlige Werte so $\mathbb{E}(Y)$ kann geschrieben werden in Bezug auf die CDF

Offensichtlich $Y$ kann nur Werte in $\{0, 1, 2, \dots\}$ , so dass wir seinen Mittelwert in Bezug auf die schreiben können komplementäre CDF , $\bar F_Y$ .

E (Y) = \sum n = 0 \infty F ¯ Y. (n) = \sum n = 0 \infty (1 - F Y. (n))

$\mathbb{E}(Y) = \sum_{n=0}^\infty \bar F_Y(n) = \sum_{n=0}^\infty \left(1 - F_Y(n) \right)$

Tatsächlich sind und beide Null, so dass die ersten beiden Terme $\Pr(Y=0)$ $\Pr(Y=1)$ . $\mathbb{E}(Y) = 1 + 1 + \dots$

Was die späteren Ausdrücke betrifft, wenn die Wahrscheinlichkeit ist, dass , von welchem Ereignis ist die Wahrscheinlichkeit? $F_Y(n)$ $\sum_{i=1}^{n}X_i > 1$ $\bar F_Y(n)$

Dritte Beobachtung: Das (Hyper-) Volumen eines Implex ist $n$ $\frac{1}{n!}$

Der Implex, den ich im Auge habe, besetzt das Volumen unter einer Standardeinheit -Implex in der vollpositiven Orthante von : Es ist die konvexe Hülle von Eckpunkten, insbesondere der Ursprung plus die Eckpunkte der Einheit -simplex bei , usw. $n$ $(n-1)$ $\mathbb{R}^n$ $(n+1)$ $(n-1)$ $(1, 0, 0, \dots)$ $(0, 1, 0, \dots)$

Zum Beispiel hat der 2-Simplex oben mit die Fläche $x_1 + x_2 \leq 1$ und der 3-Simplex mithat Volumen $\frac{1}{2}$ $x_1 + x_2 + x_3 \leq 1$ $\frac{1}{6}$ .

Für einen Beweis dafür , dass Erlös durch direkte ein integrale für die Wahrscheinlichkeit des Ereignisses beschrieben durch Auswertung , und Verbindungen zu zwei weiteren Argumenten findet diesen Thread Math SE . Der verwandte Thread könnte auch von Interesse sein: Gibt es eine Beziehung zwischen und der Summe der Implexe-Volumina? $\bar F_Y(n)$ $e$ $n$

— Silverfish
quelle

This is an interesting geometric approach, and easy to solve this way. Beautiful. Here's the equation for a volume of a simplex. I don't think there could be a more elegant solution, frankly

— Aksakal

+1 You can also obtain the full distribution of

Y $Y$ from any of the approaches in my post at stats.stackexchange.com/questions/41467/….

— whuber

If I stumbled on this solution, there's no way they could force me do it other way in a school :)

— Aksakal

Fix $n \ge 1$ . Let

U i = X 1 + X 2 + \dots + X i mod 1

$U_i = X_1 + X_2 + \cdots + X_i \mod 1$ be the fractional parts of the partial sums for

i=1,2,…,n $i=1,2,\ldots, n$ . The independent uniformity of

X1 $X_1$ and

Xi+1 $X_{i+1}$ guarantee that

Ui+1 $U_{i+1}$ is just as likely to exceed

Ui $U_i$ as it is to be less than it. This implies that all $n!$ orderings of the sequence $(U_i)$ are equally likely.

Given the sequence $U_1, U_2, \ldots, U_n$ , we can recover the sequence $X_1, X_2, \ldots, X_n$ . To see how, notice that

$U_1 = X_1$ because both are between $0$ and $1$ .
If $U_{i+1} \ge U_i$ , then $X_{i+1} = U_{i+1} - U_i$ .
Otherwise, $U_i + X_{i+1} \gt 1$ , whence $X_{i+1} = U_{i+1} - U_i + 1$ .

There is exactly one sequence in which the $U_i$ are already in increasing order, in which case $1 \gt U_n = X_1 + X_2 + \cdots + X_n$ . Being one of $n!$ equally likely sequences, this has a chance $1/n!$ of occurring. In all the other sequences at least one step from $U_i$ to $U_{i+1}$ is out of order. This implies the sum of the $X_i$ had to equal or exceed $1$ . Thus we see that

Pr (Y > n) = Pr (X 1 + X 2 + \dots + X n \leq 1) = Pr (X 1 + X 2 + \dots + X n < 1) = 1 n ! .

$\Pr(Y \gt n) = \Pr(X_1 + X_2 + \cdots + X_n \le 1) = \Pr(X_1 + X_2 + \cdots + X_n \lt 1) = \frac{1}{n!}.$

This yields the probabilities for the entire distribution of $Y$ , since for integral $n\ge 1$

Pr (Y = n) = Pr (Y > n - 1) - Pr (Y > n) = 1 ( n - 1 ) ! - 1 n ! = n - 1 n ! .

$\Pr(Y = n) = \Pr(Y \gt n-1) - \Pr(Y \gt n) = \frac{1}{(n-1)!} - \frac{1}{n!} = \frac{n-1}{n!}.$

Moreover,

E (Y) = \sum n = 0 \infty Pr (Y > n) = \sum n = 0 \infty 1 n ! = e,

$\mathbb{E}(Y) = \sum_{n=0}^\infty \Pr(Y \gt n) = \sum_{n=0}^\infty \frac{1}{n!} = e,$

QED.

— whuber
quelle

I have read it a couple of times, and I almost get it... I posted a couple of questions in the Mathematics SE as a result of the

e $e$ constant computer simulation. I don't know if you saw them. One of them came back before your kind explanation on Tenfold about the ceiling function of the

1/U(0,1) $1/U(0,1)$ and the Taylor series. The second one was exactly about this topic, never got a response, until now...

— Antoni Parellada

here and here.

— Antoni Parellada

And could you add the proof with the uniform spacings as well?

— Xi'an

@Xi'an Could you indicate more specifically what you mean by "uniform spacings" in this context?

— whuber

I am referring to your Poisson process simulation via the uniform spacing, in the thread Approximate e using Monte Carlo Simulation for which I cannot get a full derivation.

— Xi'an

In Sheldon Ross' A First Course in Probability there is an easy to follow proof:

Modifying a bit the notation in the OP, $U_i \overset{iid}\sim \mathcal{U}(0,1)$ and $Y$ the minimum number of terms for $U_1 + U_2 + \dots + U_Y > 1$ , or expressed differently:

Y = m i n {n : \sum i = 1 n U i > 1}

$Y = min\Big\{n: \sum_{i=1}^n U_i>1\Big\}$

If instead we looked for:

Y (u) = m i n {n : \sum i = 1 n U i > u}

$Y(u) = min\Big\{n: \sum_{i=1}^n U_i>u\Big\}$ for

u∈[0,1] $u\in[0,1]$ , we define the

f(u)=E[Y(u)] $f(u)=\mathbb E[Y(u)]$ , expressing the expectation for the number of realizations of uniform draws that will exceed

u $u$ when added.

We can apply the following general properties for continuous variables:

$E[X] = E[E[X|Y]]=\displaystyle\int_{-\infty}^{\infty}E[X|Y=y]\,f_Y(y)\,dy$

to express $f(u)$ conditionally on the outcome of the first uniform, and getting a manageable equation thanks to the pdf of $X \sim U(0,1)$ , $f_Y(y)=1.$ This would be it:

f (u) = \int 10 E [Y (u) | U 1 = x] d x (1)

$f(u)=\displaystyle\int_0^1 \mathbb E[Y(u)|U_1=x]\,dx \tag 1$

If the $U_1=x$ we are conditioning on is greater than $u$ , i.e. $x>u$ , $\mathbb E[Y(u)|U_1=x] =1 .$ If, on the other hand, $x <u$ , $\mathbb E[Y(u)|U_1=x] =1 + f(u - x)$ , because we already have drawn $1$ uniform random, and we still have the difference between $x$ and $u$ to cover. Going back to equation (1):

f (u) = 1 + \int x 0 f (u - x) d x

$f(u) = 1 + \displaystyle\int_0^x f(u - x) \,dx$ , and with substituting

w=u−x $w = u - x$ we would have

f(u)=1+∫x0f(w)dw $f(u) = 1 + \displaystyle\int_0^x f(w) \,dw$ .

If we differentiate both sides of this equation, we can see that:

f' (u) = f (u) ⟹ f ' ( u ) f ( u ) = 1

$f'(u) = f(u)\implies \frac{f'(u)}{f(u)}=1$

with one last integration we get:

l o g [f (u)] = u + c ⟹ f (u) = k e u

$log[f(u)] = u + c \implies f(u) = k \,e^u$

We know that the expectation that drawing a sample from the uniform distribution and surpassing $0$ is $1$ , or $f(0) = 1$ . Hence, $k = 1$ , and $f(u)=e^u$ . Therefore $f(1) = e.$

— Antoni Parellada
quelle

I do like the manner in which this generalises the result.

— Silverfish