In this section we study some graphs that are closely related to the standard discrete graph \((\N, \le)\) studied in Section 1: the strict order graph \((\N, \lt)\), the covering graph \((\N, \upa)\) of \((\N, \le)\), and the reflexive closure \((\N, \rta)\) of \((\N, \upa)\). Since the spaces are discrete, the reference measure space is \((\N, \ms P(\N), \#)\), where as usual \(\ms P(\N)\) is the power set of \(\N\) and \(\#\) is counting measure. We also assume that we have a probability space \((\Omega, \ms F, \P)\) in the background so that all random variables in \(\N\) are defined on this space.
First we consider the strict order graph \((\N, \lt)\). Note that \((\N, \lt)\) is the irreflexive reduction of \((\N, \le)\) as discussed in Section 1.6, and is isomorphic to the graph \((\N_+, \lt)\), with isomorphism \(x \mapsto x + 1\). In turn, \((\N_+, \lt)\) is the graph associated with the strict positive semigroup \((\N_+, +)\).
For the graph \((\N, \lt)\),
The (right) \(\sigma\)-algebra associated with \((\N, \lt)\) is the reference \(\sigma\)-algebra \(\ms P(\N)\).
The right neighbor set of \(x \in \N\) for the graph is \(A_x = \{y \in \N: y \gt x\} = \{x + 1, x + 2, \ldots\}\), and the \(\sigma\)-algebra associated with \((\N, \lt)\) is \(\ms A = \sigma(\{A_x: x \in \N\}\).
Suppose now that \(X\) is a random variable in \(\N\) with density function \(f\).
For the graph \((\N, \lt)\),
The results follow directly from the defintions. Again, the right neighbor set of \(x \in \N\) is \(\{x + 1, x + 2, \ldots\}\).
The graph \((\N, \lt)\) is stochastic: the reliability function of \(X\) uniquely determines the distribution of \(X\).
The density function \(f\) can be recovered from the reliability function \(F\) by \(f(0) = 1 - F(0)\) and \(f(x) = F(x - 1) - F(x)\) for \(x \in \N_+\).
For the graph \((\N, \lt)\),
Note that the generating function of \(X\) is the ordinary probability denerating function evaluated at \(1 + t\).
The recursive moment formula of \(X\) for the graph \((\N, \lt)\) is \[\sum_{x = 1}^\infty \binom{x}{n} \P(X \gt x) = \E\left[\binom{X}{n + 1}\right], \quad n \in \N\]
This follows from and and the basic result in Section 1.3.
Random variable \(X\) has constant rate for \((\N, \lt)\) if and only if \(X + 1\) is exponential for \((\N_+, +)\) if and only if \(X + 1\) is memoryless for \((\N_+, +)\). The distribution with constant rate \(\alpha \in (0, \infty)\) for \((\N, \lt)\) has density function \(f\) given by \[f(x) = \frac{\alpha}{1 + \alpha} \left(\frac{1}{1 + \alpha}\right)^x, \quad x \in \N\]
Let \(F\) denote the reliability function of \(X\) for \((\N, \lt)\) and recall that \(f(0) = 1 - F(0)\) and \(f(x) = F(x - 1) - F(x)\) for \(x \in \N\). For \(\alpha \in (0, \infty)\), the constant rate property \(f = \alpha F\) has the unique solution \begin{align*} F(x) &= \left(\frac{1}{1 + \alpha}\right)^{x + 1}, \quad x \in \N \\ f(x) &= \frac{\alpha}{1 + \alpha} \left(\frac{1}{1 + \alpha}\right)^x, \quad x \in \N \end{align*} Let \(Y = X + 1\), so that \(Y\) has values in \(\N_+\). Then \(Y\) has density function \(g\) given by \(g(y) = f(y - 1)\) for \(y \in \N_+\), and the reliability function \(G\) of \(Y\) for \((\N_+, \lt)\) is given by \(G(y) = F(y - 1)\) for \(y \in \N_+\). If \(X\) has the distribution with constant rate \(\alpha \in (0, \infty)\) for \((\N, \lt)\) above, then \(Y\) is memoryless for \((\N_+, +)\) and has constant rate \(\alpha\) for the associated graph \((\N_+, \lt)\). Hence \(Y\) is exponential for \((\N_+, +)\). Conversely, if \(Y\) is memoryless for \((\N_+, +)\) then \(G(y) = (1 - \beta)^y\) for \(y \in \N_+\) and hence \(g(y) = \beta (1 - \beta)^{y - 1}\) for \(y \in \N_+\) where \(\beta = G(1) \in (0, 1)\). Hence \(Y\) has constant rate \(\alpha = \beta / (1 - \beta) \in (0, \infty)\) for \((\N_+, \lt)\) and therefore \(Y\) is exponential for \((\N_+, +)\). Then also \(X = Y - 1\) has constant rate \(\alpha\) for \((\N, \lt)\).
Of course, the constant rate distribution in is the geometric distribution on \(\N\) with success parameter \(\alpha / (1 + \alpha)\). From the general theory, if \(X\) has constant rate \(\alpha \in (0, \infty)\) then the graph moment of \(X\) of order \(n \in \N\) in is \(\alpha^n\). The followiing result gives the mean, variance, and entropy as a function of the rate parameter.
Suppose that \(X\) has constant rate \(\alpha \in (0, \infty)\) for \((\N, \lt)\). Then
These are standard results since \(X\) has the geometric distribution on \(\N\) with success parameter \(\alpha / (1 + \alpha)\).
A random walk \(\bs{Y} = (Y_1, Y_2, \ldots)\) on the graph \((\N, \lt)\) has the property that \(Y_n \lt Y_{n + 1}\) for \(n \in \N\). If \(\bs Y\) is associated with a random variable \(X\) in \(\N\) with density \(f\), then the transition density \(P\) of \(\bs Y\) is given by \[P(x, y) = \frac{f(y)}{\sum_{x \lt z}^\infty f(z)}; \quad x \lt y \] Here are the standard results when \(X\) has a constant rate distribution:
Suppose that \(X\) has constant rate \(\alpha \in (0, \infty)\) for the graph \((\N, \lt)\) and that \(\bs{Y} = (Y_1, Y_2, \ldots)\) is the random walk associated with \(X\).
Recall also that the point process associated with the random walk \(\bs Y\) is \(\bs N = \{N_A: A \subseteq \N\}\) where \(N_A = \#\{n \in \N_+: Y_n \in A\}\).
Suppose again that \(X\) has constant rate \(\alpha \in (0, \infty)\) for the graph \((\N, \lt)\) and that \(\bs{Y} = (Y_1, Y_2, \ldots)\) is the random walk associated with \(X\). Then \[\E(N_A) = \frac {\alpha}{1 + \alpha} \#(A), \quad A \subseteq \N\]
From the general theory in Section 1.5, \(\E(N_A) = \E[U(X, \alpha); X \in A]\) for \(A \subseteq \N\) where \(U\) is the generating function. So from , \[\E(N_A) = \E[(1 + \alpha)^X; X \in A] = \sum_{x \in A} (1 + \alpha)^x \frac{\alpha}{1 + \alpha} \left(\frac{1}{1 + \alpha}\right)^x = \frac{\alpha}{1 + \alpha} \#(A)\]
Finally, recall the process of thinning the random walk \(\bs Y\). Specifically, suppose that \(N\) is independent of \(\bs Y\) and has the geometric distribution on \(\N_+\) with success probability \(p \in (0, 1)\), so that \(\P(N = n) = p (1 - p)^{n - 1}\) for \(n \in \N_+\). We accept or reject each point in \(\bs Y\), independently, with probabilities \(p\) and \(1 - p\), respectively. So \(Y_N\) is the first accepted point.
Suppose again that \(X\) has constant rate \(\alpha \in (0, \infty)\) for the graph \((\N, \lt)\) and that \(\bs{Y} = (Y_1, Y_2, \ldots)\) is the random walk associated with \(X\). For the thinned process with parameter \(p \in (0, 1)\), the density function \(h\) of \(Y_N\) is given by \[h(x) = \frac{\alpha p}{1 + \alpha} \left(1 - \frac{\alpha p}{1 + \alpha}\right)^x, \quad x \in \N\]
Recall that \(h\) is given by \(h(x) = p \alpha U[x, (1 - p) \alpha] F(x)\) for \(x \in \N\) where again \(U\) is the generating function and where \(F\) is the reliability function of \(X\). Hence \[h(x) = p \alpha [1 + (1 - p) \alpha]^x \frac{1}{(1 + \alpha)^{x + 1}} = \frac{\alpha p}{1 + \alpha} \left(1 - \frac{\alpha p}{1 + \alpha}\right)^x, \quad x \in \N\]
So \(Y_N\) has the geometric distribution on \(\N\) with success parameter \(\alpha p / (1 + \alpha)\).
Next we consider the covering graph \((\N, \upa)\) of the standard graph \((\N, \le)\), so that \(x \upa x + 1\) for \(x \in \N\).
For the graph \((\N, \upa)\),
The (right) \(\sigma\)-algebra associated with \((\N, \upa)\) is the reference \(\sigma\)-algebra \(\ms P(\N)\).
From the general theory in Section 1.2, the right \(\sigma\)-algebra associated with \((\N, \upa)\) is the same as that for \((\N, \lt)\), so the result follows from . A direct proof is also trivial since the right neighbor set of \(x \in \N\) for \((\N, \upa)\) is \(\{x + 1\}\)
Suppose again that \(X\) is a random variable in \(\N\) with density function \(f\).
For the graph \((\N, \upa)\),
The results follow form the defintions.
The graph \((\N, \upa)\) is stochastic: the reliability function of \(X\) uniquely determines the distribution of \(X\).
The density function \(f\) can be recovered from the reliability function \(F\) by \(f(x) = F(x - 1)\) for \(x \in \N_+\) and \(f(0) = 1 - \sum_{x = 0}^\infty F(x)\).
For the graph \((\N, \upa)\),
Once agin, the graph generating function of \(X\) is closely related to the ordinary probability generating function.
The recursive moment formula of \(X\) for the graph \((\N, \upa)\) is \[\sum_{x = 0}^\infty \bs 1(x \ge n) \P(X = x + 1) = \E[\bs 1(X \ge n + 1)], \quad n \in \N\]
This follows from and and the basic result in Section 1.3.
Of course, the formula is is trivial, since it's equivalent to \(\sum_{x = n}^\infty \P(X = x + 1) = \P(X \ge n + 1)\).
Random variable \(X\) has constant rate \(\alpha \in (1, \infty)\) for \((\N, \upa)\) if and only if \(X\) has denstiy function \(f\) given by \[f(x) = \left(1 - \frac{1}{\alpha}\right) \left(\frac{1}{\alpha}\right)^x, \quad x \in \N\]
Let \(f\) denote the density function of \(X\) and let \(F\) denote the reliability function of \(X\) for \((\N, \upa)\). Then \(F(x) = f(x + 1)\) for \(x \in \N\), so the constant rate property is \(f(x) = \alpha f(x + 1)\) for \(x \in \N\). Hence \(f(x) = (1 / \alpha)^x f(0)\) for \(x \in \N\). In order for \(f\) to be a proper density function, we must have \(\alpha \gt 1\), in which case \(f(x) = (1 - 1 / \alpha) (1 / \alpha)^x\) for \(x \in \N\).
Of course, the distribution in is the geometric distribution on \(\N\) with success parameter \(1 - 1 / \alpha\). A bit more generally, the total order graph \((\N, \le)\) is uniform, so from results in Section 1.5, \(X\) has constant rate \(\alpha^n\) for \((\N, \upa^n)\) for each \(n \in \N\), where \(\upa^n\) is the composition power of \(\upa\) of order \(n\).
Suppose again that \(X\) is a random variable in \(\N\). For the graph \((\N, \upa)\),
Suppose that \(X\) has density function \(f\).
Once again, the graph moment of \(X\) of order \(n \in \N\) is \(\alpha^n\). Below are the mean, variance and entropy in terms of the rate parameter.
Suppose that \(X\) has constant rate \(\alpha \in (1, \infty)\) for \((\N, \upa)\). Then
These are standard results since \(X\) has the geometric distribution on \(\N\) with success parameter \(1 - 1 / \alpha\).
The random walk on \((\N, \upa)\) associated with a random variable \(X \in \N\) is trivial: \((X, X + 1, X + 2, \ldots)\). The next result gives a summary when \(X\) has a constant rate distribution.
Suppose that \(X\) has constant rate \(\alpha \in (1, \infty)\) for the graph \((\N, \upa)\) and that \(\bs Y = (Y_1, Y_2, \ldots)\) is the random walk associated with \(X\).
Recall again that the point process associated with the random walk \(\bs Y\) is \(\bs N = \{N_A: A \subseteq \N\}\) where \(N_A = \#\{n \in \N_+: Y_n \in A\}\).
Suppose again that \(X\) has constant rate \(\alpha \in (1, \infty)\) for the graph \((\N, \upa)\) and that \(\bs{Y} = (Y_1, Y_2, \ldots)\) is the random walk associated with \(X\). Then \[\E(N_A) = \#(A) - \sum_{x \in A} \left(\frac 1 \alpha\right)^{x + 1}, \quad A \subseteq \N\]
From the general theory in Section 1.5, \(\E(N_A) = \E[U(X, \alpha); X \in A]\) for \(A \subseteq \N\) where \(U\) is the generating function. So from and , \[\E(N_A) = \E\left[\frac{\alpha^{X + 1} - 1}{\alpha - 1}; X \in A\right] = \sum_{x \in A} \left(1 - \frac 1 \alpha\right)\left(\frac 1 \alpha\right)^x \frac{\alpha^{x + 1} - 1}{\alpha - 1} = \#(A) - \sum_{x \in A} \left(\frac 1 \alpha\right)^{x + 1}, \quad A \subseteq \N\] Note that the series on the right is always finite, so we don't have to worry about the dreaded indeterminate form \(\infty - \infty\).
Recall again the process of thinning the random walk \(\bs Y\) with parameter \(p \in (0, 1)\). So \(N\) is independent of \(\bs Y\) and has the geometric distribution on \(\N_+\) with success probability \(p\), and hence \(Y_N\) is the first accepted point.
Suppose again that \(X\) has constant rate \(\alpha \in (1, \infty)\) for the graph \((\N, \upa)\) and that \(\bs{Y} = (Y_1, Y_2, \ldots)\) is the random walk associated with \(X\), thinned with parameter \(p \in (0, 1)\). Then \(Y_N\) has the distribution of the sum of two independent geometrically distrtibuted variables in \(\N\), with success parameters \(1 - 1 / \alpha\) and \(p\). The density function \(h\) of \(Y_N\) is given as follows:
Recall that \(h\) is given by \(h(x) = p \alpha U[x, (1 - p) \alpha] F(x)\) for \(x \in \N\) where again \(U\) is the generating function and where \(F\) is the reliability function of \(X\). So the result follows from and and some algebra.
There is a simple direct proof of this result: Since the random walk is \(\bs{Y} = (X, X + 1, X + 2, \ldots)\), the first accepted point is \(Y_N = X_1 + (N - 1)\). But \(X\) has the geometric distribution on \(\N\) with success parameter \(1 - 1 / \alpha\) and \(N - 1\) has the geometric distribution on \(\N\) with parameter \(p\), and \(X_1\) and \(N\) are independent.
In part (a), the success parameters are the same, so \(Y_N\) has the negative binomial distribution on \(\N\) with stopping parameter \(2\) and success parameter \(1 - 1 / \alpha\).
Finally, we study the reflexive closure \((\N, \rta)\) of the covering graph \((\N, \upa)\), so that \(x \rta x\) and \(x \rta x + 1\) for \(x \in \N\).
For the graph \((\N, \rta)\),
The results follow from from and general results on reflexive closure in Section 1.6. Let \(v_k\) denote the walk function of order \(k \in \N\) for \((\N, \upa)\), and \(V\) the generating function for \((\N, \upa)\). Then
In part (a), we have the usual convention that \(\binom{n}{k} = 0\) if \(k \gt n\), and hence \(u_n(x) = 2^n\) if \(x \ge n\).
The (right) \(\sigma\)-algebra associated with \((\N, \rta)\) is the reference \(\sigma\)-algebra \(\ms P(\N)\).
This follows from the general theory in Section 1.6 since \(\upa\) is asymmetric. A direct proof is also trivial: The right neighbor set of \(x \in \N\) for \((\N, \rta)\) is \(\{x, x + 1\}\). Hence \(A_x \setminus A_{x + 1} = \{x\}\) for \(x \in \N\).
Suppose again that \(X\) is a random variable in \(\N\) with density function \(f\).
For the graph \((\N, \rta)\),
The results follow form the defintions.
The graph \((\N, \rta)\) is stochastic: the reliability function of \(X\) uniquely determines the distribution of \(X\).
For \(x, \, n \in \N\), note that \[\sum_{i = 0}^n (-1)^i F(x + i) = f(x) + (-1)^n f(x + n + 1)\] since the sum on the left collapses. But \(f(x + n + 1) \to 0\) as \(n \to \infty\) so \[f(x) = \sum_{i = 0}^\infty (-1)^i F(x + i), \quad x \in \N\]
For the graph \((\N, \rta)\),
The graph generating function of \(X\) is closely related to the ordinary probability generating function of \(X\), evaluated at \(t / (1 - t)\).
The recursive moment formula of \(X\) for the graph \((\N, \rta)\) is \[\sum_{x = 0}^\infty \sum_{k = 0}^x \binom{n}{k}[\P(X = x) + \P(X = x + 1)] = \sum_{k = 0}^{n + 1} \binom{n + 1}{k} \P(X \ge k), \quad n \in \N\]
This follows from and and the basic result in Section 1.3. Once again, this result is \[\sum_{x = 0}^\infty u_n(x) F(x) = \E[u_{n + 1}(X)], \quad n \in \N\] A direct proof involves a sum interchange and the basic binomial identity on the left.
When \(n = 0\), the formula in reduces to the obvious result \(\sum_{x = 0}^\infty [\P(X = x) + \P(X = x + 1)] = 1 + \P(X \ge 1)\).
Random variable \(X\) has contant rate \(\alpha \in (1 / 2, 1)\) for \((\N, \rta)\) if and only if \(X\) has density function \(f\) given by \[f(x) =\left(2 - \frac 1 \alpha\right) \left(\frac 1 \alpha - 1\right)^x, \quad x \in \N\]
The proof follows from Section 1.6 on reflexive closure: \(X\) has constant rate \(\alpha\) for \((\N, \upa)\) if and only if \(X\) has constant rate \(\alpha / (1 - \alpha)\) for \((\N, \rta)\).
Of course, the distribution in is the geometric distribution on \(\N\) with success parameter \(2 - 1 / \alpha\). Once again, the graph moment of \(X\) of order \(n\) is \(\alpha^n\). The following result gives the mean, variance and entropy in terms of the rate parameter.
Suppose that \(X\) has constant rate \(\alpha \in (1 / 2, 1)\) for \((\N, \rta)\). Then
These are standard results since \(X\) has the geometric distribution on \(\N\) with success parameter \(2 - 1 / \alpha\).
A random walk \(\bs Y = (Y_1, Y_2, \ldots)\) on the graph \((\N, \rta)\) has the property that \(Y_{n + 1} \in \{Y_n, Y_n + 1\}\) for \(n \in \N\). If the walk is associated with a random variable \(X\) in \(\N\) with density \(f\), then the transition density \(P\) of \(\bs Y\) is given by \[P(x, x) = \frac{f(x)}{f(x) + f(x + 1)}, \; P(x, x + 1) = \frac{f(x + 1)}{f(x) + f(x + 1)}, \quad x \in \N\] Here are the standard results when \(X\) has a constant rate distribution.
Suppose that \(X\) has constant rate \(\alpha \in (1 / 2, 1)\) for the graph \((\N, \rta)\) and that \(\bs Y = (Y_1, Y_2, \ldots)\) is the random walk associated with \(X\).
Recall again that the point process associated with the random walk \(\bs Y\) is \(\bs N = \{N_A: A \subseteq \N\}\) where \(N_A = \#\{n \in \N_+: Y_n \in A\}\).
Suppose again that \(X\) has constant rate \(\alpha \in (1 / 2, 1)\) for the graph \((\N, \rta)\) and that \(\bs{Y} = (Y_1, Y_2, \ldots)\) is the random walk associated with \(X\). Then \[\E(N_A) = \frac{1}{1 - 2 \alpha} + \frac{1}{1 - \alpha} \#(A), \quad A \subseteq \N\]
From the general theory in Section 1.5, \(\E(N_A) = \E[U(X, \alpha); X \in A]\) for \(A \subseteq \N\) where \(U\) is the generating function. So the result follows from and .
Recall again the process of thinning the random walk \(\bs Y\) with parameter \(p \in (0, 1)\). So \(N\) is independent of \(\bs Y\) and has the geometric distribution on \(\N_+\) with success probability \(p\), and hence \(Y_N\) is the first accepted point.
Suppose again that \(X\) has constant rate \(\alpha \in (1 / 2, 1)\) for the graph \((\N, \rta)\) and that \(\bs{Y} = (Y_1, Y_2, \ldots)\) is the random walk associated with \(X\), thinned with parameter \(p \in (0, 1)\). Then \(Y_N\) has the distribution of the sum of two independent, geometrically distributed random variables in \(\N\), with success parameters \(2 - 1 / \alpha\) and \(p / [1 - (1 - p)\alpha]\). The density function \(h\) of \(Y_N\) is given as follows:
Recall that \(h\) is given by \(h(x) = p \alpha U[x, (1 - p) \alpha] F(x)\) for \(x \in \N\) where again \(U\) is the generating function and where \(F\) is the reliability function of \(X\). So the result follows from and and some algebra.
As with the covering graph, there is a direct probabilistic proof, although one that is a bit more complicated. Recall that we have a sequence of Bernoulli trials that indicate whether each point in the random walk is accepted or rejected. The success probability is \(p\) and \(N\) is the index number of the first accepted point. We now define another sequence of Bernoulli trials: Let \(J_n = 1\) if \(Y_{n + 1} = Y_n + 1\) and and let \(J_n = 0\) if \(Y_{n + 1} = Y_n\). Then \(\bs{J} = (J_1, J_2, \ldots)\) is an independent sequence and \(\P(J_n = 1) = P(x, x + 1) = 1 - \alpha\) (independent of \(x \in \N\)). Note that \(Y_N = X + M\) where \(M = \sum_{n = 1}^{N - 1} J_n\). But \(N - 1\) has the geometric distribution on \(\N\) with success parameter \(p\) and is independent of \(\bs{J}\). By a basic result in probability theory, \(M\) has the geometric distribution on \(\N\) with success parameter \[\frac{p}{(1 - \alpha) + p - p(1 - \alpha)} = \frac{p}{1 - (1 - p) \alpha}\] and is independent of \(X\). Of course \(X\) has the geometric distribution on \(\N\) with success parameter \(2 - 1 / \alpha\).
In part (a), the success parameters are the same, so \(Y_N\) has the negative binomial distribution on \(\N\) with stopping parameter 2 and success parameter \(2 - 1 / \alpha\).
As noted above, the constant rate distributions for the graphs \((\N, \lt)\), \((\N, \upa)\), and \((\N, \rta)\) in propositions , , and , as well as the constant rate distribution for \((\N, \le)\) studied in Section 1 are all geometric distributions. In a sequence of Bernoulli trials, the geoemtric distribution on \(\N\) governs the number of failures before the first success. To summarize, the distribution with constant rate \(\alpha \in (0, 1)\) for \((\N, \le)\) is geometric with success parameter \(\alpha\). The distribution with constant rate \(\alpha \in (0, \infty)\) for \((\N, \lt)\) is geometric with success parameter \(\alpha / (1 + \alpha)\). The distribution with constant rate \(\alpha \in (1, \infty)\) for \((\N, \upa)\) is geometric with success parameter \((\alpha - 1) / \alpha\). The distribution with constant rate \(\alpha \in (1 / 2, 1)\) for \((\N, \rta)\) is geometric with success parameter \((2 \alpha - 1) / \alpha\). Here is another way of looking at the results.
Suppose that \(X\) has the geometric distribution on \(\N\) with success parameter \(p \in (0, 1)\), so that \(X\) has density function \(f\) given by \(f(x) = p (1 - p)^x\) for \(x \in \N\). Then
The results follow by solving for \(\alpha\) in terms of \(p\). In terms of part (c), note that the standard graph (\(\N, \le)\) is completely uniform in the terminology of Section 1.2, since there is one path in the covering graph from \(x\) to \(y\) for each \(x, \, y \in \N\) with \(x \lt y\). So by a result in Section 1.5, the fact that \(X\) has constant rate \(1 / (1 - p)\) for \((\N, \upa)\) implies that \(X\) has constant rate \(1 / (1 - p)^n\) for \((\N, \upa^n)\) for each \(n \in \N\), and constant rate \(p\) for \((\N, \le)\).
The app below is a simulation of the geometric distribution with success parameter \(p\). The parameter \(p\) can be varied with the scrollbar and in addition, the app shows the rate constants \(\alpha_1\) for \((\N, \le)\), \(\alpha_2\) for \((\N, \lt)\), \(\alpha_3\) for \((\N, \upa)\), and \(\alpha_4\) for \((\N, \rta)\)
The geometric distribution also plays a prominent role for the random variable \(Y_N\) in the thinned process for each of the graphs: \(Y_N\) has a geometric distribution for \((\N, \lt)\) and is the sum of two independent geometric variables for the graphs \((\N, \upa)\) and \((\N, \rta)\). Finally, we already know from Section 1 that if \(X\) has the geometric distribution on \(\N\) with success parameter \(p \in (0, 1)\), then \(X\) has entropy \[H(X) = -\ln p - \frac{1 - p}{p} \ln (1 - p)\] and that \(X\) maximizes entropy over all random variables \(Y \in \N\) with \(\E(Y) = \E(X) = 1 / p\). The entropy is expressed in terms of the rate constants for the various graphs in , , and .
A useful variation of the geometric distribution arises in a sequence of independent trials, where the probability of success on the first trial may be different than the common probability of success on the remaining (Bernoulli) trials. This family of distributions will be important in Chapter 8 on subset spaces.
Consider a sequence of independent trails where trial 1 has probability of success \(p_0 \in (0, 1)\) and the remaining trials have probability of success \(p \in (0, 1)\). Let \(N\) denote the number of failures before the first success. Then \(N\) has the modified geometric distribution on \(\N\) with success parameters \(p_0, \, p\). The probability density function \(f\) of \(N\) is given by \[f(0) = p_0; \quad f(x) = (1 - p_0) p (1 - p)^{x - 1}, \; x \in \N_+\]
Let \((I_1, I_2, \ldots)\) denote the sequence of independent trials as described. Then \(\P(N = 0) = \P(I_1 = 1) = p_0\). For \(x \in \N_+\), \[\P(N = x) = \P(I_1 = 0, I_2 = 0, \cdots, I_x = 0, I_{x + 1} = 1) = (1 - p_0)(1 - p)^{x -1} p\]
Of course, if \(p_0 = p\), the modified geometric distribution reduces to the ordinary geometric distribution on \(\N\) with success parameter \(p\). In the following propositions, \(N\) has the modified geometric distribution on \(\N\) with success parameters \(p_0, \, p \in (0, 1)\).
The conditional distribution of \(N\) given \(N \gt 0\) is the geometric distribution on \(\N_+\) with success parameter \(p\).
\[\P(N = x \mid N \gt 0) = \frac{f(x)}{1 - f(0)} = (1 - p)^{x - 1} p, \quad x \in \N_+\]
The mean and variance of \(N\) are
Direct computations are simple, but we can also use and elementary facts about the geometric distribution on \(\N_+\).
The probability generating function \(P\) of \(N\) is given by \[P(t) = \frac{p_0 + (p - p_0) t}{1 - (1 - p) t}, \quad t \in [0, \infty)\]
The reliability function \(F\) and the rate function \(r\) of \(N\) for the graph \((\N, \le)\) are as follows:
So \(N\) has constant rate \(p\) on \(\N_+\). On the other hand, if \(p_0 \ne p\), the memoryless property \(F(x + y) = F(x) F(y)\) for the semmigroup \((\N, +)\) never holds for \(x, \, y \in \N_+\).
The app below is a simulation of the modified geometric distribution with success parameters \(p_0, \, p\). The parameters can be varied with the scrollbars.