The arcsine distribution is important in the study of Brownian motion and prime numbers, among other applications.
The standard arcsine distribution is a continuous distribution on the interval \((0, 1)\) with probability density function \(g\) given by \[g(x) = \frac{1}{\pi \sqrt{x (1 - x)}}, \quad x \in (0, 1)\]
There are a couple of ways to see that \( g \) is a valid PDF. First, it's the beta PDF with parameters \( a = b = \frac{1}{2} \): \[ g(x) = \frac{1}{B(1/2, 1/2)} x^{-1/2} (1 - x)^{-1/2}, \quad x \in (0, 1) \] since we recall that \( B\left(\frac 1 2, \frac 1 2\right) = \pi \). A direct proof is also easy: The substitution \(u = \sqrt{x}\), \(x = u^2\), \(dx = 2 u \, du\) gives \[\int_0^1 \frac{1}{\pi \sqrt{x (1 - x)}} dx = \int_0^1 \frac{2}{\pi \sqrt{1 - u^2}} du = \frac{2}{\pi} \arcsin u \biggm\vert_0^1 = \frac{2}{\pi} \left(\frac{\pi}{2} - 0\right) = 1\]
The occurrence of the arcsine function in the details of explains the name.
The standard arcsine probability density function \(g\) satisfies the following properties:
In particular, the standard arcsine distribution is U-shaped and has no mode.
Open the special distribution simulator and select the arcsine distribution. Keep the default parameter values for the standard arcsine distribution and note the shape of the probability density function. Run the simulation 1000 times and compare the emprical density function to the probability density function.
The distribution function has a simple expression in terms of the arcsine function, again justifying the name of the distribution.
The standard arcsine distribution function \(G\) is given by \(G(x) = \frac{2}{\pi} \arcsin\left(\sqrt{x}\right)\) for \(x \in [0, 1]\).
Again, using the substitution \(u = \sqrt{t}\), \(t = u^2\), \(dt = 2 u \, du\): \[G(x) = \int_0^x \frac{1}{\pi \sqrt{t (1 - t)}} dt = \int_0^{\sqrt{x}} \frac{2}{\pi \sqrt{1 - u^2}} du = \frac{2}{\pi}\arcsin(t) \biggm\vert_0^{\sqrt{x}} = \frac{2}{\pi} \arcsin\left(\sqrt{x}\right) \]
Not surprisingly, the quantile function has a simple expression in terms of the sine function.
The standard arcinse quantile function \(G^{-1}\) is given by \(G^{-1}(p) = \sin^2\left(\frac{\pi}{2} p\right)\) for \(p \in [0, 1]\). In particular, the quartiles are
Open the quantile app and select the arcsine distribution. Keep the default parameter values for the standard arcsine distribution and note the shape of the distribution function. Compute the quantiles of order 0.1 and 0.9.
Suppose that random variable \( Z \) has the standard arcsine distribution. First we give the mean and variance.
The mean and variance of \(Z\) are
Open the special distribution simulator and select the arcsine distribution. Keep the default parameter values for the standard arcsine distribution. Run the simulation 1000 times and compare the empirical mean and stadard deviation to the true mean and standard deviation.
The gemeral moments about 0 can be expressed as products.
For \( n \in \N \), \[\E\left(Z^n\right) = \prod_{j=0}^{n-1} \frac{2 j + 1}{2 j + 2}\]
Of course, the moments can be used to give a formula for the moment generating function, but this formula is not particularly helpful since it is not in closed form.
\(Z\) has moment generating function \(m\) given by \[m(t) = \E\left(e^{t Z}\right) = \sum_{n=0}^\infty \left(\prod_{j=0}^{n-1} \frac{2 j + 1}{2 j + 2}\right) \frac{t^n}{n!}, \quad t \in \R\]
Finally we give the skewness and kurtosis.
The skewness and kurtosis of \(Z\) are
As noted earlier, the standard arcsine distribution is a special case of the beta distribution.
The standard arcsine distribution is the beta distribution with left parameter \(\frac{1}{2}\) and right parameter \(\frac{1}{2}\).
The beta distribution with parameters \(a = b = \frac{1}{2}\) has PDF \[x \mapsto \frac{1}{B(1/2, 1/2)} x^{-1/2}(1 - x)^{-1/2}, \quad x \in (0, 1)\] But \(B\left(\frac{1}{2}, \frac{1}{2}\right) = \pi\), so this is the standard arcsine PDF.
Since the quantile function in is in closed form, the standard arcsine distribution can be simulated by the random quantile method.
Connections with the standard uniform distribution.
Open the random quantile simulator and select the arcsine distribution. Keep the default parameters for the standard arcsine distribution. Run the experiment 1000 times and compare the empirical probability density function, mean, and standard deviation to their distributional counterparts. Note how the random quantiles simulate the distribution.
The following exercise illustrates the connection between the Brownian motion process and the standard arcsine distribution.
Open the Brownian motion simulator. Keep the default time parameter and select the last zero random variable. Note that this random variable has the standard arcsine distribution. Run the experiment 1000 time and compare the empirical probability density function, mean, and standard deviation to their distributional counterparts. Note how the last zero simulates the distribution.
The standard arcsine distribution is generalized by adding location and scale parameters.
If \(Z\) has the standard arcsine distribution, and if \(a \in \R\) and \(b \in (0, \infty)\), then \(X = a + b Z\) has the arcsine distribution with location parameter \(a\) and scale parameter \(b\).
So \(X\) has a continuous distribution on the interval \( (a, a + b) \).
Suppose that \(X\) has the arcsine distribution with location parameter \(a \in \R\) and scale parameter \(b \in (0, \infty)\).
\( X \) has probability density function \(f\) given by \[f(x) = \frac{1}{\pi \sqrt{(x - a)(a + b - x)}}, \quad x \in (a, a + b)\]
An alternate parameterization of the general arcsine distribution is by the endpoints of the support interval: the left endpoint (location parameter) \(a\) and the right endpoint \(b = a + b\).
Open the special distribution simulator and select the arcsine distribution. Vary the location and scale parameters and note the shape and location of the probability density function. For selected values of the parameters, run the simulation 1000 times and compare the emprical density function to the probability density function.
Once again, the distribution function has a simple representation in terms of the arcsine function.
\(X\) has distribution function \(F\) given by \[F(x) = \frac{2}{\pi} \arcsin\left(\sqrt{\frac{x - a}{b}}\right), \quad x \in [a, a + b]\]
As before, the quantile function has a simple representation in terms of the sine functioon
\(X\) has quantile function \(F^{-1}\) given by \(F^{-1}(p) = a + b \sin^2\left(\frac{\pi}{2} p\right)\) for \(p \in [0, 1]\) In particular, the quartiles of \(X\) are
Open the quantile app and select the arcsine distribution. Vary the parameters and note the shape and location of the distribution function. For various values of the parameters, compute the quantiles of order 0.1 and 0.9.
Again, we assume that \(X\) has the arcsine distribution with location parameter \(a \in \R\) and scale parameter \(b \in (0, \infty)\). First we give the mean and variance.
The mean and variance of \(X\) are
Open the special distribution simulator and select the arcsine distribution. Vary the parameters and note the size and location of the mean \(\pm\) standard deviation bar. For various values of the parameters, run the simulation 1000 times and compare the empirical mean and stadard deviation to the true mean and standar deviation.
The moments of \(X\) can be obtained from the moments of \(Z\), but the results are messy, except when the location parameter is 0.
Suppose the location parameter \(a = 0\). For \( n \in \N \), \[\E(X^n) = b^n \prod_{j=0}^{n-1} \frac{2 j + 1}{2 j + 2}\]
The moment generating function can be expressed as a series with product coefficients, and so is not particularly helpful.
\(X\) has moment generating function \(M\) given by \[M(t) = \E\left(e^{t X}\right) = e^{a t} \sum_{n=0}^\infty \left(\prod_{j=0}^{n-1} \frac{2 j + 1}{2 j + 2}\right) \frac{b^n t^n}{n!}, \quad t \in \R\]
Finally, the skewness and kurtosis are unchanged.
The skewness and kurtosis of \(X\) are
Recall that the skewness and kurtosis are defined in terms of the standard score of \(X\) and hence are invariant under a location-scale transformation.
By construction, the general arcsine distribution is a location-scale family, and so is closed under location-scale transformations.
If \(X\) has the arcsine distribution with location parameter \(a \in \R\) and scale parameter \(b \in (0, \infty)\) and if \(c \in \R\) and \(d \in (0, \infty)\) then \(c + d X\) has the arcsine distribution with location parameter \(c + a d\) scale parameter \(d b\).
Since the quantile function is in closed form, the arcsine distribution can be simulated by the random quantile method.
Suppose that \( a \in \R \) and \( b \in (0, \infty) \).
Open the random quantile simulator and select the arcsine distribution. Vary the parameters and note the location and shape of the probability density function. For selected parameter values, run the experiment 1000 times and compare the empirical probability density function, mean, and standard deviation to their distributional counterparts. Note how the random quantiles simulate the distribution.
The following exercise illustrates the connection between the Brownian motion process and the arcsine distribution.
Open the Brownian motion simulator and select the last zero random variable. Vary the time parameter \( t \) and note that the last zero has the arcsine distribution on the interval \( (0, t) \). Run the experiment 1000 time and compare the empirical probability density function, mean, and standard deviation to their distributional counterparts. Note how the last zero simulates the distribution.