The Arcsine Distribution

\(\newcommand{\R}{\mathbb{R}}\) \(\newcommand{\N}{\mathbb{N}}\) \(\newcommand{\E}{\mathbb{E}}\) \(\newcommand{\P}{\mathbb{P}}\) \(\newcommand{\var}{\text{var}}\) \(\newcommand{\sd}{\text{sd}}\) \(\newcommand{\cov}{\text{cov}}\) \(\newcommand{\cor}{\text{cor}}\) \(\newcommand{\skw}{\text{skew}}\) \(\newcommand{\kur}{\text{kurt}}\)

The arcsine distribution is important in the study of Brownian motion and prime numbers, among other applications.

The Standard Arcsine Distribution

Distribution Functions

The standard arcsine distribution is a continuous distribution on the interval \((0, 1)\) with probability density function \(g\) given by \[g(x) = \frac{1}{\pi \sqrt{x (1 - x)}}, \quad x \in (0, 1)\]

Details:

There are a couple of ways to see that \( g \) is a valid PDF. First, it's the beta PDF with parameters \( a = b = \frac{1}{2} \): \[ g(x) = \frac{1}{B(1/2, 1/2)} x^{-1/2} (1 - x)^{-1/2}, \quad x \in (0, 1) \] since we recall that \( B\left(\frac 1 2, \frac 1 2\right) = \pi \). A direct proof is also easy: The substitution \(u = \sqrt{x}\), \(x = u^2\), \(dx = 2 u \, du\) gives \[\int_0^1 \frac{1}{\pi \sqrt{x (1 - x)}} dx = \int_0^1 \frac{2}{\pi \sqrt{1 - u^2}} du = \frac{2}{\pi} \arcsin u \biggm\vert_0^1 = \frac{2}{\pi} \left(\frac{\pi}{2} - 0\right) = 1\]

The standard arcsine probability density function \(g\) satisfies the following properties:

\(g\) is symmetric about \(x = \frac{1}{2}\).
\(g\) decreases and then increases with minimum value at \( x = \frac{1}{2} \).
\( g \) is concave upward
\(g(x) \to \infty\) as \(x \downarrow 0\) and as \(x \uparrow 1\).

Details:

Note that \(g\) is a function of \(x\) only through \(x (1 - x)\).
This follows from standard calculus: \[g^\prime(x) = \frac{2 x - 1}{2 \pi [x (1 - x)]^{3/2}}\]
This also follows from standard calculus: \[ g^{\prime\prime}(x) = \frac{3 - 8 x + 8 x^2}{4 \pi [x (1 - x)]^{5/2}} \]
The limits are clear.

Open the special distribution simulator and select the arcsine distribution. Keep the default parameter values for the standard arcsine distribution and note the shape of the probability density function. Run the simulation 1000 times and compare the emprical density function to the probability density function.

The distribution function has a simple expression in terms of the arcsine function, again justifying the name of the distribution.

The standard arcsine distribution function \(G\) is given by \(G(x) = \frac{2}{\pi} \arcsin\left(\sqrt{x}\right)\) for \(x \in [0, 1]\).

Details:

Again, using the substitution \(u = \sqrt{t}\), \(t = u^2\), \(dt = 2 u \, du\): \[G(x) = \int_0^x \frac{1}{\pi \sqrt{t (1 - t)}} dt = \int_0^{\sqrt{x}} \frac{2}{\pi \sqrt{1 - u^2}} du = \frac{2}{\pi}\arcsin(t) \biggm\vert_0^{\sqrt{x}} = \frac{2}{\pi} \arcsin\left(\sqrt{x}\right) \]

Not surprisingly, the quantile function has a simple expression in terms of the sine function.

The standard arcinse quantile function \(G^{-1}\) is given by \(G^{-1}(p) = \sin^2\left(\frac{\pi}{2} p\right)\) for \(p \in [0, 1]\). In particular, the quartiles are

\(q_1 = \sin^2\left(\frac{\pi}{8}\right) = \frac{1}{4}(2 - \sqrt{2}) \approx 0.1464\), the first quartile
\(q_2 = \frac{1}{2}\), the median
\(q_3 = \sin^2\left(\frac{3 \pi}{8}\right) = \frac{1}{4}(2 + \sqrt{2}) \approx 0.8536\), the third quartile

Details:

The formula for the quantile function follows from by solving \(p = G(x)\) for \(x\) in terms of \(p \in [0, 1]\).

Open the quantile app and select the arcsine distribution. Keep the default parameter values for the standard arcsine distribution and note the shape of the distribution function. Compute the quantiles of order 0.1 and 0.9.

Moments

Suppose that random variable \( Z \) has the standard arcsine distribution. First we give the mean and variance.

The mean and variance of \(Z\) are

\(\E(Z) = \frac{1}{2}\)
\(\var(Z) = \frac{1}{8}\)

Details:

The mean is \(\frac{1}{2}\) by symmetry.
Using the usual substitution \(u = \sqrt{x}\), \(x = u^2\) \(dx = 2 u \, du\) and then the substitution \(u = \sin \theta\), \(du = \cos \theta \, d\theta\) gives \[\E\left(Z^2\right) = \int_0^1 \frac{1}{\pi \sqrt{x (1 - x)}} dx = \int_0^1 \frac{2 u^4}{\pi \sqrt{1 - u^2}} = \int_0^{\pi/2} \frac{2}{\pi} \sin^4(\theta) d\theta = \frac{2}{\pi} \frac{3 \pi}{16} = \frac{3}{8}\]

Open the special distribution simulator and select the arcsine distribution. Keep the default parameter values for the standard arcsine distribution. Run the simulation 1000 times and compare the empirical mean and stadard deviation to the true mean and standard deviation.

For \( n \in \N \), \[\E\left(Z^n\right) = \prod_{j=0}^{n-1} \frac{2 j + 1}{2 j + 2}\]

Details:

The same integral substitutions as in gives \[\E(Z^n) = \int_0^{\pi/2} \frac{2}{\pi} \sin^{2 n}(\theta) d\theta = \prod_{j=0}^{n-1} \frac{2 j + 1}{2 j + 2}\]

Of course, the moments can be used to give a formula for the moment generating function, but this formula is not particularly helpful since it is not in closed form.

\(Z\) has moment generating function \(m\) given by \[m(t) = \E\left(e^{t Z}\right) = \sum_{n=0}^\infty \left(\prod_{j=0}^{n-1} \frac{2 j + 1}{2 j + 2}\right) \frac{t^n}{n!}, \quad t \in \R\]

The skewness and kurtosis of \(Z\) are

\(\skw(Z) = 0\)
\(\kur(Z) = \frac{3}{2}\)

Details:

The skewness is 0 by the symmetry of the distribution.
The result for the kurtosis follows from the standard formula for kurtosis in terms of the moments in : \(\E(Z) = \frac{1}{2}\), \(\E\left(Z^2\right) = \frac{3}{8}\), \(\E\left(Z^3\right) = \frac{5}{16}\), and \(\E\left(Z^4\right) = \frac{35}{128}\).

Related Distributions

As noted earlier, the standard arcsine distribution is a special case of the beta distribution.

The standard arcsine distribution is the beta distribution with left parameter \(\frac{1}{2}\) and right parameter \(\frac{1}{2}\).

Details:

The beta distribution with parameters \(a = b = \frac{1}{2}\) has PDF \[x \mapsto \frac{1}{B(1/2, 1/2)} x^{-1/2}(1 - x)^{-1/2}, \quad x \in (0, 1)\] But \(B\left(\frac{1}{2}, \frac{1}{2}\right) = \pi\), so this is the standard arcsine PDF.

Since the quantile function in is in closed form, the standard arcsine distribution can be simulated by the random quantile method.

Connections with the standard uniform distribution.

If \(U\) has the standard uniform distribution (a random number) then \(X = \sin^2\left(\frac{\pi}{2} U\right)\) has the standard arcsine distribution.
If \(X\) has the standard arcsine distribution then \(U = \frac{2}{\pi} \arcsin\left(\sqrt{X}\right)\) has the standard uniform distribution.

Open the random quantile simulator and select the arcsine distribution. Keep the default parameters for the standard arcsine distribution. Run the experiment 1000 times and compare the empirical probability density function, mean, and standard deviation to their distributional counterparts. Note how the random quantiles simulate the distribution.

The following exercise illustrates the connection between the Brownian motion process and the standard arcsine distribution.

Open the Brownian motion simulator. Keep the default time parameter and select the last zero random variable. Note that this random variable has the standard arcsine distribution. Run the experiment 1000 time and compare the empirical probability density function, mean, and standard deviation to their distributional counterparts. Note how the last zero simulates the distribution.

The General Arcsine Distribution

Definition

If \(Z\) has the standard arcsine distribution, and if \(a \in \R\) and \(b \in (0, \infty)\), then \(X = a + b Z\) has the arcsine distribution with location parameter \(a\) and scale parameter \(b\).

Distribution Functions

Suppose that \(X\) has the arcsine distribution with location parameter \(a \in \R\) and scale parameter \(b \in (0, \infty)\).

\( X \) has probability density function \(f\) given by \[f(x) = \frac{1}{\pi \sqrt{(x - a)(a + b - x)}}, \quad x \in (a, a + b)\]

\(f\) is symmetric about \(a + \frac{1}{2} b\).
\(f\) decreases and then increases with minimum value at \( x = a + \frac{1}{2} b \).
\( f \) is concave upward.
\(f(x) \to \infty\) as \(x \downarrow a\) and as \(x \uparrow a + b\).

Details:

Recall that \(f(x) = \frac{1}{b} g\left(\frac{x - a}{b}\right)\) where \(g\) is the PDF of the standard arcsine distribution in .

An alternate parameterization of the general arcsine distribution is by the endpoints of the support interval: the left endpoint (location parameter) \(a\) and the right endpoint \(b = a + b\).

Open the special distribution simulator and select the arcsine distribution. Vary the location and scale parameters and note the shape and location of the probability density function. For selected values of the parameters, run the simulation 1000 times and compare the emprical density function to the probability density function.

Once again, the distribution function has a simple representation in terms of the arcsine function.

\(X\) has distribution function \(F\) given by \[F(x) = \frac{2}{\pi} \arcsin\left(\sqrt{\frac{x - a}{b}}\right), \quad x \in [a, a + b]\]

Details:

Recall that \(F(x) = G[(x - a) / b)\) where \(G\) is the CDF of the standard arcsine distribution in .

As before, the quantile function has a simple representation in terms of the sine functioon

\(X\) has quantile function \(F^{-1}\) given by \(F^{-1}(p) = a + b \sin^2\left(\frac{\pi}{2} p\right)\) for \(p \in [0, 1]\) In particular, the quartiles of \(X\) are

\(q_1 = a + b \sin^2\left(\frac{\pi}{8}\right) = a + \frac{1}{4}\left(2 - \sqrt{2}\right) b\), the first quartile
\(q_2 = a + \frac{1}{2} b\), the median
\(q_3 = a + b \sin^2\left(\frac{3 \pi}{8}\right) = a + \frac{1}{4}\left(2 + \sqrt{2}\right) b\), the third quartile

Details:

Recall that \(F^{-1}(p) = a + b G^{-1}(p)\) where \(G^{-1}\) is the quantile function of the standard arcsine distribution in .

Open the quantile app and select the arcsine distribution. Vary the parameters and note the shape and location of the distribution function. For various values of the parameters, compute the quantiles of order 0.1 and 0.9.

Moments

Again, we assume that \(X\) has the arcsine distribution with location parameter \(a \in \R\) and scale parameter \(b \in (0, \infty)\). First we give the mean and variance.

The mean and variance of \(X\) are

\(\E(X) = a + \frac{1}{2} b\)
\(\var(X) = \frac{1}{8} b^2\)

Details:

These results from the representation \(X = a + b Z\) and the results for the mean and variance of \( Z \) in .

Open the special distribution simulator and select the arcsine distribution. Vary the parameters and note the size and location of the mean \(\pm\) standard deviation bar. For various values of the parameters, run the simulation 1000 times and compare the empirical mean and stadard deviation to the true mean and standar deviation.

The moments of \(X\) can be obtained from the moments of \(Z\), but the results are messy, except when the location parameter is 0.

Suppose the location parameter \(a = 0\). For \( n \in \N \), \[\E(X^n) = b^n \prod_{j=0}^{n-1} \frac{2 j + 1}{2 j + 2}\]

Details:

This follows from the representation \(X = b Z\) and the results for the moments of \( Z \) in .

The moment generating function can be expressed as a series with product coefficients, and so is not particularly helpful.

\(X\) has moment generating function \(M\) given by \[M(t) = \E\left(e^{t X}\right) = e^{a t} \sum_{n=0}^\infty \left(\prod_{j=0}^{n-1} \frac{2 j + 1}{2 j + 2}\right) \frac{b^n t^n}{n!}, \quad t \in \R\]

Details:

Recall that \(M(t) = e^{a t} m(b t)\) where \(m\) is the moment generating function of \( Z \) in .

The skewness and kurtosis of \(X\) are

\(\skw(X) = 0\)
\(\kur(X) = \frac{3}{2}\)

Details:

Recall that the skewness and kurtosis are defined in terms of the standard score of \(X\) and hence are invariant under a location-scale transformation.

Related Distributions

By construction, the general arcsine distribution is a location-scale family, and so is closed under location-scale transformations.

If \(X\) has the arcsine distribution with location parameter \(a \in \R\) and scale parameter \(b \in (0, \infty)\) and if \(c \in \R\) and \(d \in (0, \infty)\) then \(c + d X\) has the arcsine distribution with location parameter \(c + a d\) scale parameter \(d b\).

Details:

By definition we can take \( X = a + b Z \) where \( Z \) has the standard arcsine distribution. Hence \( c + d X = (c + d a) + (d b) Z \).

Since the quantile function is in closed form, the arcsine distribution can be simulated by the random quantile method.

Suppose that \( a \in \R \) and \( b \in (0, \infty) \).

If \(U\) has the standard uniform distribution (a random number) then \(X = a + b \sin^2\left(\frac{\pi}{2} U\right)\) has the arcsine distribution with location parameter \(a\) and scale parameter \(b\).
If \(X\) has the arcsine distribution with location parameter \(a\) and scale parameter \(b\) then \(U = \frac{2}{\pi} \arcsin\left(\sqrt{\frac{X - a}{b}}\right)\) has the standard uniform distribution.

Open the random quantile simulator and select the arcsine distribution. Vary the parameters and note the location and shape of the probability density function. For selected parameter values, run the experiment 1000 times and compare the empirical probability density function, mean, and standard deviation to their distributional counterparts. Note how the random quantiles simulate the distribution.

The following exercise illustrates the connection between the Brownian motion process and the arcsine distribution.

Open the Brownian motion simulator and select the last zero random variable. Vary the time parameter \( t \) and note that the last zero has the arcsine distribution on the interval \( (0, t) \). Run the experiment 1000 time and compare the empirical probability density function, mean, and standard deviation to their distributional counterparts. Note how the last zero simulates the distribution.