Multivariate Discrete Distributions

The multivariate discrete distributions are over multiple integer values, which are expressed in Stan as arrays.

Multinomial distribution

Probability mass function

If $K \in N$ , $N \in N$ , and $θ \in K -simplex$ , then for $y \in N^{K}$ such that $\sum_{k = 1}^{K} y_{k} = N$ , $Multinomial (y | θ) = (\binom{N}{y_{1}, \dots, y_{K}}) \prod_{k = 1}^{K} θ_{k}^{y_{k}},$ where the multinomial coefficient is defined by $(\binom{N}{y_{1}, \dots, y_{k}}) = \frac{N!}{\prod_{k = 1}^{K} y_{k}!} .$

Distribution statement

y ~ multinomial(theta)

Increment target log probability density with multinomial_lupmf(y | theta).

Available since 2.0

Stan functions

real multinomial_lpmf(array[] int y | vector theta)
The log multinomial probability mass function with outcome array y of size $K$ given the $K$ -simplex distribution parameter theta and (implicit) total count N = sum(y)

Available since 2.12

real multinomial_lupmf(array[] int y | vector theta)
The log multinomial probability mass function with outcome array y of size $K$ given the $K$ -simplex distribution parameter theta and (implicit) total count N = sum(y) dropping constant additive terms

Available since 2.25

array[] int multinomial_rng(vector theta, int N)
Generate a multinomial variate with simplex distribution parameter theta and total count $N$ ; may only be used in transformed data and generated quantities blocks

Available since 2.8

Multinomial distribution, logit parameterization

Stan also provides a version of the multinomial probability mass function distribution with the $K -simplex$ for the event count probabilities per category given on the unconstrained logistic scale.

Probability mass function

If $K \in N$ , $N \in N$ , and $softmax (θ) \in K -simplex$ , then for $y \in N^{K}$ such that $\sum_{k = 1}^{K} y_{k} = N$ , $\begin{aligned} MultinomialLogit (y ∣ γ) & = Multinomial (y ∣ softmax (γ)) \\ = (\binom{N}{y_{1}, \dots, y_{K}}) \prod_{k = 1}^{K} [softmax (γ_{k})]^{y_{k}}, \end{aligned}$ where the multinomial coefficient is defined by $(\binom{N}{y_{1}, \dots, y_{k}}) = \frac{N!}{\prod_{k = 1}^{K} y_{k}!} .$

Distribution statement

y ~ multinomial_logit(gamma)

Increment target log probability density with multinomial_logit_lupmf(y | gamma).

Available since 2.24

Stan functions

real multinomial_logit_lpmf(array[] int y | vector gamma)
The log multinomial probability mass function with outcome array y of size $K$ given the log $K$ -simplex distribution parameter $γ$ and (implicit) total count N = sum(y)

Available since 2.24

real multinomial_logit_lupmf(array[] int y | vector gamma)
The log multinomial probability mass function with outcome array y of size $K$ given the log $K$ -simplex distribution parameter $γ$ and (implicit) total count N = sum(y) dropping constant additive terms

Available since 2.25

array[] int multinomial_logit_rng(vector gamma, int N)
Generate a variate from a multinomial distribution with probabilities softmax(gamma) and total count N; may only be used in transformed data and generated quantities blocks.

Available since 2.24

Dirichlet-multinomial distribution

Stan also provides the Dirichlet-multinomial distribution, which generalizes the Beta-binomial distribution to more than two categories. As such, it is an overdispersed version of the multinomial distribution.

Probability mass function

If $K \in N$ , $N \in N$ , and $α \in R_{+}^{K}$ , then for $y \in N^{K}$ such that $\sum_{k = 1}^{K} y_{k} = N$ , the PMF of the Dirichlet-multinomial distribution is defined as $DirMult (y | θ) = \frac{Γ (α_{0}) Γ (N + 1)}{Γ (N + α_{0})} \prod_{k = 1}^{K} \frac{Γ (y_{k} + α_{k})}{Γ (α_{k}) Γ (y_{k} + 1)},$ where $α_{0}$ is defined as $α_{0} = \sum_{k = 1}^{K} α_{k}$ .

Distribution statement

y ~ dirichlet_multinomial(alpha)

Increment target log probability density with dirichlet_multinomial_lupmf(y | alpha).

Available since 2.34

Stan functions

real dirichlet_multinomial_lpmf(array[] int y | vector alpha)
The log multinomial probability mass function with outcome array y with $K$ elements given the positive $K$ -vector distribution parameter alpha and (implicit) total count N = sum(y).

Available since 2.34

real dirichlet_multinomial_lupmf(array[] int y | vector alpha)
The log multinomial probability mass function with outcome array y with $K$ elements, given the positive $K$ -vector distribution parameter alpha and (implicit) total count N = sum(y) dropping constant additive terms.

Available since 2.34

array[] int dirichlet_multinomial_rng(vector alpha, int N)
Generate a multinomial variate with positive vector distribution parameter alpha and total count N; may only be used in transformed data and generated quantities blocks. This is equivalent to multinomial_rng(dirichlet_rng(alpha), N).

Available since 2.34