Hidden Markov Models

An elementary first-order Hidden Markov model is a probabilistic model over $N$ observations, $y_{n}$ , and $N$ hidden states, $x_{n}$ , which can be fully defined by the conditional distributions $p (y_{n} ∣ x_{n}, ϕ)$ and $p (x_{n} ∣ x_{n - 1}, ϕ)$ . Here we make the dependency on additional model parameters, $ϕ$ , explicit. When $x$ is continuous, the user can explicitly encode these distributions in Stan and use Markov chain Monte Carlo to integrate $x$ out.

When each state $x$ takes a value over a discrete and finite set, say ${1, 2, . . ., K}$ , we can take advantage of the dependency structure to marginalize $x$ and compute $p (y ∣ ϕ)$ . We start by defining the conditional observational distribution, stored in a $K \times N$ matrix $ω$ with $ω_{k n} = p (y_{n} ∣ x_{n} = k, ϕ) .$ Next, we introduce the $K \times K$ transition matrix, $Γ$ , with $Γ_{i j} = p (x_{n} = j ∣ x_{n - 1} = i, ϕ) .$ Each row defines a probability distribution and must therefore be a simplex (i.e. its components must add to 1). Currently, Stan only supports stationary transitions where a single transition matrix is used for all transitions. Finally we define the initial state $K$ -vector $ρ$ , with $ρ_{k} = p (x_{0} = k ∣ ϕ) .$

The Stan functions that support this type of model are special in that the user does not explicitly pass $y$ and $ϕ$ as arguments. Instead, the user passes $\log ω$ , $Γ$ , and $ρ$ , which in turn depend on $y$ and $ϕ$ .

Stan functions

real hmm_marginal(matrix log_omega, matrix Gamma, vector rho)
Returns the log probability density of $y$ , with $x_{n}$ integrated out at each iteration.

Available since 2.24

The arguments represent (1) the log density of each output, (2) the transition matrix, and (3) the initial state vector.

log_omega: $\log ω_{k n} = \log p (y_{n} ∣ x_{n} = k, ϕ)$ , log density of each output,
Gamma: $Γ_{i j} = p (x_{n} = j | x_{n - 1} = i, ϕ)$ , the transition matrix,
rho: $ρ_{k} = p (x_{0} = k ∣ ϕ)$ , the initial state probability.

array[] int hmm_latent_rng(matrix log_omega, matrix Gamma, vector rho)
Returns a length $N$ array of integers over ${1, . . ., K}$ , sampled from the joint posterior distribution of the hidden states, $p (x ∣ ϕ, y)$ . May be only used in transformed data and generated quantities.

Available since 2.24

matrix hmm_hidden_state_prob(matrix log_omega, matrix Gamma, vector rho)
Returns the matrix of marginal posterior probabilities of each hidden state value. This will be a $K \times N$ matrix. The $n^{th}$ column is a simplex of probabilities for the $n^{th}$ variable. Moreover, let $A$ be the output. Then $A_{i j} = p (x_{j} = i ∣ ϕ, y)$ . This function may only be used in transformed data and generated quantities.

Available since 2.24