14.4 Negative-Binomial-2-Log Generalized Linear Model (Negative Binomial Regression)
Stan also supplies a single function for a generalized linear model with negative binomial likelihood and log link function, i.e. a function for a negative binomial regression. This provides a more efficient implementation of negative binomial regression than a manually written regression in terms of a negative binomial likelihood and matrix multiplication.
14.4.1 Probability Mass Function
If \(x\in \mathbb{R}^{n\cdot m}, \alpha \in \mathbb{R}^n, \beta\in \mathbb{R}^m, \phi\in \mathbb{R}^+\), then for \(y \in \mathbb{N}^n\), \[ \text{NegBinomial2LogGLM}(y~|~x, \alpha, \beta, \phi) = \prod_{1\leq i \leq n}\text{NegBinomial2}(y_i~|~\exp(\alpha_i + x_i\cdot \beta), \phi). \]
14.4.2 Sampling Statement
y ~
neg_binomial_2_log_glm
(x, alpha, beta, phi)
Increment target log probability density with neg_binomial_2_log_glm_lpmf(y | x, alpha, beta, phi)
dropping constant additive terms.
14.4.3 Stan Functions
real
neg_binomial_2_log_glm_lpmf
(int y | matrix x, real alpha, vector beta, real phi)
The log negative binomial probability mass of y
given log-location
alpha + x * beta
and inverse overdispersion parameter phi
, where the same
intercept alpha
, inverse overdispersion parameter phi
and dependant variable y
are used for all observations.
The number of columns of x
needs to match the size of the coefficient vector beta
.
If x
and y
are data (not parameters) this function can be executed on a GPU.
real
neg_binomial_2_log_glm_lpmf
(int y | matrix x, vector alpha, vector beta, real phi)
The log negative binomial probability mass of y
given log-location
alpha + x * beta
and inverse overdispersion parameter phi
, where the
same inverse overdispersion parameter phi
and independent variable value y
are used for all observations. Intercept alpha
is used that is allowed to vary by observation.
The number of rows of x
must match the size of alpha
and
the number of columns of x
needs to match the size of the coefficient vector beta
.
If x
and y
are data (not parameters) this function can be executed on a GPU.
real
neg_binomial_2_log_glm_lpmf
(int[] y | row_vector x, real alpha, vector beta, real phi)
The log negative binomial probability mass of y
given log-location
alpha + x * beta
and inverse overdispersion parameter phi
, where the
same intercept alpha
, inverse overdispersion parameter phi
and independent variable values x
are used for all observations.
The number of columns of x
needs to match the size of the coefficient vector beta
.
real
neg_binomial_2_log_glm_lpmf
(int[] y | row_vector x, vector alpha, vector beta, real phi)
The log negative binomial probability mass of y
given log-location
alpha + x * beta
and inverse overdispersion parameter phi
, where the
same inverse overdispersion parameter phi
and independent variable values x
are used for all observations. Intercept alpha
is used that is allowed to vary by observation.
The size of y
must match the size of alpha
and
the number of columns of x
needs to match the size of the coefficient vector beta
.
real
neg_binomial_2_log_glm_lpmf
(int[] y | matrix x, real alpha, vector beta, real phi)
The log negative binomial probability mass of y
given log-location
alpha + x * beta
and inverse overdispersion parameter phi
, where the
same intercept alpha
and inverse overdispersion parameter phi
is used for all observations. The
number of rows of the independent variable matrix x
needs to match
the size of the dependent variable vector y
and the number of
columns of x
needs to match the size of the coefficient vector beta
.
If x
and y
are data (not parameters) this function can be executed on a GPU.
real
neg_binomial_2_log_glm_lpmf
(int[] y | matrix x, vector alpha, vector beta, real phi)
The log negative binomial probability mass of y
given log-location
alpha + x * beta
and inverse overdispersion parameter phi
, where the
same inverse overdispersion parameter phi
is used for all observations and an intercept alpha
is used that is allowed to vary by observation. The
number of rows of the independent variable matrix x
needs to match
the size of the dependent variable vector y
and alpha
and the
number of columns of x
needs to match the size of the coefficient
vector beta
.
If x
and y
are data (not parameters) this function can be executed on a GPU.