pomp C API

Random variables

Beta-binomial distribution

double rbetabinom(double size, double prob, double theta);
double dbetabinom(double x, double size, double prob, 
  double theta, int give_log);

$X$ is said to be Beta-binomially distributed with size $n$ , mean probability $p$ , and dispersion parameter $\theta$ if $P \sim \mathrm{Beta}\left(\theta\,p,\theta\,(1-p)\right)$ and $X|P \sim \mathrm{Binomial}\left(n,P\right).$ If $X\sim\mathrm{BetaBinomial}\left(n,p,\theta\right)$ , then $\mathbb{E}\left[X\right]=n\,p\qquad\text{and}\qquad\mathrm{Var}\left[X\right]=n\,p\,(1-p)\,\frac{\theta+n}{\theta+1}.$

Multinomial distribution

The R C API provides a simulator for the multinomial distribution, rmultinom. See the “Rmath.h” header file for information on this facility. pomp provides an evaluator for the probability mass function (dmultinom) for this distribution.

double dmultinom(int m, const double *prob, double *x, int give_log);

Input:

m is a positive integer, is the dimension of the random variable.
prob is a pointer to an m-vector of probabilities.
x is a pointer to an m-vector containing the data.
give_log is an integer:
- give_log=1 if the log probability is desired.
- give_log=0 if probability is desired.

The return value is the probability or log probability (as requested).

Euler-multinomial distribution

The Euler multinomial approximation of a continuous-time, stochastic compartmental model is as follows. Suppose a compartment has occupancy $N_t$ at time $t$ and that there are $K$ ways of exiting the compartment, with per capita rates (hazards) $\mu_1,\dots,\mu_K$ , respectively.

Diagram: A single compartment within a compartmental model. Here, there are $K=2$ paths out of the compartment.

To make the Euler multinomial approximation, we approximate the total exit rate as constant over a small interval $[t,t+\Delta{t})$ . Let the random variable $\Delta{n_k}$ , $k=1,\dots,K$ , be the number that exit by path $k$ in this time interval and $\Delta{n_0}$ be the number that remain. Under this assumption, the vector of numbers of exits, $(\Delta{n_{0}},\Delta{n_{1}},\dots,\Delta{n_{K}})$ is multinomially distributed with size $N_t$ and probabilities $(p_k)_{k=0}^K$ , where $p_0 = \exp\left(-\sum\!\mu_i\,\Delta{t}\right),$ and $p_k = \frac{\mu_k}{\sum\!\mu_i}\,\left(1-p_0\right),\qquad k=1,\dots,K.$ By way of shorthand, we say that $\Delta{n}=(\Delta{n_k})_{k=1}^K$ is Euler-multinomially distributed with size $N_t$ , rates $\mu=(\mu_k)_{k=1}^K$ , and time-step $\Delta{t}$ and we write $\Delta{n} \sim \mathrm{Eulermultinom}\left(N_t,\mu,\Delta{t}\right).$

The pomp C API provides three functions that relate to the Euler-multinomial distribution. Their descriptions follow.

Simulate an Euler-multinomial random variable

The reulermultinom function draws a random sample from this distribution. Using the notation above, one has to pack the $K$ rates $\mu_1,\dots,\mu_K$ into contiguous memory locations and retrieve the results in (a different set of) contiguous memory locations. For example, if rate is a pointer to $K$ contiguous memory locations holding the rates and dn is a pointer to $K$ contiguous memory locations ready to hold the results, then

reulermultinom(K,N,rate,dt,dn);

will result in a random sample from the Euler multinomial distribution (with timestep dt) being stored in dn[0], …, dn[K-1]. In the foregoing, we’ve assumed that the quantities $N_t$ and $K$ are stored in the integer variables N and K, respectively, and that the double precision variable dt holds the timestep.

The prototype is:

void reulermultinom(int m, double size, const double *rate,
  double dt, double *trans);

Input:

m, a positive integer, is number of potential transitions (“deaths”).
size, a positive integer, is the number of individuals at risk.
rate is a pointer to the vector of transition (“death”) rates.
dt, a positive real number, is the duration of time interval.
trans is a pointer to the vector that will hold the random deviate.

Output:

On return, trans[0], …, trans[m-1] will be the numbers of individuals making each of the respective transitions.

See ?reulermultinom for more on the Euler-multinomial distributions.

NB: reulermultinom does not call GetRNGstate() or PutRNGstate() internally. This must be done by the calling program. But note that when reulermultinom is called inside a pomp rprocess, there is no need to call either GetRNGState() or PutRNGState(); this is handled by pomp.

Probability distribution of an Euler-multinomial random variable

If $\Delta{n} \sim \mathrm{Eulermultinom}\left(N_t,\mu,\Delta{t}\right)$ , then the probability it takes a specific value can be computed using the C function deulermultinom. Its prototype is:

double deulermultinom(int m, double size, const double *rate,
  double dt, double *trans, int give_log);

Input:

m, a positive integer, is the number of potential transitions (“deaths”).
size, a positive integer, is the number of individuals at risk.
rate is a pointer to vector of transition (“death”) rates.
dt, a positive real number, is the duration of time interval.
trans is pointer to vector containing the data, which are numbers of individuals making the respective transitions.
give_log is an integer:
- give_log=0 requests that the probability be returned.
- give_log=1 requests that the log probability to be returned.

Output:

The value returned is the probability or log probability (as requested).

Expectation of an Euler-multinomial random variable

If $\Delta{n} \sim \mathrm{Eulermultinom}\left(N_t,\mu,\Delta{t}\right)$ , then the expectation of its $i$ -th component is $\mathbb{E}\left[\Delta{n}_i\right]=p_k N_t,$ where $p_k$ is as defined above. The C function eeulermultinom computes this. Its prototype is:

void eeulermultinom(int m, double size, const double *rate,
  double dt, double *trans);

Input:

The parameters m, size, rate, and dt have the same meaning as above.

Output:

After a call to eeulermultinom, trans points to an array of doubles holding the expected values of the Euler-multinomial random variables.

Gamma white noise

double rgammawn(double sigma, double dt);

Corresponding to the R function rgammawn, this C function draws a single random increment of a gamma white-noise process. This will have expectation equal to dt and variance sigma^2*dt.

In particular, when dW = rgammawn(sigma,dt); is executed, mu*dW/dt is a candidate for a random rate process within an Euler-multinomial context, i.e., mu*dW will have expectation mu*dt and variance mu*sigma^2*dt.

Prototypes for basic model components

pomp provides a facility whereby model codes can be compiled into a dynamically linked library for use in pomp objects. Specifically, basic model components are coded as C functions with the following prototypes.

NB: These functions should not be used within C snippets!

Indices

Each of the following functions is supplied one or more of the stateindex, parindex, covindex, obsindex, vmatindex arguments. Each of these is an integer vector: the integers within are indices giving the positions of specific model variables, according to the user’s specification, the latter being given by means of the statenames, paramnames, covarnames, and obsnames arguments. See ?pomp for more explanation. Thus, for example, within the body of a function of prototype pomp_rinit (see below),

 x[stateindex[0]];
 x[stateindex[3]];
 p[parindex[2]];
 covars[covindex[1]];

refer to the first state variable, the fourth state variable, the third parameter, and the second covariate, respectively.

rinit

void pomp_rinit (double *x, const double *p, double t0,
  const int *stateindex, const int *parindex, const int *covindex,
  const double *covars);

Description:

p is a pointer to parameter vector.
t0 is the zero time.
stateindex, parindex, covindex: see Indices, above.
covars is a pointer to a vector containing the (possibly interpolated) values of the covariates at the current time.
x is a vector that will, on return, contain a draw from the initial-state distribution.

NB: There is no need to call GetRNGstate() or PutRNGstate() in the body of the user-defined function. The RNG is initialized before any call to this function, and the RNG state is written afterward. Inclusion of these calls in the user-defined function may result in significant slowdown.

dinit

void pomp_dinit (double *loglik, const double *x, const double *p, double t0,
  const int *stateindex, const int *parindex, const int *covindex,
  const double *covars);

Description:

loglik is a pointer to the scalar that will, on return, contain the log probability density.
x is the state vector at time t.
p is a pointer to parameter vector.
t0 is the zero time.
stateindex, parindex, covindex: see Indices, above.
covars is a pointer to a vector containing the (possibly interpolated) values of the covariates at the current time.

rprocess

`step.fun` as used by `euler` and `onestep`

void pomp_onestep_sim (double *x, const double *p,
  const int *stateindex, const int *parindex, const int *covindex,
  const double *covars, double t, double dt);

Description:

p is the parameter vector.
stateindex, parindex, covindex: see Indices, above.
covars is the vector of covariates.
t is the time at the beginning of the step.
dt is the step size (duration of the interval).
x is the vector, that will, on return, contain a draw from the state process at time t+dt.

`rate.fun` as used by `gillespie`

double pomp_ssa_rate_fn (int event, double t, const double *x, const double *p,
  const int *stateindex, const int *parindex, const int *covindex, 
  const double *covars);

Description:

event is an integer specifying the number of the reaction whose rate is desired (the first is event is 1, not 0).
t is the current time.
x is the vector of state variables.
p is the vector of parameters.
stateindex, parindex, covindex: see Indices, above.
covars is the vector of (possibly interpolated) covariates at time t.

The function returns the rate of the requested reaction.

dprocess

void pomp_dprocess (double *loglik,
  const double *x1, const double *x2, double t1, double t2, const double *p,
  const int *stateindex, const int *parindex, const int *covindex,
  const double *covars);

Description:

t1, t2 are the times at the beginning and end of the interval, respectively.
x1, x2 are the state vectors at time t1 and t2, respectively.
p is the parameter vector.
stateindex, parindex, covindex: see Indices, above.
covars is the vector of (possibly interpolated) covariates at time t.
loglik is a pointer to the scalar that will, on return, contain the log probability density.

skeleton

void pomp_skeleton (double *f, const double *x, const double *p,
  const int *stateindex, const int *parindex, const int *covindex,
  const double *covars, double t);

Description:

t is the time.
x is the state vector at time t.
p is the parameter parameter vector.
stateindex, parindex, covindex: see Indices, above.
covars is the vector of (possibly interpolated) covariates at time t.
f is a vector, of the same length as x, that will, on return, contain the value of the map or vectorfield.

rmeasure

void pomp_rmeasure (double *y, const double *x, const double *p,
  const int *obsindex, const int *stateindex, const int *parindex,
  const int *covindex, const double *covars, double t);

Description:

t is the time at the beginning of the Euler step.
x is the state vector at time t.
p is the parameter vector.
obsindex, stateindex, parindex, covindex: see Indices, above.
covars is the vector of (possibly interpolated) covariates at time t.
y is a vector that will, on return, contain the simulated observations.

dmeasure

void pomp_dmeasure (double *lik, const double *y, const double *x, const double *p,
  int give_log, const int *obsindex, const int *stateindex, const int *parindex,
  const int *covindex, const double *covars, double t);

Description:

y is the vector of observables at time t.
x is the state vector at time t.
p is the parameter vector.
give_log is an integer:
- give_log=1 if log probability is desired;
- give_log=0 if probability is desired.
obsindex, stateindex, parindex, covindex: see Indices, above.
covars is the vector of (possibly interpolated) covariates at time t.
lik is a pointer to a scalar that will, on return, contain the requested likelihood or log likelihood.

emeasure

void pomp_emeasure (double *e, const double *x, const double *p,
  const int *obsindex, const int *stateindex, const int *parindex, const int *covindex,
  const double *covars, double t);

Description:

t is the time.
x is the state vector at time t.
p is the parameter parameter vector.
stateindex, parindex, covindex: see Indices, above.
covars is the vector of (possibly interpolated) covariates at time t.
e is a vector, of the same length as x, that will, on return, contain the expected value of the observed variables.

vmeasure

void pomp_vmeasure (double *v, const double *x, const double *p,
  const int *vmatindex, const int *stateindex, const int *parindex, const int *covindex,
  const double *covars, double t);

Description:

t is the time.
x is the state vector at time t.
p is the parameter parameter vector.
stateindex, parindex, covindex, vmatindex: see Indices, above.
covars is the vector of (possibly interpolated) covariates at time t.
v points to a square matrix that will, on return, contain the covariance matrix of the observed variables. In particular, if $X$ is the latent state and the nobs observables are $Y_i$ , then v[vmatindex[i+nobs*j]] contains $\mathrm{Cov}[Y_i,Y_j\;\vert\;X]$ .
It is the user’s responsibility to ensure that the returned covariance matrix is symmetric: this is not checked.

rprior

void pomp_rprior (double *p, const int *parindex);

Description:

p is the parameter vector.
parindex: see Indices, above.

On return, p will contain a new random draw from the prior distribution.

NB: There is no need to call GetRNGstate() or PutRNGstate() in the body of the user-defined function. The RNG is initialized before any call to this function, and the RNG state is written afterward. Inclusion of these calls in the user-defined function may result in significant slowdown.

dprior

void pomp_dprior (double *lik, const double *p, int give_log, const int *parindex);

Description:

p is the parameter vector.
give_log is an integer:
- give_log=1 if log probability is desired;
- give_log=0 if probability is desired.
parindex: see Indices, above.
lik is a pointer to a scalar that will, on return, contain the requested probability density or log probability density.

partrans

void pomp_transform (double *pt, const double *p, const int *parindex);

Description:

p is the parameter vector.
parindex: see Indices, above.
pt is the vector wherein the results will be returned.

pomp C API

Overview

Random variables

Beta-binomial distribution

Multinomial distribution

Euler-multinomial distribution

Simulate an Euler-multinomial random variable

Probability distribution of an Euler-multinomial random variable

Expectation of an Euler-multinomial random variable

Gamma white noise

Splines

Transformations

Logit transformation

Log-barycentric transformation

Convenience functions

Vector dot product

Exponential/geometric rate conversion

Access to the userdata

Prototypes for basic model components

Indices

rinit

dinit

rprocess

`step.fun` as used by `euler` and `onestep`

`rate.fun` as used by `gillespie`

dprocess

skeleton

rmeasure

dmeasure

emeasure

vmeasure

rprior

dprior

partrans

pomp C API

Overview

Random variables

Beta-binomial distribution

Multinomial distribution

Euler-multinomial distribution

Simulate an Euler-multinomial random variable

Probability distribution of an Euler-multinomial random variable

Expectation of an Euler-multinomial random variable

Gamma white noise

Splines

Transformations

Logit transformation

Log-barycentric transformation

Convenience functions

Vector dot product

Exponential/geometric rate conversion

Access to the userdata

Prototypes for basic model components

Indices

rinit

dinit

rprocess

step.fun as used by euler and onestep

rate.fun as used by gillespie

dprocess

skeleton

rmeasure

dmeasure

emeasure

vmeasure

rprior

dprior

partrans

`step.fun` as used by `euler` and `onestep`

`rate.fun` as used by `gillespie`