Deterministic latent process models and their role in benchmarking POMP models
Produced in R version 4.3.2.
We’re focusing on biological dynamic systems, which are typically far from perfectly predictable.
Deterministic latent process models are perfectly predictable, given the parameter values and initial conditions. One might therefore expect that these determinstic process models are problematic as statistical descriptions of unpredictable systems.
Exception: an imperfectly measured chaotic system is both deterministic and unpredictable. However, there has been little success using this class of models for fitting biological systems.
Exception: for short time series, the role of dynamic stochasticity may be negligible compared to measurement error, parameter uncertainty and uncertainty about initial values.
Deterministic latent process models are useful as a benchmark when fitting POMP models.
Recall that it is statistically meaningful to compare likelihoods between entirely different classes of models for the same data, for example by AIC.
A benchmark is a simple model that is fitted with the goal of providing a sanity check on a more sophisticated model. We believe that the complex model should fit the data better, but we want to check!
Some simple benchmarks can be constructed with relatively little effort:
independent, identically distributed observations.
ARMA models are appropriate for time series which appear stationary, perhaps after differencing.
Regression models, including nonlinear regression models such as ordinary differential equation models for compartment models.
Suppose that the latent process is deterministic. In our POMP notation, this lets us write the latent process as Xn=xn(θ), so that the latent process is a known and non-random function of θ for each n. What is the likelihood?
Since the probability of the observation, Yn, depends only on Xn and θ, and since, in particular Ym and Yn are independent given Xm and Xn, we have L(θ)=∏nfYn|Xn(y∗n;xn(θ),θ) or ℓ(θ)=logL(θ)=∑nlogfYn|Xn(y∗n;xn(θ),θ). The following diagram illustrates this.
In this diagram, ˆyn refers to the model prediction, ˆyn=E[Yn|Xn=xn(θ)], and y∗n is data.
Licensed under the Creative Commons Attribution-NonCommercial license. Please share and remix noncommercially, mentioning its origin.