Article, 2022

Flexible estimation of the state dwell-time distribution in hidden semi-Markov models

Computational Statistics & Data Analysis, ISSN 1872-7352, 0167-9473, Volume 172, Page 107479, 10.1016/j.csda.2022.107479

Contributors

Pohle, Jennifer (Corresponding author) [1] [2] Adam, Timo [3] Beumer, Larissa Teresa 0000-0002-5255-1889 [4]

Affiliations

  1. [1] Bielefeld University
  2. [NORA names: Germany; Europe, EU; OECD];
  3. [2] University of Potsdam
  4. [NORA names: Germany; Europe, EU; OECD];
  5. [3] University of St Andrews
  6. [NORA names: United Kingdom; Europe, Non-EU; OECD];
  7. [4] Aarhus University
  8. [NORA names: AU Aarhus University; University; Denmark; Europe, EU; Nordic; OECD]

Abstract

Hidden semi-Markov models generalise hidden Markov models by explicitly modelling the time spent in a given state, the so-called dwell time, using some distribution defined on the natural numbers. While the (shifted) Poisson and negative binomial distribution provide natural choices for such distributions, in practice, parametric distributions can lack the flexibility to adequately model the dwell times. To overcome this problem, a penalised maximum likelihood approach is proposed that allows for a flexible and data-driven estimation of the dwell-time distributions without the need to make any distributional assumption. This approach is suitable for direct modelling purposes or as an exploratory tool to investigate the latent state dynamics. The feasibility and potential of the suggested approach is illustrated in a simulation study and by modelling muskox movements in northeast Greenland using GPS tracking data. The proposed method is implemented in the R-package PHSMM which is available on CRAN.

Keywords

CRAN, GPS, GPS tracking data, Greenland, Markov, Markov model, Northeast Greenland, Poisson, R package, approach, assumptions, binomial distribution, choice, data, data-driven estimation, distribution, distributional assumptions, dwell time, dwell-time distributions, dwellings, dynamics, estimation, exploratory tool, feasibility, flexibility, flexible estimation, hidden semi-Markov model, likelihood approach, maximum likelihood approach, method, model, modeling purposes, movement, natural choice, natural numbers, negative binomial distribution, number, parametric distribution, penalised maximum likelihood approach, potential, practice, problem, purposes, semi-Markov model, simulation, simulation study, state, state dynamics, study, time, tools, tracking data

Data Provider: Digital Science