PH distributions

All PH distributions in this package subtype AbstractPHDist <: ContinuousUnivariateDistribution, so the full Distributions.jl interface works on every subtype. Specialized subtypes use closed-form implementations where available; the general PHDist falls back to matrix-exponential formulas.

using PhaseTypeDistributions, Distributions, Random, Statistics

Constructors

General `PHDist(α, T)`

ph = PHDist([0.6, 0.4], [-3.0 1.0; 0.0 -2.0])

PHDist(α=[0.6, 0.4], T=[-3.0 1.0; 0.0 -2.0])

`HyperExponentialDist` — mixture of exponentials

A diagonal sub-generator: SCV is always ≥ 1.

he  = HyperExponentialDist([0.4, 0.6], [2.0, 5.0])
he2 = HyperExponentialDist(2.0, 3.0)        # by mean and SCV
mean(he2), scv(he2)

(2.0, 3.0)

`HypoExponentialDist` — convolution of exponentials

A bidiagonal sub-generator with α = [1, 0, …, 0]: SCV is always ≤ 1. Repeated rates fall back to the matrix-exponential form (partial fractions diverge for equal rates).

ho  = HypoExponentialDist([3.0, 5.0])
ho2 = HypoExponentialDist(2.0, 0.5)         # by mean and SCV
mean(ho2), scv(ho2)

(2.0, 0.5)

`ErlangPHDist` — `k` equal phases

er = ErlangPHDist(3, 2.0)
mean(er), var(er)

(1.5, 0.75)

`CoxianDist` — sequential phases with exit probabilities

cox = CoxianDist([3.0, 4.0, 5.0], [0.2, 0.3])
mean(cox), var(cox)

(0.6453333333333333, 0.21456711111111104)

From Distributions.jl

ph_exp = PHDist(Exponential(2.0))           # 1-phase
ph_erl = PHDist(Erlang(3, 0.5))             # 3-phase
mean(ph_exp), mean(ph_erl)

(2.0, 1.5)

Standard Distributions.jl interface

Every PH type supports the standard density / cdf / sampling API:

pdf(er, 1.0), logpdf(er, 1.0), cdf(er, 1.0), ccdf(er, 1.0)

(0.5413411329464507, -0.6137056388801095, 0.3233235838169365, 0.6766764161830635)

ccdf is the natively-computed quantity for PH distributions (α' exp(Tx) 𝟙) — cdf is derived from it — so tail precision is preserved into the deep tail:

ccdf(er, 50.0)        # ≪ 1, but still finite

1.8976107553682285e-40

Support is [0, ∞):

minimum(er), maximum(er), insupport(er, 1.0), insupport(er, -1.0)

(0.0, Inf, true, false)

Moments and shape

mean(he), var(he), std(he), scv(he)

(0.32, 0.1456, 0.38157568056677826, 1.421875)

skewness(he), kurtosis(he)      # excess kurtosis

(2.8125136580841357, 12.240550658133078)

kth_moment(he, 3), mgf(he, 0.5)

(0.3288, 1.2)

mgf extends Distributions.mgf. It is defined for t < min(-diag(T)); the linear solve diverges otherwise.

Quantile / median

quantile(er, 0.95), median(er)

(3.1478968109358902, 1.337030156861374)

quantile for the general AbstractPHDist uses bisection; specialized subtypes that are exactly Distributions.Erlang etc. could use closed forms but currently also bisect via the fallback.

Sampling

rng = Random.MersenneTwister(42)
rand(rng, er)

0.5840034701546643

xs = rand(rng, er, 1000);
length(xs), extrema(xs), Statistics.mean(xs)

(1000, (0.08957195116427663, 7.008321490323527), 1.5269136616230647)

Internally, sampling simulates the underlying CTMC until absorption. Each specialized subtype overrides rand with its natural recipe — e.g. HyperExponentialDist picks a component and draws one exponential; ErlangPHDist sums k independent exponentials.

Accessors — the underlying `(α, T)` representation

Every subtype exposes its parameters in the canonical PH form:

initial_prob(cox), subgenerator(cox), exit_rates(cox), nphases(cox)

([1.0, 0.0, 0.0], [-3.0 2.4000000000000004 0.0; 0.0 -4.0 2.8; 0.0 0.0 -5.0], [0.6000000000000001, 1.2, 5.0], 3)

params(cox)                     # natural parameters: (rates, exit_probs)
params(he)                      # (probs, rates)
params(er)                      # (shape, rate)
params(ph)                      # (α, T)

([0.6, 0.4], [-3.0 1.0; 0.0 -2.0])

Conversion to the general form

Any subtype converts to the general (α, T) form via PHDist(d):

ph_from_he = PHDist(he)
ph_from_er = PHDist(er)
nphases(ph_from_he), nphases(ph_from_er)

(2, 3)

PHDist(d::PHDist) is a no-op.

Comparison helpers for non-identifiable distributions

Two different (α, T) representations can describe the same distribution. The package provides comparison routines that work across representations:

moments_isapprox(he, ph_from_he)        # by moments
distribution_isapprox(he, ph_from_he)   # by CDF on an adaptive grid
moment_vector(he, 4)                    # [E[X], E[X²], E[X³], E[X⁴]]

4-element Vector{Float64}:
 0.32
 0.248
 0.3288
 0.62304

moments_isapprox matches a fixed number of moments (necessary but not sufficient); distribution_isapprox evaluates the CDF on a grid spanning out to several standard deviations and so is a stronger check.

Reference — types

PhaseTypeDistributions.AbstractPHDist — Type

AbstractPHDist <: ContinuousUnivariateDistribution

Supertype for every phase-type distribution in this package. Subtypes must implement initial_prob and subgenerator; generic implementations of the Distributions.jl interface (pdf, cdf, ccdf, mean, var, rand, …) are provided by fallback and can be overridden by specialized subtypes for efficiency.

HyperExponentialDist — mixture of exponentials

HypoExponentialDist — convolution of exponentials

ErlangPHDist — k equal phases

CoxianDist — sequential phases with exit probabilities

`HyperExponentialDist` — mixture of exponentials

`HypoExponentialDist` — convolution of exponentials

`ErlangPHDist` — `k` equal phases

`CoxianDist` — sequential phases with exit probabilities