Exposure

The Risk component (and its parent class CausalFactor) is the entry point for modeling risk exposure. It has two responsibilities: assigning each simulant a stable propensity and registering an exposure pipeline that converts that propensity into an exposure value using the configured distribution.

Propensity

Every simulant carries a separate propensity for each risk factor in the model. When new simulants are created, each CausalFactor instance draws a uniform random value in [0, 1] for every simulant and stores it in a {risk_name}.propensity column on the state table. For example, a model with two risk factors risk_factor.high_sbp and risk_factor.smoking would add two columns — high_sbp.propensity and smoking.propensity — each containing an independent draw per simulant.

Each propensity value is fixed for the life of the simulant and acts as that simulant’s position in the cumulative distribution of the corresponding risk factor. Because each risk’s propensities are drawn from a dedicated randomness stream (named initial_{risk_name}_propensity), they are reproducible across simulation branches that share a random seed and are statistically independent across risks.

The distribution component (CausalFactorDistribution) reads the propensity column from the state table and uses it as the quantile input to the distribution’s percent-point function. How that conversion works depends on the distribution type:

Continuous (normal, lognormal, ensemble) — given a propensity q, the PPF returns the exposure value x such that P(X ≤ x) = q.
Polytomous — the propensity is compared against the cumulative sum of category probabilities to assign each simulant to one of N categories.
Dichotomous — simulants with a propensity below the exposure probability are assigned to the “exposed” category; the rest are “unexposed”.

This mechanism ensures that the simulated exposure distribution matches the empirical distribution sourced from artifact data, while also allowing pipeline modifiers to shift the distribution over time without needing to re-draw random numbers. Because propensities are per-simulant and per-risk, a simulant can sit at a high quantile for one risk factor while sitting at a low quantile for another — the two are independent unless an external component (such as a correlated-propensity model) explicitly introduces dependence.

Exposure Pipeline

During setup, the component registers an {risk_name}.exposure attribute pipeline. The pipeline’s source is the distribution’s PPF, which itself is registered as a separate {risk_name}.exposure_distribution.ppf pipeline. This two-layer design allows other components to modify the distribution parameters (e.g., shifting the mean) independently of the final exposure value.

Configuration

Risk exposure data is loaded from the simulation artifact by default, but can be overridden with a scalar value or a covariate name in the configuration. The distribution type is similarly configurable. See the Risk class documentation for the full set of configuration keys and YAML examples.

Rebinning and Category Thresholds

Two optional configuration blocks allow transforming exposure representations:

Rebinning (rebinned_exposed) — collapses a polytomous risk into a dichotomous one by merging specified categories into a single “exposed” group. All remaining categories become “unexposed”. This is useful when a risk has many GBD categories but the model only needs an exposed/unexposed distinction.
Category thresholds (category_thresholds) — bins a continuous distribution into ordered categories defined by the given thresholds. This is designed for alternative risk factors that need categorical representations. The two options are mutually exclusive.

Exposure

Propensity

Exposure Pipeline

Configuration

Rebinning and Category Thresholds

See Also