Skip to content

Nelson-Aalen Cumulative Hazard Estimator

Survival Analysis

Estimates the cumulative hazard function from time-to-event data, providing an alternative perspective to the Kaplan-Meier survival curve by focusing on the accumulated risk over time.

When to Use

Use this estimator when you want to understand how risk accumulates over time rather than how survival probability decreases. It is also used as a foundation for smoothed hazard rate estimation and as an input to other survival models.

Assumptions

  • Censoring is non-informative.
  • Events are independent across subjects.

Required Inputs

InputTypeNotes
Time to EventNumericSurvival times
Event IndicatorBinary (0/1)1 = event, 0 = censored

Output Metrics

MetricWhat it means
TimeEach event time point.
Cumulative HazardEstimated cumulative hazard H(t) at each event time. H(t) = -log(S(t)) relates to the Kaplan-Meier survival estimate.
95% CI LowerLower confidence limit for the cumulative hazard.
95% CI UpperUpper confidence limit for the cumulative hazard.
Fixed Time EstimatesCumulative hazard values at specific user-defined time points.
Smoothed HazardKernel-smoothed instantaneous hazard rate estimate over time.

Interpretation

  • The cumulative hazard H(t) represents the total accumulated risk up to time t. It starts at 0 and increases monotonically.
  • A steep segment in the cumulative hazard curve indicates a period of high event rate. A flat segment indicates a period of low risk.
  • The Nelson-Aalen estimator and the Kaplan-Meier estimator are closely related: S(t) is approximately exp(-H(t)). For small hazards, they give nearly identical results.
  • The smoothed hazard estimate reveals how the instantaneous risk changes over time, which the cumulative hazard and survival curves obscure.

Common Pitfalls

  • The cumulative hazard can exceed 1.0 (unlike survival probability), which sometimes confuses researchers unfamiliar with hazard functions.
  • Smoothed hazard estimates depend on the choice of bandwidth. Too narrow a bandwidth produces noisy estimates; too wide smooths over real features.
  • With few events, the cumulative hazard estimate has wide confidence intervals, especially at later time points when few subjects remain at risk.

How It Works

  1. At each event time, estimate the instantaneous hazard as the number of events divided by the number at risk: d_i / n_i.
  2. Cumulate these hazard increments: H(t) = sum of d_i / n_i for all event times up to t.
  3. Compute variance using Greenwood-type formulas and construct confidence intervals.
  4. Optionally smooth the hazard increments using a kernel smoother to estimate the instantaneous hazard rate function.

Citations

References

  • Nelson, W. (1972). Theory and applications of hazard plotting for censored failure data. Technometrics, 14(4), 945-966.
  • Aalen, O. O. (1978). Nonparametric inference for a family of counting processes. Annals of Statistics, 6(4), 701-726.