Add tutorial on how to estimate a Cox Proportional Hazards Model in glum #876

MatthiasSchmidtblaicherQC · 2024-11-01T18:55:57Z

And: Add link to Matt Mills' tutorial on fitting penalized splines in glum.

Checklist

Added a CHANGELOG.rst entry

stanmart · 2024-11-05T09:05:51Z

docs/tutorials/cox_model/cox_model.ipynb

+    "\n",
+    "## 1. Equivalence Between the Cox Likelihood and a Profile Poisson Likelihood<a class=\"anchor\"></a>\n",
+    "\n",
+    "In the Cox model, the rate of event occurrence, $\\lambda(t,x_i)$, factorizes nicely into a linear predictor $\\eta_i=\\sum_k \\beta_k x_{ik}$ that depends on individual $i$'s characteristics but not on time $t$ and a baseline hazard $\\lambda_0$ that depends only on time, $\\lambda(t,x_i)=\\lambda_0(t)\\exp(\\eta_i)$ (the proportional hazards assumption). The partial log-likelihood of $\\eta_i$ is\n",


Suggested change

"In the Cox model, the rate of event occurrence, $\\lambda(t,x_i)$, factorizes nicely into a linear predictor $\\eta_i=\\sum_k \\beta_k x_{ik}$ that depends on individual $i$'s characteristics but not on time $t$ and a baseline hazard $\\lambda_0$ that depends only on time, $\\lambda(t,x_i)=\\lambda_0(t)\\exp(\\eta_i)$ (the proportional hazards assumption). The partial log-likelihood of $\\eta_i$ is\n",

"In the Cox model, the rate of event occurrence, $\\lambda(t,x_i)$, factorizes nicely into a linear predictor $\\eta_i=\\sum_k \\beta_k x_{ik}$ that depends on individual $i$'s characteristics but not on time $t$, and a baseline hazard $\\lambda_0$ that depends only on time: $\\lambda(t,x_i)=\\lambda_0(t)\\exp(\\eta_i)$. This is known as the proportional hazards assumption). The partial log-likelihood of $\\eta_i$ is\n",

stanmart · 2024-11-05T09:08:26Z

docs/tutorials/cox_model/cox_model.ipynb

+    "\n",
+    "## 2. Estimating a Cox Model in Glum<a class=\"anchor\"></a>\n",
+    "\n",
+    "We now show that a Poisson approach in `glum` yields the same parameter estimates as a Cox model. For the latter, we use the [lifelines](https://github.com/CamDavidsonPilon/lifelines) library. We also take the dataset from lifelines, which is from an RCT on recidivism for 432 convicts released from Maryland state prisons with first arrest after release as event. We first load imports and the dataset. The dataset has one row per convict, with two outcome columns, the `week` until which the observation lasts and `arrest`, which indicates whether an arrest event happened or not (censoring)."


Suggested change

"We now show that a Poisson approach in `glum` yields the same parameter estimates as a Cox model. For the latter, we use the [lifelines](https://github.com/CamDavidsonPilon/lifelines) library. We also take the dataset from lifelines, which is from an RCT on recidivism for 432 convicts released from Maryland state prisons with first arrest after release as event. We first load imports and the dataset. The dataset has one row per convict, with two outcome columns, the `week` until which the observation lasts and `arrest`, which indicates whether an arrest event happened or not (censoring)."

"We now show that a Poisson approach in `glum` yields the same parameter estimates as a Cox model. For the latter, we use the [lifelines](https://github.com/CamDavidsonPilon/lifelines) library. We also take the dataset from lifelines, which is from an RCT on recidivism for 432 convicts released from Maryland state prisons with first arrest after release as event. We first load imports and the dataset. The dataset has one row per convict, with two outcome columns: the `week` until which the observation lasts and `arrest`, which indicates whether an arrest event happened or not (censoring)."

stanmart · 2024-11-05T09:10:29Z

docs/tutorials/cox_model/cox_model.ipynb

+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "One might, therefore, wonder if the Poisson approach is competitive in terms of estimation speed. For the dataset here, the Poisson approach, including the data transformation by `survival_split`, turns out to be faster than the Cox model. This speedup is likely aided by tabmat's optimizations for the high-dimensional `week` categorical."


Suggested change

"One might, therefore, wonder if the Poisson approach is competitive in terms of estimation speed. For the dataset here, the Poisson approach, including the data transformation by `survival_split`, turns out to be faster than the Cox model. This speedup is likely aided by tabmat's optimizations for the high-dimensional `week` categorical."

"One might, therefore, wonder if the Poisson approach is competitive in terms of estimation speed. For the dataset here, the Poisson approach, including the data transformation by `survival_split`, turns out to be faster than the Cox model. This speedup is aided by tabmat's optimizations for the high-dimensional `week` categorical."

I'm confident this is the case :)

stanmart · 2024-11-05T09:12:38Z

docs/tutorials/cox_model/cox_model.ipynb

+   "source": [
+    "## Footnotes\n",
+    "<span id=\"fn1\"> 1</span>:\n",
+    "The Cox model assumes that at most one individual has an event at any time (\"no ties\").\n",


Maybe mention here that there are various ways to deal with this? (e.g., exact - and expensive - calculations, efron approximation (reasonably accurate), breslow approximation (fastest). Also, in the case of many ties, an inherently discrete survival model might be the best option.

stanmart · 2024-11-05T09:16:14Z

docs/tutorials/cox_model/cox_model.ipynb

+    "\n",
+    "[2] Whitehead, J., 1980. Fitting Cox’s regression model to survival data using GLIM. _Journal of the Royal Statistical Society Series C: Applied Statistics_, 29(3), pp.268-275.\n",
+    "\n",
+    "[3] Zhong, C. and Tibshirani, R., 2019. Survival analysis as a classification problem. arXiv preprint arXiv:1909.11171."


They do use the same dataset-transformation (which they call stacking) idea, but they rely on a binomial model instead of Poisson and say that it is a good enough approximation. So relevant, but I would not say it's the same thing.

stanmart

Thank you Matthias, I like this tutorial a lot!

Maybe we could also mention that the model we estimate is conceptually a piecewise constant hazard model, but because we have a different constant for each event time, it becomes essentially non-parametric. As a corollary, if one wants to reduce the number of parameters, they can opt for merging some categories and estimating fewer "constants".

MatthiasSchmidtblaicherQC added 10 commits September 9, 2024 18:42

add tutorial

4ba48ef

first version

fc692d4

add anchors and adjust format

ea9580b

title case

c338a4b

categoricals

699dfc8

tiny changes

d7fc939

even more cosmetics

64dd30b

wording

6b50f82

Merge branch 'main' into cox-model

f74dfc6

add lifelines optional dependency

96eebf3

MatthiasSchmidtblaicherQC requested review from MarcAntoineSchmidtQC, jtilly and lbittarello as code owners November 1, 2024 18:55

MatthiasSchmidtblaicherQC requested a review from stanmart November 1, 2024 18:56

tiny wordings

d9117d2

MatthiasSchmidtblaicherQC linked an issue Nov 1, 2024 that may be closed by this pull request

Add Cox Objective #464

Open

MatthiasSchmidtblaicherQC added 8 commits November 1, 2024 20:07

add lifelines to pixi lock

35f1a8b

even more tiny wordings

e721388

add output for all cells

e9260c5

clearer notation and small wordings

47540c2

some more words on data part

f5e2bbd

Merge branch 'main' into cox-model

15ee125

update pixi lock

42eaf6f

add reference to penalized splines blog post

fa84277

MatthiasSchmidtblaicherQC linked an issue Nov 4, 2024 that may be closed by this pull request

Add reference to blog post on penalized splines in docs #827

Open

stanmart reviewed Nov 5, 2024

View reviewed changes

stanmart approved these changes Nov 5, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tutorial on how to estimate a Cox Proportional Hazards Model in glum #876

Add tutorial on how to estimate a Cox Proportional Hazards Model in glum #876

MatthiasSchmidtblaicherQC commented Nov 1, 2024 •

edited

Loading

stanmart Nov 5, 2024

stanmart Nov 5, 2024

stanmart Nov 5, 2024

stanmart Nov 5, 2024

stanmart Nov 5, 2024

stanmart left a comment

	"One might, therefore, wonder if the Poisson approach is competitive in terms of estimation speed. For the dataset here, the Poisson approach, including the data transformation by `survival_split`, turns out to be faster than the Cox model. This speedup is likely aided by tabmat's optimizations for the high-dimensional `week` categorical."
	"One might, therefore, wonder if the Poisson approach is competitive in terms of estimation speed. For the dataset here, the Poisson approach, including the data transformation by `survival_split`, turns out to be faster than the Cox model. This speedup is aided by tabmat's optimizations for the high-dimensional `week` categorical."

Add tutorial on how to estimate a Cox Proportional Hazards Model in glum #876

Are you sure you want to change the base?

Add tutorial on how to estimate a Cox Proportional Hazards Model in glum #876

Conversation

MatthiasSchmidtblaicherQC commented Nov 1, 2024 • edited Loading

stanmart Nov 5, 2024

Choose a reason for hiding this comment

stanmart Nov 5, 2024

Choose a reason for hiding this comment

stanmart Nov 5, 2024

Choose a reason for hiding this comment

stanmart Nov 5, 2024

Choose a reason for hiding this comment

stanmart Nov 5, 2024

Choose a reason for hiding this comment

stanmart left a comment

Choose a reason for hiding this comment

MatthiasSchmidtblaicherQC commented Nov 1, 2024 •

edited

Loading