Revamp survival analysis interface #842

aGuyLearning · 2024-12-20T10:35:53Z

Fixes #840

This pull request includes refactoring and enhancements to the survival analysis tools, particularly focusing on the Cox Proportional Hazards (CoxPH), Weibull Accelerated Failure Time (Weibull AFT), and Log-Logistic Accelerated Failure Time (Log-Logistic AFT) models. The changes improve the flexibility and usability of these models by adding new parameters and restructuring the code.

Refactoring and Enhancements:

Code Refactoring:
- Split the _regression_model function into two separate functions: _regression_model_data_frame_preparation and _regression_model_populate_adata.
Cox Proportional Hazards Model:
- Added new parameters to the cox_ph function to provide more control over the model fitting process, such as inplace, key_added_prefix, alpha, label, baseline_estimation_method, penalizer, l1_ratio, strata, n_baseline_knots, knots, breakpoints, weights_col, cluster_col, robust, formula, batch_mode, show_progress, initial_point, and fit_options.
Weibull Accelerated Failure Time Model:
- Enhanced the weibull_aft function with additional parameters similar to those added to the cox_ph function.
Log-Logistic Accelerated Failure Time Model:
- Updated the log_logistic_aft function to include new parameters.
Testing Adjustments:
- Modified the _sa_func_test method in tests/tools/test_sa.py to accommodate the updated function signatures, ensuring that tests pass with the new parameters.

aGuyLearning · 2024-12-20T10:38:45Z

This is not a finished PR, but a starting point for a discussion. The Weibull_aft and log_logistic_aft model summaries include multi index rows. The question is, if we can reduce them and how that works with the limitations of the adata.var.

We were hoping to include all the summary data into the adata, so as to have a similar style as scanpy.

aGuyLearning · 2025-01-08T10:44:14Z

Add model result dataframe to .uns
- Add key name of .uns to be added to the function arguments ( default: function name )
- PLOT: Search for function name on default
- PLOT: Take information from .uns object

Do this for:

cox_ph
weibull_aft
log_logistic_aft

eroell · 2025-01-08T15:12:49Z

Add model result dataframe to .uns

just a quick write-down of our offline discussion:

indeed, I think this is the best way to go from here if we do want to offer nice plots as request in the linked issue.

It is required that we store the survival analysis results in the adata object to produce corresponding plots with the functional API like ep.pl.<fancy_plot>(adata, ...).

However it can't be stored in adata.var, as in addition to the covariates, the models typically include an intercept term, making the model fit results to be of length len(adata.var.columns) + 1.

storing the summary results of the fitter as a very basic DataFrame in adata.uns is the most straightforward way in this case imo

tests/tools/test_sa.py

ehrapy/tools/_sa.py

…nivariates are updated )

eroell · 2025-01-08T22:07:04Z

Allows to pass keywords as requested in #744. Does not finish off this issue though, as ideally also show in a notebook tutorial later what effect regularization has.

eroell · 2025-01-08T22:20:08Z

From my side good. @Zethson, request your review here as this a quite specific choice we're making here.
See comment my comment a quick write-down of our offline discussion above about the rationale behind this choice.
If you have objections, we're interested to hear them

Zethson

Great! Just minor points.

ehrapy/tools/_sa.py

tests/tools/test_sa.py

Co-authored-by: Lukas Heumos <[email protected]>

…tions for customizable storage in AnnData object

…ata to assertion method

aGuyLearning added 5 commits December 18, 2024 14:08

cox_ph add all arguments

b1d36b8

updated test to use keywords

35dbacf

weibull_aft arguments update

22d190a

log_logistic update

742d38c

updated log logistic example

02e343d

aGuyLearning linked an issue Dec 20, 2024 that may be closed by this pull request

Update survival analysis models #840

Closed

24 tasks

aGuyLearning requested a review from eroell December 20, 2024 10:36

Merge branch 'main' into enhancement/issue-840

6038c7a

aGuyLearning marked this pull request as draft January 7, 2025 17:05

Zethson changed the title ~~Enhancement/issue 840~~ Revamp survival analysis interface Jan 8, 2025

aGuyLearning added 2 commits January 8, 2025 13:56

store summary df in adata.uns

8e1baa5

Merge branch 'main' into enhancement/issue-840

119947b

aGuyLearning marked this pull request as ready for review January 8, 2025 13:30

try moving np

eb9daba

eroell requested changes Jan 8, 2025

View reviewed changes

aGuyLearning and others added 5 commits January 8, 2025 17:48

omit inplace keyword

e340a28

added explanation, as to where the results are stored

c6a81df

corrected spelling

38f4efb

updated tests to check for .uns ( should be removed later, when the u…

501b864

…nivariates are updated )

fix argument order, doc fixes

eb0b404

slightly simpler wording

cffed4d

eroell self-requested a review January 8, 2025 22:17

eroell approved these changes Jan 8, 2025

View reviewed changes

eroell requested a review from Zethson January 8, 2025 22:17

Zethson approved these changes Jan 9, 2025

View reviewed changes

aGuyLearning and others added 12 commits January 9, 2025 15:12

fiexed spelling

3b21988

Co-authored-by: Lukas Heumos <[email protected]>

Update ehrapy/tools/_sa.py

58ce157

Co-authored-by: Lukas Heumos <[email protected]>

Update ehrapy/tools/_sa.py

ee97f31

Co-authored-by: Lukas Heumos <[email protected]>

Update ehrapy/tools/_sa.py

6dc7831

Co-authored-by: Lukas Heumos <[email protected]>

Update ehrapy/tools/_sa.py

540b79f

Co-authored-by: Lukas Heumos <[email protected]>

Update ehrapy/tools/_sa.py

09484d9

Co-authored-by: Lukas Heumos <[email protected]>

Update ehrapy/tools/_sa.py

96db288

Co-authored-by: Lukas Heumos <[email protected]>

Update ehrapy/tools/_sa.py

568b84b

Co-authored-by: Lukas Heumos <[email protected]>

renamed function to be clearer

cf00a3f

Add uns_key parameter to Kaplan-Meier, Nelson-Aalen, and Weibull func…

b574978

…tions for customizable storage in AnnData object

Update test assertions in TestSA for event_table handling and pass ad…

ace1baf

…ata to assertion method

uns to in doc

3de5678

eroell merged commit b2b8e40 into main Jan 10, 2025
11 checks passed

eroell deleted the enhancement/issue-840 branch January 10, 2025 16:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revamp survival analysis interface #842

Revamp survival analysis interface #842

aGuyLearning commented Dec 20, 2024 •

edited by eroell

Loading

aGuyLearning commented Dec 20, 2024

aGuyLearning commented Jan 8, 2025 •

edited

Loading

eroell commented Jan 8, 2025 •

edited

Loading

eroell commented Jan 8, 2025 •

edited

Loading

eroell commented Jan 8, 2025 •

edited

Loading

Zethson left a comment

Revamp survival analysis interface #842

Revamp survival analysis interface #842

Conversation

aGuyLearning commented Dec 20, 2024 • edited by eroell Loading

Refactoring and Enhancements:

aGuyLearning commented Dec 20, 2024

aGuyLearning commented Jan 8, 2025 • edited Loading

eroell commented Jan 8, 2025 • edited Loading

eroell commented Jan 8, 2025 • edited Loading

eroell commented Jan 8, 2025 • edited Loading

Zethson left a comment

Choose a reason for hiding this comment

aGuyLearning commented Dec 20, 2024 •

edited by eroell

Loading

aGuyLearning commented Jan 8, 2025 •

edited

Loading

eroell commented Jan 8, 2025 •

edited

Loading

eroell commented Jan 8, 2025 •

edited

Loading

eroell commented Jan 8, 2025 •

edited

Loading