New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Custom dataloader registry support #2932

Open

ori-kron-wis wants to merge 83 commits into main from ori-2907-custom-dataloader-registry

Collaborator

ori-kron-wis commented Aug 7, 2024

No description provided.

ori-kron-wis and others added 16 commits

July 28, 2024 17:04


          copying CZI custom dataloader into our repo

7088e4b


          added some fixes to the custom dataloader stuff

cc72b05


          Some suggestions

46048e3


          Changes to datamodule pipeline

14f343d


          Fixed attr_dict

17282cd


          added some fixes based on custom data loader test

a4143f5


          Changes to dataloader

69abc47


          copying CZI custom dataloader into our repo

dc21a3d


          added some fixes to the custom dataloader stuff

a1098b3


          Some suggestions

b07216b


          Changes to datamodule pipeline

a578af1


          Fixed attr_dict

42434ec


          added some fixes based on custom data loader test

3d0c890


          Changes to dataloader

eff5b1e


          Merge remote-tracking branch 'origin/ori-2907-custom-dataloader-regis…

cbdc26e

…try' into ori-2907-custom-dataloader-registry


          add changes to tests and some merging with main following custom data…

18d65a6

…module / registry big change

ori-kron-wis added this to the scvi-tools 1.2 milestone

ori-kron-wis self-assigned this

ori-kron-wis linked an issue

that may be closed by this pull request

Fix custom dataloader registry #2907

Open

pre-commit-ci bot and others added 5 commits

August 7, 2024 12:58


          [pre-commit.ci] auto fixes from pre-commit.com hooks

4fe3ee1

for more information, see https://pre-commit.ci


          just put the cutom dataloder2 test under remarks so hook tests will r…

…un, we will later adjust this file


          fixes

7972bdc


          additional external models fixes once there is a registry

2d86c43


          fixed a few failed tests

3c44d86

codecov bot commented Aug 11, 2024 •

edited

Loading

Codecov Report

Attention: Patch coverage is 50.43860% with 113 lines in your changes missing coverage. Please review.

Project coverage is 82.50%. Comparing base (835d17a) to head (31e1d44).

Files with missing lines	Patch %	Lines
src/scvi/model/base/_base_model.py	37.60%	73 Missing ⚠️
src/scvi/model/_scanvi.py	58.13%	18 Missing ⚠️
src/scvi/model/_scvi.py	47.82%	12 Missing ⚠️
src/scvi/model/base/_archesmixin.py	75.00%	8 Missing ⚠️
src/scvi/model/base/_save_load.py	75.00%	1 Missing ⚠️
src/scvi/model/base/_training_mixin.py	50.00%	1 Missing ⚠️

❗ There is a different number of reports uploaded between BASE (835d17a) and HEAD (31e1d44). Click for more details.

HEAD has 13 uploads less than BASE

Flag BASE (835d17a) HEAD (31e1d44)

16 3

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2932      +/-   ##
==========================================
- Coverage   90.14%   82.50%   -7.65%     
==========================================
  Files         181      181              
  Lines       15644    15548      -96     
==========================================
- Hits        14103    12828    -1275     
- Misses       1541     2720    +1179

Files with missing lines	Coverage Δ
src/scvi/data/_utils.py	`87.57% <100.00%> (+0.53%)`	⬆️
src/scvi/external/stereoscope/_model.py	`92.40% <ø> (ø)`
src/scvi/external/stereoscope/_module.py	`96.33% <ø> (ø)`
src/scvi/model/_amortizedlda.py	`94.11% <ø> (ø)`
src/scvi/model/_autozi.py	`95.40% <ø> (ø)`
src/scvi/model/_condscvi.py	`95.74% <ø> (ø)`
src/scvi/model/_jaxscvi.py	`92.30% <ø> (ø)`
src/scvi/model/_linear_scvi.py	`94.87% <ø> (ø)`
src/scvi/model/_multivi.py	`75.08% <ø> (ø)`
src/scvi/model/_peakvi.py	`87.09% <ø> (ø)`
... and 7 more

... and 28 files with indirect coverage changes

ori-kron-wis added 4 commits

August 11, 2024 19:03


          fix archesmixin init and added new custom dataloader test and github …

c0889d8

…action


          fix again for from __future__ import annotations

8fe043c

and fix the test for custom dataloaders


          fix for run custom dataloader in github action

d8cf0f6


          rollback

c41e8b2

ori-kron-wis added the custom_dataloader label

canergen reviewed

View reviewed changes

tests/dataloaders/test_custom_dataloader.py

+                  adata.obs["batch"] = adata.obs[batch_keys].agg("".join, axis=1).astype("category")
+                  scvi.model.SCVI.prepare_query_anndata(adata, save_path)
+                  scvi.model.SCVI.load_query_data(registry=datamodule.registry, reference_model=save_path)

Member

canergen Oct 11, 2024

We should have more tests that actually fail - using different genes without prepare_query_anndata and different batch categories. Assert that it fails.

canergen reviewed

View reviewed changes

tests/dataloaders/test_custom_dataloader.py


		scvi.model.SCVI.prepare_query_anndata(adata, model_census2)

		scvi.model.SCVI.setup_anndata(adata, batch_key="batch") # needed?

Member

canergen Oct 11, 2024

checking that an AnnData model can be trained using datamodule. Do we really want it?

canergen reviewed

View reviewed changes

tests/dataloaders/test_custom_dataloader.py

+                  user_attributes_model_census3 = model_census3._get_user_attributes()
+                  pprint(user_attributes_model_census3)
+                  _ = model_census3.get_elbo()

Member

canergen Oct 11, 2024

uses AnnData for inference?

canergen reviewed

View reviewed changes

tests/dataloaders/test_custom_dataloader.py

+                  scvi.model.SCVI.prepare_query_anndata(adata, model_census3)
+                  scvi.model.SCVI.load_query_data(adata, model_census3)
+                  datamodule_inference = CensusSCVIDataModule(

Member

canergen Oct 11, 2024

check here that using different genes and different batches fails. You can take much fewer cells here, like 1000.

canergen reviewed

View reviewed changes

tests/dataloaders/test_custom_dataloader.py

+                  # Create a dataloder of a CZI module
+                  datapipe = datamodule_inference.datapipe
+                  dataloader = experiment_dataloader(datapipe, num_workers=0, persistent_workers=False)
+                  mapped_dataloader = (

Member

canergen Oct 11, 2024

What's this?

canergen reviewed

View reviewed changes

tests/model/test_scvi.py

+                  model = SCVI(adata, n_latent=n_latent)
+                  model.train(max_epochs=1)
+                  dataloader = model._make_data_loader(adata)

Member

canergen Oct 11, 2024

Does model._make_data_loader exist for all models? We should then add the test to the other models as well?

Member

canergen Oct 11, 2024

Is the dataloader sufficient to also setup the model and does setup_datamodule work for it?

ori-kron-wis and others added 14 commits

October 13, 2024 15:06


          removed redundat functions in code base

f94f7fa


          Added scanvi support, including CZI datamodule fix for it

962f043


          Merge remote-tracking branch 'origin/main' into ori-2907-custom-datal…

5c21d71

…oader-registry


          updates from main

a8aeffe


          more updates from main


          Merge branch 'main' into ori-2907-custom-dataloader-registry

624ee72


          Merge remote-tracking branch 'origin/ori-2907-custom-dataloader-regis…

6d4f368

…try' into ori-2907-custom-dataloader-registry


          updated related to tests

8ab01a4


          updated related to tests

31e1d44


          Running DataLoader MappedCollection

93666fa


          [pre-commit.ci] auto fixes from pre-commit.com hooks

1d1d6d3

for more information, see https://pre-commit.ci


          Fixed LaminDB dataloader

7695a8a


          Merge branch 'ori-2907-custom-dataloader-registry' of https://github.…

e4d732a

…com/scverse/scvi-tools into ori-2907-custom-dataloader-registry


          LaminDB dataloader test.

a651442

review-notebook-app bot commented Dec 31, 2024

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

pre-commit-ci bot and others added 9 commits

December 31, 2024 07:29


          [pre-commit.ci] auto fixes from pre-commit.com hooks

9767b8c

for more information, see https://pre-commit.ci


          Merge branch 'main' into ori-2907-custom-dataloader-registry

719e740


          Merge remote-tracking branch 'origin/main' into ori-2907-custom-datal…

1a4c796

…oader-registry

# Conflicts:
#	docs/tutorials/notebooks


          Changes for MappedCollection.


          Merge branch 'ori-2907-custom-dataloader-registry' of https://github.…

c740dd2

…com/scverse/scvi-tools into ori-2907-custom-dataloader-registry


          [pre-commit.ci] auto fixes from pre-commit.com hooks

61f2e27

for more information, see https://pre-commit.ci


          Add other notebook for testing new dataloader

874935b


          Merge branch 'ori-2907-custom-dataloader-registry' of https://github.…

f2c63bd

…com/scverse/scvi-tools into ori-2907-custom-dataloader-registry


          [pre-commit.ci] auto fixes from pre-commit.com hooks

35d45c8

for more information, see https://pre-commit.ci

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

custom_dataloader