Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataloading Revamp #3216

Open
wants to merge 118 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from 48 commits
Commits
Show all changes
118 commits
Select commit Hold shift + click to select a range
0471543
initial debugging and testing works
AntonioMacaronio Jun 11, 2024
c6dde7d
pwais changes with RayBatchStream to alleviate training
AntonioMacaronio Jun 12, 2024
a09ea0c
Merge branch 'main' into dataloading-revamp
AntonioMacaronio Jun 12, 2024
78453cd
few bugs to iron out with multiprocessing, specifically pickled colla…
AntonioMacaronio Jun 12, 2024
f2bd96f
working version of RayBatchStream
AntonioMacaronio Jun 13, 2024
d8b7430
additional docstrings
AntonioMacaronio Jun 13, 2024
a5425d4
cleanup
AntonioMacaronio Jun 13, 2024
604f734
much more documentation
AntonioMacaronio Jun 13, 2024
0143803
successfully trained AEA-script2_seq2 closed_loop without OOM
AntonioMacaronio Jun 13, 2024
d3527e2
porting over aria dataset-size feature
AntonioMacaronio Jun 13, 2024
25f5f27
added logic to handle eviction of a worker's cached_collated_batch
AntonioMacaronio Jun 14, 2024
3a8b63b
antonio's implementation of stream batches
AntonioMacaronio Jun 15, 2024
536c6ca
training on a dataset with 4000 images works!
AntonioMacaronio Jun 15, 2024
43a0061
some configuration speedups, loops aren't actually needed!
AntonioMacaronio Jun 15, 2024
fa7cf30
quick fix adjustment to aria
AntonioMacaronio Jun 15, 2024
927cb6a
removed unnecessary looping
AntonioMacaronio Jun 16, 2024
814f2c2
much faster training when adding i variable to collate every 5 ray bu…
AntonioMacaronio Jun 25, 2024
247ac3e
cleanup unnecssary variables in Dataloader
AntonioMacaronio Jul 7, 2024
55d0803
further cleanup
AntonioMacaronio Jul 11, 2024
b6979a4
adding caching of compressed images to RAM to reduce disk bottleneck
AntonioMacaronio Jul 20, 2024
81dbf7c
added caching to RAM for masks
AntonioMacaronio Jul 22, 2024
55ca71d
found fast way to collate - many tricks applied
AntonioMacaronio Jul 26, 2024
3b4f091
quick update to aria to test on different datasets
AntonioMacaronio Jul 26, 2024
7de1922
cleaned up the accelerated pil_to_numpy function
AntonioMacaronio Jul 26, 2024
9ceaad1
cleaning up PR
AntonioMacaronio Jul 26, 2024
4147a6a
this commit was used to generate the time metrics and profiling metrics
AntonioMacaronio Jul 26, 2024
5a55b7a
REAL commit used to run tests
AntonioMacaronio Jul 26, 2024
78f02e6
testing with nerfacto-big
AntonioMacaronio Aug 15, 2024
19bc4b5
generated RayBundle collate and converting images from uint8s to floa…
AntonioMacaronio Aug 15, 2024
9245d05
updating nerfacto to support uint8 easily, will need to figure out a …
AntonioMacaronio Aug 20, 2024
3124c14
datamanager updates, both splat and nerf
AntonioMacaronio Aug 20, 2024
afb0612
must use writeable arrays because torch requires them
AntonioMacaronio Aug 20, 2024
288a740
cleaned up base_dataset, added pickle to utils, more code in full_ima…
AntonioMacaronio Aug 22, 2024
2fd0862
lots of process on a parallel FullImageDatamanger
AntonioMacaronio Aug 23, 2024
846e2f3
can train big splats with pre-assertion hack or ROI hack and 0 workers
AntonioMacaronio Aug 24, 2024
8fb0b4d
fixed all undistortion issues with ParallelImageDatamanager
AntonioMacaronio Aug 27, 2024
ce3f83f
adding some downsampling and parallel tests with splatfacto!
AntonioMacaronio Aug 31, 2024
8ab9963
deleted commented code in dataloaders.py and added bugfix to shuffling
AntonioMacaronio Aug 31, 2024
c9e16bf
testing splatfacto-big
AntonioMacaronio Sep 1, 2024
ddac38d
cleaned up base_pipeline.py
AntonioMacaronio Sep 1, 2024
443719a
cleaned up base_pipeline.py ACTUALLY THIS TIME, forgot to save last time
AntonioMacaronio Sep 1, 2024
d16e519
cleaned up a lot of code
AntonioMacaronio Sep 1, 2024
367d512
process_project_aria back to main branch and some cleanup in full_ima…
AntonioMacaronio Sep 1, 2024
d3d99b4
clarifying docstrings
AntonioMacaronio Sep 1, 2024
6f763dc
further PR cleanup
AntonioMacaronio Sep 3, 2024
a5191bd
updating models
AntonioMacaronio Sep 9, 2024
7db70dc
further cleanup
AntonioMacaronio Sep 9, 2024
5c3262b
removed caching of images into bytestrings
AntonioMacaronio Sep 9, 2024
ff2bda1
adding caching of compressed images to RAM, forgot that hardware matters
AntonioMacaronio Sep 9, 2024
f6dd7dd
removing oom methods, adding the ability to add a flag to dataloading
AntonioMacaronio Sep 15, 2024
a6602c7
removed CacheDataloader, moved RayBatchStream to dataloaders.py, new …
AntonioMacaronio Sep 15, 2024
3dc2031
fixing base_piplines, deleting a weird datamanager_configs file that …
AntonioMacaronio Sep 15, 2024
89f3d98
cleaning up next_train
AntonioMacaronio Sep 15, 2024
14e60e5
replaced parallel datamanager with new datamanager
AntonioMacaronio Sep 19, 2024
204dfb2
reverted the original base_datamanager.py, new datamanager replaced p…
AntonioMacaronio Sep 19, 2024
5864bc9
modified VanillaConfig, but VanillaDataManager is the same as before
AntonioMacaronio Sep 19, 2024
6d97de3
cleaning up, 2 datamanagers now - original and new parallel one
AntonioMacaronio Sep 19, 2024
1f34017
able to train with new nerfstudio dataloader now
AntonioMacaronio Sep 19, 2024
99cf86a
side by side datamanagers, moved tons of logic into dataloaders.py an…
AntonioMacaronio Sep 23, 2024
4ebad85
added custom ray processing API to support implementations like LERF,…
AntonioMacaronio Sep 23, 2024
87921be
adding functionality for ns-eval by adding FixedIndicesEvalDataloader…
AntonioMacaronio Sep 24, 2024
b628c7c
adding both ray API and image-view API to datamanagers for custom par…
AntonioMacaronio Sep 27, 2024
d2785d1
updating splatfacto config for 4k tests
AntonioMacaronio Sep 30, 2024
436af9d
updating docstrings to be more descriptive
AntonioMacaronio Sep 30, 2024
dd4daaa
new datamanager API breaks when setup_eval() has multiple workers, no…
AntonioMacaronio Sep 30, 2024
43c66ae
adding custom_view_processor to ImageBatchStream
AntonioMacaronio Sep 30, 2024
ba81e11
merging with main!
AntonioMacaronio Sep 30, 2024
1922566
reverting full_images_datamanager to main branch
AntonioMacaronio Oct 1, 2024
beb74be
removing nn.Module inheritance from Datamanager class
AntonioMacaronio Oct 1, 2024
087cff0
don't need to move datamanger to device anymore since Datamanager is …
AntonioMacaronio Oct 1, 2024
48e6d15
finished integration test with nerfacto
AntonioMacaronio Oct 4, 2024
3f1799b
simplified config variables, integrated the parallelism/disk-data-loa…
AntonioMacaronio Oct 25, 2024
f46aa42
updated the splatfacto config to be simpler with the dataloading and …
AntonioMacaronio Oct 25, 2024
5aa51fb
style checks and some cleanup
AntonioMacaronio Oct 25, 2024
ec3c12a
new splatfacto test, cleaning up nerfacto integration test
AntonioMacaronio Oct 25, 2024
82bc5b2
removing redundant parallel_full_images_datamaanger, as the OG full_i…
AntonioMacaronio Oct 26, 2024
377a56a
Merge branch 'main' into dataloading-revamp
AntonioMacaronio Oct 28, 2024
bbb5473
ruff linting and pyright fixing
AntonioMacaronio Oct 28, 2024
2e64120
further pyright fixing
AntonioMacaronio Oct 28, 2024
e9c2fd6
another pyright fixing
AntonioMacaronio Oct 28, 2024
e4dc9f9
fixing pyright error, camera optimization no longer part of datamanager
AntonioMacaronio Nov 1, 2024
8b0ec8e
fixing one pyright
AntonioMacaronio Nov 22, 2024
6349852
fixing dataloading error when camera is not undistorted with dataloader
AntonioMacaronio Dec 13, 2024
ad6b090
fixing comments and updating style
AntonioMacaronio Dec 21, 2024
8c678ee
undoing a style change i made
AntonioMacaronio Dec 21, 2024
64edabb
undoing another style change i made by accident
AntonioMacaronio Dec 21, 2024
cc63585
Merge branch 'main' into dataloading-revamp
AntonioMacaronio Dec 22, 2024
1e40aad
fixing slow runtime
AntonioMacaronio Dec 25, 2024
0012017
fixing a more general camera undistortion bug
AntonioMacaronio Dec 31, 2024
2fdba24
move images to device properly
kerrj Jan 2, 2025
d8ec706
minor improvements
kerrj Jan 2, 2025
51fc984
Merge branch 'main' into dataloading-revamp
kerrj Jan 3, 2025
36df9b3
add print statement about >500 images, cleanup method configs
kerrj Jan 3, 2025
a3fb46f
make method configs consistent across nerfacto models
kerrj Jan 3, 2025
3e06221
adding description comments
AntonioMacaronio Jan 4, 2025
c6f8094
Merge branch 'main' into dataloading-revamp
kerrj Jan 6, 2025
e72dd78
updating description
AntonioMacaronio Jan 6, 2025
f5024fc
Merge branch 'dataloading-revamp' of https://github.com/AntonioMacaro…
AntonioMacaronio Jan 6, 2025
d2af513
resolving some pyright issues with export.py, explained in PR desc
AntonioMacaronio Jan 6, 2025
1a02133
fixing pyright issues in base_pipeline.py
AntonioMacaronio Jan 6, 2025
b7bcb13
ran pyright on exporter and base_pipeline.py without issues
AntonioMacaronio Jan 6, 2025
603a5db
adding a git ignore to a clearly checked pyright issue
AntonioMacaronio Jan 6, 2025
eedda79
typo
kerrj Jan 7, 2025
3c5ab8e
merge
kerrj Jan 7, 2025
2f90812
fixing most ns-dev-test cases
AntonioMacaronio Jan 7, 2025
3a82351
Merge branch 'dataloading-revamp' of https://github.com/AntonioMacaro…
AntonioMacaronio Jan 7, 2025
4091694
cleanup, passing final ns-dev-test
AntonioMacaronio Jan 7, 2025
e7c99e4
oops, accidentally pushed the deletion of a docstring, undoing that
AntonioMacaronio Jan 7, 2025
bd6d1ae
another cleanup
AntonioMacaronio Jan 7, 2025
deb4d7f
some fixes to eval pipeline
kerrj Jan 7, 2025
a5f62aa
lint
kerrj Jan 7, 2025
97629a7
Merge branch 'main' into dataloading-revamp
kerrj Jan 8, 2025
e13525e
add asserts for spawn
kerrj Jan 8, 2025
94afc0b
Merge branch 'dataloading-revamp' of https://github.com/AntonioMacaro…
kerrj Jan 8, 2025
c316a7b
lint
kerrj Jan 8, 2025
b8da37d
cleaning up import statements in parallel_datamanager.py
AntonioMacaronio Jan 9, 2025
3fafbc2
Merge branch 'dataloading-revamp' of https://github.com/AntonioMacaro…
AntonioMacaronio Jan 9, 2025
e4a7661
adding new developer documentation if users would like to migrate the…
AntonioMacaronio Jan 9, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
106 changes: 103 additions & 3 deletions nerfstudio/configs/method_configs.py
Original file line number Diff line number Diff line change
Expand Up @@ -37,6 +37,7 @@
from nerfstudio.data.dataparsers.phototourism_dataparser import PhototourismDataParserConfig
from nerfstudio.data.dataparsers.sdfstudio_dataparser import SDFStudioDataParserConfig
from nerfstudio.data.dataparsers.sitcoms3d_dataparser import Sitcoms3DDataParserConfig
from nerfstudio.data.datasets.base_dataset import InputDataset
from nerfstudio.data.datasets.depth_dataset import DepthDataset
from nerfstudio.data.datasets.sdf_dataset import SDFDataset
from nerfstudio.data.datasets.semantic_dataset import SemanticDataset
Expand Down Expand Up @@ -90,7 +91,7 @@
max_num_iterations=30000,
mixed_precision=True,
pipeline=VanillaPipelineConfig(
datamanager=ParallelDataManagerConfig(
datamanager=VanillaDataManagerConfig(
dataparser=NerfstudioDataParserConfig(),
train_num_rays_per_batch=4096,
eval_num_rays_per_batch=4096,
Expand Down Expand Up @@ -126,7 +127,7 @@
max_num_iterations=100000,
mixed_precision=True,
pipeline=VanillaPipelineConfig(
datamanager=ParallelDataManagerConfig(
datamanager=VanillaDataManagerConfig(
dataparser=NerfstudioDataParserConfig(),
train_num_rays_per_batch=8192,
eval_num_rays_per_batch=4096,
Expand Down Expand Up @@ -211,6 +212,48 @@
vis="viewer",
)

method_configs["nerfacto-oom"] = TrainerConfig(
method_name="nerfacto-oom",
steps_per_eval_batch=500,
steps_per_save=2000,
max_num_iterations=30000,
mixed_precision=True,
pipeline=VanillaPipelineConfig(
datamanager=VanillaDataManagerConfig(
dataparser=NerfstudioDataParserConfig(),
train_num_rays_per_batch=4096,
eval_num_rays_per_batch=4096,
use_parallel_dataloader=True,
load_from_disk=True,
dataloader_num_workers=4,
prefetch_factor=10,
train_num_images_to_sample_from=50,
train_num_times_to_repeat_images=10,
),
model=NerfactoModelConfig(
eval_num_rays_per_chunk=1 << 15,
average_init_density=0.01,
camera_optimizer=CameraOptimizerConfig(mode="SO3xR3"),
),
),
optimizers={
"proposal_networks": {
"optimizer": AdamOptimizerConfig(lr=1e-2, eps=1e-15),
"scheduler": ExponentialDecaySchedulerConfig(lr_final=0.0001, max_steps=200000),
},
"fields": {
"optimizer": AdamOptimizerConfig(lr=1e-2, eps=1e-15),
"scheduler": ExponentialDecaySchedulerConfig(lr_final=0.0001, max_steps=200000),
},
"camera_opt": {
"optimizer": AdamOptimizerConfig(lr=1e-3, eps=1e-15),
"scheduler": ExponentialDecaySchedulerConfig(lr_final=1e-4, max_steps=5000),
},
},
viewer=ViewerConfig(num_rays_per_chunk=1 << 15),
vis="viewer",
)

method_configs["depth-nerfacto"] = TrainerConfig(
method_name="depth-nerfacto",
steps_per_eval_batch=500,
Expand Down Expand Up @@ -601,8 +644,11 @@
mixed_precision=False,
pipeline=VanillaPipelineConfig(
datamanager=FullImageDatamanagerConfig(
dataparser=NerfstudioDataParserConfig(load_3D_points=True),
# _target=ParallelFullImageDatamanager[InputDataset],
dataparser=NerfstudioDataParserConfig(load_3D_points=True, downscale_factor=1),
# dataparser=NerfstudioDataParserConfig(load_3D_points=True),
cache_images_type="uint8",
use_parallel_dataloader=True,
),
model=SplatfactoModelConfig(),
),
Expand Down Expand Up @@ -697,6 +743,60 @@
vis="viewer",
)

method_configs["splatfacto-oom"] = TrainerConfig(
method_name="splatfacto-oom",
steps_per_eval_image=100,
steps_per_eval_batch=0,
steps_per_save=2000,
steps_per_eval_all_images=1000,
max_num_iterations=30000,
mixed_precision=False,
pipeline=VanillaPipelineConfig(
datamanager=FullImageDatamanagerConfig(
# _target=ParallelFullImageDatamanager[InputDataset],
dataparser=NerfstudioDataParserConfig(load_3D_points=True, downscale_factor=1),
# dataparser=NerfstudioDataParserConfig(load_3D_points=True),
cache_images_type="uint8",
use_parallel_dataloader=True,
),
model=SplatfactoModelConfig(),
),
optimizers={
"means": {
"optimizer": AdamOptimizerConfig(lr=1.6e-4, eps=1e-15),
"scheduler": ExponentialDecaySchedulerConfig(
lr_final=1.6e-6,
max_steps=30000,
),
},
"features_dc": {
"optimizer": AdamOptimizerConfig(lr=0.0025, eps=1e-15),
"scheduler": None,
},
"features_rest": {
"optimizer": AdamOptimizerConfig(lr=0.0025 / 20, eps=1e-15),
"scheduler": None,
},
"opacities": {
"optimizer": AdamOptimizerConfig(lr=0.05, eps=1e-15),
"scheduler": None,
},
"scales": {
"optimizer": AdamOptimizerConfig(lr=0.005, eps=1e-15),
"scheduler": None,
},
"quats": {"optimizer": AdamOptimizerConfig(lr=0.001, eps=1e-15), "scheduler": None},
"camera_opt": {
"optimizer": AdamOptimizerConfig(lr=1e-4, eps=1e-15),
"scheduler": ExponentialDecaySchedulerConfig(
lr_final=5e-7, max_steps=30000, warmup_steps=1000, lr_pre_warmup=0
),
},
},
viewer=ViewerConfig(num_rays_per_chunk=1 << 15),
vis="viewer",
)


def merge_methods(methods, method_descriptions, new_methods, new_descriptions, overwrite=True):
"""Merge new methods and descriptions into existing methods and descriptions.
Expand Down
Loading