mirror of https://github.com/facebookresearch/pytorch3d.git synced 2026-03-19 20:35:59 +08:00

Go to file

Nikhila Ravi aa8b03f31d Updates to support Accelerate and multigpu training (#37 )

Summary:
## Changes:
- Added Accelerate Library and refactored experiment.py to use it
- Needed to move `init_optimizer` and `ExperimentConfig` to a separate file to be compatible with submitit/hydra
- Needed to make some modifications to data loaders etc to work well with the accelerate ddp wrappers
- Loading/saving checkpoints incorporates an unwrapping step so remove the ddp wrapped model

## Tests

Tested with both `torchrun` and `submitit/hydra` on two gpus locally. Here are the commands:

**Torchrun**

Modules loaded:
```sh
1) anaconda3/2021.05   2) cuda/11.3   3) NCCL/2.9.8-3-cuda.11.3   4) gcc/5.2.0. (but unload gcc when using submit)
```

```sh
torchrun --nnodes=1 --nproc_per_node=2 experiment.py --config-path ./configs --config-name repro_singleseq_nerf_test
```

**Submitit/Hydra Local test**

```sh
~/pytorch3d/projects/implicitron_trainer$ HYDRA_FULL_ERROR=1 python3.9 experiment.py --config-name repro_singleseq_nerf_test --multirun --config-path ./configs  hydra/launcher=submitit_local hydra.launcher.gpus_per_node=2 hydra.launcher.tasks_per_node=2 hydra.launcher.nodes=1
```

**Submitit/Hydra distributed test**

```sh
~/implicitron/pytorch3d$ python3.9 experiment.py --config-name repro_singleseq_nerf_test --multirun --config-path ./configs  hydra/launcher=submitit_slurm hydra.launcher.gpus_per_node=8 hydra.launcher.tasks_per_node=8 hydra.launcher.nodes=1 hydra.launcher.partition=learnlab hydra.launcher.timeout_min=4320
```

## TODOS:
- Fix distributed evaluation: currently this doesn't work as the input format to the evaluation function is not suitable for gathering across gpus (needs to be nested list/tuple/dicts of objects that satisfy `is_torch_tensor`) and currently `frame_data`  contains `Cameras` type.
- Refactor the `accelerator` object to be accessible by all functions instead of needing to pass it around everywhere? Maybe have a `Trainer` class and add it as a method?
- Update readme with installation instructions for accelerate and also commands for running jobs with torchrun and submitit/hydra

X-link: https://github.com/fairinternal/pytorch3d/pull/37

Reviewed By: davnov134, kjchalup

Differential Revision: D37543870

Pulled By: bottler

fbshipit-source-id: be9eb4e91244d4fe3740d87dafec622ae1e0cf76

2022-07-11 19:29:58 -07:00

.circleci

cu116 docker image

2022-07-07 23:23:37 -07:00

.github

isort->usort

2022-04-26 08:34:54 -07:00

dev

move targets

2022-05-25 06:16:03 -07:00

docs

Add vis to readthedocs

2022-05-06 04:07:43 -07:00

packaging

cu116 docker image

2022-07-07 23:23:37 -07:00

projects

Updates to support Accelerate and multigpu training (#37 )

2022-07-11 19:29:58 -07:00

pytorch3d

Updates to support Accelerate and multigpu training (#37 )

2022-07-11 19:29:58 -07:00

scripts

Update license for company name

2022-01-04 11:43:38 -08:00

tests

fix test

2022-07-11 06:17:48 -07:00

website

docs: add social banner in support of Ukraine (#1101 )

2022-03-04 01:51:16 -08:00

.clang-format

Cut FOR_EACH_KV

2020-06-09 13:20:47 -07:00

.flake8

Enable black + isort fbsource linter

2020-03-29 14:51:02 -07:00

.gitignore

Shader API more consistent naming

2020-02-19 23:16:50 -08:00

INSTALL.md

Add pytorch 1.12, drop pytorch 1.7

2022-07-06 14:36:45 -07:00

LICENSE

More company name & License

2022-01-04 11:43:38 -08:00

LICENSE-3RD-PARTY

loading llff and blender datasets

2022-06-16 03:09:15 -07:00

README.md

INSTALL/README updates

2022-04-28 04:48:24 -07:00

setup.cfg

More company name & License

2022-01-04 11:43:38 -08:00

setup.py

don't check black version

2022-05-17 09:08:06 -07:00

README.md

Introduction

PyTorch3D provides efficient, reusable components for 3D Computer Vision research with PyTorch.

Key features include:

Data structure for storing and manipulating triangle meshes
Efficient operations on triangle meshes (projective transformations, graph convolution, sampling, loss functions)
A differentiable mesh renderer

PyTorch3D is designed to integrate smoothly with deep learning methods for predicting and manipulating 3D data. For this reason, all operators in PyTorch3D:

Are implemented using PyTorch tensors
Can handle minibatches of hetereogenous data
Can be differentiated
Can utilize GPUs for acceleration

Within FAIR, PyTorch3D has been used to power research projects such as Mesh R-CNN.

Installation

For detailed instructions refer to INSTALL.md.

License

PyTorch3D is released under the BSD License.

Tutorials

Get started with PyTorch3D by trying one of the tutorial notebooks.


Deform a sphere mesh to dolphin	Bundle adjustment


Render textured meshes	Camera position optimization


Render textured pointclouds	Fit a mesh with texture


Render DensePose data	Load & Render ShapeNet data


Fit Textured Volume	Fit A Simple Neural Radiance Field

Documentation

Learn more about the API by reading the PyTorch3D documentation.

We also have deep dive notes on several API components:

Overview Video

We have created a short (~14 min) video tutorial providing an overview of the PyTorch3D codebase including several code examples. Click on the image below to watch the video on YouTube:

Development

We welcome new contributions to PyTorch3D and we will be actively maintaining this library! Please refer to CONTRIBUTING.md for full instructions on how to run the code, tests and linter, and submit your pull requests.

Development and Compatibility

main branch: actively developed, without any guarantee, Anything can be broken at any time
- REMARK: this includes nightly builds which are built from main
- HINT: the commit history can help locate regressions or changes
backward-compatibility between releases: no guarantee. Best efforts to communicate breaking changes and facilitate migration of code or data (incl. models).

Contributors

PyTorch3D is written and maintained by the Facebook AI Research Computer Vision Team.

In alphabetical order:

Amitav Baruah
Steve Branson
Luya Gao
Georgia Gkioxari
Taylor Gordon
Justin Johnson
Patrick Labatut
Christoph Lassner
Wan-Yen Lo
David Novotny
Nikhila Ravi
Jeremy Reizenstein
Dave Schnizlein
Roman Shapovalov
Olivia Wiles

Citation

If you find PyTorch3D useful in your research, please cite our tech report:

@article{ravi2020pytorch3d,
    author = {Nikhila Ravi and Jeremy Reizenstein and David Novotny and Taylor Gordon
                  and Wan-Yen Lo and Justin Johnson and Georgia Gkioxari},
    title = {Accelerating 3D Deep Learning with PyTorch3D},
    journal = {arXiv:2007.08501},
    year = {2020},
}

If you are using the pulsar backend for sphere-rendering (the PulsarPointRenderer or pytorch3d.renderer.points.pulsar.Renderer), please cite the tech report:

@article{lassner2020pulsar,
    author = {Christoph Lassner and Michael Zollh\"ofer},
    title = {Pulsar: Efficient Sphere-based Neural Rendering},
    journal = {arXiv:2004.07484},
    year = {2020},
}

News

Please see below for a timeline of the codebase updates in reverse chronological order. We are sharing updates on the releases as well as research projects which are built with PyTorch3D. The changelogs for the releases are available under Releases, and the builds can be installed using conda as per the instructions in INSTALL.md.

[Dec 16th 2021]: PyTorch3D v0.6.1 released

[Oct 6th 2021]: PyTorch3D v0.6.0 released

[Aug 5th 2021]: PyTorch3D v0.5.0 released

[Feb 9th 2021]: PyTorch3D v0.4.0 released with support for implicit functions, volume rendering and a reimplementation of NeRF.

[November 2nd 2020]: PyTorch3D v0.3.0 released, integrating the pulsar backend.

[Aug 28th 2020]: PyTorch3D v0.2.5 released

[July 17th 2020]: PyTorch3D tech report published on ArXiv: https://arxiv.org/abs/2007.08501

[April 24th 2020]: PyTorch3D v0.2.0 released

[March 25th 2020]: SynSin codebase released using PyTorch3D: https://github.com/facebookresearch/synsin

[March 8th 2020]: PyTorch3D v0.1.1 bug fix release

[Jan 23rd 2020]: PyTorch3D v0.1.0 released. Mesh R-CNN codebase released: https://github.com/facebookresearch/meshrcnn

Languages

Python 80.8%

C++ 10.3%

Cuda 6.3%

C 0.9%

Shell 0.8%

Other 0.9%