Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction

This repo is the official implementation of "Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction" by Zhuofan Zong, Dongzhi Jiang, Guanglu Song, Zeyue Xue, Jingyong Su, Hongsheng Li, and Yu Liu.

News

[07/25/2023] Code for HoP on BEVDet is released!
[07/14/2023] HoP is accepted to ICCV 2023!
[04/05/2023] HoP achieves new SOTA performance on nuScenes 3D detection leaderboard with 68.5 NDS and 62.4 mAP.

Model Zoo

Result on BEVDet4D-Depth

model	backbone	pretrain	img size	Epoch	NDS	mAP	config	ckpt	log
BEVDet4D-Depth(Baseline)	Res50	ImageNet	256x704	24	0.4930	0.3848	cfg	ckpt	log
HoP_BEVDet4D-Depth	Res50	ImageNet	256x704	24	0.5099	0.3990	cfg	ckpt	log

Get Started

Install

We train our models under the following environment:

python=3.6.9
pytorch=1.8.1
torchvision=0.9.1
cuda=11.2

Other versions may possibly be imcompatible.

We use MMDetection3D V1.0.0rc4, MMDetection V2.24.0 and MMCV V1.5.0. The source code of MMDetection3D has been included in this repo.

You can take the following steps to install packages above:

Build MMCV following official instructions.
Install MMDetection by
```
pip install mmdet==2.24.0
```

Copy HoP repo and install MMDetection3D.

git clone git@github.com:Sense-X/HoP.git
cd HoP
pip install -e .

Data Preparation

Follow the steps to prepare nuScenes Dataset introduced in nuscenes_det.md and create the pkl by running:

python tools/create_data_bevdet.py

Train HoP

# single gpu
python tools/train.py configs/hop_bevdet/hop_bevdet4d-r50-depth.py
# multiple gpu
./tools/dist_train.sh configs/hop_bevdet/hop_bevdet4d-r50-depth.py $num_gpu

Eval HoP

# single gpu
python tools/test.py configs/hop_bevdet/hop_bevdet4d-r50-depth.py $checkpoint --eval bbox
# multiple gpu
./tools/dist_test.sh configs/hop_bevdet/hop_bevdet4d-r50-depth.py $checkpoint $num_gpu --eval bbox

Method

TODO

Release code for HoP on BEVFormer.

Cite HoP

If you find this repository useful, please use the following BibTeX entry for citation.

@misc{hop2023,
      title={Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction},
      author={Zhuofan Zong and Dongzhi Jiang and Guanglu Song and Zeyue Xue and Jingyong Su and Hongsheng Li and Yu Liu},
      year={2023},
      eprint={2304.00967},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

License

This project is released under the MIT license. Please see the LICENSE file for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.dev_scripts		.dev_scripts
.github		.github
configs		configs
data		data
demo		demo
docker		docker
docs		docs
mmdet3d		mmdet3d
mmdet3d_plugin		mmdet3d_plugin
requirements		requirements
resources		resources
tests		tests
tools		tools
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.readthedocs.yml		.readthedocs.yml
CITATION.cff		CITATION.cff
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
README_zh-CN.md		README_zh-CN.md
model-index.yml		model-index.yml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction

News

Model Zoo

Result on BEVDet4D-Depth

Get Started

Install

Data Preparation

Train HoP

Eval HoP

Method

TODO

Cite HoP

License

About

Releases 1

Packages

Contributors 3

Languages

License

Sense-X/HoP

Folders and files

Latest commit

History

Repository files navigation

Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction

News

Model Zoo

Result on BEVDet4D-Depth

Get Started

Install

Data Preparation

Train HoP

Eval HoP

Method

TODO

Cite HoP

License

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 3

Languages

Packages