By Wonhyeok Choi, Mingyu Shin, Sunghoon Im.
This repository is an official implementation of the paper 'Depth-discriminative Metric Learning for Monocular 3D Object Detection' using 'Delving into Localization Errors for Monocular 3D Detection'. In this work, we address the challenge of monocular 3D object detection in RGB images by proposing a novel metric learning scheme. Our method, which does not rely on extra parameters, modules, or data, concentrates on extracting depth-discriminative features without increasing the inference time or model size.
This repo is tested on our local environment (python=3.6, cuda=9.2, pytorch=1.10.0), and we recommend you to use anaconda to create a vitural environment:
conda create -n qi_monodle python=3.6
Then, activate the environment:
conda activate qi_monodle
Install Install PyTorch:
conda install pytorch==1.10.0 torchvision==0.11.0 torchaudio==0.10.0 cudatoolkit=10.2 -c pytorch
and other requirements:
pip install -r requirements.txt
Please download KITTI dataset and organize the data as follows:
#ROOT
|data/
|KITTI/
|ImageSets/ [already provided in this repo]
|training/
|calib/
|image_2/
|label/
|testing/
|calib/
|image_2/
Make the object-wise depth map following command:
cd #ROOT
python make_obj_depth.py
Move to the workplace and train the network:
cd #ROOT
cd experiments/example
python ../../tools/train_val.py --config kitti_example.yaml
The model will be evaluated automatically if the training completed. If you only want evaluate your trained model, you can modify the test part configuration in the .yaml file and use the following command:
python ../../tools/train_val.py --config kitti_example.yaml --e
See the below table to check the performance.
AP40@Easy | AP40@Mod. | AP40@Hard | |
---|---|---|---|
Monodle | 17.32 | 14.35 | 12.22 |
Monodle + Ours | 21.31 | 16.53 | 13.93 |
If you find our work useful in your research, please consider citing:
@inproceedings{choi2023depth,
title={Depth-discriminative Metric Learning for Monocular 3D Object Detection},
author={Choi, Wonhyeok and Shin, Mingyu and Im, Sunghoon},
booktitle={Thirty-seventh Conference on Neural Information Processing Systems},
year={2023}
}
This repo benefits from the excellent work Monodle. Please also consider citing it.
This project is released under the MIT License.
If you have any question about this project, please feel free to contact [email protected] or [email protected].