c3d

This package implements the C3D loss (continuous 3D loss) in the paper Monocular Depth Prediction Through Continuous 3D Loss. The paper is accepted by IROS 2020.

Video Demo:

This repo contains the python package for computing Continuous 3D Loss. For complete networks conducting monocular depth prediction trained with this loss, please check this DORN implementation.

Implementation of the C3D loss in BTS and Monodepth2 network will be released soon.

Overview

The interface for using the C3D loss is the C3DLoss class in c3d_loss.py. You can import the class to your script to augment your own depth prediction project with C3D loss. The implementation of its operations are in cvo_funcs.py and cvo_ops folder. The class also needs some other util classes to run properly, which you can find in utils folder.

Getting Started

Prerequisite

Python 3.6
Pytorch 1.2
PIL

This is the environment I used to work properly. Higher versions are likely to be fine, but I did encountered problem of divergence in no_grad mode when working on another environment with high version of Pytorch. The exact problem is not located yet.

Installing

To use the C3D loss, you need to install this library.

$ python setup.py install

Deploying in your own network

You will need to create an instance for these classes:

C3DLoss (for calculating the proposed loss),
CamProj (for generating camera intrinsic and extrinsics information needed by C3DLoss), and
DataReaderKITTI (enabling easier retrieval of camera intrinsic and extrinsics information from KITTI dataset).

C3DLoss can be simply initialized as c3d_loss = C3DLoss(). Optionally you can create a config text file to customize the parameters. See c3d_config_example.txt for an example. C3DLoss.parse_opts(f_input=$path_to_config_file$) set the parameters specified in the config file.
CamProj and DataReaderKITTI objects should be initialized in the dataset-handling part of your code. DataReaderKITTI takes the root path to your local KITTI dataset as initialization argument. CamProj takes the DataReaderKITTI object as initialization argument.
Camera intrinsic and extrinsics information are encapsuled in the CamInfo class, which is output by a CamProj object. CamInfo object is needed in C3DLoss forward calculation, together with RGB image, depth prediction, ground truth, and validity masks.

Please check here as an example of using the C3D loss in an existing network (DORN).

Note

This repo is still under constructions. The implementation is subject to change. More comments and instructions would come later. Your comments are welcomed.

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
c3d		c3d
.gitignore		.gitignore
README.md		README.md
c3d_config_example.txt		c3d_config_example.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

c3d

Overview

Getting Started

Prerequisite

Installing

Deploying in your own network

Note

About

Releases

Packages

Languages

minghanz/c3d

Folders and files

Latest commit

History

Repository files navigation

c3d

Overview

Getting Started

Prerequisite

Installing

Deploying in your own network

Note

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages