Skip to content

This is my Master thesis which evaluates 6D pose estimating deep learning methods for usage in an AR use case. It includes 2 new proxies for Gabriel in order to make the Deep Learning methods usable in AR-enabled devices.

License

Notifications You must be signed in to change notification settings

linhdoan8/6d-pose-estimation-with-ml-in-ar

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Implementation and Evaluation of Deep Learning Approaches for Real-time 3D Robot Localization in an Augmented Reality Use Case

External links

General information

This project is a fusion of SSPE, Betapose and Gabriel. The betapose and sspe directories are used for training and evaluation. The tools directory contrains useful tools like projectors, dataset converters and augmetors as well as PLY-manipulators which are mainly useful for data preparation. The gabriel directory contains the Gabriel code base with 2 new proxies (cognitive engines) for Betapose and SSPE.

In many subdirectories there is a commands.txt file which contains the actual commands that were used during the project. They can be used for reference.

The programs were run with Nvidia driver 418.87 and CUDA version 10.0. Betapose and SSPE (including their gabriel proxies) were run on Python 3.6.8, while the 2 other servers of Gabriel were run on Python 2.7.16.

Speed demo

Speed demo

Accuracy demo

Accuracy demo

Training

If you are going to use the already existing weights you can skip this step.

The betapose and sspe folders are used for training. To train each of the methods reformat the datasets in the needed format. You can do that by using the tools provided in this repository under tools/converters/. After your dataset is ready follow the README.md files of each method which explain how to do the training in more detail.

Notice: For training SSPE you would need a GPU with at least 8GB of memory

Evaluation

To evaluate SSPE just run the SSPE evaluation script.

To evaluate Betapose with the same metrics as the ones used by SSPE do the following:

  1. Run the betapose_evaluate.py with save option enabled
  2. Then get the outputed Betapose-results.json file and move it in the directory of the evaluation script. Then also move the SSPE label files for the same images in the same directory. Then run the evaluation script

Running the AR-service

You have to run the Gabriel servers locally. In case you're stuck with those instructions you can look at the Gabriel README for more info.

It is advised to have 3 Anaconda environments - for Betapose (Python 2.6), for SSPE (Python 2.6) and for Gabriel (Python 2.6). After having installed all needed libraries in the coresponding environments you can purceed with running the servers:

  1. Connect the computer and the AR-enabled device to the same WiFi network. It would be easiest if you make one of the devices a hotspot.
  2. (Gabriel Environment) Run the control server from the gabriel/server/bin directory
python gabriel-control -n wlp3s0 -l
  1. (Gabriel Environment) Run the ucomm server from the gabriel/server/bin directory
python gabriel-ucomm -n wlp3s0

Now we have the core of Gabriel running with no cognitive engines running. To start the SSPE congnitive engine do the following:

  1. Go to the gabriel/server/bin/example-proxies/gabriel-proxy-sspe directory
  2. Put the PLY 3D model in the ./sspd/3d_models directory
  3. Put the configuration file in the ./sspd/cfg directory
  4. Put the weights in the ./sspd/backup/kuka directory
  5. Run the following command (it might change according to the names you've used):
python proxy.py ./sspd/cfg/kuka.data ./sspd/cfg/yolo-pose-noanchor.cfg ./sspd/backup/kuka/model.weights 0.0.0.0:8021

To start the Betapose cognitive engine do the following:

  1. Go to the gabriel/server/bin/example-proxies/gabriel-proxy-betapose directory
  2. Put the PLY 3D object model in the ./betapose/models/models directory
  3. Put the PLY 3D key points model in the ./betapose/models/kpmodels directory
  4. Put the KPD weights named kpd.pkl in the ./betapose/weights directory
  5. Put the YOLO weights named yolo.weights in the ./betapose/weights directory
  6. Run the following command:
python proxy.py --control_server 0.0.0.0:8021

About

This is my Master thesis which evaluates 6D pose estimating deep learning methods for usage in an AR use case. It includes 2 new proxies for Gabriel in order to make the Deep Learning methods usable in AR-enabled devices.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 50.7%
  • C 31.3%
  • Java 7.6%
  • Cuda 4.1%
  • C# 2.8%
  • C++ 2.0%
  • Other 1.5%