Skip to content

Official repo for FEAR: Fast, Efficient, Accurate and Robust Visual Tracker (ECCV 2022)

License

Notifications You must be signed in to change notification settings

DDuranin/FEARTracker

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

FEAR: Fast, Efficient, Accurate and Robust Visual Tracker

Paper Conference

FEAR architecture

This is an official repository for the paper

FEAR: Fast, Efficient, Accurate and Robust Visual Tracker
Vasyl Borsuk, Roman Vei, Orest Kupyn, Tetiana Martyniuk, Igor Krashenyi, Jiři Matas
ECCV 2022

Environment setup

The training code is tested on Linux systems and mobile benchmarking on MacOS systems.

conda create -n py37fear python=3.7
conda activate py37fear
pip install -r requirements.txt

Note: you might need to remove xtcocotools requirement when installing environment on MacOS system for model evaluation.

FEAR Benchmark

We provide FEAR evaluation protocol implementation in evaluate/MeasurePerformance directory. Also, we provide FEAR-XS model checkpoint without the Dynamic Template Update module; the complete version of the model will be added soon. You should do the following steps on MacOS device to evaluate model on iOS device:

  1. Open evaluate/MeasurePerformance project in Xcode. You can do this by double-clicking on evaluate/MeasurePerformance/MeasurePerformance.xcodeproj file or by opening it from the Open option in Xcode.
  2. Connect iOS device to your computer and build the project into it.
  3. Select one of the benchmark options by tapping on the corresponding button on your mobile device:
    • Benchmark FPS: launches simple model benchmark that warms up the model for 20 iterations and measures average FPS across 100 model calls. The result is displayed in Xcode console.
    • Benchmark Online: launches FEAR online benchmark as described in the paper
    • Benchmark Offline: launches FEAR offline benchmark

Do the following steps on MacOS device to convert model into CoreML:

  1. To convert the model trained in PyTorch to CoreML with the following command from the project root directory. This command will produce a file with the model in CoreML format (Model.mlmodel) and a model with FP16 weight quantization (Model_quantized.mlmodel).
PYTHONPATH=. python evaluate/coreml_convert.py
  1. Move converted model into the iOS project with the following command cp Model_quantized.mlmodel evaluate/MeasurePerformance/MeasurePerformance/models/Model_quantized.mlmodel.

Count FLOPS and parameters

PYTHONPATH=. python evaluate/macs_params.py

Demo inference with Python

PYTHONPATH=. python demo_video.py --initial_bbox=[163,53,45,174] \
--video_path=assets/test.mp4 \
--output_path=outputs/test.mp4

Demo app for iOS

FEARDemo.mp4
  1. Open evaluate/FEARDemo project in Xcode.
  2. Connect iOS device to your computer and build the project.

Note: demo app does not contain all preprocessing and postprocessing steps of the tracker so its quality is not the same as in Python.

Training

Data preparation

There are two dataset configurations. Download all datasets from the configuration file you'll train with and put them into the directory specified in visual_object_tracking_datasets configuration field. You can change the value of visual_object_tracking_datasets to your local dataset path. There are two dataset configurations:

  1. Quick train on GOT-10k dataset
    Config file: model_training/config/dataset/got10k_train.yaml
  2. Full train on LaSOT, COCO2017, YouTube-BoundingBoxes, GOT-10k and ILSVRC
    Config file: model_training/config/dataset/full_train.yaml

You should create CSV annotation file for each of training datasets. We don't provide CSV annotations as some datasets have license restrictions. The annotation file for each dataset should have the following format:

  • sequence_id: str - unique identifier of video file
  • track_id: str - unique identifier of scene inside video file
  • frame_index: int - index of frame inside video
  • img_path: str - location of frame image relative to root folder with all datasets
  • bbox: Tuple[int, int, int, int] - bounding box of object in a format x, y, w, h
  • frame_shape: Tuple[int, int] - width and height of image
  • dataset: str - label to identify dataset (example: got10k)
  • presence: int - presence of the object (example, 0/1)
  • near_corner: int - is bounding box touches borders of the image (example, 0/1)

Run training

Current training code supports model training without Dynamic Template Update module, it'll be added soon. You can launch training with default configuration with the following command from the project root directory:

PYTHONPATH=. python model_training/train.py backend=2gpu
# or the following for full train
PYTHONPATH=. python model_training/train.py dataset=full_train backend=2gpu

Citation

If you use the FEAR Tracker benchmark, demo or training code (implicitly or explicitly) - for your research projects, please cite the following paper:

@article{fear_tracker,
  title={FEAR: Fast, Efficient, Accurate and Robust Visual Tracker},
  author={Borsuk, Vasyl and Vei, Roman and Kupyn, Orest and Martyniuk, Tetiana and Krashenyi, Igor and Matas, Ji{\v{r}}i},
  journal={arXiv preprint arXiv:2112.07957},
  year={2021}
}

About

Official repo for FEAR: Fast, Efficient, Accurate and Robust Visual Tracker (ECCV 2022)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 68.7%
  • Swift 31.3%