Skip to content

Yuanbo2021/General-Purpose-Sound-Recognition-Demo

 
 

Repository files navigation

General-Purpose-Sound-Recognition-Demo

General purpose sound recognition demo.

Example

We apply the audio tagging system to build a sound event detection (SED) system. The SED prediction is obtained by applying the audio tagging system on consecutive 2-second segments. The video of demo can be viewed at:
https://www.youtube.com/watch?v=7TEtDMzdLeY

Download models

This is a demo based on our AudioSet work. Please download trained models from https://zenodo.org/record/3576599, and save models in models folder.

Please also check our AudioSet work on https://github.com/qiuqiangkong/audioset_tagging_cnn, where the model of this demo is trained.

Run

Create python environment first from:

conda env create -f environment.yml

Simply run

python MSoS_demo_generalisation.py

Citation

If you use our codes in any format, please consider citing the following paper:

[1] Qiuqiang Kong, Yin Cao, Turab Iqbal, Yuxuan Wang, Wenwu Wang, Mark D. Plumbley. "PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition." arXiv preprint arXiv:1912.10211 (2019).

Authors

Yin Cao, Qiuqiang Kong, Christian Kroos, Turab Iqbal, Wenwu Wang, Mark Plumbley

About

General purpose sound recognition demo

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%