General-Purpose-Sound-Recognition-Demo

General purpose sound recognition demo.

Example

We apply the audio tagging system to build a sound event detection (SED) system. The SED prediction is obtained by applying the audio tagging system on consecutive 2-second segments. The video of demo can be viewed at:
https://www.youtube.com/watch?v=7TEtDMzdLeY

Download models

This is a demo based on our AudioSet work. Please download trained models from https://zenodo.org/record/3576599, and save models in models folder.

Please also check our AudioSet work on https://github.com/qiuqiangkong/audioset_tagging_cnn, where the model of this demo is trained.

Run

Create python environment first from:

conda env create -f environment.yml

Simply run

python MSoS_demo_generalisation.py

Citation

If you use our codes in any format, please consider citing the following paper:

[1] Qiuqiang Kong, Yin Cao, Turab Iqbal, Yuxuan Wang, Wenwu Wang, Mark D. Plumbley. "PANNs: Large-Scale Pretrained Audio Neural Networks for Audio Pattern Recognition." arXiv preprint arXiv:1912.10211 (2019).

Authors

Yin Cao, Qiuqiang Kong, Christian Kroos, Turab Iqbal, Wenwu Wang, Mark Plumbley

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
images		images
models		models
models_code		models_code
MSoS_demo_recognition.py		MSoS_demo_recognition.py
README.md		README.md
audio_detection.py		audio_detection.py
environment.yml		environment.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

General-Purpose-Sound-Recognition-Demo

Example

Download models

Run

Citation

Authors

About

Releases

Packages

Languages

Yuanbo2021/General-Purpose-Sound-Recognition-Demo

Folders and files

Latest commit

History

Repository files navigation

General-Purpose-Sound-Recognition-Demo

Example

Download models

Run

Citation

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages