deep-learning-sound-source-separation

CSC 486B (Deep Learning for Computer Vision) Group Project (with Quinton Yong and Jingjing Zhu): An Exploration and Implementation of “Learning to Separate Object Sounds by Watching Unlabelled Video”

We use a reduced dataset (from the AudioSet Dataset) of 4000 samples with the following 4 instrument classes: drum, acoustic guitar, piano and violin.

We have attached a directory containing the results of the audio source separation. We included results of 3 different videos for WithDropout vs WithoutDropout comparison. The source-separated WAV files and the original 10-second mp4 clip are provided. We also include the source-separation results of one video using hard-coded ground truth labels.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
AudioResults		AudioResults
README.md		README.md
basis_disentangle.py		basis_disentangle.py
extract_test_data.py		extract_test_data.py
extract_train_data.py		extract_train_data.py
postprocessing.py		postprocessing.py
preprocessing.py		preprocessing.py
report.pdf		report.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

deep-learning-sound-source-separation

About

Releases

Packages

Languages

RLuke22/deep-learning-sound-source-separation

Folders and files

Latest commit

History

Repository files navigation

deep-learning-sound-source-separation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages