Speaker-Diarization-with-SNN

Final project for the Seminario en Aplicaciones de Redes Neuronales en la recuperación de información musical. The objetive is to use a Siamese Neuronal Network architecture in the Speaker Diarization task. We use the Librispeech dataset for training and validation.

First Implementation

The first implementation is in Keras and uses the SincNet architecture to lower the dimensionality of the convolutional task and work directly with the raw audio. With this approach we can obteain a good training error but the model does not generalize well and the validetion error was high.

Second Implementation

The second implementation is in PyTorch and uses the Wav2Vec model to extract the acoustic features of raw audio and proceed with this low dimensionality vector for the analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
main		main
main2		main2
.gitignore		.gitignore
README.md		README.md
TP Final Seminario Redes - Di Bernardo Ferreyra.ipynb		TP Final Seminario Redes - Di Bernardo Ferreyra.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speaker-Diarization-with-SNN

First Implementation

Second Implementation

About

Releases

Packages

Languages

MatiasDiBernardo/Speaker-Diarization-with-SNN

Folders and files

Latest commit

History

Repository files navigation

Speaker-Diarization-with-SNN

First Implementation

Second Implementation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages