torchaudio

Speech command classification on Speech-Command v0.02 dataset using PyTorch and torchaudio. In this example, three models have been trained using the raw signal waveforms, MFCC features and MelSpectogram features.

speech dnn speech-recognition classification pytorch-tutorial torchaudio

Updated Dec 5, 2022
Python

eonu / torch-fsdd

Star

A utility for wrapping the Free Spoken Digit Dataset into PyTorch-ready data set splits.

audio torch data-loader torchaudio pytorch-dataset pytorch-dataset-split audio-dataset pytorch-dataloader fsdd free-spoken-digit-dataset

Updated Dec 27, 2022
Python

nipponjo / tts-german-pytorch

Star

TTS (FastPitch) for German

python text-to-speech deep-learning german speech pytorch tts speech-synthesis german-language torchaudio emotional-speech hifi-gan fastpitch

Updated Sep 16, 2024
Python

CrispenGari / emotionAI

Star

(😞 😨 😄 😮 😍 😠 😐 🤮) This is a simple DL API that classifies human emotions from audios and text.

python flask machine-learning torch pytorch artificial-intelligence deeplearning torchaudio torchvision

Updated Feb 5, 2022
Jupyter Notebook

CrispenGari / animal-sound-classification

Star

this is a simple artificial neural network model using deep learning and torch-audio to classify cats and dog sounds.

audio python machine-learning deep-neural-networks deep-learning pytorch artificial-intelligence rnn artificial-neural-networks audio-processing torchaudio

Updated Jan 25, 2022
Jupyter Notebook

overcrash66 / OpenTranslator

Star

Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features

multilingual multi-platform translator transformers speech-to-text speaker-recognition autosub multimodal torchaudio gtts-api s2st coqui-tts whisper-ai llama2 audio-translation xttsv2 s2tt

Updated Jan 16, 2025
Python

glefundes / misophonia-bot

Star

🤖 Telegram bot powered by Deep Learning. Automatically assesses the safety of audios and voice messages for people suffering from misophonia.

audio telegram deep-learning telegram-bot pytorch telegram-bot-api audio-classification torchaudio

Updated Sep 27, 2020
Python

LumenPallidium / audio_generation

Star

Experiments in neural networks for audio generation.

pytorch autoencoder hopfield-network vector-quantization torchaudio audio-generation energy-transformer

Updated Oct 18, 2024
Python

Improve this page

Add a description, image, and links to the torchaudio topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the torchaudio topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torchaudio

Here are 58 public repositories matching this topic...

2noise / ChatTTS

DrewThomasson / VoxNovel

ujiaqi / MusicRecommend

KentoNishi / torch-pitch-shift

nipponjo / tts-arabic-pytorch

evshiron / rocm_lab

KentoNishi / torch-time-stretch

SekiroRong / KAN-AutoEncoder

torchsmoke / Python3-Wheels

PINTO0309 / pytorch4raspberrypi

BakingBrains / Sound_Classification

LukeSutor / programmatic-pitch

aminul-huq / Speech-Command-Classification

eonu / torch-fsdd

nipponjo / tts-german-pytorch

CrispenGari / emotionAI

CrispenGari / animal-sound-classification

overcrash66 / OpenTranslator

glefundes / misophonia-bot

LumenPallidium / audio_generation

Improve this page

Add this topic to your repo