A generative speech model for daily dialogue.
-
Updated
Jan 13, 2025 - Python
A generative speech model for daily dialogue.
VoxNovel: generate audiobooks giving each character a different voice actor.
⭐ 本科毕业设计:基于内容的音乐推荐系统设计与开发。使用了Pytorch框架构建训练模型代码,使用Django构建了前后端。
Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
TTS models for Arabic (Tacotron2, FastPitch)
Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.
KAE : KAN-based AutoEncoder (AE, VAE, VQ-VAE, RVQ, etc.)
Cross-compilation of PyTorch armv7l (32bit) for RaspberryPi OS
Sound classification on Urban Sound Dataset
High fidelity music synthesis using diffusion and UnivNet.
Speech command classification on Speech-Command v0.02 dataset using PyTorch and torchaudio. In this example, three models have been trained using the raw signal waveforms, MFCC features and MelSpectogram features.
A utility for wrapping the Free Spoken Digit Dataset into PyTorch-ready data set splits.
TTS (FastPitch) for German
(😞 😨 😄 😮 😍 😠 😐 🤮) This is a simple DL API that classifies human emotions from audios and text.
this is a simple artificial neural network model using deep learning and torch-audio to classify cats and dog sounds.
Open Translator: Speech To Speech and Speech to text Translator with voice cloning and other cool features
🤖 Telegram bot powered by Deep Learning. Automatically assesses the safety of audios and voice messages for people suffering from misophonia.
Experiments in neural networks for audio generation.
Add a description, image, and links to the torchaudio topic page so that developers can more easily learn about it.
To associate your repository with the torchaudio topic, visit your repo's landing page and select "manage topics."