Skip to content
View jasonppy's full-sized avatar
🍗
🍗

Highlights

  • Pro

Block or report jasonppy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. VoiceCraft VoiceCraft Public

    Zero-Shot Speech Editing and Text-to-Speech in the Wild

    Jupyter Notebook 7.8k 758

  2. PromptingWhisper PromptingWhisper Public

    Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation

    Python 137 11

  3. syllable-discovery syllable-discovery Public

    Syllable Segmentation and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model

    Python 31 6

  4. word-discovery word-discovery Public

    Word Discovery in Visually Grounded, Self-Supervised Speech Models

    Jupyter Notebook 26 7

  5. FaST-VGS-Family FaST-VGS-Family Public

    Transformer-based visually grounded speech models

    Python 19 1

  6. MAE-AST-Public MAE-AST-Public Public

    Forked from AlanBaade/MAE-AST-Public

    Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer

    Python