Skip to content
View nateanl's full-sized avatar

Highlights

  • Pro

Organizations

@pytorch

Block or report nateanl

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reproduction of DeepSeek-R1

Python 168 17 Updated Mar 24, 2025

Unified automatic quality assessment for speech, music, and sound.

Python 433 27 Updated Mar 26, 2025

The first Large Audio Language Model that enables native in-depth thinking, which is trained on large-scale audio Chain-of-Thought data.

Python 202 19 Updated Mar 17, 2025

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 601 30 Updated Mar 19, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,693 101 Updated Mar 7, 2025

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 567 27 Updated Mar 26, 2025

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 457 44 Updated Mar 12, 2025

Scalable and Performant Data Loading

Python 231 11 Updated Mar 28, 2025

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,269 71 Updated Feb 14, 2025

Mamba SSM architecture

Python 14,410 1,259 Updated Jan 18, 2025

An Open-source Streaming High-fidelity Neural Audio Codec

Python 461 22 Updated Mar 4, 2025
Python 33 3 Updated Mar 30, 2021

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,240 264 Updated Sep 6, 2023

Audio Codec Speech processing Universal PERformance Benchmark

Python 243 22 Updated Nov 1, 2024

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,547 2,394 Updated Mar 26, 2025

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 13,901 1,934 Updated Nov 19, 2024

FAIR Sequence Modeling Toolkit 2

Python 875 103 Updated Mar 28, 2025

Text-to-Audio/Music Generation

Python 2,395 188 Updated Sep 29, 2024

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,711 2,282 Updated Mar 13, 2025

Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.

578 32 Updated Jun 19, 2023

ImageBind One Embedding Space to Bind Them All

Python 8,568 802 Updated Jul 31, 2024

2.5D visual sound dataset

96 15 Updated Sep 21, 2021

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,116 862 Updated Jul 6, 2024

Python loaders for many Real Room Impulse Response databases

Python 88 3 Updated Sep 30, 2024

SDX23 startkit for the Demucs baselines.

Python 27 2 Updated Mar 3, 2023

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python 562 157 Updated Aug 19, 2023

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,517 275 Updated Jan 12, 2025
Next
Showing results