nateanl

nateanl nateanl

I do research on speech, audio, and language.

167 followers · 67 following

Meta Reality Labs
New York
https://nateanl.github.io

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Stars

ByungKwanLee / DeepSick-R1

Reproduction of DeepSeek-R1

Python 168 17 Updated Mar 24, 2025

facebookresearch / audiobox-aesthetics

Unified automatic quality assessment for speech, music, and sound.

Python 433 27 Updated Mar 26, 2025

xzf-thu / Audio-Reasoner

The first Large Audio Language Model that enables native in-depth thinking, which is trained on large-scale audio Chain-of-Thought data.

Python 202 19 Updated Mar 17, 2025

fla-org / native-sparse-attention

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 601 30 Updated Mar 19, 2025

MoonshotAI / MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,693 101 Updated Mar 7, 2025

lucidrains / native-sparse-attention-pytorch

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Python 567 27 Updated Mar 26, 2025

deepseek-ai / DeepSeek-R1

87,691 11,320 Updated Feb 24, 2025

deepseek-ai / DeepSeek-V3

Python 94,481 15,273 Updated Mar 16, 2025

lucidrains / e2-tts-pytorch

Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch

Python 457 44 Updated Mar 12, 2025

facebookresearch / spdl

Scalable and Performant Data Loading

Python 231 11 Updated Mar 28, 2025

facebookresearch / MobileLLM

MobileLLM Optimizing Sub-billion Parameter Language Models for On-Device Use Cases. In ICML 2024.

Python 1,269 71 Updated Feb 14, 2025

state-spaces / mamba

Mamba SSM architecture

Python 14,410 1,259 Updated Jan 18, 2025

facebookresearch / AudioDec

An Open-source Streaming High-fidelity Neural Audio Codec

Python 461 22 Updated Mar 4, 2025

NaoyukiKanda / LibriSpeechMix

Python 33 3 Updated Mar 30, 2021

lucidrains / musiclm-pytorch

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,240 264 Updated Sep 6, 2023

voidful / Codec-SUPERB

Audio Codec Speech processing Universal PERformance Benchmark

Python 243 22 Updated Nov 1, 2024

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 16,547 2,394 Updated Mar 26, 2025

neonbjb / tortoise-tts

A multi-voice TTS system trained with an emphasis on quality

Jupyter Notebook 13,901 1,934 Updated Nov 19, 2024

facebookresearch / fairseq2

FAIR Sequence Modeling Toolkit 2

Python 875 103 Updated Mar 28, 2025

haoheliu / AudioLDM2

Text-to-Audio/Music Generation

Python 2,395 188 Updated Sep 29, 2024

facebookresearch / audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Jupyter Notebook 21,711 2,282 Updated Mar 13, 2025

SpeechifyInc / Meta-voicebox

Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.

578 32 Updated Jun 19, 2023

facebookresearch / ImageBind

ImageBind One Embedding Space to Bind Them All

Python 8,568 802 Updated Jul 31, 2024

facebookresearch / FAIR-Play

2.5D visual sound dataset

96 15 Updated Sep 21, 2021

AIGC-Audio / AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Python 10,116 862 Updated Jul 6, 2024

jonashaag / RealRIRs

Python loaders for many Real Room Impulse Response databases

Python 88 3 Updated Sep 30, 2024

chenfei-wu / TaskMatrix

Python 34,500 3,298 Updated Jan 6, 2024

adefossez / sdx23

SDX23 startkit for the Demucs baselines.

Python 27 2 Updated Mar 3, 2023

Audio-WestlakeU / FullSubNet

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python 562 157 Updated Aug 19, 2023

lucidrains / audiolm-pytorch

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,517 275 Updated Jan 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nateanl nateanl

Achievements

Achievements

Highlights

Organizations

Block or report nateanl

Stars

ByungKwanLee / DeepSick-R1

facebookresearch / audiobox-aesthetics

xzf-thu / Audio-Reasoner

fla-org / native-sparse-attention

MoonshotAI / MoBA

lucidrains / native-sparse-attention-pytorch

deepseek-ai / DeepSeek-R1

deepseek-ai / DeepSeek-V3

lucidrains / e2-tts-pytorch

facebookresearch / spdl

facebookresearch / MobileLLM

state-spaces / mamba

facebookresearch / AudioDec

NaoyukiKanda / LibriSpeechMix

lucidrains / musiclm-pytorch

voidful / Codec-SUPERB

meta-llama / llama-cookbook

neonbjb / tortoise-tts

facebookresearch / fairseq2

haoheliu / AudioLDM2

facebookresearch / audiocraft

SpeechifyInc / Meta-voicebox

facebookresearch / ImageBind

facebookresearch / FAIR-Play

AIGC-Audio / AudioGPT

jonashaag / RealRIRs

chenfei-wu / TaskMatrix

adefossez / sdx23

Audio-WestlakeU / FullSubNet

lucidrains / audiolm-pytorch