Add MP3 support to AudioInterface and update tests #1222

bendichter · 2025-02-22T20:01:35Z

for more information, see https://pre-commit.ci

h-mayorquin · 2025-03-01T03:15:08Z

I hadn’t seen this before opening the other ones—my mistake.

I’ve been thinking about this, and the main challenge is that many of these libraries rely on FFmpeg for reading audio files. However, packaging FFmpeg in a way that works reliably across the three major operating systems within a pip-installable framework is difficult.

To test this, I’ve created a small set of stub files for the most common archival formats (WAV, FLAC, AIFF), along with MP3 and OGG, which are also widely used. We can them put them in gin. I believe this is necessary to establish a testing framework where these formats can be accessed without requiring FFmpeg or any other dependency that isn’t pip-installable. This will also allow us to enable CI testing in an environment similar to what our users experience.

A preliminary review suggests that torchaudio could be a straightforward solution, even though it’s quite heavy. Once these test files are available on Gin, we can explore lighter alternatives if we want since PyTorch itself is very large (5GiB as a dependency as they vendorize things inside of the package).

That said, I might be overlooking a better approach. What do you think, Ben?

add ffmpeg-python

bendichter · 2025-03-01T04:17:23Z

I see. I'm not crazy about a 5GB dependency just to read MP3s. If we use ffmpeg, the downsides are that

Users will need to install ffmpeg if they have not already
This will require us to either not include this capability in GUIDE or to put some work into including ffmpeg, which will be a bit tricky because it is OS-dependent.

This may have been why we previously stopped at wav files. I think I would prefer going with librosa even given those down-sides though I agree it's not ideal

bendichter and others added 2 commits February 22, 2025 14:00

Add MP3 support to AudioInterface and update tests

106d016

[pre-commit.ci] auto fixes from pre-commit.com hooks

9ca458e

for more information, see https://pre-commit.ci

bendichter added 2 commits February 28, 2025 23:05

Update pyproject.toml

0acd841

add ffmpeg-python

Merge branch 'main' into mp3-support

e61d67e

h-mayorquin mentioned this pull request Mar 1, 2025

Add mp3 support by Claude Code with Sonet 3.7 #1224

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MP3 support to AudioInterface and update tests #1222

Add MP3 support to AudioInterface and update tests #1222

bendichter commented Feb 22, 2025

h-mayorquin commented Mar 1, 2025

bendichter commented Mar 1, 2025

Add MP3 support to AudioInterface and update tests #1222

Are you sure you want to change the base?

Add MP3 support to AudioInterface and update tests #1222

Conversation

bendichter commented Feb 22, 2025

h-mayorquin commented Mar 1, 2025

bendichter commented Mar 1, 2025