Add resampling to BlissToPcmHDFJob #568

michelwi · 2025-01-10T09:10:25Z

Instead of calling a bunch of BlissFfmpegJob before BlissToPcmHDFJob we now can resample the audio data on the fly before writing to the HDF file.
Useful for multi-bandwidth training.

Also a bit of cleanup in the Job as I was on it anyway.

michelwi · 2025-01-10T09:18:24Z

huh.. are we not having librosa in i6_core?

JackTemaki · 2025-01-10T09:22:05Z

huh.. are we not having librosa in i6_core?

No, I think so far no core job used it.

michelwi · 2025-01-10T09:26:04Z

No, I think so far no core job used it.

yes, grep told me.. I would like to change my question to "Is is available in your apptainer images / systems / etc, or would I break someones things if I merged this?"
Otherwise I could just add it to the requirements and have it available in the test pipeline.

Icemole

Looks good. Do you know how it interacts with the round_factor parameter? Should we assert, or at least warn, that only one of the two should be set at a given point?

michelwi · 2025-01-14T09:09:03Z

Do you know how it interacts with the round_factor parameter?

technically both are solving the same problem in two different ways:

if the audio was resampled externally, round_factor makes this Job read the resampled audio file as if the timestamps were calculated with the original sampling rate.
with resampling the audio file is read at the "original" sampling rate and resampled internally.

Now if you had say 48kHz data, externally resampled it to 8kHz and then internally upsampled to 16kHz again, it would make sense to set both parameters; but I admit this is a contrived scenario and in practice we would only use one of those parameters.

add resampling to BlissToPcmHDFJob

79e8080

michelwi requested review from curufinwe, christophmluscher, JackTemaki, Icemole and Atticus1806 January 10, 2025 09:10

add librosa to requirements.txt

88819b0

Icemole approved these changes Jan 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add resampling to BlissToPcmHDFJob #568

Add resampling to BlissToPcmHDFJob #568

michelwi commented Jan 10, 2025

michelwi commented Jan 10, 2025

JackTemaki commented Jan 10, 2025

michelwi commented Jan 10, 2025

Icemole left a comment

michelwi commented Jan 14, 2025

Add resampling to BlissToPcmHDFJob #568

Are you sure you want to change the base?

Add resampling to BlissToPcmHDFJob #568

Conversation

michelwi commented Jan 10, 2025

michelwi commented Jan 10, 2025

JackTemaki commented Jan 10, 2025

michelwi commented Jan 10, 2025

Icemole left a comment

Choose a reason for hiding this comment

michelwi commented Jan 14, 2025