Audio Loader

‘audio.audio_loader’ module within the ketos library

This module contains the utilities for loading waveforms and computing spectrograms.

The audio representations currently implemented in Ketos are:

AudioFrameLoader(duration[, step, path, ...])

Load audio segments by sliding a fixed-size frame across the recording.

AudioLoader(selection_gen[, channel, ...])

Class for loading segments of audio data from .wav files.

FrameStepper(duration[, step, path, ...])

Generates selections with uniform length 'duration', with successive selections displaced by a fixed amount 'step' (If 'step' is not specified, it is set equal to 'duration'.)

SelectionGenerator()

Template class for selection generators.

SelectionTableIterator(data_dir, selection_table)

Iterates over entries in a selection table.