Librosa

From EMC23 - Satellite Of Love
Jump to navigation Jump to search

Python library used for audio manipulation especially uselful with Deep Learning

The librosa package is structured as collection of submodules:

librosa

librosa.core[edit]

Core functionality includes functions to load audio from disk, compute various spectrogram representations, and a variety of commonly used tools for music analysis. For convenience, all functionality in this submodule is directly accessible from the top-level librosa.* namespace.

librosa.util[edit]

Helper utilities (normalization, padding, centering, etc.)

librosa.beat[edit]

Functions for estimating tempo and detecting beat events.

librosa.decompose[edit]

Functions for harmonic-percussive source separation (HPSS) and generic spectrogram decomposition using matrix decomposition methods implemented in scikit-learn.

librosa.display[edit]

Visualization and display routines using matplotlib.

librosa.effects[edit]

Time-domain audio processing, such as pitch shifting and time stretching. This submodule also provides time-domain wrappers for the decompose submodule.

librosa.feature[edit]

Feature extraction and manipulation. This includes low-level feature extraction, such as chromagrams, Mel spectrogram, MFCC, and various other spectral and rhythmic features. Also provided are feature manipulation methods, such as delta features and memory embedding.

librosa.filters[edit]

Filter-bank generation (chroma, pseudo-CQT, CQT, etc.). These are primarily internal functions used by other parts of librosa.

librosa.onset[edit]

Onset detection and onset strength computation.

librosa.segment[edit]

Functions useful for structural segmentation, such as recurrence matrix construction, time-lag representation, and sequentially constrained clustering.

librosa.sequence[edit]

Functions for sequential modeling. Various forms of Viterbi decoding, and helper functions for constructing transition matrices.