Librosa: Difference between revisions

From EMC23 - Satellite Of Love
Jump to navigation Jump to search
mNo edit summary
mNo edit summary
 
(2 intermediate revisions by the same user not shown)
Line 1: Line 1:
[[Python]] library used for audio manipulation especially uselful with [[Deep Learning]]
[[Python]] library used for audio manipulation especially uselful with [[Deep Learning]]


* Core IO and DSP
The librosa package is structured as collection of submodules:
* Display
 
* Feature extraction
librosa
* Onset detection
 
* Beat and tempo
= librosa.core =
* Spectrogram decomposition
Core functionality includes functions to load audio from disk, compute various spectrogram representations, and a variety of commonly used tools for music analysis. For convenience, all functionality in this submodule is directly accessible from the top-level librosa.* namespace.
* Effects
 
* Temporal segmentation
= librosa.util =
* Sequential modeling
Helper utilities (normalization, padding, centering, etc.)
* cUtilities
 
= librosa.beat =
Functions for estimating tempo and detecting beat events.
 
= librosa.decompose =
Functions for harmonic-percussive source separation (HPSS) and generic spectrogram decomposition using matrix decomposition methods implemented in scikit-learn.
 
= librosa.display =
Visualization and display routines using matplotlib.
 
= librosa.effects =
Time-domain audio processing, such as pitch shifting and time stretching. This submodule also provides time-domain wrappers for the decompose submodule.
 
= librosa.feature =
Feature extraction and manipulation. This includes low-level feature extraction, such as chromagrams, Mel spectrogram, MFCC, and various other spectral and rhythmic features. Also provided are feature manipulation methods, such as delta features and memory embedding.
 
= librosa.filters =
Filter-bank generation (chroma, pseudo-CQT, CQT, etc.). These are primarily internal functions used by other parts of librosa.
 
= librosa.onset =
Onset detection and onset strength computation.
 
= librosa.segment =
Functions useful for structural segmentation, such as recurrence matrix construction, time-lag representation, and sequentially constrained clustering.
 
= librosa.sequence =
Functions for sequential modeling. Various forms of Viterbi decoding, and helper functions for constructing transition matrices.

Latest revision as of 20:34, 25 August 2021

Python library used for audio manipulation especially uselful with Deep Learning

The librosa package is structured as collection of submodules:

librosa

librosa.core[edit]

Core functionality includes functions to load audio from disk, compute various spectrogram representations, and a variety of commonly used tools for music analysis. For convenience, all functionality in this submodule is directly accessible from the top-level librosa.* namespace.

librosa.util[edit]

Helper utilities (normalization, padding, centering, etc.)

librosa.beat[edit]

Functions for estimating tempo and detecting beat events.

librosa.decompose[edit]

Functions for harmonic-percussive source separation (HPSS) and generic spectrogram decomposition using matrix decomposition methods implemented in scikit-learn.

librosa.display[edit]

Visualization and display routines using matplotlib.

librosa.effects[edit]

Time-domain audio processing, such as pitch shifting and time stretching. This submodule also provides time-domain wrappers for the decompose submodule.

librosa.feature[edit]

Feature extraction and manipulation. This includes low-level feature extraction, such as chromagrams, Mel spectrogram, MFCC, and various other spectral and rhythmic features. Also provided are feature manipulation methods, such as delta features and memory embedding.

librosa.filters[edit]

Filter-bank generation (chroma, pseudo-CQT, CQT, etc.). These are primarily internal functions used by other parts of librosa.

librosa.onset[edit]

Onset detection and onset strength computation.

librosa.segment[edit]

Functions useful for structural segmentation, such as recurrence matrix construction, time-lag representation, and sequentially constrained clustering.

librosa.sequence[edit]

Functions for sequential modeling. Various forms of Viterbi decoding, and helper functions for constructing transition matrices.