Caution

You're reading an old version of this documentation. If you want up-to-date information, please have a look at 0.10.2.

Core IO and DSP

Audio loading

load(path, *[, sr, mono, offset, duration, ...])

Load an audio file as a floating point time series.

stream(path, *, block_length, frame_length, ...)

Stream audio in fixed-length buffers.

to_mono(y)

Convert an audio signal to mono by averaging samples across channels.

resample(y, *, orig_sr, target_sr[, ...])

Resample a time series from orig_sr to target_sr

get_duration(*[, y, sr, S, n_fft, ...])

Compute the duration (in seconds) of an audio time series, feature matrix, or filename.

get_samplerate(path)

Get the sampling rate for a given file.

Time-domain processing

autocorrelate(y, *[, max_size, axis])

Bounded-lag auto-correlation

lpc(y, *, order[, axis])

Linear Prediction Coefficients via Burg's method

zero_crossings(y, *[, threshold, ...])

Find the zero-crossings of a signal y: indices i such that sign(y[i]) != sign(y[j]).

mu_compress(x, *[, mu, quantize])

mu-law compression

mu_expand(x, *[, mu, quantize])

mu-law expansion

Signal generation

clicks(*[, times, frames, sr, hop_length, ...])

Construct a "click track".

tone(frequency, *[, sr, length, duration, phi])

Construct a pure tone (cosine) signal at a given frequency.

chirp(*, fmin, fmax[, sr, length, duration, ...])

Construct a "chirp" or "sine-sweep" signal.

Spectral representations

stft(y, *[, n_fft, hop_length, win_length, ...])

Short-time Fourier transform (STFT).

istft(stft_matrix, *[, hop_length, ...])

Inverse short-time Fourier transform (ISTFT).

reassigned_spectrogram(y, *[, sr, S, n_fft, ...])

Time-frequency reassigned spectrogram.

cqt(y, *[, sr, hop_length, fmin, n_bins, ...])

Compute the constant-Q transform of an audio signal.

icqt(C, *[, sr, hop_length, fmin, ...])

Compute the inverse constant-Q transform.

hybrid_cqt(y, *[, sr, hop_length, fmin, ...])

Compute the hybrid constant-Q transform of an audio signal.

pseudo_cqt(y, *[, sr, hop_length, fmin, ...])

Compute the pseudo constant-Q transform of an audio signal.

vqt(y, *[, sr, hop_length, fmin, n_bins, ...])

Compute the variable-Q transform of an audio signal.

iirt(y, *[, sr, win_length, hop_length, ...])

Time-frequency representation using IIR filters

fmt(y, *[, t_min, n_fmt, kind, beta, ...])

The fast Mellin transform (FMT)

magphase(D, *[, power])

Separate a complex-valued spectrogram D into its magnitude (S) and phase (P) components, so that D = S * P.

Phase recovery

griffinlim(S, *[, n_iter, hop_length, ...])

Approximate magnitude spectrogram inversion using the "fast" Griffin-Lim algorithm.

griffinlim_cqt(C, *[, n_iter, sr, ...])

Approximate constant-Q magnitude spectrogram inversion using the "fast" Griffin-Lim algorithm.

Harmonics

interp_harmonics(x, *, freqs, harmonics[, ...])

Compute the energy at harmonics of time-frequency representation.

salience(S, *, freqs, harmonics[, weights, ...])

Harmonic salience function.

phase_vocoder(D, *, rate[, hop_length, n_fft])

Phase vocoder.

Magnitude scaling

amplitude_to_db(S, *[, ref, amin, top_db])

Convert an amplitude spectrogram to dB-scaled spectrogram.

db_to_amplitude(S_db, *[, ref])

Convert a dB-scaled spectrogram to an amplitude spectrogram.

power_to_db(S, *[, ref, amin, top_db])

Convert a power spectrogram (amplitude squared) to decibel (dB) units

db_to_power(S_db, *[, ref])

Convert a dB-scale spectrogram to a power spectrogram.

perceptual_weighting(S, frequencies, *[, kind])

Perceptual weighting of a power spectrogram.

frequency_weighting(frequencies, *[, kind])

Compute the weighting of a set of frequencies.

multi_frequency_weighting(frequencies, *[, ...])

Compute multiple weightings of a set of frequencies.

A_weighting(frequencies, *[, min_db])

Compute the A-weighting of a set of frequencies.

B_weighting(frequencies, *[, min_db])

Compute the B-weighting of a set of frequencies.

C_weighting(frequencies, *[, min_db])

Compute the C-weighting of a set of frequencies.

D_weighting(frequencies, *[, min_db])

Compute the D-weighting of a set of frequencies.

pcen(S, *[, sr, hop_length, gain, bias, ...])

Per-channel energy normalization (PCEN)

Time unit conversion

frames_to_samples(frames, *[, hop_length, n_fft])

Converts frame indices to audio sample indices.

frames_to_time(frames, *[, sr, hop_length, ...])

Converts frame counts to time (seconds).

samples_to_frames(samples, *[, hop_length, ...])

Converts sample indices into STFT frames.

samples_to_time(samples, *[, sr])

Convert sample indices to time (in seconds).

time_to_frames(times, *[, sr, hop_length, n_fft])

Converts time stamps into STFT frames.

time_to_samples(times, *[, sr])

Convert timestamps (in seconds) to sample indices.

blocks_to_frames(blocks, *, block_length)

Convert block indices to frame indices

blocks_to_samples(blocks, *, block_length, ...)

Convert block indices to sample indices

blocks_to_time(blocks, *, block_length, ...)

Convert block indices to time (in seconds)

Frequency unit conversion

hz_to_note(frequencies, **kwargs)

Convert one or more frequencies (in Hz) to the nearest note names.

hz_to_midi(frequencies)

Get MIDI note number(s) for given frequencies

hz_to_svara_h(frequencies, *, Sa[, abbr, ...])

Convert frequencies (in Hz) to Hindustani svara

hz_to_svara_c(frequencies, *, Sa, mela[, ...])

Convert frequencies (in Hz) to Carnatic svara

midi_to_hz(notes)

Get the frequency (Hz) of MIDI note(s)

midi_to_note(midi, *[, octave, cents, key, ...])

Convert one or more MIDI numbers to note strings.

midi_to_svara_h(midi, *, Sa[, abbr, octave, ...])

Convert MIDI numbers to Hindustani svara

midi_to_svara_c(midi, *, Sa, mela[, abbr, ...])

Convert MIDI numbers to Carnatic svara within a given melakarta raga

note_to_hz(note, **kwargs)

Convert one or more note names to frequency (Hz)

note_to_midi(note, *[, round_midi])

Convert one or more spelled notes to MIDI number(s).

note_to_svara_h(notes, *, Sa[, abbr, ...])

Convert western notes to Hindustani svara

note_to_svara_c(notes, *, Sa, mela[, abbr, ...])

Convert western notes to Carnatic svara

hz_to_mel(frequencies, *[, htk])

Convert Hz to Mels

hz_to_octs(frequencies, *[, tuning, ...])

Convert frequencies (Hz) to (fractional) octave numbers.

mel_to_hz(mels, *[, htk])

Convert mel bin numbers to frequencies

octs_to_hz(octs, *[, tuning, bins_per_octave])

Convert octaves numbers to frequencies.

A4_to_tuning(A4, *[, bins_per_octave])

Convert a reference pitch frequency (e.g., A4=435) to a tuning estimation, in fractions of a bin per octave.

tuning_to_A4(tuning, *[, bins_per_octave])

Convert a tuning deviation (from 0) in fractions of a bin per octave (e.g., tuning=-0.1) to a reference pitch frequency relative to A440.

Music notation

key_to_notes(key, *[, unicode])

Lists all 12 note names in the chromatic scale, as spelled according to a given key (major or minor).

key_to_degrees(key)

Construct the diatonic scale degrees for a given key.

mela_to_svara(mela, *[, abbr, unicode])

Spell the Carnatic svara names for a given melakarta raga

mela_to_degrees(mela)

Construct the svara indices (degrees) for a given melakarta raga

thaat_to_degrees(thaat)

Construct the svara indices (degrees) for a given thaat

list_mela()

List melakarta ragas by name and index.

list_thaat()

List supported thaats by name.

Frequency range generation

fft_frequencies(*[, sr, n_fft])

Alternative implementation of np.fft.fftfreq

cqt_frequencies(n_bins, *, fmin[, ...])

Compute the center frequencies of Constant-Q bins.

mel_frequencies([n_mels, fmin, fmax, htk])

Compute an array of acoustic frequencies tuned to the mel scale.

tempo_frequencies(n_bins, *[, hop_length, sr])

Compute the frequencies (in beats per minute) corresponding to an onset auto-correlation or tempogram matrix.

fourier_tempo_frequencies(*[, sr, ...])

Compute the frequencies (in beats per minute) corresponding to a Fourier tempogram matrix.

Pitch and tuning

pyin(y, *, fmin, fmax[, sr, frame_length, ...])

Fundamental frequency (F0) estimation using probabilistic YIN (pYIN).

yin(y, *, fmin, fmax[, sr, frame_length, ...])

Fundamental frequency (F0) estimation using the YIN algorithm.

estimate_tuning(*[, y, sr, S, n_fft, ...])

Estimate the tuning of an audio time series or spectrogram input.

pitch_tuning(frequencies, *[, resolution, ...])

Given a collection of pitches, estimate its tuning offset (in fractions of a bin) relative to A440=440.0Hz.

piptrack(*[, y, sr, S, n_fft, hop_length, ...])

Pitch tracking on thresholded parabolically-interpolated STFT.

Miscellaneous

samples_like(X, *[, hop_length, n_fft, axis])

Return an array of sample indices to match the time axis from a feature matrix.

times_like(X, *[, sr, hop_length, n_fft, axis])

Return an array of time values to match the time axis from a feature matrix.

get_fftlib()

Get the FFT library currently used by librosa

set_fftlib([lib])

Set the FFT library used by librosa.