Caution
You're reading an old version of this documentation. If you want up-to-date information, please have a look at 0.9.1.
librosa.core.load¶
- librosa.core.load(path, sr=22050, mono=True, offset=0.0, duration=None, dtype=<class 'numpy.float32'>, res_type='kaiser_best')[source]¶
Load an audio file as a floating point time series.
Audio will be automatically resampled to the given rate (default sr=22050).
To preserve the native sampling rate of the file, use sr=None.
- Parameters
- pathstring, int, pathlib.Path or file-like object
path to the input file.
Any codec supported by
soundfileoraudioreadwill work.Any string file paths, or any object implementing Python’s file interface (e.g.
pathlib.Path) are supported as path.If the codec is supported by
soundfile, then path can also be an open file descriptor (int).- srnumber > 0 [scalar]
target sampling rate
‘None’ uses the native sampling rate
- monobool
convert signal to mono
- offsetfloat
start reading after this time (in seconds)
- durationfloat
only load up to this much audio (in seconds)
- dtypenumeric type
data type of y
- res_typestr
resample type (see note)
Note
By default, this uses
resampy’s high-quality mode (‘kaiser_best’).For alternative resampling modes, see
resampleNote
audioreadmay truncate the precision of the audio data to 16 bits.See https://librosa.github.io/librosa/ioformats.html for alternate loading methods.
- Returns
- ynp.ndarray [shape=(n,) or (2, n)]
audio time series
- srnumber > 0 [scalar]
sampling rate of y
Examples
>>> # Load an ogg vorbis file >>> filename = librosa.util.example_audio_file() >>> y, sr = librosa.load(filename) >>> y array([ -4.756e-06, -6.020e-06, ..., -1.040e-06, 0.000e+00], dtype=float32) >>> sr 22050
>>> # Load a file and resample to 11 KHz >>> filename = librosa.util.example_audio_file() >>> y, sr = librosa.load(filename, sr=11025) >>> y array([ -2.077e-06, -2.928e-06, ..., -4.395e-06, 0.000e+00], dtype=float32) >>> sr 11025
>>> # Load 5 seconds of a file, starting 15 seconds in >>> filename = librosa.util.example_audio_file() >>> y, sr = librosa.load(filename, offset=15.0, duration=5.0) >>> y array([ 0.069, 0.1 , ..., -0.101, 0. ], dtype=float32) >>> sr 22050