Compute mel spectrogram (S3Gen compatible)
Description
Compute mel spectrogram (S3Gen compatible)
Usage
compute_mel_spectrogram(
y,
n_fft = 1920,
n_mels = 80,
sr = 24000,
hop_size = 480,
win_size = 1920,
fmin = 0,
fmax = 8000,
center = FALSE
)
Arguments
y: Audio samples as torch tensor or numeric vectorn_fft: FFT size (default 1920 for 24kHz)n_mels: Number of mel bins (default 80)sr: Sample rate (default 24000)hop_size: Hop size (default 480)win_size: Window size (default 1920)fmin: Minimum frequency (default 0)fmax: Maximum frequency (default 8000)center: Whether to center frames (default FALSE)
Value
Mel spectrogram tensor (batch, n_mels, time)