compute_mel_spectrogram

Compute mel spectrogram (S3Gen compatible)

Description

Compute mel spectrogram (S3Gen compatible)

Usage

compute_mel_spectrogram(
  y,
  n_fft = 1920,
  n_mels = 80,
  sr = 24000,
  hop_size = 480,
  win_size = 1920,
  fmin = 0,
  fmax = 8000,
  center = FALSE
)

Arguments

  • y: Audio samples as torch tensor or numeric vector
  • n_fft: FFT size (default 1920 for 24kHz)
  • n_mels: Number of mel bins (default 80)
  • sr: Sample rate (default 24000)
  • hop_size: Hop size (default 480)
  • win_size: Window size (default 1920)
  • fmin: Minimum frequency (default 0)
  • fmax: Maximum frequency (default 8000)
  • center: Whether to center frames (default FALSE)

Value

Mel spectrogram tensor (batch, n_mels, time)