Skip to content

Visualization

Requirements#

Requires the audio-dataset-converter-visualization library.

Plugins#

For the following examples, data from the LJ Speech Dataset was used.

Mel spectrogram#

to-mel-spectrogram - outputs Mel spectrogram images

adc-convert \
  -l INFO \
    from-data \
      -l INFO \
      -i "./input/*.wav" \
      -t sp \
    to-mel-spectrogram \
      -l INFO \
      -o ./output

Mel spectrogram example plot

MFCC spectrogram#

to-mfcc-spectrogram - outputs Mel-frequency cepstral coefficients images

adc-convert \
  -l INFO \
    from-data \
      -l INFO \
      -i "./input/*.wav" \
      -t sp \
    to-mfcc-spectrogram \
      -l INFO \
      -o ./output

MFCC spectrogram example plot

STFT spectrogram#

to-stft-spectrogram - outputs short time fourier transform (STFT) spectrogram images

adc-convert \
  -l INFO \
    from-data \
      -l INFO \
      -i "./input/*.wav" \
      -t sp \
    to-stft-spectrogram \
      -l INFO \
      -o ./output

STFT spectrogram example plot