Audio Data Augmentation with Spectrogram
In the previous chapter, we visualized the sound using the Waveform graph. An audio spectrogram is another visualizing method for seeing the audio components. The inputs to the Spectrogram are a one-dimensional array of amplitude values and the sampling rate. They are the same inputs as the Waveform graph.
An audio spectrogram is sometimes called a sonograph, sonogram, voiceprint, or voicegram. The Spectrogram is a more detailed representation of sound than the Waveform graph. It shows a correlation between frequency and amplitude (loudness) over time, which helps visualize the frequency content in a signal. Spectrograms make it easier to identify musical elements, detect melodic patterns, recognize frequency-based effects, and compare the results of different volume settings. Additionally, the Spectrogram can be more helpful in identifying non-musical aspects of a signal, such as noise and interference from other frequencies.