Standard audio augmentation techniques
Similar to image augmentation in Chapter 3, various audio libraries provide many more functions than are necessary for augmentation. Therefore, we will only cover some of the methods available in the chosen audio library.
In image augmentation, the term safe level is defined as not altering or distorting the original image beyond an acceptable level. There is no standard terminology for deforming the original audio signal beyond a permissible point. Thus, the term safe or true will be used interchangeably to denote a limit point for the audio signal.
Here is a thought experiment: all audio files are represented as numbers in time series format. Thus, can you create a statistically valid augmentation method that does not consider human hearing perception? In other words, use math to manipulate a statistically valid number array, but never listen to the before and after effects. After all, audio augmentation aims to have more...