Deep Neural Audio

  1. Audio deep learning made simple by Ketan Doshi

    1. State-of-the-Art Techniquesarrow-up-right (What is sound and how it is digitized. What problems is audio deep learning solving in our daily lives. What are Spectrograms and why they are all-important.)

    2. Why Mel Spectrograms perform betterarrow-up-right (Processing audio data in Python. What are Mel Spectrograms and how to generate them)

    3. Data Preparation and Augmentationarrow-up-right (Enhance Spectrograms features for optimal performance by hyper-parameter tuning and data augmentation)

    4. Sound Classificationarrow-up-right (End-to-end example and architecture to classify ordinary sounds. Foundational application for a range of scenarios.)

    5. Beam Searcharrow-up-right (Algorithm commonly used by Speech-to-Text and NLP applications to enhance predictions)

  2. Wav2Vec - paper Youtubearrow-up-right

  3. Whisperarrow-up-right

    1. ZACarrow-up-right (Zero-shot Audio Classification using Whisper) allows you to assign audio files to ANY class you want without training.

  4. Neural Amp Modelerarrow-up-right - is an open-source project that uses deep learning to create models of guitar amplifiers and pedals with state-of-the-art accuracy. Training notebookarrow-up-right

Last updated

Was this helpful?