Deep Learning Pre-trained model for real-time audio classification

mistafo11 · October 26, 2024, 2:05pm

What is the best pre-trained deep learning model to be used for a real-time audio classification system on a microcontroller such as Raspberry Pi 3? Added bonus if the model can be audio-to-audio meaning the output depends on whether the audio is passes the classification or not.

Nevermnd · October 26, 2024, 2:36pm

@mistafo11 it is not clear from your statement what you intend by ‘audio classification’, but I’ve played around a bit with Spotify’s ‘Basic Pitch’.

Still not ‘real-time’ though, you need a few more steps and perhaps a condensed model.

https://basicpitch.spotify.com/

If you mean voice (in English) then check out OpenAI’s Whisper.

https://openai.com/index/whisper/

Still, also, not real-time at least ‘out of the box’.

Nevermnd · October 28, 2024, 1:43pm

@mistafo11 though hmmm; News to me, so due caution:

mistafo11 · October 30, 2024, 8:34am

@Nevermnd thanks a lot for the reply!! I will check them out and try to play around and see if it is possible

Nevermnd · October 30, 2024, 9:16am

@mistafo11 I would just add I have a RasPi 3 for a very similar project I was trying to work on, and if I remember correctly the built in audio port is not mic’d. It is ‘audio out’ only.

So you will need a cheap USB soundcard to get it to work. Almost any one will do, but I’d research a little and just check the chipset is supported by whatever variant of 'nix you’re running on it.

Topic		Replies	Views
Deepfake audio detection AI Discussions ai-discussions , project	2	219	May 6, 2024
Adding car noise to the voice recognition dataset Structuring Machine Learning Projects coursera-platform	1	588	May 31, 2021
AI cannot do well on a training set AI Discussions ai-discussions	0	61	July 31, 2023
Issues using RNN for drum sound classification AI Discussions ai-discussions , introductions	26	481	April 11, 2024
C5_W3_A2 Question about the architecture Sequence Models week-module-3 , coursera-platform	7	22	September 3, 2024

Deep Learning Pre-trained model for real-time audio classification

Related topics