Needed help ! as I am working on a voice identification system, so what features of a human voice are distinct from person to person

Vedant_Vasaikar · April 25, 2024, 1:56pm

Voice identification!

paulinpaloalto · April 25, 2024, 3:29pm

How much experience do you have with ML and DL? The way you phrased the question in the title sounds like a “callback” to the classic “feature engineering” approach that people used from the beginning of the computer era through maybe the late 1990s, before ML based on Neural Networks really flowered. ML has changed everything. Now you don’t have to figure out what the features are: you just create a neural network of the appropriate style and complexity, define your goals in terms of objective functions and then throw a massive amount of labelled training data at the network and let it figure out what the features are by learning through back propagation.

Voice recognition is a very well developed field at this point and there is a lot of literature out there about how to approach this and lots of existing products that you can use. So it depends on what your goals are here:

Are you trying to build an actual application and does it work for you to use someone else’s system as the basis and just put your application specific UI in front of it?

Or is this some kind of academic project and you have to build a system yourself to demonstrate that you understand the concepts?

Obviously those are totally different scenarios that are the endpoints of a large span and there are other points between those extremes.

If you’d like to get some sense of what commercial products are out there, just googling “voice recognition” will get you started. If you want to know more about the academic state of the art, start with Papers with Code and search for voice.

Vedant_Vasaikar · April 25, 2024, 3:51pm

Sir I am just a beginner, 20 yrs old engineering student . Training a neural network on voice data would work but human voice changes over age so it won’t be a perfect solution. What I am trying to do is extract features store them ,then when at time of identification take users voice extract features and compare them with stored one and if it matches over a certain threshold allow access, then replace the stored with new ones for each sucessful identification for a single user. (Features would be extracted for a fixed phrase as “Hello my name is Vedant”)

paulinpaloalto · April 25, 2024, 4:34pm

Well, if your voice can change with age, then why can’t the features change also if you are thinking of this in terms of features?

I would say changes to the inputs over time is an orthogonal problem. Your face changes over time also, but are there enough features that are retained that a face recognition algorithm would continue to work? It’s an interesting question and I don’t claim to know the answer, but I’ll bet we can find some papers about that.

My suggestion would be first to solve the base problem without considering the “evolution over time” issue. Then once you have a valid solution to the base problem, you can consider how to evaluate the behavior of your model over time. Of course this will only be possible if you have a dataset that has samples of given people’s voices that are also over time.

Your particular application sounds exactly analogous to one of the topics about Face Recognition that is covered here by Prof Andrew Ng in the Deep Learning Specialization Course 4 about Convolutional Nets. He has a section in Week 4 about Face Recognition that covers both the “one shot” case (“is this person at the door someone in my database”) and the general problem (“do these two pictures represent the same person even if it’s not someone from my database”). If you are just getting started on ML/DL, one good approach would be to take some of the courses offered here. If you’re starting from scratch, the normal path would be to take the Machine Learning Specialization first and then do the Deep Learning Specialization.

Vedant_Vasaikar · April 25, 2024, 5:13pm

Thankyou for the guidance sir , I am currently doing the Machine learning specialization and then will continue with deep learning specialization. Also can you suggest website where I can find research papers and about people’s work regarding machine learning topics .

TMosh · April 25, 2024, 5:15pm

You will find an internet search to be very useful. There is no single website that provides all possible links to research topics.

paulinpaloalto · April 25, 2024, 5:17pm

Check the Papers with Code link that I gave you earlier, but as Tom says, the Internet is a big place and, as they say, “Google is your friend!”

nehasgh2014 · July 12, 2024, 2:21pm

@vedant : google scholar is one of the resources you can find research papers.

Vedant_Vasaikar · July 14, 2024, 1:32pm

Ohh I’ll check it out, Thankyou!

Topic		Replies	Views
How to develop voice authentication model AI Discussions ai-discussions	3	53	September 16, 2024
Deepfake audio detection AI Discussions ai-discussions , project	2	220	May 6, 2024
Adding car noise to the voice recognition dataset Structuring Machine Learning Projects coursera-platform	1	588	May 31, 2021
Hi, i need to start my face recognition project model training AI Discussions ai-discussions , project	5	150	October 2, 2024
I want to make a vioce to voice translation app in multiple language in real time AI Discussions ai-discussions , careers , data-centric	2	162	April 21, 2024

Needed help ! as I am working on a voice identification system, so what features of a human voice are distinct from person to person

Related topics