Speaker Identification in Multispeaker Environment using Deep Neural Networks

Abstract

Human beings are capable of performing unfathomable tasks. A human being is able to focus on a single person’s voice in an environment of simultaneous conversations. We have tried to emulate this particular skill through an artificial intelligence system. Our system identifies an audio file as a single or multi-speaker file as the first step and then recognizes the speaker(s). Our approach towards the desired solution was to first conduct pre-processing of the audio (input) file where it is subjected to reduction and silence removal, framing, windowing and DCT calculation, all of which is used to extract its features. Mel Frequency Cepstral Coefficients (MFCC) technique was used for feature extraction. The extracted features are then used to train the system via neural networks using the Error Back Propagation Training Algorithm (EBPTA). One of the many applications of our model is in biometric systems such as telephone banking, authentication and surveillance.

Keywords: Speaker identification, neural network, Multi- Speaker, Mel Frequency Cepstral Coefficients (MFCC).

Research Paper published in Springer Journal.

For more details: download file ResearchPaper.pdf, projectreport

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.idea		.idea
BackPropogationAlgo		BackPropogationAlgo
DeepNeuralNetPython		DeepNeuralNetPython
Error Back Propogation Training Algorithm		Error Back Propogation Training Algorithm
mfcc		mfcc
.gitignore		.gitignore
LICENSE		LICENSE
ProjectReport.pdf		ProjectReport.pdf
Research paper.pdf		Research paper.pdf
english.wav		english.wav
readme.md		readme.md
readme.pdf		readme.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speaker Identification in Multispeaker Environment using Deep Neural Networks

About

Releases

Packages

Languages

License

huangyiting111/speakerIdentificationNeuralNetworks

Folders and files

Latest commit

History

Repository files navigation

Speaker Identification in Multispeaker Environment using Deep Neural Networks

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages