Development of Speech Recognition AI Project with Python

Methodology

Working on the Speech Recognition Python Project. Design and Development of Speech Recognition AI Project with Python Source code, report, and ppt using NLP, PLP, and Deep Neural Networks.

Speak– The assistant will speak the following introduction, the output, and the following things according to which good is given. It will use the laptop microphone to hear the input from the user and later recognize the voice said by the user and match the code words and if anything matches it will show the output.

Wish Me-The assistant will speak the Message included in the introduction even if it will wish the morning afternoon and even the evening depending upon the real-time based scenario. It will wish the morning from 04HH to 11HH 59MM. It will wish the afternoon from 12HH to 17HH 59MM. It will wish the evening from 18HH to 03HH 59MM.

Take Command– The assistant will take microphone(speech) input from the user and returns string output. It will be sub-divide into many different parts as described below. Listening-The assistant will open the microphone and try to hear what the user wants to convey to it.

Recognizing– The assistant will try to recognize the input spoken by the user and then check the code whether the word that is recognized by the assistant is there or not if the input matches it will show the output otherwise it will speak “Say that again please” this line which means to give the input again by the user. If the word is correctly recognized, it will follow the instructions assigned to it.

Wikipedia– If the word is recognized as “Wikipedia” it will search Wikipedia according to the input given by the user. E.g. if we say Narendra Modi Wikipedia so the assistant will speak “searching Wikipedia Narendra Modi” and then after it “According to Wikipedia…” and the details of that particular person. Youtube- If the word is recognized as “YouTube”, it will open the internet explorer and directly start opening the default web browser by the link “youtube.com”.

Google– If the word is recognized as “Google”, it will open the internet explorer and directly start opening the Google by the link “google.com”.

Train Information– If the word is recognized as “Train info”. It will fetch the detail from a CSV file and returns the detail of all the train and display them on the terminal. Stack Overflow- If the word is recognized as “Stack Over Flow” it will open the internet explorer and directly start opening the Stack Over Flow website by the link “stackoverflow.com”.

Play Music– If the word is recognized as “Play Music” it will search the .mp3 or .mp4 file in the default path of the device that is provided by the programmer in the programming. E.g. if we say Play Music so the assistant will search in the path like “D:\\Non Critical\\songs\\Favourite Songs2” and it will play that particular song. The Time- If the word is recognized as “The Time” it will check the real-time from the device and speak the same in terms of “HH:MM: SS”. E.g. if we say the time so the assistant will check the time and if the time is 08:14:21 P.M. it will speak “Sir, the time is 20HH:14MM:21SS”.

Open Code– If the word is recognized as “Open Code” it will search the .java or .py file in the default path of the device that is provided by the programmer in the programming. E.g. if we say Open Code so the assistant will search in the path like “C:\\Users\\XYZ\\AppData\\Local\\Programs\\project.py” and it will open the code. Stop- If the word is recognized as “Stop” it will speak “Quitting sir thanks for your time” and the code terminates.

Code-Snippet

Speech Recognition Project Coding

Algorithms used in Speech Recognition

NLP (Natural Language Processing) & Tokenization
PLP
Deep Neural Networks
Discrimination training
WFST Frameworks etc;

The following must be installed-:

1. sudo pip install SpeechRecognition.
2. Sudo apt-get installs python-pyaudio python3-pyaudio or pip install pyaudio.
This is the most important module in your project as it provides the main functionality in our project to convert speech into text.

Future Scope

This specific area of AI ends up being productive in each specialized field. We have additionally actualized this to show how it is valuable in various fields as we have made a little undertaking to exhibit its use in various documented, for example, railroad, looking through feed and so on; Like PCs began to play chess better than human, speech recognition before long will be improved by PCs as well. Critically, that will include some significant information about nature in general and the human mind specifically. So speech recognition is a significant advance in our investigation of natural laws. Our venture can be utilized by railroads and another center point to show distinctive data utilizing speech recognition.