BingSpeechRecognition

About

This project includes two parts:

Text to Speech Conversion
Speech to Text Conversion

However, to help transcribing voice recording to text easier, the main propose of this project is generating a well-organized file that includes each sentence of yoru voice recording files.

Requirements

Python 3
Speech API token of Microsoft Project Oxford
Bing Voice Recognition token

Tokens

Save your API and tokens in tokens_sample.py and rename it to tokens.py

Text to Speech Conversion

In this part, we use Speech API of Microsoft Project Oxford to synthesize the voice from text inputs. You can subscribe a free plan of Speech APIs, which includes 5000 free API calls per month.

Usage: python synthesizer.py "Never gonna give you up, never gonna let you down."

Output: synthesized.wav

Speech to Text Recognition

In this part, we should get Bing Voice Recognition token, like Speech API, you can call 5000 times for free per month. Then you can access your keys from this page

Since Bing Voice Recognition service can't recognition our voice recoding file at once, we should separate the file to mutiple parts first. We suggest the tool Audacity for track editing.

Separate Voice Recording File

Open your file with Audicity and choose "Analyze → Silence Finder..."

Change the settings, then press "OK"

Click "File → Export Multiple..."

Check export format and name files as the following settings, then export all files to same directory

Usage

Put all .wav files in same directory, then use python recognizer.py FOLDERNAME to generate results. Yes, it will try to recognize each sentense in your voice files.

Results

The output of recognizer.py is a .csv file which includes all sentences in your voice recoding files. Each row represent the corresponding output of the voice recording. If the recoginition failed, the output file also represents the reason and error code.

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
images		images
.gitignore		.gitignore
BingSpeechRecognition.sublime-project		BingSpeechRecognition.sublime-project
LICENSE		LICENSE
README.md		README.md
SampleResponse.xml		SampleResponse.xml
configs.py		configs.py
recognizer.py		recognizer.py
synthesizer.py		synthesizer.py
tokens_sample.py		tokens_sample.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BingSpeechRecognition

About

Requirements

Tokens

Text to Speech Conversion

Speech to Text Recognition

Separate Voice Recording File

Usage

Results

License

About

Releases

Packages

Contributors 3

Languages

License

GaryniL/BingSpeechRecognition

Folders and files

Latest commit

History

Repository files navigation

BingSpeechRecognition

About

Requirements

Tokens

Text to Speech Conversion

Speech to Text Recognition

Separate Voice Recording File

Usage

Results

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages