Speech-To-Text engine based on Baidu's Deep Speech

Installation

Download the pre-trained model (1.8GB):

wget https://github.com/mozilla/DeepSpeech/releases/download/v0.4.1/deepspeech-0.4.1-models.tar.gz
tar xvfz deepspeech-0.4.1-models.tar.gz

Move the contents of file to /models

Build the docker file :

$ docker build .

Run the docker container:

$ docker run -d -p 3001:3000 {id_of_container}

Run API

Now you can use postman or similar tools to upload the audio/video file to the service. Upload audio/video file to http://localhost:3001/ using form-data file with the data key name and wait for the json result.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
Readme.md		Readme.md
app.js		app.js
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech-To-Text engine based on Baidu's Deep Speech

Installation

Run API

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Speech-To-Text engine based on Baidu's Deep Speech

Installation

Run API

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages