Speech Libraries - Page 2

WaveRNNby fatchord

Python 1953 Version:Current
License: Permissive (MIT)

WaveRNN Vocoder + TTS

Support

Quality

Security

License

Reuse

gTTSby pndurette

Python 1886 Version:Current
License: Permissive (MIT)

Python library and CLI tool to interface with Google Translate's text-to-speech API

Support

Quality

Security

License

Reuse

STTby coqui-ai

C++ 1886 Version:Current
License: Weak Copyleft (MPL-2.0)

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

Support

Quality

Security

License

Reuse

tacotronby Kyubyong

Python 1813 Version:Current
License: Permissive (Apache-2.0)

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Support

Quality

Security

License

Reuse

VITS-fast-fine-tuningby Plachtaa

Python 1809 Version:Current
License: Permissive (Apache-2.0)

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Support

Quality

Security

License

Reuse

asteroidby asteroid-team

Python 1801 Version:Current
License: Permissive (MIT)

The PyTorch-based audio source separation toolkit for researchers

Support

Quality

Security

License

Reuse

Python 1777 Version:Current
License: Proprietary (Proprietary)

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Support

Quality

Security

License

Reuse

masrby nobody132

Python 1708 Version:Current
License: No License (No License)

中文语音识别; Mandarin Automatic Speech Recognition;

Support

Quality

Security

License

Reuse

juliusby julius-speech

C 1671 Version:Current
License: Permissive (BSD-3-Clause)

Open-Source Large Vocabulary Continuous Speech Recognition Engine

Support

Quality

Security

License

Reuse

project_aliasby bjoernkarmann

Python 1648 Version:Current
License: Strong Copyleft (GPL-3.0)

Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.

Support

Quality

Security

License

Reuse

kalliopeby kalliope-project

Python 1622 Version:Current
License: Strong Copyleft (GPL-3.0)

Kalliope is a framework that will help you to create your own personal assistant.

Support

Quality

Security

License

Reuse

Python 1606 Version:Current
License: Permissive (MIT)

Yet another voice assistant, but alive.

Support

Quality

Security

License

Reuse

pyttsx3by nateshmbhat

Python 1571 Version:Current
License: Weak Copyleft (MPL-2.0)

Offline Text To Speech synthesis for python

Support

Quality

Security

License

Reuse

deltaby Delta-ML

Python 1549 Version:Current
License: Permissive (Apache-2.0)

DELTA is a deep learning based natural language and speech processing platform.

Support

Quality

Security

License

Reuse

Python 1508 Version:Current
License: Permissive (Apache-2.0)

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Support

Quality

Security

License

Reuse

DeepSpeechby PaddlePaddle

Python 1470 Version:Current
License: Permissive (Apache-2.0)

A Speech Toolkit based on PaddlePaddle.

Support

Quality

Security

License

Reuse

DeepSpeechRecognitionby audier

Python 1454 Version:Current
License: No License (No License)

A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型

Support

Quality

Security

License

Reuse

free-chatgpt-client-pubby akl7777777

JavaScript 1445 Version:Current
License: No License (No License)

ShellGPT is a free chatgpt client, now Supported online search.no need for a key, no need to log in.Multi-node automatic speed measurement switch,Long text translation with no word limit, AI graphics.免费的chatgpt客户端，已支持联网搜索,无需密钥，无需登录,多节点自动测速切换,长文翻译不限字数,AI出图

Support

Quality

Security

License

Reuse

say.jsby Marak

JavaScript 1427 Version:Current
License: Permissive (MIT)

TTS (text to speech) for node.js. send text from node.js to your speakers.

Support

Quality

Security

License

Reuse

JavaScript 1408 Version:Current
License: Permissive (CC0-1.0)

A repository for demos illustrating features of the Web Speech API. See https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API for more details.

Support

Quality

Security

License

Reuse

sphinx4by cmusphinx

Java 1350 Version:Current
License: Proprietary (Proprietary)

Pure Java speech recognition library

Support

Quality

Security

License

Reuse

HTML 1344 Version:Current
License: No License (No License)

:speaker: Web Component wrapper to the Web Speech API, that allows you to do voice recognition and speech synthesis using Polymer

Support

Quality

Security

License

Reuse

live-transcribe-speech-engineby google

Java 1327 Version:Current
License: Permissive (Apache-2.0)

Live Transcribe is an Android application that provides real-time captioning for people who are deaf or hard of hearing. This repository contains the Android client libraries for communicating with Google's Cloud Speech API that are used in Live Transcribe.

Support

Quality

Security

License

Reuse

Jupyter Notebook 1324 Version:Current
License: Permissive (MIT)

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Support

Quality

Security

License

Reuse

deltaby didi

Python 1289 Version:Current
License: Permissive (Apache-2.0)

DELTA is a deep learning based natural language and speech processing platform.

Support

Quality

Security

License

Reuse

hifi-ganby jik876

Python 1266 Version:Current
License: Permissive (MIT)

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Support

Quality

Security

License

Reuse

merlinby CSTR-Edinburgh

Python 1260 Version:Current
License: Permissive (Apache-2.0)

This is now the official location of the Merlin project.

Support

Quality

Security

License

Reuse

RHVoiceby RHVoice

C++ 1255 Version:Current
License: Strong Copyleft (GPL-2.0)

a free and open source speech synthesizer for Russian and other languages

Support

Quality

Security

License

Reuse

Python 1246 Version:Current
License: Permissive (MIT)

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Support

Quality

Security

License

Reuse

speak.jsby kripken

C++ 1234 Version:Current
License: Strong Copyleft (GPL-3.0)

Text-to-Speech in JavaScript using eSpeak

Support

Quality

Security

License

Reuse

Python 1207 Version:Current
License: Permissive (MIT)

Core Engine of Singing Voice Conversion & Singing Voice Clone

Support

Quality

Security

License

Reuse

pororoby kakaobrain

Python 1199 Version:Current
License: Permissive (Apache-2.0)

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Support

Quality

Security

License

Reuse

artyom.jsby sdkcarlos

JavaScript 1165 Version:Current
License: Permissive (MIT)

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

Support

Quality

Security

License

Reuse

dc_ttsby Kyubyong

Python 1133 Version:Current
License: Permissive (Apache-2.0)

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Support

Quality

Security

License

Reuse

praatby praat

C 1117 Version:Current
License: No License (No License)

Praat: Doing Phonetics By Computer

Support

Quality

Security

License

Reuse

XZVoiceby bawangxx

JavaScript 1117 Version:Current
License: No License (No License)

Free and open source text-to-speech software

Support

Quality

Security

License

Reuse

Python 1056 Version:Current
License: Strong Copyleft (GPL-3.0)

Use Microsoft Edge's online text-to-speech service from Python (without needing Microsoft Edge/Windows or an API key)

Support

Quality

Security

License

Reuse

SAMby s-macke

C 1054 Version:Current
License: No License (No License)

Software Automatic Mouth - Tiny Speech Synthesizer

Support

Quality

Security

License

Reuse

speech-to-text-nodejsby watson-developer-cloud

JavaScript 1050 Version:Current
License: Permissive (Apache-2.0)

:microphone: Sample Node.js Application for the IBM Watson Speech to Text Service

Support

Quality

Security

License

Reuse

svoiceby facebookresearch

Python 1029 Version:Current
License: Proprietary (Proprietary)

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

Support

Quality

Security

License

Reuse

voice2jsonby synesthesiam

Python 1028 Version:Current
License: Permissive (MIT)

Command-line tools for speech and intent recognition on Linux

Support

Quality

Security

License

Reuse

SincNetby mravanelli

Python 1017 Version:Current
License: Permissive (MIT)

SincNet is a neural architecture for efficiently processing raw audio samples.

Support

Quality

Security

License

Reuse

Worldby mmorise

C++ 1017 Version:Current
License: Proprietary (Proprietary)

A high-quality speech analysis, manipulation and synthesis system

Support

Quality

Security

License

Reuse

kaldi-gstreamer-serverby alumae

Python 1015 Version:Current
License: Permissive (BSD-2-Clause)

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Support

Quality

Security

License

Reuse

Python 999 Version:Current
License: Permissive (MIT)

Support

Quality

Security

License

Reuse

vall-eby lifeiteng

Python 996 Version:Current
License: Permissive (Apache-2.0)

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Support

Quality

Security

License

Reuse

Montreal-Forced-Alignerby MontrealCorpusTools

Python 991 Version:Current
License: Permissive (MIT)

Command line utility for forced alignment using Kaldi

Support

Quality

Security

License

Reuse

Python 982 Version:Current
License: Permissive (MIT)

Audio Normalization for Python/ffmpeg

Support

Quality

Security

License

Reuse

Python 974 Version:Current
License: Proprietary (Proprietary)

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Support

Quality

Security

License

Reuse

nlp-paperby DengBoCong

Python 960 Version:Current
License: Permissive (Apache-2.0)

自然语言处理领域下的相关论文（附阅读笔记），复现模型以及数据处理等（代码含TensorFlow和PyTorch两版本）

Support

Quality

Security

License

Reuse

WaveRNNby fatchord

WaveRNN Vocoder + TTS

Python

1953

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

gTTSby pndurette

Python library and CLI tool to interface with Google Translate's text-to-speech API

Python

1886

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

STTby coqui-ai

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

C++

1886

Updated: 2 y ago

License: Weak Copyleft (MPL-2.0)

Support

Quality

Security

License

Reuse

tacotronby Kyubyong

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

Python

1813

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

VITS-fast-fine-tuningby Plachtaa

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python

1809

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

asteroidby asteroid-team

The PyTorch-based audio source separation toolkit for researchers

Python

1801

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

deepvoice3_pytorchby r9y9

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Python

1777

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

masrby nobody132

中文语音识别; Mandarin Automatic Speech Recognition;

Python

1708

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

juliusby julius-speech

Open-Source Large Vocabulary Continuous Speech Recognition Engine

1671

Updated: 2 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

project_aliasby bjoernkarmann

Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.

Python

1648

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

kalliopeby kalliope-project

Kalliope is a framework that will help you to create your own personal assistant.

Python

1622

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Digital_Life_Serverby zixiiu

Yet another voice assistant, but alive.

Python

1606

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pyttsx3by nateshmbhat

Offline Text To Speech synthesis for python

Python

1571

Updated: 2 y ago

License: Weak Copyleft (MPL-2.0)

Support

Quality

Security

License

Reuse

deltaby Delta-ML

DELTA is a deep learning based natural language and speech processing platform.

Python

1549

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

OpenSeq2Seqby NVIDIA

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

Python

1508

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

DeepSpeechby PaddlePaddle

A Speech Toolkit based on PaddlePaddle.

Python

1470

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

DeepSpeechRecognitionby audier

A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型

Python

1454

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

free-chatgpt-client-pubby akl7777777

**ShellGPT is a free chatgpt client, now Supported online search.no need for a key, no need to log in.Multi-node automatic speed measurement switch,Long text translation with no word limit, AI graphics.免费的chatgpt客户端，已支持联网搜索,无需密钥，无需登录,多节点自动测速切换,长文翻译不限字数,AI出图**

JavaScript

1445

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

say.jsby Marak

TTS (text to speech) for node.js. send text from node.js to your speakers.

JavaScript

1427

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

web-speech-apiby mdn

A repository for demos illustrating features of the Web Speech API. See https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API for more details.

JavaScript

1408

Updated: 2 y ago

License: Permissive (CC0-1.0)

Support

Quality

Security

License

Reuse

sphinx4by cmusphinx

Pure Java speech recognition library

Java

1350

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

voice-elementsby zenorocha

:speaker: Web Component wrapper to the Web Speech API, that allows you to do voice recognition and speech synthesis using Polymer

HTML

1344

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

live-transcribe-speech-engineby google

Live Transcribe is an Android application that provides real-time captioning for people who are deaf or hard of hearing. This repository contains the Android client libraries for communicating with Google's Cloud Speech API that are used in Live Transcribe.

Java

1327

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

ParallelWaveGANby kan-bayashi

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Jupyter Notebook

1324

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

deltaby didi

DELTA is a deep learning based natural language and speech processing platform.

Python

1289

Updated: 5 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

hifi-ganby jik876

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Python

1266

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

merlinby CSTR-Edinburgh

This is now the official location of the Merlin project.

Python

1260

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

RHVoiceby RHVoice

a free and open source speech synthesizer for Russian and other languages

C++

1255

Updated: 2 y ago

License: Strong Copyleft (GPL-2.0)

Support

Quality

Security

License

Reuse

FastSpeech2by ming024

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

Python

1246

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

speak.jsby kripken

Text-to-Speech in JavaScript using eSpeak

C++

1234

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

so-vits-svc-5.0by PlayVoice

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python

1207

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pororoby kakaobrain

PORORO: Platform Of neuRal mOdels for natuRal language prOcessing

Python

1199

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

artyom.jsby sdkcarlos

A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.

JavaScript

1165

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

dc_ttsby Kyubyong

A TensorFlow Implementation of DC-TTS: yet another text-to-speech model

Python

1133

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

praatby praat

Praat: Doing Phonetics By Computer

1117

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

XZVoiceby bawangxx

Free and open source text-to-speech software

JavaScript

1117

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

edge-ttsby rany2

Use Microsoft Edge's online text-to-speech service from Python (without needing Microsoft Edge/Windows or an API key)

Python

1056

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

SAMby s-macke

Software Automatic Mouth - Tiny Speech Synthesizer

1054

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speech-to-text-nodejsby watson-developer-cloud

:microphone: Sample Node.js Application for the IBM Watson Speech to Text Service

JavaScript

1050

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

svoiceby facebookresearch

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

Python

1029

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

voice2jsonby synesthesiam

Command-line tools for speech and intent recognition on Linux

Python

1028

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

SincNetby mravanelli

SincNet is a neural architecture for efficiently processing raw audio samples.

Python

1017

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Worldby mmorise

A high-quality speech analysis, manipulation and synthesis system

C++

1017

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

kaldi-gstreamer-serverby alumae

Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.

Python

1015

Updated: 2 y ago

License: Permissive (BSD-2-Clause)

Support

Quality

Security

License

Reuse

NeuralSpeechby microsoft

Python

999

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

vall-eby lifeiteng

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python

996

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Montreal-Forced-Alignerby MontrealCorpusTools

Command line utility for forced alignment using Kaldi

Python

991

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ffmpeg-normalizeby slhck

Audio Normalization for Python/ffmpeg

Python

982

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

TransformerTTSby as-ideas

🤖💬 Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.

Python

974

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

nlp-paperby DengBoCong

自然语言处理领域下的相关论文（附阅读笔记），复现模型以及数据处理等（代码含TensorFlow和PyTorch两版本）

Python

960

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 2

WaveRNNby fatchord

Python 1953 Version:Current License: Permissive (MIT)

WaveRNN Vocoder + TTS

gTTSby pndurette

Python 1886 Version:Current License: Permissive (MIT)

Python library and CLI tool to interface with Google Translate's text-to-speech API

STTby coqui-ai

C++ 1886 Version:Current License: Weak Copyleft (MPL-2.0)

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

tacotronby Kyubyong

Python 1813 Version:Current License: Permissive (Apache-2.0)

A TensorFlow Implementation of Tacotron: A Fully End-to-End Text-To-Speech Synthesis Model

VITS-fast-fine-tuningby Plachtaa

Python 1809 Version:Current License: Permissive (Apache-2.0)

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

asteroidby asteroid-team

Python 1801 Version:Current License: Permissive (MIT)

The PyTorch-based audio source separation toolkit for researchers

deepvoice3_pytorchby r9y9

Python 1777 Version:Current License: Proprietary (Proprietary)

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

masrby nobody132

Python 1708 Version:Current License: No License (No License)

中文语音识别; Mandarin Automatic Speech Recognition;

juliusby julius-speech

C 1671 Version:Current License: Permissive (BSD-3-Clause)

Open-Source Large Vocabulary Continuous Speech Recognition Engine

project_aliasby bjoernkarmann

Python 1648 Version:Current License: Strong Copyleft (GPL-3.0)

kalliopeby kalliope-project

Python 1622 Version:Current License: Strong Copyleft (GPL-3.0)

Kalliope is a framework that will help you to create your own personal assistant.

Digital_Life_Serverby zixiiu

Python 1606 Version:Current License: Permissive (MIT)

Yet another voice assistant, but alive.

pyttsx3by nateshmbhat

Python 1571 Version:Current License: Weak Copyleft (MPL-2.0)

Offline Text To Speech synthesis for python

deltaby Delta-ML

Python 1549 Version:Current License: Permissive (Apache-2.0)

DELTA is a deep learning based natural language and speech processing platform.

OpenSeq2Seqby NVIDIA

Python 1508 Version:Current License: Permissive (Apache-2.0)

Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP

DeepSpeechby PaddlePaddle

Python 1470 Version:Current License: Permissive (Apache-2.0)

A Speech Toolkit based on PaddlePaddle.

DeepSpeechRecognitionby audier

Python 1454 Version:Current License: No License (No License)

A Chinese Deep Speech Recognition System 包括基于深度学习的声学模型和基于深度学习的语言模型

free-chatgpt-client-pubby akl7777777

JavaScript 1445 Version:Current License: No License (No License)

say.jsby Marak

JavaScript 1427 Version:Current License: Permissive (MIT)

TTS (text to speech) for node.js. send text from node.js to your speakers.

web-speech-apiby mdn

JavaScript 1408 Version:Current License: Permissive (CC0-1.0)

A repository for demos illustrating features of the Web Speech API. See https://developer.mozilla.org/en-US/docs/Web/API/Web_Speech_API for more details.

sphinx4by cmusphinx

Java 1350 Version:Current License: Proprietary (Proprietary)

Pure Java speech recognition library

voice-elementsby zenorocha

HTML 1344 Version:Current License: No License (No License)

:speaker: Web Component wrapper to the Web Speech API, that allows you to do voice recognition and speech synthesis using Polymer

live-transcribe-speech-engineby google

Java 1327 Version:Current License: Permissive (Apache-2.0)

Live Transcribe is an Android application that provides real-time captioning for people who are deaf or hard of hearing. This repository contains the Android client libraries for communicating with Google's Cloud Speech API that are used in Live Transcribe.

ParallelWaveGANby kan-bayashi

Jupyter Notebook 1324 Version:Current License: Permissive (MIT)

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

deltaby didi

Python 1289 Version:Current License: Permissive (Apache-2.0)

DELTA is a deep learning based natural language and speech processing platform.

hifi-ganby jik876

Python 1266 Version:Current License: Permissive (MIT)

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

merlinby CSTR-Edinburgh

Python 1260 Version:Current License: Permissive (Apache-2.0)

This is now the official location of the Merlin project.

Python 1953 Version:Current
License: Permissive (MIT)

Python 1886 Version:Current
License: Permissive (MIT)

C++ 1886 Version:Current
License: Weak Copyleft (MPL-2.0)

Python 1813 Version:Current
License: Permissive (Apache-2.0)

Python 1809 Version:Current
License: Permissive (Apache-2.0)

Python 1801 Version:Current
License: Permissive (MIT)

Python 1777 Version:Current
License: Proprietary (Proprietary)

Python 1708 Version:Current
License: No License (No License)

C 1671 Version:Current
License: Permissive (BSD-3-Clause)

Python 1648 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 1622 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 1606 Version:Current
License: Permissive (MIT)

Python 1571 Version:Current
License: Weak Copyleft (MPL-2.0)

Python 1549 Version:Current
License: Permissive (Apache-2.0)

Python 1508 Version:Current
License: Permissive (Apache-2.0)

Python 1470 Version:Current
License: Permissive (Apache-2.0)

Python 1454 Version:Current
License: No License (No License)

JavaScript 1445 Version:Current
License: No License (No License)

JavaScript 1427 Version:Current
License: Permissive (MIT)

JavaScript 1408 Version:Current
License: Permissive (CC0-1.0)

Java 1350 Version:Current
License: Proprietary (Proprietary)

HTML 1344 Version:Current
License: No License (No License)

Java 1327 Version:Current
License: Permissive (Apache-2.0)

Jupyter Notebook 1324 Version:Current
License: Permissive (MIT)

Python 1289 Version:Current
License: Permissive (Apache-2.0)

Python 1266 Version:Current
License: Permissive (MIT)

Python 1260 Version:Current
License: Permissive (Apache-2.0)

C++ 1255 Version:Current
License: Strong Copyleft (GPL-2.0)

Python 1246 Version:Current
License: Permissive (MIT)

C++ 1234 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 1207 Version:Current
License: Permissive (MIT)

Python 1199 Version:Current
License: Permissive (Apache-2.0)

JavaScript 1165 Version:Current
License: Permissive (MIT)

Python 1133 Version:Current
License: Permissive (Apache-2.0)

C 1117 Version:Current
License: No License (No License)

JavaScript 1117 Version:Current
License: No License (No License)

Python 1056 Version:Current
License: Strong Copyleft (GPL-3.0)

C 1054 Version:Current
License: No License (No License)

JavaScript 1050 Version:Current
License: Permissive (Apache-2.0)

Python 1029 Version:Current
License: Proprietary (Proprietary)

Python 1028 Version:Current
License: Permissive (MIT)

Python 1017 Version:Current
License: Permissive (MIT)

C++ 1017 Version:Current
License: Proprietary (Proprietary)

Python 1015 Version:Current
License: Permissive (BSD-2-Clause)

Python 999 Version:Current
License: Permissive (MIT)

Python 996 Version:Current
License: Permissive (Apache-2.0)

Python 991 Version:Current
License: Permissive (MIT)

Python 982 Version:Current
License: Permissive (MIT)

Python 974 Version:Current
License: Proprietary (Proprietary)

Python 960 Version:Current
License: Permissive (Apache-2.0)