Speech Libraries - Page 10

react-speech-kitby MikeyParton

JavaScript 168 Version:Current
License: No License (No License)

React hooks for Speech Recognition and Speech Synthesis

Support

Quality

Security

License

Reuse

TensorVoxby ZDisket

C++ 168 Version:Current
License: Permissive (MIT)

Desktop application for neural speech synthesis written in C++

Support

Quality

Security

License

Reuse

Bing-GPT-Voice-Assistantby Ai-Austin

Python 168 Version:Current
License: No License (No License)

This is a Python voice assistant that takes two different wake words. One for prompting Bing AI using EdgeGPT and the other will prompt the GPT-3.5-Turbo API

Support

Quality

Security

License

Reuse

python-speech-recognitionby realpython

Python 167 Version:Current
License: Permissive (MIT)

Speech Recognition with Python examples

Support

Quality

Security

License

Reuse

gst-deepspeechby Elleo

C++ 166 Version:Current
License: Proprietary (Proprietary)

NOTE: This plugin is now deprecated in favour of the coqui-stt branch in gst-plugins-bad: https://gitlab.freedesktop.org/philn/gstreamer/-/tree/coqui-stt/subprojects/gst-plugins-bad/ext/coqui

Support

Quality

Security

License

Reuse

TTS_TFLiteby tulasiram58827

Jupyter Notebook 166 Version:Current
License: Permissive (Apache-2.0)

This repository is a collection of TTS Models in TFLite

Support

Quality

Security

License

Reuse

tacotron_asrby Kyubyong

Python 165 Version:Current
License: Permissive (Apache-2.0)

Speech Recognition Using Tacotron

Support

Quality

Security

License

Reuse

noise_reductionby dodiku

HTML 165 Version:Current
License: No License (No License)

Speech noise reduction which was generated using existing post-production techniques implemented in Python

Support

Quality

Security

License

Reuse

crankby k2kobayashi

Python 164 Version:Current
License: Permissive (MIT)

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Support

Quality

Security

License

Reuse

Beamforming-for-speech-enhancementby AkojimaSLP

Python 164 Version:Current
License: No License (No License)

simple delaysum, MVDR and CGMM-MVDR

Support

Quality

Security

License

Reuse

cotatronby mindslab-ai

Python 164 Version:Current
License: Permissive (BSD-3-Clause)

Official code for Cotatron @ INTERSPEECH 2020

Support

Quality

Security

License

Reuse

norbertby sigsep

Python 164 Version:Current
License: Permissive (MIT)

Painless Wiener filters for audio separation

Support

Quality

Security

License

Reuse

SPTKby sp-nitech

C++ 164 Version:Current
License: Permissive (Apache-2.0)

A suite of speech signal processing tools

Support

Quality

Security

License

Reuse

StyleSpeechby keonlee9420

Python 164 Version:Current
License: Permissive (MIT)

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Support

Quality

Security

License

Reuse

voice_chatgptby nickbild

Python 164 Version:Current
License: No License (No License)

VoiceGPT is a voice assistant that leverages the powerful ChatGPT chatbot to answer your questions.

Support

Quality

Security

License

Reuse

myprosodyby Shahabks

Python 163 Version:Current
License: Permissive (MIT)

A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.

Support

Quality

Security

License

Reuse

source_separationby AppleHolic

Python 163 Version:Current
License: Permissive (Apache-2.0)

Deep learning based speech source separation using Pytorch

Support

Quality

Security

License

Reuse

speech-routerby lukasolson

JavaScript 163 Version:Current
License: Permissive (MIT)

A way to utilize Chrome's speech recognition APIs to perform actions when specific text is heard.

Support

Quality

Security

License

Reuse

kaldi-tuda-deby uhh-lt

Shell 163 Version:Current
License: Permissive (Apache-2.0)

Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.

Support

Quality

Security

License

Reuse

aimybox-android-assistantby just-ai

Kotlin 163 Version:Current
License: Permissive (Apache-2.0)

Embeddable custom voice assistant for Android applications

Support

Quality

Security

License

Reuse

LiveWhisperby Nikorasu

Python 163 Version:Current
License: Permissive (MIT)

A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

Support

Quality

Security

License

Reuse

pykaldi2by jzlianglu

Python 162 Version:Current
License: Permissive (MIT)

Yet another speech toolkit based on Kaldi and PyTorch

Support

Quality

Security

License

Reuse

python-pesqby ludlows

C 162 Version:Current
License: Permissive (MIT)

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

Support

Quality

Security

License

Reuse

Uni-SVCby PlayVoice

Python 162 Version:Current
License: Permissive (MIT)

uni-svc based on whisper for singing voice conversion, also for singing voice clone. lora for svc.

Support

Quality

Security

License

Reuse

FullSubNet-plusby RookieJunChen

Python 162 Version:Current
License: Permissive (Apache-2.0)

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Support

Quality

Security

License

Reuse

Talkifyby Hagsten

JavaScript 160 Version:Current
License: No License (No License)

Javascript Text to speech library

Support

Quality

Security

License

Reuse

electron-speechby noffle

JavaScript 160 Version:Current
License: No License (No License)

:microphone: Easy speech recognition in Node!

Support

Quality

Security

License

Reuse

electron-speechby hackergrrl

JavaScript 160 Version:Current
License: No License (No License)

:microphone: Easy speech recognition in Node!

Support

Quality

Security

License

Reuse

HateSonarby Hironsan

Jupyter Notebook 160 Version:Current
License: Permissive (MIT)

Hate Speech Detection Library for Python.

Support

Quality

Security

License

Reuse

SiFiGANby chomeyama

Python 159 Version:Current
License: Permissive (MIT)

Official implementation of the source-filter HiFiGAN vocoder

Support

Quality

Security

License

Reuse

voicetoolsby namco1992

Python 158 Version:Current
License: Permissive (Apache-2.0)

All in one voice processing library

Support

Quality

Security

License

Reuse

ClovaCallby clovaai

Python 158 Version:Current
License: Permissive (MIT)

ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)

Support

Quality

Security

License

Reuse

rnn-transducerby ZhengkunTian

Python 157 Version:Current
License: No License (No License)

A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition

Support

Quality

Security

License

Reuse

speech-to-textby akras14

Python 157 Version:Current
License: No License (No License)

Example transcribing audio file (speech) to text with Google Cloud Speech API and Python

Support

Quality

Security

License

Reuse

py-kaldi-asrby gooofy

C++ 157 Version:Current
License: Permissive (Apache-2.0)

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

Support

Quality

Security

License

Reuse

Voice_Activity_Detectorby eesungkim

Jupyter Notebook 157 Version:Current
License: No License (No License)

A statistical model-based Voice Activity Detection

Support

Quality

Security

License

Reuse

Speech_Enhancement_DNN_NMFby eesungkim

Python 156 Version:Current
License: No License (No License)

Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF

Support

Quality

Security

License

Reuse

code-by-voiceby simianhacker

Python 156 Version:Current
License: Permissive (MIT)

All the support file for my code by voice setup using Dragon Naturally Speaking and DragonFly

Support

Quality

Security

License

Reuse

gpt-voice-conversation-chatbotby Adri6336

Python 156 Version:Current
License: Strong Copyleft (GPL-3.0)

Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.

Support

Quality

Security

License

Reuse

conv-tasnetby funcwj

Python 153 Version:Current
License: Permissive (MIT)

A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation"

Support

Quality

Security

License

Reuse

speechTby louiskirsch

Python 153 Version:Current
License: Permissive (Apache-2.0)

An opensource speech-to-text software written in tensorflow

Support

Quality

Security

License

Reuse

TLSphinxby tryolabs

C++ 153 Version:Current
License: Permissive (MIT)

Swift wrapper around Pocketsphinx

Support

Quality

Security

License

Reuse

DiffSingerby keonlee9420

Python 152 Version:Current
License: Permissive (MIT)

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

Support

Quality

Security

License

Reuse

MsEdgeTTSby Migushthe2nd

TypeScript 152 Version:Current
License: Permissive (MIT)

A simple Azure Speech Service module that uses the Microsoft Edge Read Aloud API

Support

Quality

Security

License

Reuse

Chinese-automatic-speech-recognitionby chenmingxiang110

Jupyter Notebook 152 Version:Current
License: Permissive (MIT)

Chinese speech recognition

Support

Quality

Security

License

Reuse

JARVIS-ChatGPTby gia-guar

Python 152 Version:Current
License: Permissive (MIT)

A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.

Support

Quality

Security

License

Reuse

mitzuliby artetxem

C 151 Version:Current
License: Strong Copyleft (GPL-2.0)

The open, easy-to-use and powerful translator app for Android

Support

Quality

Security

License

Reuse

muavicby facebookresearch

Python 151 Version:Current
License: Proprietary (Proprietary)

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Support

Quality

Security

License

Reuse

jPTDPby datquocnguyen

Python 150 Version:Current
License: Proprietary (Proprietary)

Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)

Support

Quality

Security

License

Reuse

VoiceSplitby Edresson

Python 150 Version:Current
License: Permissive (Apache-2.0)

VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram

Support

Quality

Security

License

Reuse

react-speech-kitby MikeyParton

React hooks for Speech Recognition and Speech Synthesis

JavaScript

168

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

TensorVoxby ZDisket

Desktop application for neural speech synthesis written in C++

C++

168

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Bing-GPT-Voice-Assistantby Ai-Austin

This is a Python voice assistant that takes two different wake words. One for prompting Bing AI using EdgeGPT and the other will prompt the GPT-3.5-Turbo API

Python

168

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

python-speech-recognitionby realpython

Speech Recognition with Python examples

Python

167

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

gst-deepspeechby Elleo

NOTE: This plugin is now deprecated in favour of the coqui-stt branch in gst-plugins-bad: https://gitlab.freedesktop.org/philn/gstreamer/-/tree/coqui-stt/subprojects/gst-plugins-bad/ext/coqui

C++

166

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

TTS_TFLiteby tulasiram58827

This repository is a collection of TTS Models in TFLite

Jupyter Notebook

166

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

tacotron_asrby Kyubyong

Speech Recognition Using Tacotron

Python

165

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

noise_reductionby dodiku

Speech noise reduction which was generated using existing post-production techniques implemented in Python

HTML

165

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

crankby k2kobayashi

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Python

164

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Beamforming-for-speech-enhancementby AkojimaSLP

simple delaysum, MVDR and CGMM-MVDR

Python

164

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

cotatronby mindslab-ai

Official code for Cotatron @ INTERSPEECH 2020

Python

164

Updated: 4 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

norbertby sigsep

Painless Wiener filters for audio separation

Python

164

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

SPTKby sp-nitech

A suite of speech signal processing tools

C++

164

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

StyleSpeechby keonlee9420

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

Python

164

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

voice_chatgptby nickbild

VoiceGPT is a voice assistant that leverages the powerful ChatGPT chatbot to answer your questions.

Python

164

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

myprosodyby Shahabks

A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.

Python

163

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

source_separationby AppleHolic

Deep learning based speech source separation using Pytorch

Python

163

Updated: 5 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

speech-routerby lukasolson

A way to utilize Chrome's speech recognition APIs to perform actions when specific text is heard.

JavaScript

163

Updated: 5 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

kaldi-tuda-deby uhh-lt

Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.

Shell

163

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

aimybox-android-assistantby just-ai

Embeddable custom voice assistant for Android applications

Kotlin

163

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

LiveWhisperby Nikorasu

A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

Python

163

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pykaldi2by jzlianglu

Yet another speech toolkit based on Kaldi and PyTorch

Python

162

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

python-pesqby ludlows

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

162

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Uni-SVCby PlayVoice

uni-svc based on whisper for singing voice conversion, also for singing voice clone. lora for svc.

Python

162

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

FullSubNet-plusby RookieJunChen

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Python

162

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Talkifyby Hagsten

Javascript Text to speech library

JavaScript

160

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

electron-speechby noffle

:microphone: Easy speech recognition in Node!

JavaScript

160

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

electron-speechby hackergrrl

:microphone: Easy speech recognition in Node!

JavaScript

160

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

HateSonarby Hironsan

Hate Speech Detection Library for Python.

Jupyter Notebook

160

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

SiFiGANby chomeyama

Official implementation of the source-filter HiFiGAN vocoder

Python

159

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

voicetoolsby namco1992

All in one voice processing library

Python

158

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

ClovaCallby clovaai

ClovaCall dataset and Pytorch LAS baseline code (Interspeech 2020)

Python

158

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

rnn-transducerby ZhengkunTian

A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition

Python

157

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speech-to-textby akras14

Example transcribing audio file (speech) to text with Google Cloud Speech API and Python

Python

157

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

py-kaldi-asrby gooofy

Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.

C++

157

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Voice_Activity_Detectorby eesungkim

A statistical model-based Voice Activity Detection

Jupyter Notebook

157

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Speech_Enhancement_DNN_NMFby eesungkim

Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF

Python

156

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

code-by-voiceby simianhacker

All the support file for my code by voice setup using Dragon Naturally Speaking and DragonFly

Python

156

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

gpt-voice-conversation-chatbotby Adri6336

Allows you to have an engaging and safely emotive spoken / CLI conversation with the AI ChatGPT / GPT-4 while giving you the option to let it remember things discussed.

Python

156

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

conv-tasnetby funcwj

A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation"

Python

153

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

speechTby louiskirsch

An opensource speech-to-text software written in tensorflow

Python

153

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

TLSphinxby tryolabs

Swift wrapper around Pocketsphinx

C++

153

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

DiffSingerby keonlee9420

PyTorch implementation of DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (focused on DiffSpeech)

Python

152

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

MsEdgeTTSby Migushthe2nd

A simple Azure Speech Service module that uses the Microsoft Edge Read Aloud API

TypeScript

152

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Chinese-automatic-speech-recognitionby chenmingxiang110

Chinese speech recognition

Jupyter Notebook

152

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

JARVIS-ChatGPTby gia-guar

A Conversational Assistant equipped with synthetic voices including J.A.R.V.I.S's. Powered by OpenAI and IBM Watson APIs and a Tacotron model for voice generation.

Python

152

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

mitzuliby artetxem

The open, easy-to-use and powerful translator app for Android

151

Updated: 4 y ago

License: Strong Copyleft (GPL-2.0)

Support

Quality

Security

License

Reuse

muavicby facebookresearch

MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation

Python

151

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

jPTDPby datquocnguyen

Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)

Python

150

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

VoiceSplitby Edresson

VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram

Python

150

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 10

react-speech-kitby MikeyParton

JavaScript 168 Version:Current License: No License (No License)

React hooks for Speech Recognition and Speech Synthesis

TensorVoxby ZDisket

C++ 168 Version:Current License: Permissive (MIT)

Desktop application for neural speech synthesis written in C++

Bing-GPT-Voice-Assistantby Ai-Austin

Python 168 Version:Current License: No License (No License)

This is a Python voice assistant that takes two different wake words. One for prompting Bing AI using EdgeGPT and the other will prompt the GPT-3.5-Turbo API

python-speech-recognitionby realpython

Python 167 Version:Current License: Permissive (MIT)

Speech Recognition with Python examples

gst-deepspeechby Elleo

C++ 166 Version:Current License: Proprietary (Proprietary)

NOTE: This plugin is now deprecated in favour of the coqui-stt branch in gst-plugins-bad: https://gitlab.freedesktop.org/philn/gstreamer/-/tree/coqui-stt/subprojects/gst-plugins-bad/ext/coqui

TTS_TFLiteby tulasiram58827

Jupyter Notebook 166 Version:Current License: Permissive (Apache-2.0)

This repository is a collection of TTS Models in TFLite

tacotron_asrby Kyubyong

Python 165 Version:Current License: Permissive (Apache-2.0)

Speech Recognition Using Tacotron

noise_reductionby dodiku

HTML 165 Version:Current License: No License (No License)

Speech noise reduction which was generated using existing post-production techniques implemented in Python

crankby k2kobayashi

Python 164 Version:Current License: Permissive (MIT)

A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder

Beamforming-for-speech-enhancementby AkojimaSLP

Python 164 Version:Current License: No License (No License)

simple delaysum, MVDR and CGMM-MVDR

cotatronby mindslab-ai

Python 164 Version:Current License: Permissive (BSD-3-Clause)

Official code for Cotatron @ INTERSPEECH 2020

norbertby sigsep

Python 164 Version:Current License: Permissive (MIT)

Painless Wiener filters for audio separation

SPTKby sp-nitech

C++ 164 Version:Current License: Permissive (Apache-2.0)

A suite of speech signal processing tools

StyleSpeechby keonlee9420

Python 164 Version:Current License: Permissive (MIT)

PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation

voice_chatgptby nickbild

Python 164 Version:Current License: No License (No License)

VoiceGPT is a voice assistant that leverages the powerful ChatGPT chatbot to answer your questions.

myprosodyby Shahabks

Python 163 Version:Current License: Permissive (MIT)

A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.

source_separationby AppleHolic

Python 163 Version:Current License: Permissive (Apache-2.0)

Deep learning based speech source separation using Pytorch

speech-routerby lukasolson

JavaScript 163 Version:Current License: Permissive (MIT)

A way to utilize Chrome's speech recognition APIs to perform actions when specific text is heard.

kaldi-tuda-deby uhh-lt

Shell 163 Version:Current License: Permissive (Apache-2.0)

Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.

aimybox-android-assistantby just-ai

Kotlin 163 Version:Current License: Permissive (Apache-2.0)

Embeddable custom voice assistant for Android applications

LiveWhisperby Nikorasu

Python 163 Version:Current License: Permissive (MIT)

A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.

pykaldi2by jzlianglu

Python 162 Version:Current License: Permissive (MIT)

Yet another speech toolkit based on Kaldi and PyTorch

python-pesqby ludlows

C 162 Version:Current License: Permissive (MIT)

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

Uni-SVCby PlayVoice

Python 162 Version:Current License: Permissive (MIT)

uni-svc based on whisper for singing voice conversion, also for singing voice clone. lora for svc.

FullSubNet-plusby RookieJunChen

Python 162 Version:Current License: Permissive (Apache-2.0)

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Talkifyby Hagsten

JavaScript 160 Version:Current License: No License (No License)

Javascript Text to speech library

electron-speechby noffle

JavaScript 168 Version:Current
License: No License (No License)

C++ 168 Version:Current
License: Permissive (MIT)

Python 168 Version:Current
License: No License (No License)

Python 167 Version:Current
License: Permissive (MIT)

C++ 166 Version:Current
License: Proprietary (Proprietary)

Jupyter Notebook 166 Version:Current
License: Permissive (Apache-2.0)

Python 165 Version:Current
License: Permissive (Apache-2.0)

HTML 165 Version:Current
License: No License (No License)

Python 164 Version:Current
License: Permissive (MIT)

Python 164 Version:Current
License: No License (No License)

Python 164 Version:Current
License: Permissive (BSD-3-Clause)

Python 164 Version:Current
License: Permissive (MIT)

C++ 164 Version:Current
License: Permissive (Apache-2.0)

Python 164 Version:Current
License: Permissive (MIT)

Python 164 Version:Current
License: No License (No License)

Python 163 Version:Current
License: Permissive (MIT)

Python 163 Version:Current
License: Permissive (Apache-2.0)

JavaScript 163 Version:Current
License: Permissive (MIT)

Shell 163 Version:Current
License: Permissive (Apache-2.0)

Kotlin 163 Version:Current
License: Permissive (Apache-2.0)

Python 163 Version:Current
License: Permissive (MIT)

Python 162 Version:Current
License: Permissive (MIT)

C 162 Version:Current
License: Permissive (MIT)

Python 162 Version:Current
License: Permissive (MIT)

Python 162 Version:Current
License: Permissive (Apache-2.0)

JavaScript 160 Version:Current
License: No License (No License)

JavaScript 160 Version:Current
License: No License (No License)

JavaScript 160 Version:Current
License: No License (No License)

Jupyter Notebook 160 Version:Current
License: Permissive (MIT)

Python 159 Version:Current
License: Permissive (MIT)

Python 158 Version:Current
License: Permissive (Apache-2.0)

Python 158 Version:Current
License: Permissive (MIT)

Python 157 Version:Current
License: No License (No License)

Python 157 Version:Current
License: No License (No License)

C++ 157 Version:Current
License: Permissive (Apache-2.0)

Jupyter Notebook 157 Version:Current
License: No License (No License)

Python 156 Version:Current
License: No License (No License)

Python 156 Version:Current
License: Permissive (MIT)

Python 156 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 153 Version:Current
License: Permissive (MIT)

Python 153 Version:Current
License: Permissive (Apache-2.0)

C++ 153 Version:Current
License: Permissive (MIT)

Python 152 Version:Current
License: Permissive (MIT)

TypeScript 152 Version:Current
License: Permissive (MIT)

Jupyter Notebook 152 Version:Current
License: Permissive (MIT)

Python 152 Version:Current
License: Permissive (MIT)

C 151 Version:Current
License: Strong Copyleft (GPL-2.0)

Python 151 Version:Current
License: Proprietary (Proprietary)

Python 150 Version:Current
License: Proprietary (Proprietary)

Python 150 Version:Current
License: Permissive (Apache-2.0)