Speech Libraries - Page 20

easy-kaldiby JRMeyer

Shell 54 Version:Current
License: Permissive (Apache-2.0)

Use your data to create a speech recognition system in Kaldi. Fast.

Support

Quality

Security

License

Reuse

tap-pluginsby tomszilagyi

C 54 Version:Current
License: Strong Copyleft (GPL-2.0)

Tom's Audio Processing LADSPA plugins

Support

Quality

Security

License

Reuse

rsrganby wangkenpu

Shell 54 Version:Current
License: Permissive (MIT)

Robust Speech Recognition Using Generative Adversarial Networks (GAN)

Support

Quality

Security

License

Reuse

iOS-Speech-To-Textby mzeeshanid

C 54 Version:Current
License: Proprietary (Proprietary)

This library use the Google Voice API and the Speex audio codec for speech-to-text on iOS

Support

Quality

Security

License

Reuse

ASR-Wav2vec-Finetuneby khanld

Python 54 Version:Current
License: No License (No License)

:zap: Finetune Wa2vec 2.0 For Speech Recognition

Support

Quality

Security

License

Reuse

pieby baidubce

Java 53 Version:Current
License: No License (No License)

百度云流式语音识别客户端 SDK

Support

Quality

Security

License

Reuse

MAX-Speech-to-Text-Converterby IBM

Python 53 Version:Current
License: Permissive (Apache-2.0)

Converts spoken words into text form.

Support

Quality

Security

License

Reuse

Co-Speech_Gesture_Generationby youngwoo-yoon

Python 53 Version:Current
License: Proprietary (Proprietary)

This is an implementation of Robots learn social skills: End-to-end learning of co-speech gesture generation for humanoid robots.

Support

Quality

Security

License

Reuse

talkieby joelpurra

TypeScript 53 Version:Current
License: Strong Copyleft (GPL-3.0)

Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.

Support

Quality

Security

License

Reuse

Unity-Text-to-Speechby ActiveNick

C# 53 Version:Current
License: Permissive (MIT)

Sample app used to demonstrate the use of Microsoft Cognitive Services Text-to-Speech APIs (aka Speech Synthesis) from within Unity.

Support

Quality

Security

License

Reuse

EA-SVCby hhguo

Python 53 Version:Current
License: Permissive (MIT)

An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"

Support

Quality

Security

License

Reuse

silent_speechby dgaddy

Python 53 Version:Current
License: No License (No License)

Code for the papers "Digital Voicing of Silent Speech" at EMNLP 2020 and "An Improved Model for Voicing Silent Speech" at ACL 2021.

Support

Quality

Security

License

Reuse

orangetextby hrbrmstr

R 53 Version:Current
License: No License (No License)

🍊📄 : An #rstats project to keep track of The 🍊 One's speeches

Support

Quality

Security

License

Reuse

Reverb.jsby burnson

HTML 53 Version:Current
License: Proprietary (Proprietary)

Reverb.js is a Web Audio API extension for creating reverb nodes and an accompanying impulse-response reverb library.

Support

Quality

Security

License

Reuse

Inter-SubNetby RookieJunChen

Python 53 Version:Current
License: Permissive (Apache-2.0)

The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.

Support

Quality

Security

License

Reuse

Speech_Recognitionby drbinliang

Python 52 Version:Current
License: No License (No License)

A simple speech recognition using HMM (python)

Support

Quality

Security

License

Reuse

Speech-Accent-Recognitionby yatharthgarg

Python 52 Version:Current
License: No License (No License)

Support

Quality

Security

License

Reuse

pkwrapby idiap

Python 52 Version:Current
License: Proprietary (Proprietary)

A pytorch wrapper for LF-MMI training and parallel training in Kaldi

Support

Quality

Security

License

Reuse

text-to-speech-jsby IonicaBizau

JavaScript 52 Version:Current
License: Permissive (MIT)

:v: A small JavaScript library that provides a text to speech conversion using tts-api.com service.

Support

Quality

Security

License

Reuse

Unity-MS-SpeechSDKby ActiveNick

C# 52 Version:Current
License: Permissive (MIT)

Sample Unity project used to demonstrate Speech Recognition using the new Microsoft Speech Service (Preview) via WebSockets.

Support

Quality

Security

License

Reuse

WG-WaveNetby BogiHsu

Python 52 Version:Current
License: Permissive (MIT)

Real-Time High-Fidelity Speech Synthesis without GPU

Support

Quality

Security

License

Reuse

klangsyntheseby 200sc

Go 52 Version:Current
License: Permissive (Apache-2.0)

Waveform and Audio Synthesis library in Go

Support

Quality

Security

License

Reuse

multi-task-kaldiby JRMeyer

Shell 52 Version:Current
License: Permissive (Apache-2.0)

An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 training.

Support

Quality

Security

License

Reuse

cs224n-gpu-that-talksby akashmjn

Jupyter Notebook 52 Version:Current
License: No License (No License)

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Support

Quality

Security

License

Reuse

VoiceGANby Yolanda-Gao

Jupyter Notebook 52 Version:Current
License: No License (No License)

These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB

Support

Quality

Security

License

Reuse

spot-cpp-sdkby boston-dynamics

C++ 52 Version:Current
License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

McNetby Audio-WestlakeU

Python 52 Version:Current
License: No License (No License)

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement" submitted to ICASSP 2023

Support

Quality

Security

License

Reuse

GANSynthby skmhrk1209

Python 51 Version:Current
License: No License (No License)

TensorFlow implementation of "GANSynth: Adversarial Neural Audio Synthesis"

Support

Quality

Security

License

Reuse

alex-asrby UFAL-DSG

Python 51 Version:Current
License: Proprietary (Proprietary)

Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.

Support

Quality

Security

License

Reuse

mxnet-seq2seqby yoosan

Python 51 Version:Current
License: No License (No License)

Sequence to sequence learning with MXNET

Support

Quality

Security

License

Reuse

obviby googlecreativelab

JavaScript 51 Version:Current
License: Permissive (Apache-2.0)

A Polymer 3+ webcomponent / button for doing speech recognition

Support

Quality

Security

License

Reuse

sova-tts-engineby sovaai

Python 51 Version:Current
License: Permissive (Apache-2.0)

Tacotron2 based engine for the SOVA-TTS project

Support

Quality

Security

License

Reuse

voice-commandby PRFTDigitalLabs

JavaScript 51 Version:Current
License: No License (No License)

A simple no-API voice command assitant

Support

Quality

Security

License

Reuse

ion-avpby pion

Go 51 Version:Current
License: Permissive (MIT)

Audio/Video Processing Service

Support

Quality

Security

License

Reuse

DISSCby gallilmaimon

Python 51 Version:Current
License: Permissive (MIT)

Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units". https://arxiv.org/abs/2212.09730

Support

Quality

Security

License

Reuse

Synthalinguaby cyberofficial

Python 51 Version:Current
License: Strong Copyleft (GPL-3.0)

Synthalingua - Real Time Translation

Support

Quality

Security

License

Reuse

turkish-pos-taggerby onuryilmaz

Python 50 Version:Current
License: Permissive (Apache-2.0)

Part-of-Speech (POS) Tagger for Turkish

Support

Quality

Security

License

Reuse

deep_avsrby lordmartian

Python 50 Version:Current
License: Permissive (MIT)

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Support

Quality

Security

License

Reuse

voice-commands.jsby jimmybyrum

HTML 50 Version:Current
License: Permissive (MIT)

Simple wrapper for Javascript Speech-to-text to add voice commands.

Support

Quality

Security

License

Reuse

Emovoxby KunZhou9646

Python 50 Version:Current
License: No License (No License)

This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".

Support

Quality

Security

License

Reuse

VITSby zassou65535

Python 50 Version:Current
License: Permissive (MIT)

VITSによるテキスト読み上げ器&ボイスチェンジャー

Support

Quality

Security

License

Reuse

RuntimeSpeechRecognizerby gtreshchev

C++ 50 Version:Current
License: Permissive (MIT)

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

Support

Quality

Security

License

Reuse

vocosby charactr-platform

Python 50 Version:Current
License: Permissive (MIT)

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Support

Quality

Security

License

Reuse

ncnn-android-mobilenetssdby nihui

Java 49 Version:Current
License: No License (No License)

The mobilenetssd object detection android example

Support

Quality

Security

License

Reuse

keras-sincnetby grausof

Python 49 Version:Current
License: No License (No License)

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Support

Quality

Security

License

Reuse

clari_wavenet_vocoderby HaiFengZeng

Python 49 Version:Current
License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

RNN-Transducerby sooftware

Python 49 Version:Current
License: Permissive (Apache-2.0)

PyTorch implementation of RNN-Transducer(RNN-T).

Support

Quality

Security

License

Reuse

LAS_Mandarin_PyTorchby jackaduma

Python 49 Version:Current
License: Permissive (MIT)

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

Support

Quality

Security

License

Reuse

vo-aacencby mstorsjo

C 49 Version:Current
License: Permissive (Apache-2.0)

VisualOn AAC encoder from Android

Support

Quality

Security

License

Reuse

hey-victoriaby sk89q

C 49 Version:Current
License: No License (No License)

TeamSpeak bot w/ speech recognition (like Siri, OK Google, Cortana, etc.)

Support

Quality

Security

License

Reuse

easy-kaldiby JRMeyer

Use your data to create a speech recognition system in Kaldi. Fast.

Shell

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

tap-pluginsby tomszilagyi

Tom's Audio Processing LADSPA plugins

Updated: 4 y ago

License: Strong Copyleft (GPL-2.0)

Support

Quality

Security

License

Reuse

rsrganby wangkenpu

Robust Speech Recognition Using Generative Adversarial Networks (GAN)

Shell

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

iOS-Speech-To-Textby mzeeshanid

This library use the Google Voice API and the Speex audio codec for speech-to-text on iOS

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

ASR-Wav2vec-Finetuneby khanld

:zap: Finetune Wa2vec 2.0 For Speech Recognition

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

pieby baidubce

百度云流式语音识别客户端 SDK

Java

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

MAX-Speech-to-Text-Converterby IBM

Converts spoken words into text form.

Python

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Co-Speech_Gesture_Generationby youngwoo-yoon

This is an implementation of Robots learn social skills: End-to-end learning of co-speech gesture generation for humanoid robots.

Python

Updated: 1 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

talkieby joelpurra

Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.

TypeScript

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Unity-Text-to-Speechby ActiveNick

Sample app used to demonstrate the use of Microsoft Cognitive Services Text-to-Speech APIs (aka Speech Synthesis) from within Unity.

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

EA-SVCby hhguo

An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"

Python

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

silent_speechby dgaddy

Code for the papers "Digital Voicing of Silent Speech" at EMNLP 2020 and "An Improved Model for Voicing Silent Speech" at ACL 2021.

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

orangetextby hrbrmstr

🍊📄 : An #rstats project to keep track of The 🍊 One's speeches

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Reverb.jsby burnson

Reverb.js is a Web Audio API extension for creating reverb nodes and an accompanying impulse-response reverb library.

HTML

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Inter-SubNetby RookieJunChen

The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.

Python

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Speech_Recognitionby drbinliang

A simple speech recognition using HMM (python)

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Speech-Accent-Recognitionby yatharthgarg

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

pkwrapby idiap

A pytorch wrapper for LF-MMI training and parallel training in Kaldi

Python

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

text-to-speech-jsby IonicaBizau

:v: A small JavaScript library that provides a text to speech conversion using tts-api.com service.

JavaScript

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Unity-MS-SpeechSDKby ActiveNick

Sample Unity project used to demonstrate Speech Recognition using the new Microsoft Speech Service (Preview) via WebSockets.

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

WG-WaveNetby BogiHsu

Real-Time High-Fidelity Speech Synthesis without GPU

Python

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

klangsyntheseby 200sc

Waveform and Audio Synthesis library in Go

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

multi-task-kaldiby JRMeyer

An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 training.

Shell

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

cs224n-gpu-that-talksby akashmjn

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

Jupyter Notebook

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

VoiceGANby Yolanda-Gao

These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB

Jupyter Notebook

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

spot-cpp-sdkby boston-dynamics

C++

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

McNetby Audio-WestlakeU

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement" submitted to ICASSP 2023

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

GANSynthby skmhrk1209

TensorFlow implementation of "GANSynth: Adversarial Neural Audio Synthesis"

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

alex-asrby UFAL-DSG

Online decoder for Kaldi NNET2 and GMM speech recognition models with Python bindings.

Python

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

mxnet-seq2seqby yoosan

Sequence to sequence learning with MXNET

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

obviby googlecreativelab

A Polymer 3+ webcomponent / button for doing speech recognition

JavaScript

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

sova-tts-engineby sovaai

Tacotron2 based engine for the SOVA-TTS project

Python

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

voice-commandby PRFTDigitalLabs

A simple no-API voice command assitant

JavaScript

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

ion-avpby pion

Audio/Video Processing Service

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

DISSCby gallilmaimon

Official repository for "Speaking Style Conversion With Discrete Self-Supervised Units". https://arxiv.org/abs/2212.09730

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Synthalinguaby cyberofficial

Synthalingua - Real Time Translation

Python

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

turkish-pos-taggerby onuryilmaz

Part-of-Speech (POS) Tagger for Turkish

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

deep_avsrby lordmartian

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

voice-commands.jsby jimmybyrum

Simple wrapper for Javascript Speech-to-text to add voice commands.

HTML

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Emovoxby KunZhou9646

This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

VITSby zassou65535

VITSによるテキスト読み上げ器&ボイスチェンジャー

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

RuntimeSpeechRecognizerby gtreshchev

Cross-platform, real-time, offline speech recognition plugin for Unreal Engine. Based on Whisper OpenAI technology, whisper.cpp.

C++

Updated: 1 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

vocosby charactr-platform

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python

Updated: 1 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ncnn-android-mobilenetssdby nihui

The mobilenetssd object detection android example

Java

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

keras-sincnetby grausof

Keras (tensorflow) implementation of SincNet (Mirco Ravanelli, Yoshua Bengio - https://github.com/mravanelli/SincNet)

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

clari_wavenet_vocoderby HaiFengZeng

Python

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

RNN-Transducerby sooftware

PyTorch implementation of RNN-Transducer(RNN-T).

Python

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

LAS_Mandarin_PyTorchby jackaduma

Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)

Python

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

vo-aacencby mstorsjo

VisualOn AAC encoder from Android

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

hey-victoriaby sk89q

TeamSpeak bot w/ speech recognition (like Siri, OK Google, Cortana, etc.)

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 20

easy-kaldiby JRMeyer

Shell 54 Version:Current License: Permissive (Apache-2.0)

Use your data to create a speech recognition system in Kaldi. Fast.

tap-pluginsby tomszilagyi

C 54 Version:Current License: Strong Copyleft (GPL-2.0)

Tom's Audio Processing LADSPA plugins

rsrganby wangkenpu

Shell 54 Version:Current License: Permissive (MIT)

Robust Speech Recognition Using Generative Adversarial Networks (GAN)

iOS-Speech-To-Textby mzeeshanid

C 54 Version:Current License: Proprietary (Proprietary)

This library use the Google Voice API and the Speex audio codec for speech-to-text on iOS

ASR-Wav2vec-Finetuneby khanld

Python 54 Version:Current License: No License (No License)

:zap: Finetune Wa2vec 2.0 For Speech Recognition

pieby baidubce

Java 53 Version:Current License: No License (No License)

百度云流式语音识别客户端 SDK

MAX-Speech-to-Text-Converterby IBM

Python 53 Version:Current License: Permissive (Apache-2.0)

Converts spoken words into text form.

Co-Speech_Gesture_Generationby youngwoo-yoon

Python 53 Version:Current License: Proprietary (Proprietary)

This is an implementation of Robots learn social skills: End-to-end learning of co-speech gesture generation for humanoid robots.

talkieby joelpurra

TypeScript 53 Version:Current License: Strong Copyleft (GPL-3.0)

Text-to-speech browser extension button. Select text on any web page, and have the computer read it out loud for you by simply clicking the Talkie button.

Unity-Text-to-Speechby ActiveNick

C# 53 Version:Current License: Permissive (MIT)

Sample app used to demonstrate the use of Microsoft Cognitive Services Text-to-Speech APIs (aka Speech Synthesis) from within Unity.

EA-SVCby hhguo

Python 53 Version:Current License: Permissive (MIT)

An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"

silent_speechby dgaddy

Python 53 Version:Current License: No License (No License)

Code for the papers "Digital Voicing of Silent Speech" at EMNLP 2020 and "An Improved Model for Voicing Silent Speech" at ACL 2021.

orangetextby hrbrmstr

R 53 Version:Current License: No License (No License)

🍊📄 : An #rstats project to keep track of The 🍊 One's speeches

Reverb.jsby burnson

HTML 53 Version:Current License: Proprietary (Proprietary)

Reverb.js is a Web Audio API extension for creating reverb nodes and an accompanying impulse-response reverb library.

Inter-SubNetby RookieJunChen

Python 53 Version:Current License: Permissive (Apache-2.0)

The official PyTorch implementation of "Inter-SubNet: Speech Enhancement with Subband Interaction", accepted by ICASSP 2023.

Speech_Recognitionby drbinliang

Python 52 Version:Current License: No License (No License)

A simple speech recognition using HMM (python)

Speech-Accent-Recognitionby yatharthgarg

Python 52 Version:Current License: No License (No License)

pkwrapby idiap

Python 52 Version:Current License: Proprietary (Proprietary)

A pytorch wrapper for LF-MMI training and parallel training in Kaldi

text-to-speech-jsby IonicaBizau

JavaScript 52 Version:Current License: Permissive (MIT)

:v: A small JavaScript library that provides a text to speech conversion using tts-api.com service.

Unity-MS-SpeechSDKby ActiveNick

C# 52 Version:Current License: Permissive (MIT)

Sample Unity project used to demonstrate Speech Recognition using the new Microsoft Speech Service (Preview) via WebSockets.

WG-WaveNetby BogiHsu

Python 52 Version:Current License: Permissive (MIT)

Real-Time High-Fidelity Speech Synthesis without GPU

klangsyntheseby 200sc

Go 52 Version:Current License: Permissive (Apache-2.0)

Waveform and Audio Synthesis library in Go

multi-task-kaldiby JRMeyer

Shell 52 Version:Current License: Permissive (Apache-2.0)

An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 training.

cs224n-gpu-that-talksby akashmjn

Jupyter Notebook 52 Version:Current License: No License (No License)

Attention, I'm Trying to Speak: End-to-end speech synthesis (CS224n '18)

VoiceGANby Yolanda-Gao

Jupyter Notebook 52 Version:Current License: No License (No License)

These are the results for VoiceGAN voice transformation. You can hear the audios which are in folder A-AB-ABA/B-BA-BAB

spot-cpp-sdkby boston-dynamics

C++ 52 Version:Current License: Proprietary (Proprietary)

McNetby Audio-WestlakeU

Python 52 Version:Current License: No License (No License)

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement" submitted to ICASSP 2023

Shell 54 Version:Current
License: Permissive (Apache-2.0)

C 54 Version:Current
License: Strong Copyleft (GPL-2.0)

Shell 54 Version:Current
License: Permissive (MIT)

C 54 Version:Current
License: Proprietary (Proprietary)

Python 54 Version:Current
License: No License (No License)

Java 53 Version:Current
License: No License (No License)

Python 53 Version:Current
License: Permissive (Apache-2.0)

Python 53 Version:Current
License: Proprietary (Proprietary)

TypeScript 53 Version:Current
License: Strong Copyleft (GPL-3.0)

C# 53 Version:Current
License: Permissive (MIT)

Python 53 Version:Current
License: Permissive (MIT)

Python 53 Version:Current
License: No License (No License)

R 53 Version:Current
License: No License (No License)

HTML 53 Version:Current
License: Proprietary (Proprietary)

Python 53 Version:Current
License: Permissive (Apache-2.0)

Python 52 Version:Current
License: No License (No License)

Python 52 Version:Current
License: No License (No License)

Python 52 Version:Current
License: Proprietary (Proprietary)

JavaScript 52 Version:Current
License: Permissive (MIT)

C# 52 Version:Current
License: Permissive (MIT)

Python 52 Version:Current
License: Permissive (MIT)

Go 52 Version:Current
License: Permissive (Apache-2.0)

Shell 52 Version:Current
License: Permissive (Apache-2.0)

Jupyter Notebook 52 Version:Current
License: No License (No License)

Jupyter Notebook 52 Version:Current
License: No License (No License)

C++ 52 Version:Current
License: Proprietary (Proprietary)

Python 52 Version:Current
License: No License (No License)

Python 51 Version:Current
License: No License (No License)

Python 51 Version:Current
License: Proprietary (Proprietary)

Python 51 Version:Current
License: No License (No License)

JavaScript 51 Version:Current
License: Permissive (Apache-2.0)

Python 51 Version:Current
License: Permissive (Apache-2.0)

JavaScript 51 Version:Current
License: No License (No License)

Go 51 Version:Current
License: Permissive (MIT)

Python 51 Version:Current
License: Permissive (MIT)

Python 51 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 50 Version:Current
License: Permissive (Apache-2.0)

Python 50 Version:Current
License: Permissive (MIT)

HTML 50 Version:Current
License: Permissive (MIT)

Python 50 Version:Current
License: No License (No License)

Python 50 Version:Current
License: Permissive (MIT)

C++ 50 Version:Current
License: Permissive (MIT)

Python 50 Version:Current
License: Permissive (MIT)

Java 49 Version:Current
License: No License (No License)

Python 49 Version:Current
License: No License (No License)

Python 49 Version:Current
License: Proprietary (Proprietary)

Python 49 Version:Current
License: Permissive (Apache-2.0)

Python 49 Version:Current
License: Permissive (MIT)

C 49 Version:Current
License: Permissive (Apache-2.0)

C 49 Version:Current
License: No License (No License)