Speech Libraries - Page 12

UniVoiceby adrenak

C# 132 Version:Current
License: Permissive (MIT)

P2P VoIP in Unity

Support

Quality

Security

License

Reuse

KontinuousSpeechRecognizerby StephenVinouze

Kotlin 132 Version:Current
License: Permissive (Apache-2.0)

A Kotlin Speech Recognizer that runs continuously and is triggered with an activation keyword

Support

Quality

Security

License

Reuse

graph-based-code-modellingby microsoft

C# 131 Version:Current
License: Permissive (MIT)

Code for "Generative Code Modeling with Graphs" (ICLR'19)

Support

Quality

Security

License

Reuse

pytorch-dc-ttsby tugstugi

Jupyter Notebook 131 Version:Current
License: Permissive (MIT)

Text to Speech with PyTorch (English and Mongolian)

Support

Quality

Security

License

Reuse

mongolian-nlpby tugstugi

Jupyter Notebook 131 Version:Current
License: No License (No License)

Useful resources for Mongolian NLP

Support

Quality

Security

License

Reuse

TDNNby cvqluu

Python 130 Version:Current
License: No License (No License)

Time delay neural network (TDNN) implementation in Pytorch using unfold method

Support

Quality

Security

License

Reuse

vae-npvcby JeremyCCHsu

Python 130 Version:Current
License: Proprietary (Proprietary)

Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder

Support

Quality

Security

License

Reuse

Chinese-speech-to-textby liangstein

Python 129 Version:Current
License: Permissive (Apache-2.0)

Chinese Speech To Text Using Wavenet

Support

Quality

Security

License

Reuse

openspeechby sooftware

Python 129 Version:Current
License: Proprietary (Proprietary)

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Support

Quality

Security

License

Reuse

FastSpeech2by rishikksh20

Jupyter Notebook 129 Version:Current
License: Permissive (Apache-2.0)

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Support

Quality

Security

License

Reuse

ukrainian-ttsby robinhad

Jupyter Notebook 129 Version:Current
License: Permissive (MIT)

Ukrainian TTS (text-to-speech) using ESPNET

Support

Quality

Security

License

Reuse

Multi-Singerby Rongjiehuang

Python 129 Version:Current
License: Permissive (MIT)

PyTorch Implementation of Multi-Singer (ACM-MM'21)

Support

Quality

Security

License

Reuse

VI-SVSby PlayVoice

Python 129 Version:Current
License: Permissive (Apache-2.0)

Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.

Support

Quality

Security

License

Reuse

ei-keyword-spottingby ShawnHymel

C 128 Version:Current
License: No License (No License)

Support

Quality

Security

License

Reuse

tensorflow-ctc-speech-recognitionby philipperemy

Python 127 Version:Current
License: Permissive (Apache-2.0)

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

Support

Quality

Security

License

Reuse

CNN-for-single-channel-speech-enhancementby zhr1201

Python 126 Version:Current
License: No License (No License)

Convolutional neural nets for single channel speech enhancement

Support

Quality

Security

License

Reuse

sova-ttsby sovaai

Python 126 Version:Current
License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Crystalby thuhcsi

C++ 126 Version:Current
License: Permissive (Apache-2.0)

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

Support

Quality

Security

License

Reuse

bark-voice-cloning-HuBERT-quantizerby gitmylo

Python 126 Version:Current
License: Permissive (MIT)

The code for the bark-voicecloning model. Training and inference.

Support

Quality

Security

License

Reuse

fdndlpby helianvine

Python 125 Version:Current
License: Permissive (MIT)

A speech dereverberation algorithm, also called wpe

Support

Quality

Security

License

Reuse

Tacotron-Wavenet-Vocoder-Koreanby hccho2

Python 125 Version:Current
License: Permissive (MIT)

Tacotron, Korean, Wavenet-Vocoder, Korean TTS

Support

Quality

Security

License

Reuse

Saiy-PSby brandall76

Java 125 Version:Current
License: Strong Copyleft (AGPL-3.0)

Saiy Android Play Services dependencies

Support

Quality

Security

License

Reuse

quran-alignby cpfair

C++ 125 Version:Current
License: Permissive (MIT)

Word-accurate timestamps for Qur'anic audio.

Support

Quality

Security

License

Reuse

keras-kaldiby dspavankumar

Python 124 Version:Current
License: Strong Copyleft (GPL-3.0)

Keras Interface for Kaldi ASR

Support

Quality

Security

License

Reuse

at16kby at16k

Python 124 Version:Current
License: Permissive (MIT)

Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.

Support

Quality

Security

License

Reuse

STYLERby keonlee9420

Python 124 Version:Current
License: Permissive (MIT)

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

Support

Quality

Security

License

Reuse

Twistby DCubix

C 124 Version:Current
License: No License (No License)

Twist - node-based audio synthesizer

Support

Quality

Security

License

Reuse

python-google-speech-scriptsby jeysonmc

Python 123 Version:Current
License: Proprietary (Proprietary)

Simple scripts to interact with Google's speech services

Support

Quality

Security

License

Reuse

scribeby VikParuchuri

Python 123 Version:Current
License: No License (No License)

Simple speech recognition using your microphone.

Support

Quality

Security

License

Reuse

end-to-end-lipreadingby mpc001

Python 123 Version:Current
License: No License (No License)

Pytorch code for End-to-End Audiovisual Speech Recognition

Support

Quality

Security

License

Reuse

fac-via-ppgby guanlongzhao

Python 123 Version:Current
License: Permissive (Apache-2.0)

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

Support

Quality

Security

License

Reuse

DurIANby ivanvovk

Python 123 Version:Current
License: Permissive (BSD-3-Clause)

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Support

Quality

Security

License

Reuse

3D-Speakerby alibaba-damo-academy

Python 123 Version:Current
License: Permissive (Apache-2.0)

A repository for single- and multi-modal speaker verification, speaker recognition, and speaker diarization.

Support

Quality

Security

License

Reuse

phasenby huyanxin

Python 122 Version:Current
License: No License (No License)

A unofficial Pytorch implementation of Microsoft's PHASEN

Support

Quality

Security

License

Reuse

audiofileby mpruett

C++ 122 Version:Current
License: Proprietary (Proprietary)

Audio File Library

Support

Quality

Security

License

Reuse

MTFAA-Netby echocatzh

Python 122 Version:Current
License: Permissive (MIT)

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

Support

Quality

Security

License

Reuse

pennby interactiveaudiolab

Python 122 Version:Current
License: Permissive (MIT)

Pitch Estimating Neural Networks (PENN)

Support

Quality

Security

License

Reuse

beamformersby Enny1991

Python 121 Version:Current
License: Permissive (MIT)

Easy to use Beamformers for multi-channel speech separation/enhancement

Support

Quality

Security

License

Reuse

SpleeterRTby james34602

C 121 Version:Current
License: Strong Copyleft (GPL-3.0)

Real time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.

Support

Quality

Security

License

Reuse

mongolian-speech-recognitionby tugstugi

Python 120 Version:Current
License: No License (No License)

Mongolian speech recognition with PyTorch

Support

Quality

Security

License

Reuse

howlby castorini

Python 120 Version:Current
License: Weak Copyleft (MPL-2.0)

Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.

Support

Quality

Security

License

Reuse

cmusphinxby cjac

C 120 Version:Current
License: No License (No License)

CMU Sphinx - Speech Recognition Toolkit

Support

Quality

Security

License

Reuse

UEAzSpeechby lucoiso

C++ 120 Version:Current
License: Permissive (MIT)

This plugin integrates Azure Speech Cognitive Services in Unreal Engine.

Support

Quality

Security

License

Reuse

RNN-Transducerby HawkAaron

Python 119 Version:Current
License: No License (No License)

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Support

Quality

Security

License

Reuse

gst-interpipeby RidgeRun

C 119 Version:Current
License: Proprietary (Proprietary)

GStreamer plug-in for interpipeline communication

Support

Quality

Security

License

Reuse

gocapby cugu

Go 119 Version:Current
License: Strong Copyleft (GPL-3.0)

List your dependencies capabilities and monitor if updates require more capabilities.

Support

Quality

Security

License

Reuse

HTML5-overviewby dret

HTML 118 Version:Current
License: Permissive (Unlicense)

Overview of HTML5 Standardization Activities.

Support

Quality

Security

License

Reuse

OpenSpeechby sooftware

Python 118 Version:Current
License: Proprietary (Proprietary)

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Support

Quality

Security

License

Reuse

Voice-Conversion-GANby pritishyuvraj

Jupyter Notebook 118 Version:Current
License: Permissive (Unlicense)

Voice Conversion using Cycle GAN's For Non-Parallel Data

Support

Quality

Security

License

Reuse

DroidSpeechby vikramezhil

Java 117 Version:Current
License: Permissive (Apache-2.0)

Android library for continuous speech recognition

Support

Quality

Security

License

Reuse

UniVoiceby adrenak

P2P VoIP in Unity

132

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

KontinuousSpeechRecognizerby StephenVinouze

A Kotlin Speech Recognizer that runs continuously and is triggered with an activation keyword

Kotlin

132

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

graph-based-code-modellingby microsoft

Code for "Generative Code Modeling with Graphs" (ICLR'19)

131

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pytorch-dc-ttsby tugstugi

Text to Speech with PyTorch (English and Mongolian)

Jupyter Notebook

131

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

mongolian-nlpby tugstugi

Useful resources for Mongolian NLP

Jupyter Notebook

131

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

TDNNby cvqluu

Time delay neural network (TDNN) implementation in Pytorch using unfold method

Python

130

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

vae-npvcby JeremyCCHsu

Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder

Python

130

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Chinese-speech-to-textby liangstein

Chinese Speech To Text Using Wavenet

Python

129

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

openspeechby sooftware

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python

129

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

FastSpeech2by rishikksh20

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

Jupyter Notebook

129

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

ukrainian-ttsby robinhad

Ukrainian TTS (text-to-speech) using ESPNET

Jupyter Notebook

129

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Multi-Singerby Rongjiehuang

PyTorch Implementation of Multi-Singer (ACM-MM'21)

Python

129

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

VI-SVSby PlayVoice

Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.

Python

129

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

ei-keyword-spottingby ShawnHymel

128

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

tensorflow-ctc-speech-recognitionby philipperemy

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

Python

127

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

CNN-for-single-channel-speech-enhancementby zhr1201

Convolutional neural nets for single channel speech enhancement

Python

126

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

sova-ttsby sovaai

Python

126

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Crystalby thuhcsi

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

C++

126

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

bark-voice-cloning-HuBERT-quantizerby gitmylo

The code for the bark-voicecloning model. Training and inference.

Python

126

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

fdndlpby helianvine

A speech dereverberation algorithm, also called wpe

Python

125

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Tacotron-Wavenet-Vocoder-Koreanby hccho2

Tacotron, Korean, Wavenet-Vocoder, Korean TTS

Python

125

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Saiy-PSby brandall76

Saiy Android Play Services dependencies

Java

125

Updated: 4 y ago

License: Strong Copyleft (AGPL-3.0)

Support

Quality

Security

License

Reuse

quran-alignby cpfair

Word-accurate timestamps for Qur'anic audio.

C++

125

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

keras-kaldiby dspavankumar

Keras Interface for Kaldi ASR

Python

124

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

at16kby at16k

Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.

Python

124

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

STYLERby keonlee9420

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

Python

124

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Twistby DCubix

Twist - node-based audio synthesizer

124

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

python-google-speech-scriptsby jeysonmc

Simple scripts to interact with Google's speech services

Python

123

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

scribeby VikParuchuri

Simple speech recognition using your microphone.

Python

123

Updated: 6 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

end-to-end-lipreadingby mpc001

Pytorch code for End-to-End Audiovisual Speech Recognition

Python

123

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

fac-via-ppgby guanlongzhao

Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)

Python

123

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

DurIANby ivanvovk

Implementation of "Duration Informed Attention Network for Multimodal Synthesis" (https://arxiv.org/pdf/1909.01700.pdf) paper.

Python

123

Updated: 4 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

3D-Speakerby alibaba-damo-academy

A repository for single- and multi-modal speaker verification, speaker recognition, and speaker diarization.

Python

123

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

phasenby huyanxin

A unofficial Pytorch implementation of Microsoft's PHASEN

Python

122

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

audiofileby mpruett

Audio File Library

C++

122

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

MTFAA-Netby echocatzh

Multi-Scale Temporal Frequency Convolutional Network With Axial Attention for Speech Enhancement

Python

122

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pennby interactiveaudiolab

Pitch Estimating Neural Networks (PENN)

Python

122

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

beamformersby Enny1991

Easy to use Beamformers for multi-channel speech separation/enhancement

Python

121

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

SpleeterRTby james34602

Real time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.

121

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

mongolian-speech-recognitionby tugstugi

Mongolian speech recognition with PyTorch

Python

120

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

howlby castorini

Wake word detection modeling toolkit for Firefox Voice, supporting open datasets like Speech Commands and Common Voice.

Python

120

Updated: 3 y ago

License: Weak Copyleft (MPL-2.0)

Support

Quality

Security

License

Reuse

cmusphinxby cjac

CMU Sphinx - Speech Recognition Toolkit

120

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

UEAzSpeechby lucoiso

This plugin integrates Azure Speech Cognitive Services in Unreal Engine.

C++

120

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

RNN-Transducerby HawkAaron

MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks

Python

119

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

gst-interpipeby RidgeRun

GStreamer plug-in for interpipeline communication

119

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

gocapby cugu

List your dependencies capabilities and monitor if updates require more capabilities.

119

Updated: 3 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

HTML5-overviewby dret

Overview of HTML5 Standardization Activities.

HTML

118

Updated: 2 y ago

License: Permissive (Unlicense)

Support

Quality

Security

License

Reuse

OpenSpeechby sooftware

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Python

118

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Voice-Conversion-GANby pritishyuvraj

Voice Conversion using Cycle GAN's For Non-Parallel Data

Jupyter Notebook

118

Updated: 2 y ago

License: Permissive (Unlicense)

Support

Quality

Security

License

Reuse

DroidSpeechby vikramezhil

Android library for continuous speech recognition

Java

117

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 12

UniVoiceby adrenak

C# 132 Version:Current License: Permissive (MIT)

P2P VoIP in Unity

KontinuousSpeechRecognizerby StephenVinouze

Kotlin 132 Version:Current License: Permissive (Apache-2.0)

A Kotlin Speech Recognizer that runs continuously and is triggered with an activation keyword

graph-based-code-modellingby microsoft

C# 131 Version:Current License: Permissive (MIT)

Code for "Generative Code Modeling with Graphs" (ICLR'19)

pytorch-dc-ttsby tugstugi

Jupyter Notebook 131 Version:Current License: Permissive (MIT)

Text to Speech with PyTorch (English and Mongolian)

mongolian-nlpby tugstugi

Jupyter Notebook 131 Version:Current License: No License (No License)

Useful resources for Mongolian NLP

TDNNby cvqluu

Python 130 Version:Current License: No License (No License)

Time delay neural network (TDNN) implementation in Pytorch using unfold method

vae-npvcby JeremyCCHsu

Python 130 Version:Current License: Proprietary (Proprietary)

Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder

Chinese-speech-to-textby liangstein

Python 129 Version:Current License: Permissive (Apache-2.0)

Chinese Speech To Text Using Wavenet

openspeechby sooftware

Python 129 Version:Current License: Proprietary (Proprietary)

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

FastSpeech2by rishikksh20

Jupyter Notebook 129 Version:Current License: Permissive (Apache-2.0)

PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech

ukrainian-ttsby robinhad

Jupyter Notebook 129 Version:Current License: Permissive (MIT)

Ukrainian TTS (text-to-speech) using ESPNET

Multi-Singerby Rongjiehuang

Python 129 Version:Current License: Permissive (MIT)

PyTorch Implementation of Multi-Singer (ACM-MM'21)

VI-SVSby PlayVoice

Python 129 Version:Current License: Permissive (Apache-2.0)

Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.

ei-keyword-spottingby ShawnHymel

C 128 Version:Current License: No License (No License)

tensorflow-ctc-speech-recognitionby philipperemy

Python 127 Version:Current License: Permissive (Apache-2.0)

Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).

CNN-for-single-channel-speech-enhancementby zhr1201

Python 126 Version:Current License: No License (No License)

Convolutional neural nets for single channel speech enhancement

sova-ttsby sovaai

Python 126 Version:Current License: Permissive (Apache-2.0)

Crystalby thuhcsi

C++ 126 Version:Current License: Permissive (Apache-2.0)

Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.

bark-voice-cloning-HuBERT-quantizerby gitmylo

Python 126 Version:Current License: Permissive (MIT)

The code for the bark-voicecloning model. Training and inference.

fdndlpby helianvine

Python 125 Version:Current License: Permissive (MIT)

A speech dereverberation algorithm, also called wpe

Tacotron-Wavenet-Vocoder-Koreanby hccho2

Python 125 Version:Current License: Permissive (MIT)

Tacotron, Korean, Wavenet-Vocoder, Korean TTS

Saiy-PSby brandall76

Java 125 Version:Current License: Strong Copyleft (AGPL-3.0)

Saiy Android Play Services dependencies

quran-alignby cpfair

C++ 125 Version:Current License: Permissive (MIT)

Word-accurate timestamps for Qur'anic audio.

keras-kaldiby dspavankumar

Python 124 Version:Current License: Strong Copyleft (GPL-3.0)

Keras Interface for Kaldi ASR

at16kby at16k

Python 124 Version:Current License: Permissive (MIT)

Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.

STYLERby keonlee9420

Python 124 Version:Current License: Permissive (MIT)

Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech, INTERSPEECH 2021

Twistby DCubix

C 124 Version:Current License: No License (No License)

Twist - node-based audio synthesizer

C# 132 Version:Current
License: Permissive (MIT)

Kotlin 132 Version:Current
License: Permissive (Apache-2.0)

C# 131 Version:Current
License: Permissive (MIT)

Jupyter Notebook 131 Version:Current
License: Permissive (MIT)

Jupyter Notebook 131 Version:Current
License: No License (No License)

Python 130 Version:Current
License: No License (No License)

Python 130 Version:Current
License: Proprietary (Proprietary)

Python 129 Version:Current
License: Permissive (Apache-2.0)

Python 129 Version:Current
License: Proprietary (Proprietary)

Jupyter Notebook 129 Version:Current
License: Permissive (Apache-2.0)

Jupyter Notebook 129 Version:Current
License: Permissive (MIT)

Python 129 Version:Current
License: Permissive (MIT)

Python 129 Version:Current
License: Permissive (Apache-2.0)

C 128 Version:Current
License: No License (No License)

Python 127 Version:Current
License: Permissive (Apache-2.0)

Python 126 Version:Current
License: No License (No License)

Python 126 Version:Current
License: Permissive (Apache-2.0)

C++ 126 Version:Current
License: Permissive (Apache-2.0)

Python 126 Version:Current
License: Permissive (MIT)

Python 125 Version:Current
License: Permissive (MIT)

Python 125 Version:Current
License: Permissive (MIT)

Java 125 Version:Current
License: Strong Copyleft (AGPL-3.0)

C++ 125 Version:Current
License: Permissive (MIT)

Python 124 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 124 Version:Current
License: Permissive (MIT)

Python 124 Version:Current
License: Permissive (MIT)

C 124 Version:Current
License: No License (No License)

Python 123 Version:Current
License: Proprietary (Proprietary)

Python 123 Version:Current
License: No License (No License)

Python 123 Version:Current
License: No License (No License)

Python 123 Version:Current
License: Permissive (Apache-2.0)

Python 123 Version:Current
License: Permissive (BSD-3-Clause)

Python 123 Version:Current
License: Permissive (Apache-2.0)

Python 122 Version:Current
License: No License (No License)

C++ 122 Version:Current
License: Proprietary (Proprietary)

Python 122 Version:Current
License: Permissive (MIT)

Python 122 Version:Current
License: Permissive (MIT)

Python 121 Version:Current
License: Permissive (MIT)

C 121 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 120 Version:Current
License: No License (No License)

Python 120 Version:Current
License: Weak Copyleft (MPL-2.0)

C 120 Version:Current
License: No License (No License)

C++ 120 Version:Current
License: Permissive (MIT)

Python 119 Version:Current
License: No License (No License)

C 119 Version:Current
License: Proprietary (Proprietary)

Go 119 Version:Current
License: Strong Copyleft (GPL-3.0)

HTML 118 Version:Current
License: Permissive (Unlicense)

Python 118 Version:Current
License: Proprietary (Proprietary)

Jupyter Notebook 118 Version:Current
License: Permissive (Unlicense)

Java 117 Version:Current
License: Permissive (Apache-2.0)