PHP implementation of the Google Speech to Text API
Support
Quality
Security
License
Reuse
Google Speech API in Unity (C#)
Support
Quality
Security
License
Reuse
NLP-preprocessor for the SOVA-TTS project
Support
Quality
Security
License
Reuse
Painless speech-to-text transcription
Support
Quality
Security
License
Reuse
UniMRCP modules for Asterisk
Support
Quality
Security
License
Reuse
QtSpeech is cross-platform library based on Qt to provide common cross-platform API to access and use system TTS (Text-to-Speech) engines on platforms as Windows (SAPI), Mac (SpeechSynthesis) and Linux (Festival). Licensed as LGPL, so can be used on OpenSource and Commercial products
Support
Quality
Security
License
Reuse
Citar part of speech tagger
Support
Quality
Security
License
Reuse
Speech synthesis running on ESP32 based on Flite engine.
Support
Quality
Security
License
Reuse
PPG-Based Voice Conversion
Support
Quality
Security
License
Reuse
Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Angular JavaScript library packaged as XStatic. Mirror of code maintained at opendev.org.
Support
Quality
Security
License
Reuse
Audio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmentation, evaluation in noisy conditions, etc.
Support
Quality
Security
License
Reuse
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
Support
Quality
Security
License
Reuse
Gaussian Mixture VAE Tacotron
Support
Quality
Security
License
Reuse
A Python based Voice Assistant like Siri
Support
Quality
Security
License
Reuse
SPPAS: the automatic annotation and analysis of speech software
Support
Quality
Security
License
Reuse
Quad-based audio fingerprinting and recognition in Python
Support
Quality
Security
License
Reuse
Convert text to speech using Google Translate API
Support
Quality
Security
License
Reuse
Algorithms for matching audio file similarities.
Support
Quality
Security
License
Reuse
This repository is now obsolete. Please go to https://github.com/idlak/idlak instead.
Support
Quality
Security
License
Reuse
V
Voice-Privacy-Challenge-2022by Voice-Privacy-Challenge
Python 38 Version:Current License: No License (No License)
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
Support
Quality
Security
License
Reuse
An Windows client SDK and Demo software for ASRT speech recognition system. 一个可用于ASRT语音识别系统的Windows SDK和Demo客户端软件
Support
Quality
Security
License
Reuse
vits singing voice conversion based on ppg & hubert;singing voice clone;
Support
Quality
Security
License
Reuse
An Implementation of Singing Voice Conversion Based on Diffsinger
Support
Quality
Security
License
Reuse
A lightweight intelligent voice assistant built on tools such as Snowboy, Whisper, ChatYuan and Azure TTS
Support
Quality
Security
License
Reuse
Python package implementing the TD-PSOLA algorithm for speech processing
Support
Quality
Security
License
Reuse
Smart Mirror that helps you pick clothes
Support
Quality
Security
License
Reuse
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
Support
Quality
Security
License
Reuse
Implementation of meta-transfer-learning for ASR and LM (ACL 2020)
Support
Quality
Security
License
Reuse
NodeJS Client library for Bandwidth Voice and Messaging APIs
Support
Quality
Security
License
Reuse
An Atom package for Solargraph.
Support
Quality
Security
License
Reuse
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
Support
Quality
Security
License
Reuse
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.
Support
Quality
Security
License
Reuse
Asterisk module that provides the "eSpeak" dialplan application. It allows you to use the eSpeak text to speech synthesizer. Works with asterisk 1.6 or newer.
Support
Quality
Security
License
Reuse
Voxceleb1 i-vector based speaker recognition system
Support
Quality
Security
License
Reuse
Auto Segmentation Criterion (ASG) implemented in pytorch
Support
Quality
Security
License
Reuse
German part-of-speech dictionary
Support
Quality
Security
License
Reuse
This consist of basic examples of performing Speech Recognition in Python using Google Speech Recognition Engine
Support
Quality
Security
License
Reuse
This is the official implementation of the paper "Speech2AffectiveGestures: Synthesizing Co-Speech Gestures with Generative Adversarial Affective Expression Learning".
Support
Quality
Security
License
Reuse
not official API for Microsoft speech synthesis from Microsoft Edge web browser read aloud
Support
Quality
Security
License
Reuse
audio processing module for pytorch:stft, istft
Support
Quality
Security
License
Reuse
Phase-Aware Speech Enhancement with Deep Complex U-Net
Support
Quality
Security
License
Reuse
Segment a given audio into utterances using a trained end-to-end ASR model.
Support
Quality
Security
License
Reuse
a python library for different types of vocoders like LPC, MCEP, PSOLA, etc.
Support
Quality
Security
License
Reuse
A
AcousticFeatureExtractionby Zhangtingyuxuan
Python 36 Version:Current License: Strong Copyleft (GPL-3.0)
Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取
Support
Quality
Security
License
Reuse
Implementation of Multi speaker TTS
Support
Quality
Security
License
Reuse
Small addition to impress.js to allow for html5 audio in your presentation
Support
Quality
Security
License
Reuse
Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech; 百度语音识别、语音合成API使用。
Support
Quality
Security
License
Reuse
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Support
Quality
Security
License
Reuse
p
php-speech-to-textby rogerthomas84
PHP implementation of the Google Speech to Text API
PHP 39Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
L
LowkeySpeechby steelejay
Google Speech API in Unity (C#)
C# 39Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
sova-tts-tpsby sovaai
NLP-preprocessor for the SOVA-TTS project
Python 39Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
t
transcribe4allby hack4impact-upenn
Painless speech-to-text transcription
Go 39Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
asterisk-unimrcpby unispeech
UniMRCP modules for Asterisk
C 39Updated: 4 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
q
qtspeechby yshurik
QtSpeech is cross-platform library based on Qt to provide common cross-platform API to access and use system TTS (Text-to-Speech) engines on platforms as Windows (SAPI), Mac (SpeechSynthesis) and Linux (Festival). Licensed as LGPL, so can be used on OpenSource and Commercial products
C++ 39Updated: 4 y ago License: Weak Copyleft (LGPL-3.0)
Support
Quality
Security
License
Reuse
c
citar-cxxby danieldk
Citar part of speech tagger
C++ 39Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
e
esp32-fliteby alkhimey
Speech synthesis running on ESP32 based on Flite engine.
C 39Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
ppg-vcby liusongxiang
PPG-Based Voice Conversion
Python 39Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
V
VQMIVCby Wendison
Official implementation of VQMIVC: One-shot Voice Conversion @ Interspeech 2021
Python 39Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
k
kaldi-for-russianby grib0ed0v
Jupyter Notebook 39Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
x
xstatic-angularby openstack
Angular JavaScript library packaged as XStatic. Mirror of code maintained at opendev.org.
Python 38Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
audio_degraderby emilio-molina
Audio degradation toolbox in python, with a command-line tool. It is useful to apply controlled degradations to audio: e.g. data augmentation, evaluation in noisy conditions, etc.
Python 38Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
d
deepspeech-cleanerby silenterus
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
Python 38Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
gmvae_tacotronby rishikksh20
Gaussian Mixture VAE Tacotron
Python 38Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
Python-Voice-Assistantby rollingstarky
A Python based Voice Assistant like Siri
Python 38Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
sppasby brigittebigi
SPPAS: the automatic annotation and analysis of speech software
Python 38Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
q
qfpby mbortnyck
Quad-based audio fingerprinting and recognition in Python
Python 38Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
txt2speechby rudkovskyi
Convert text to speech using Google Translate API
Ruby 38Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
Strugatzkiby Sciss
Algorithms for matching audio file similarities.
Scala 38Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
i
idlakby bpotard
This repository is now obsolete. Please go to https://github.com/idlak/idlak instead.
C++ 38Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
V
Voice-Privacy-Challenge-2022by Voice-Privacy-Challenge
Baseline Recipe for VoicePrivacy Challenge 2022: anonymization systems and evaluation software
Python 38Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
A
ASRT_SDK_WinClientby nl8590687
An Windows client SDK and Demo software for ASRT speech recognition system. 一个可用于ASRT语音识别系统的Windows SDK和Demo客户端软件
C# 38Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
V
VI-SVCby dtx525942103
vits singing voice conversion based on ppg & hubert;singing voice clone;
Python 38Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
diff-svcby innnky
An Implementation of Singing Voice Conversion Based on Diffsinger
Python 38Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
v
voice-assistantby tiansztiansz
A lightweight intelligent voice assistant built on tools such as Snowboy, Whisper, ChatYuan and Azure TTS
C++ 38Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
psolaby diguo2046
Python package implementing the TD-PSOLA algorithm for speech processing
Python 37Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
magic-mirror-baseby Techblogogy
Smart Mirror that helps you pick clothes
Python 37Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
s
semi-ttsby ttaoREtw
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
Python 37Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
meta-transfer-learningby audioku
Implementation of meta-transfer-learning for ASR and LM (ACL 2020)
Python 37Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
n
node-bandwidthby Bandwidth
NodeJS Client library for Bandwidth Voice and Messaging APIs
JavaScript 37Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
atom-solargraphby castwide
An Atom package for Solargraph.
JavaScript 37Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
w
web-speech-cognitive-servicesby compulim
Polyfill Web Speech API with Cognitive Services Bing Speech for both speech-to-text and text-to-speech service.
JavaScript 37Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
E
ERISHAby ajinkyakulkarni14
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for which no expressive speech corpus is available.
Python 37Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
A
Asterisk-eSpeakby zaf
Asterisk module that provides the "eSpeak" dialplan application. It allows you to use the eSpeak text to speech synthesizer. Works with asterisk 1.6 or newer.
C 37Updated: 4 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
v
voxceleb-ivectorby swshon
Voxceleb1 i-vector based speaker recognition system
Perl 37Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
torch-asgby zh217
Auto Segmentation Criterion (ASG) implemented in pytorch
C++ 37Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
g
german-pos-dictby languagetool-org
German part-of-speech dictionary
Shell 37Updated: 4 y ago License: Strong Copyleft (CC-BY-SA-4.0)
Support
Quality
Security
License
Reuse
P
Python-Speech-Recognition-by Kalebu
This consist of basic examples of performing Speech Recognition in Python using Google Speech Recognition Engine
Python 37Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
speech2affective_gesturesby UttaranB127
This is the official implementation of the paper "Speech2AffectiveGestures: Synthesizing Co-Speech Gestures with Generative Adversarial Affective Expression Learning".
Python 37Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
msspeechby alekssamos
not official API for Microsoft speech synthesis from Microsoft Edge web browser read aloud
Python 37Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
pytorch_audioby diggerdu
audio processing module for pytorch:stft, istft
Python 36Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
DCUnet.pytorchby chanil1218
Phase-Aware Speech Enhancement with Deep Complex U-Net
Python 36Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
ctc_segmentationby cornerfarmer
Segment a given audio into utterances using a trained end-to-end ASR model.
Python 36Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
pyvocoderby shamidreza
a python library for different types of vocoders like LPC, MCEP, PSOLA, etc.
Python 36Updated: 4 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
A
AcousticFeatureExtractionby Zhangtingyuxuan
Acoustic feature extraction using Librosa library and openSMILE toolkit.使用Librosa音频处理库和openSMILE工具包,进行简单的声学特征提取
Python 36Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
m
multi_speaker_ttsby CODEJIN
Implementation of Multi speaker TTS
Python 36Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
i
impress-audioby danielsimons1
Small addition to impress.js to allow for html5 audio in your presentation
JavaScript 36Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
B
BaiduASRAndTTSby heartsuit
Using Baidu API. ASR: Automatic Speech Recognition;TTS: Text To Speech; 百度语音识别、语音合成API使用。
C# 36Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
L
LightSpeechby rishikksh20
LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search
Python 36Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse