Support
Quality
Security
License
Reuse
Explore Text-To-Speech
Support
Quality
Security
License
Reuse
Voice conversion based on deep learning method
Support
Quality
Security
License
Reuse
TTS(Text to speech) GUI using Baidu TTS api, currently only support Chinese; 将文字转换为语音mp3文件,自动拆分较长文本文件,适合用于生成有声小说
Support
Quality
Security
License
Reuse
tacotron-2(tensorflow) + wavernn(pytorch) chinese TTS
Support
Quality
Security
License
Reuse
Android Voice Recognition(CMU Sphinx + Google Voice) Like Siri
Support
Quality
Security
License
Reuse
wavenet vocoder using tensorflow
Support
Quality
Security
License
Reuse
This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervision".
Support
Quality
Security
License
Reuse
s
speech-emotion-recognition-using-self-attentionby KrishnaDN
Python 28 Version:Current License: No License (No License)
Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From INTERSPEECH 2019
Support
Quality
Security
License
Reuse
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion
Support
Quality
Security
License
Reuse
This is my public repository for LaunchBar 6 Actions.
Support
Quality
Security
License
Reuse
🗨🎙📚 generate acoustic model adaptation datasets for Custom Speech Service
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
convert subtitle (.srt) to speech (.wav) using google API
Support
Quality
Security
License
Reuse
L
Python 28 Version:Current License: No License (No License)
[NeurIPS 2020] Official repository for the project "Listening to Sound of Silence for Speech Denoising"
Support
Quality
Security
License
Reuse
MemLock: Memory Usage Guided Fuzzing
Support
Quality
Security
License
Reuse
This is Malayalam Speech Recognition model developed for CMUSphinx. This is now used for Google Summer Code 2016
Support
Quality
Security
License
Reuse
Create and edit DDC headset correction files
Support
Quality
Security
License
Reuse
Toolkit to asses speech impairments in patients with neurological disorders
Support
Quality
Security
License
Reuse
VisualOn AMR-WB encoder from Android
Support
Quality
Security
License
Reuse
Speech Recognition and Text to Speech for Android with FireMonkey (written in Object Pascal and Delphi XE6)
Support
Quality
Security
License
Reuse
Smart loader handler to manage loaders everywhere in Angular apps.
Support
Quality
Security
License
Reuse
M
Multi-Hotword_Spottingby aishoot
Jupyter Notebook 28 Version:Current License: No License (No License)
Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Support
Quality
Security
License
Reuse
s
sinhalese_language_racism_detectionby renuka-fernando
Jupyter Notebook 28 Version:Current License: Permissive (MIT)
Sinhalese Language based Hate Speech Detection
Support
Quality
Security
License
Reuse
CLI tool for macOS that transcribes speech from the microphone using Apple’s speech recognition API, SFSpeechRecognizer. (help.)
Support
Quality
Security
License
Reuse
Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
A framework for automatic speech recognition
Support
Quality
Security
License
Reuse
Code repository for the book Make Python Talk
Support
Quality
Security
License
Reuse
《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm
Support
Quality
Security
License
Reuse
Android Things project demonstrating Text-To-Speech and Speech-To-Text
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Automatic Speech Recognition with deepspeech2 model in pytorch with support from Zakuro AI.
Support
Quality
Security
License
Reuse
Learning Lip Sync of Obama from Speech Audio
Support
Quality
Security
License
Reuse
C
Conv-Tasnet-for-speech-enchancement-and-seperationby runninging
Python 27 Version:Current License: No License (No License)
The state-of-art time domain network for speech separation, and it performs well on speech enhancement and music separation
Support
Quality
Security
License
Reuse
ABX and kaldi experiments on speech corpora made easy
Support
Quality
Security
License
Reuse
Convert phoneme codes and lexicon formats for English speech synths
Support
Quality
Security
License
Reuse
Deep Convolution Text to Speech
Support
Quality
Security
License
Reuse
component for Chrome speech recognition wrapper
Support
Quality
Security
License
Reuse
Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition
Support
Quality
Security
License
Reuse
a PyTorch implementation of Lip2Wav
Support
Quality
Security
License
Reuse
PyTorch implementation of Densely Connected Time Delay Neural Network
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
An implementation of a phase vocoder that uses the Fast Lifting Wavelet Transform for pitch detection and TD-PSOLA for pitch correction
Support
Quality
Security
License
Reuse
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020
Support
Quality
Security
License
Reuse
Text-to-Speech tutorial at SLTU 2016
Support
Quality
Security
License
Reuse
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
Support
Quality
Security
License
Reuse
GUI application for submitting audio fingerprints to AcoustID
Support
Quality
Security
License
Reuse
Scripts for LIUM SpkDiarization tools
Support
Quality
Security
License
Reuse
An automatic speech recognition API
Support
Quality
Security
License
Reuse
T
TTSby macdonst
Java 28Updated: 8 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
R
Support
Quality
Security
License
Reuse
V
VoiceConversionby SerialLain3170
Voice conversion based on deep learning method
Python 28Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
cytron_tts_guiby eisneim
TTS(Text to speech) GUI using Baidu TTS api, currently only support Chinese; 将文字转换为语音mp3文件,自动拆分较长文本文件,适合用于生成有声小说
Python 28Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
tacotron-2_wavernnby wqt2019
tacotron-2(tensorflow) + wavernn(pytorch) chinese TTS
Python 28Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
V
VoiceRecognition_siriby jimin530
Android Voice Recognition(CMU Sphinx + Google Voice) Like Siri
Java 28Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
tensorflow_wavenet_vocoderby azraelkuan
wavenet vocoder using tensorflow
Python 28Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
A
AAS_enhancementby lifelongeek
This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervision".
Python 28Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
speech-emotion-recognition-using-self-attentionby KrishnaDN
Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From INTERSPEECH 2019
Python 28Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
T
TransferLearning-CLVCby cjerry1243
Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion
Python 28Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
M
MyLaunchBarActionsby raguay
This is my public repository for LaunchBar 6 Actions.
JavaScript 28Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
acoustic-model-machineby noopkat
🗨🎙📚 generate acoustic model adaptation datasets for Custom Speech Service
JavaScript 28Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
asterisk-external-mediaby asterisk
JavaScript 28Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
subtitle_to_speechby bryan-brancotte
convert subtitle (.srt) to speech (.wav) using google API
Python 28Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
L
Listening-to-Sound-of-Silence-for-Speech-Denoisingby henryxrl
[NeurIPS 2020] Official repository for the project "Listening to Sound of Silence for Speech Denoising"
Python 28Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
M
MemLockby ICSE2020-MemLock
MemLock: Memory Usage Guided Fuzzing
C 28Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
ml-am-lm-cmusphinxby sreecodeslayer
This is Malayalam Speech Recognition model developed for CMUSphinx. This is now used for Google Summer Code 2016
Perl 28Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
D
DDCToolboxby ThePBone
Create and edit DDC headset correction files
C++ 28Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
N
NeuroSpeechby jcvasquezc
Toolkit to asses speech impairments in patients with neurological disorders
C++ 28Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
v
vo-amrwbencby mstorsjo
VisualOn AMR-WB encoder from Android
C 28Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
F
FireMonkey-Android-Voiceby jimmckeeth
Speech Recognition and Text to Speech for Android with FireMonkey (written in Object Pascal and Delphi XE6)
C++ 28Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
n
ngx-smart-loaderby maximelafarie
Smart loader handler to manage loaders everywhere in Angular apps.
TypeScript 28Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
M
Multi-Hotword_Spottingby aishoot
Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?
Jupyter Notebook 28Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
sinhalese_language_racism_detectionby renuka-fernando
Sinhalese Language based Hate Speech Detection
Jupyter Notebook 28Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
transcribeby dtinth
CLI tool for macOS that transcribes speech from the microphone using Apple’s speech recognition API, SFSpeechRecognizer. (help.)
Swift 28Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
v
vocalistby vskadandale
Official repository for the paper VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices
Python 28Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
V
VoiceToJapaneseby 0Xiaohei0
Python 28Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
SpeeQby msalhab96
A framework for automatic speech recognition
Python 28Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
mptby markhliu
Code repository for the book Make Python Talk
Python 28Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
SpeechPrompt-v2by ga642381
《SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks》Speech processing with prompting paradigm
Python 28Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
audiofun-androidthingsby Nilhcem
Android Things project demonstrating Text-To-Speech and Speech-To-Text
Java 27Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
G
GELPby ljuvela
Python 27Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
A
ASRDeepSpeechby JeanMaximilienCadic
Automatic Speech Recognition with deepspeech2 model in pytorch with support from Zakuro AI.
Python 27Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
L
Learning-Lip-Sync-from-Audioby amtsai96
Learning Lip Sync of Obama from Speech Audio
Python 27Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
C
Conv-Tasnet-for-speech-enchancement-and-seperationby runninging
The state-of-art time domain network for speech separation, and it performs well on speech enhancement and music separation
Python 27Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
abkhaziaby bootphon
ABX and kaldi experiments on speech corpora made easy
Python 27Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
l
lexconvertby ssb22
Convert phoneme codes and lexicon formats for English speech synths
Python 27Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
d
dctts2by eazhary
Deep Convolution Text to Speech
Python 27Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
speechby yyx990803
component for Chrome speech recognition wrapper
JavaScript 27Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
m
modulated_fusion_transformerby jbdel
Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition
Python 27Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
L
Lip2Wav-pytorchby joannahong
a PyTorch implementation of Lip2Wav
Python 27Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
D-TDNNby yuyq96
PyTorch implementation of Densely Connected Time Delay Neural Network
Python 27Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
A
ALFFA_PUBLICby getalp
Shell 27Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
Phase-Vocoderby terrykong
An implementation of a phase vocoder that uses the Fast Lifting Wavelet Transform for pitch detection and TD-PSOLA for pitch correction
HTML 27Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
V
VoxSRC2020by a-nagrani
Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020
Perl 27Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
tts-tutorialby mjansche
Text-to-Speech tutorial at SLTU 2016
C++ 27Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
D
DeepSpeech-APIby AASHISHAG
The code enables users to use Mozilla's Deep Speech model over the Web Browser.
TypeScript 27Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
acoustid-fingerprinterby acoustid
GUI application for submitting audio fingerprints to AcoustID
C++ 27Updated: 4 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
L
LIUMby StevenLOL
Scripts for LIUM SpkDiarization tools
Shell 27Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
linto-platform-sttby linto-ai
An automatic speech recognition API
Python 27Updated: 2 y ago License: Strong Copyleft (AGPL-3.0)
Support
Quality
Security
License
Reuse