Deep learning based Speech Beamforming
Support
Quality
Security
License
Reuse
An efficient speech separation method
Support
Quality
Security
License
Reuse
1
1D-Speech-Emotion-Recognitionby vandana-rajan
Python 48 Version:Current License: No License (No License)
Speech Emotion Recognition from raw speech signals using 1D CNN-LSTM
Support
Quality
Security
License
Reuse
J
Java-Speech-Recognizer-Tutorial--Calculatorby goxr3plus
Java 48 Version:Current License: Permissive (Apache-2.0)
:lips: Java-Speech-Recognizer-Tutorial--Calculator
Support
Quality
Security
License
Reuse
Use Google Speech Recognition API to convert your speech into text
Support
Quality
Security
License
Reuse
Implementation of GAN architectures for Voice Conversion
Support
Quality
Security
License
Reuse
streaming attention networks for end-to-end automatic speech recognition
Support
Quality
Security
License
Reuse
Baseline for the VoxSRC 2020 self-supervised speaker verification
Support
Quality
Security
License
Reuse
A collection of basic python modules for spoken natural language processing
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
multi speaker TTS,
Support
Quality
Security
License
Reuse
A script to use the PyWavelet library to perform denoising on a signal using a multi-level signal decomposition using a discrete wavelet transform.
Support
Quality
Security
License
Reuse
Voice control for your websites and applications
Support
Quality
Security
License
Reuse
Create a custom Watson Speech to Text model using specialized domain data
Support
Quality
Security
License
Reuse
This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization.
Support
Quality
Security
License
Reuse
Demos, samples, and experimental code for Lingvo.
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
The EMU-webApp is an online and offline web application for labeling, visualizing and correcting speech and derived speech data.
Support
Quality
Security
License
Reuse
Amazingly simple Fourier transform library for Java
Support
Quality
Security
License
Reuse
Continues Recognition Using Android SpeechRecognition
Support
Quality
Security
License
Reuse
Speech To Text in Android
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
A Sweet Automatic Speech Recognition like Tiramisu Cake using Tensorflow 2. Supported languages having small number of characters such as English, Vietnamese, German, etc.
Support
Quality
Security
License
Reuse
S
Python 47 Version:Current License: No License (No License)
This is the implementation of the paper "Converting anyone's emotion: towards speaker-independent emotional voice conversion".
Support
Quality
Security
License
Reuse
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).
Support
Quality
Security
License
Reuse
Code related to the Dutch instance and user groups of the KALDI speech recognition toolkit
Support
Quality
Security
License
Reuse
Big integer multiplication library for Go using Fast Fourier transform
Support
Quality
Security
License
Reuse
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Support
Quality
Security
License
Reuse
Implementation of the EBU R128 loudness standard
Support
Quality
Security
License
Reuse
VAD(voice activity detection) implement and using for baidu voice recognition
Support
Quality
Security
License
Reuse
Making Espnet easier to use
Support
Quality
Security
License
Reuse
s
speech-recognition-in-javascriptby zolomohan
JavaScript 47 Version:Current License: No License (No License)
Final Code for Speech Recognition in JavaScript tutorial.
Support
Quality
Security
License
Reuse
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
Support
Quality
Security
License
Reuse
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Support
Quality
Security
License
Reuse
Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Support
Quality
Security
License
Reuse
80x faster and 95% accurate language identification with Fasttext
Support
Quality
Security
License
Reuse
Spokestack: give your React Native app a voice interface!
Support
Quality
Security
License
Reuse
Automatic Speech Recognition using Tensorflow
Support
Quality
Security
License
Reuse
Automatic synchronizer of subtitles based on voice activity in the video
Support
Quality
Security
License
Reuse
D
DeepSpeech_Frontendby AccelerateNetworks
Python 46 Version:Current License: Strong Copyleft (GPL-3.0)
A webpage and API for using Mozilla DeepSpeech
Support
Quality
Security
License
Reuse
HTML5 Web Audio API Library
Support
Quality
Security
License
Reuse
:speech_balloon: A speech bubble dialog component for React Native.
Support
Quality
Security
License
Reuse
English Part-of-speech (POS) tagger
Support
Quality
Security
License
Reuse
Mozilla deepspeech server implemented in django.
Support
Quality
Security
License
Reuse
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Support
Quality
Security
License
Reuse
An Android wrapper for the C++ SoundTouch audio processing library
Support
Quality
Security
License
Reuse
pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf
Support
Quality
Security
License
Reuse
D
Deep-Learning-Speech-Recognitionby AKBoles
Jupyter Notebook 46 Version:Current License: No License (No License)
Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.
Support
Quality
Security
License
Reuse
d
deepbeamby auspicious3000
Deep learning based Speech Beamforming
Jupyter Notebook 49Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
T
TDANetby JusperLee
An efficient speech separation method
Python 49Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
1
1D-Speech-Emotion-Recognitionby vandana-rajan
Speech Emotion Recognition from raw speech signals using 1D CNN-LSTM
Python 48Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
J
Java-Speech-Recognizer-Tutorial--Calculatorby goxr3plus
:lips: Java-Speech-Recognizer-Tutorial--Calculator
Java 48Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
S
SpeechRecognitionby MauryaRitesh
Use Google Speech Recognition API to convert your speech into text
Python 48Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
G
GAN-Voice-Conversionby njellinas
Implementation of GAN architectures for Voice Conversion
Python 48Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
streaming-attentionby HaoranMiao
streaming attention networks for end-to-end automatic speech recognition
Python 48Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
v
voxceleb_unsupervisedby joonson
Baseline for the VoxSRC 2020 self-supervised speaker verification
Python 48Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
py-nltoolsby gooofy
A collection of basic python modules for spoken natural language processing
Python 48Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
d
denoising_DIHARD18by staplesinLA
Python 48Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
R
Real-Time-Voice-Cloningby IMLHF
multi speaker TTS,
Python 48Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
W
Wavelet-denoisingby MProx
A script to use the PyWavelet library to perform denoising on a signal using a multi-level signal decomposition using a discrete wavelet transform.
Python 48Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
anycontrolby KaiWedekind
Voice control for your websites and applications
JavaScript 48Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
T
Train-Custom-Speech-Modelby IBM
Create a custom Watson Speech to Text model using specialized domain data
JavaScript 48Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
A
AGAIN-VCby KimythAnly
This is the official implementation of the paper AGAIN-VC: A One-shot Voice Conversion using Activation Guidance and Adaptive Instance Normalization.
Python 48Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
lingvo-labby google-research
Demos, samples, and experimental code for Lingvo.
HTML 48Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
e
esp8266-google-ttsby horihiro
C++ 48Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
E
EMU-webAppby IPS-LMU
The EMU-webApp is an online and offline web application for labeling, visualizing and correcting speech and derived speech data.
TypeScript 48Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
Q
QuiFFTby mileshenrichs
Amazingly simple Fourier transform library for Java
Java 47Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
C
ContinuesVoiceRecognitionby galrom
Continues Recognition Using Android SpeechRecognition
Java 47Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
SpeechToTextby smartherd
Speech To Text in Android
Java 47Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
v
vae_tacotronby yanggeng1995
Python 47Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
T
TiramisuASRby usimarit
A Sweet Automatic Speech Recognition like Tiramisu Cake using Tensorflow 2. Supported languages having small number of characters such as English, Vietnamese, German, etc.
Python 47Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
S
Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWTby KunZhou9646
This is the implementation of the paper "Converting anyone's emotion: towards speaker-independent emotional voice conversion".
Python 47Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
wavenet-classifierby mjpyeon
Keras Implementation of Deepmind's WaveNet for Supervised Learning Tasks
Python 47Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
speech-recognitionby capacitor-community
Java 47Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
A
AESRC2020by R1ckShi
Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech Recognition Challenge (AESRC).
Python 47Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
K
Kaldi_NLby opensource-spraakherkenning-nl
Code related to the Dutch instance and user groups of the KALDI speech recognition toolkit
Shell 47Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
b
bigfftby remyoudompheng
Big integer multiplication library for Go using Fast Fourier transform
Go 47Updated: 4 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
A
AmazonSpeechTranslatorby mobilequickie
End-to-end Solution for Speech Recognition, Text Translation, and Text-to-Speech for iOS using Amazon Translate and Amazon Polly as AWS Machine Learning managed services.
Swift 47Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
e
ebur128by sdroege
Implementation of the EBU R128 loudness standard
Rust 47Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
v
vadby shiweixingcn
VAD(voice activity detection) implement and using for baidu voice recognition
C 47Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
E
EasyEspnetby jindongwang
Making Espnet easier to use
Python 47Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
speech-recognition-in-javascriptby zolomohan
Final Code for Speech Recognition in JavaScript tutorial.
JavaScript 47Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
asrecognitionby jonatasgrosman
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
Python 47Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
D
Daft-Exprtby keonlee9420
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
Python 47Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
M
MediumVCby BrightGu
Any-to-any voice conversion using synthetic specific-speaker speeches as intermedium features
Python 47Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
f
fasttext-langdetectby zafercavdar
80x faster and 95% accurate language identification with Fasttext
Python 47Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
r
react-native-spokestackby spokestack
Spokestack: give your React Native app a voice interface!
TypeScript 46Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
a
automatic-speech-recognitionby brianlan
Automatic Speech Recognition using Tensorflow
Python 46Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
srtsyncby pums974
Automatic synchronizer of subtitles based on voice activity in the video
Python 46Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
D
DeepSpeech_Frontendby AccelerateNetworks
A webpage and API for using Mozilla DeepSpeech
Python 46Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
X
XSound.jsby Korilakkuma
HTML5 Web Audio API Library
JavaScript 46Updated: 6 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
r
react-native-speech-bubbleby charpeni
:speech_balloon: A speech bubble dialog component for React Native.
JavaScript 46Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
w
wink-pos-taggerby winkjs
English Part-of-speech (POS) tagger
JavaScript 46Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
django-deepspeech-serverby ashwan1
Mozilla deepspeech server implemented in django.
JavaScript 46Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
T
TFGANby rishikksh20
TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis
Python 46Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
S
SoundTouch-Androidby VladimirKulyk
An Android wrapper for the C++ SoundTouch audio processing library
C++ 46Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
tacotron2by A-Jacobson
pytorch tacotron2 https://arxiv.org/pdf/1712.05884.pdf
Jupyter Notebook 46Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
Deep-Learning-Speech-Recognitionby AKBoles
Project to learn about speech recognition - both Speaker Diarization and other Speech Recognition applications.
Jupyter Notebook 46Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse