Repository for the CLiPS HAte speech DEtection System [HADES].
Support
Quality
Security
License
Reuse
An open-source tool for automatic speech recognition ASR quality estimation.
Support
Quality
Security
License
Reuse
Speech Commands Recognition using end-to-end deep learning models in pytorch
Support
Quality
Security
License
Reuse
Neural speaker recognition/verification system based on Kaldi and Tensorflow
Support
Quality
Security
License
Reuse
Python library and CLI for Speechmatics
Support
Quality
Security
License
Reuse
Meta-learning model agnostic (MAML) implementation for cross-accented ASR
Support
Quality
Security
License
Reuse
node.js module for Google speech systems (ASR & TTS)
Support
Quality
Security
License
Reuse
WARNING: THIS REPOSITORY IS NO LONGER BEING UPDATED THEREFORE USE THE OTHER REPOSITORY: https://github.com/MrWall112/advanced-discord-bot-easy-install
Support
Quality
Security
License
Reuse
Practice your speech level in any language using speech recognition
Support
Quality
Security
License
Reuse
AI grand challenge 2020 Repo (Speech Recognition Track)
Support
Quality
Security
License
Reuse
中文语音识别,windows可用,准确率较为有限,无需配置gpu
Support
Quality
Security
License
Reuse
a
asterisk-voicekit-modulesby TinkoffCreditSystems
Shell 23 Version:Current License: No License (No License)
Non-blocking Asterisk modules for accessing VoiceKit services for speech recognition and speech synthesis.
Support
Quality
Security
License
Reuse
A
ASR-System-for-Hindi-Languageby KunalDhawan
Shell 23 Version:Current License: No License (No License)
The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project description is available at :- https://kunal-dhawan.weebly.com/asr-system-for-hindi-language-from-scratch.html) : It contains the code for the following systems - 1) Monophone-HMM system built using HTK toolkit , 2)Monophone-HMM system built using Kaldi toolkit, 3)Triphone-HMM system built using Kaldi toolkit and 4)DNN-HMM system built using Kaldi toolkit
Support
Quality
Security
License
Reuse
Tianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用
Support
Quality
Security
License
Reuse
java JNI 调用C实现.speex转换为.wav;使用场景:微信高清语音.speex解码为.wav
Support
Quality
Security
License
Reuse
24-hour Automatic Speech Recognition
Support
Quality
Security
License
Reuse
Kaldi Speech Processing Tools
Support
Quality
Security
License
Reuse
Brian King's VocalKit for voice recognition for iOS devices - redesigned, stripped, better
Support
Quality
Security
License
Reuse
A wavelet audio denoiser done in python
Support
Quality
Security
License
Reuse
Simple speech recognition using dynamic time warping with examples
Support
Quality
Security
License
Reuse
S
Singing-Voice-Conversionby solalala-12
Jupyter Notebook 23 Version:Current License: No License (No License)
2019/04~2019/09 투빅스 Singing Voice Conversion
Support
Quality
Security
License
Reuse
api.audio Python SDK
Support
Quality
Security
License
Reuse
Sample C++ command-line Riva clients.
Support
Quality
Security
License
Reuse
Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets
Support
Quality
Security
License
Reuse
ASRT Speech Recognition SDK for Java. 用于ASRT语音识别系统的Java SDK
Support
Quality
Security
License
Reuse
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
Support
Quality
Security
License
Reuse
Experiments to test different speech recognition systems for SEPIA Framework
Support
Quality
Security
License
Reuse
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION
Support
Quality
Security
License
Reuse
Implementation of a light-weighted Latent-Composer in PyTorch based on "Composer: Creative and Controllable Image Synthesis with Composable Conditions".
Support
Quality
Security
License
Reuse
A Processing library to interface with the SuperCollider synthesis engine.
Support
Quality
Security
License
Reuse
Generation of speech using Yandex SpeechKit.
Support
Quality
Security
License
Reuse
Simple Text-To-Speech (TTS) interface library with multi-language and multi-engine support.
Support
Quality
Security
License
Reuse
Voice Activity Detection
Support
Quality
Security
License
Reuse
Speaker Verification
Support
Quality
Security
License
Reuse
DDAE speech enhancement on spectrogram domain using Keras
Support
Quality
Security
License
Reuse
A demo for simple isolated Chinese speech word recognition using GMMHMM in Python
Support
Quality
Security
License
Reuse
TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Custom wakeup-words for an Android app
Support
Quality
Security
License
Reuse
tacotron for research on Chinese speech synthesis and Taiwanese speech synthesis from Chinese input text sequence with different granularities
Support
Quality
Security
License
Reuse
text-to-speech notification
Support
Quality
Security
License
Reuse
Voice-Control for the MagicMirror based in Google Speech Recognizer (annyang)
Support
Quality
Security
License
Reuse
A simple demo project for the Microsoft Cognitive Services Speaker Recognition APIs
Support
Quality
Security
License
Reuse
External Audio Analyzer for Unity
Support
Quality
Security
License
Reuse
An application that demostrate the usage of Syn.Speech library for Speech Recognition
Support
Quality
Security
License
Reuse
Shazam's music identification functionality ported to Windows
Support
Quality
Security
License
Reuse
💬 A wrapper for popular TTS services to create a more simple & uniform API. Currently, only AWS Polly is supported.
Support
Quality
Security
License
Reuse
Python wrapper for kaldi's arpa2fst
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Multilingual speech translation
Support
Quality
Security
License
Reuse
h
hadesby clips
Repository for the CLiPS HAte speech DEtection System [HADES].
Python 23Updated: 3 y ago License: Weak Copyleft (LGPL-3.0)
Support
Quality
Security
License
Reuse
T
TranscRaterby hlt-mt
An open-source tool for automatic speech recognition ASR quality estimation.
Python 23Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
g
gcommandsby jarfo
Speech Commands Recognition using end-to-end deep learning models in pytorch
Python 23Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
t
tf-kaldi-speakerby mycrazycracy
Neural speaker recognition/verification system based on Kaldi and Tensorflow
Python 23Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
speechmatics-pythonby speechmatics
Python library and CLI for Speechmatics
Python 23Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
cross-accent-maml-asrby audioku
Meta-learning model agnostic (MAML) implementation for cross-accented ASR
Python 23Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
google-speechby antirek
node.js module for Google speech systems (ASR & TTS)
JavaScript 23Updated: 5 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
A
Advanced-Discord-Bot-js8by MrWall112
WARNING: THIS REPOSITORY IS NO LONGER BEING UPDATED THEREFORE USE THE OTHER REPOSITORY: https://github.com/MrWall112/advanced-discord-bot-easy-install
JavaScript 23Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
g
good-speech-web-clientby GoodSpeech
Practice your speech level in any language using speech recognition
JavaScript 23Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
A
AI-Grand-Challenge-2020by yschoi-nisp
AI grand challenge 2020 Repo (Speech Recognition Track)
Python 23Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
C
Chinese_speech_recognitionby sethGu
中文语音识别,windows可用,准确率较为有限,无需配置gpu
Python 23Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
asterisk-voicekit-modulesby TinkoffCreditSystems
Non-blocking Asterisk modules for accessing VoiceKit services for speech recognition and speech synthesis.
Shell 23Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
A
ASR-System-for-Hindi-Languageby KunalDhawan
The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project description is available at :- https://kunal-dhawan.weebly.com/asr-system-for-hindi-language-from-scratch.html) : It contains the code for the following systems - 1) Monophone-HMM system built using HTK toolkit , 2)Monophone-HMM system built using Kaldi toolkit, 3)Triphone-HMM system built using Kaldi toolkit and 4)DNN-HMM system built using Kaldi toolkit
Shell 23Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
r
rosechoby tianbot
Tianbot Rosecho (Tianecho),中文语音人机交互模块,支持ROS即插即用
C 23Updated: 4 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
J
JSpeex-utilby guoguo11
java JNI 调用C实现.speex转换为.wav;使用场景:微信高清语音.speex解码为.wav
C 23Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
a
asr24by uiuc-sst
24-hour Automatic Speech Recognition
C++ 23Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
f
featxtraby mvansegbroeck-zz
Kaldi Speech Processing Tools
C++ 23Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
V
VocalKitby H2CO3
Brian King's VocalKit for voice recognition for iOS devices - redesigned, stripped, better
C 23Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
wavelet-denoiserby actondev
A wavelet audio denoiser done in python
Python 23Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
dtwby crawles
Simple speech recognition using dynamic time warping with examples
Jupyter Notebook 23Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
Singing-Voice-Conversionby solalala-12
2019/04~2019/09 투빅스 Singing Voice Conversion
Jupyter Notebook 23Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
apiaudio-pythonby aflorithmic
api.audio Python SDK
Python 23Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
cpp-clientsby nvidia-riva
Sample C++ command-line Riva clients.
C++ 23Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
M
MT4SSLby ddlBoJack
Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets
Python 23Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
A
ASRT_SDK_Javaby nl8590687
ASRT Speech Recognition SDK for Java. 用于ASRT语音识别系统的Java SDK
Java 23Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
a
asr-corpus-creatorby egorsmkv
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
Python 23Updated: 2 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
s
speech-recognition-experimentsby fquirin
Experiments to test different speech recognition systems for SEPIA Framework
Python 23Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
u
ucla-phonetic-corpusby xinjli
Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION
Python 23Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
L
Latent-Composer-pytorchby aartykov
Implementation of a light-weighted Latent-Composer in PyTorch based on "Composer: Creative and Controllable Image Synthesis with Composable Conditions".
Jupyter Notebook 23Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
processing-scby ideoforms
A Processing library to interface with the SuperCollider synthesis engine.
Java 22Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
y
yandex_speechby art1415926535
Generation of speech using Yandex SpeechKit.
Python 22Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
talkeyby grigi
Simple Text-To-Speech (TTS) interface library with multi-language and multi-engine support.
Python 22Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
V
Support
Quality
Security
License
Reuse
H
Hackbright-Projectby ritchieleeann
Speaker Verification
Python 22Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
DDAEby jerrygood0703
DDAE speech enhancement on spectrogram domain using Keras
Python 22Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
h
hmm_speech_recognition_demoby wblgers
A demo for simple isolated Chinese speech word recognition using GMMHMM in Python
Python 22Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
attentive-modality-hopping-for-SERby david-yoon
TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20
Python 22Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
A
Acoustic_Indicesby patriceguyot
Python 22Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
h
hotwordby wolfpaulus
Custom wakeup-words for an Android app
Java 22Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
tacotronby HappyBall
tacotron for research on Chinese speech synthesis and Taiwanese speech synthesis from Chinese input text sequence with different granularities
Python 22Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
t
Support
Quality
Security
License
Reuse
M
MMM-Hello-Mirrorby Matzefication
Voice-Control for the MagicMirror based in Google Speech Recognizer (annyang)
JavaScript 22Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
speaker-recognition-apiby rposbo
A simple demo project for the Microsoft Cognitive Services Speaker Recognition APIs
JavaScript 22Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
A
AudioJackby keijiro
External Audio Analyzer for Unity
C# 22Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
syn-speech-samplesby SynHub
An application that demostrate the usage of Syn.Speech library for Speech Recognition
C# 22Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
windows-shazamby tomer8007
Shazam's music identification functionality ported to Windows
C# 22Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
l
laravel-text-to-speechby meemalabs
💬 A wrapper for popular TTS services to create a more simple & uniform API. Currently, only AWS Polly is supported.
PHP 22Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
k
kaldilmby csukuangfj
Python wrapper for kaldi's arpa2fst
C++ 22Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
I
ISSAI_SAIDA_Kazakh_ASRby IS2AI
Shell 22Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
speech-translationby formiel
Multilingual speech translation
Shell 22Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse