Add voice commands to your website with easy and simple way
Support
Quality
Security
License
Reuse
📱Voice transcription iOS app powered by IBM Watson
Support
Quality
Security
License
Reuse
React app using the Watson Speech to Text service to transform voice audio into written text.
Support
Quality
Security
License
Reuse
A Web Application that implements Speech Recognition and Speech Synthesis using Web APIs, Angular, TypeScript, RxJS, and Angular Material
Support
Quality
Security
License
Reuse
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
Support
Quality
Security
License
Reuse
A
Adaptive-MultiSpeaker-Separationby Totoketchup
Jupyter Notebook 43 Version:Current License: Permissive (MIT)
Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem
Support
Quality
Security
License
Reuse
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis”
Support
Quality
Security
License
Reuse
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Support
Quality
Security
License
Reuse
A highly customisable Intelligent Personal Assistant
Support
Quality
Security
License
Reuse
s
speech-emotion-recognitionby PiotrSobczak
Python 42 Version:Current License: No License (No License)
Multi-modal Speech Emotion Recogniton on IEMOCAP dataset
Support
Quality
Security
License
Reuse
Tensorflow implementation for Speech Enhancement (DDAE)
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Инструмент для тестирования и отладки СanvasApps — навыков семейства Виртуальных Ассистентов "Салют"
Support
Quality
Security
License
Reuse
A tool to generate Audio files from text strings in WAV and MP3 format, using various TTS engines as the source
Support
Quality
Security
License
Reuse
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
Support
Quality
Security
License
Reuse
Old Angular Prototyping Project (no longer maintained)
Support
Quality
Security
License
Reuse
The Cainteoir Text-to-Speech core engine
Support
Quality
Security
License
Reuse
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Support
Quality
Security
License
Reuse
MelGAN implementation with Multi-Band and Full Band supports...
Support
Quality
Security
License
Reuse
使用C++ OnnxRuntime 重构了Tacotron2的推理,使用Libtorch实现了VITS单角色和多角色模型推理的集成UI软件
Support
Quality
Security
License
Reuse
My Part of Speech Tagger
Support
Quality
Security
License
Reuse
:speaker: :e-mail: Voice Based Email for (Blinds?)
Support
Quality
Security
License
Reuse
一句代码搞定语音合成,文字转语音
Support
Quality
Security
License
Reuse
Contains code for our work on speech to singing conversion (ICASSP 2020)
Support
Quality
Security
License
Reuse
generates transcript for video from link
Support
Quality
Security
License
Reuse
node.js module for Yandex speech systems (ASR & TTS)
Support
Quality
Security
License
Reuse
simplistic Digital Audio Workstation written with React/Redux and Electron
Support
Quality
Security
License
Reuse
Kaldi based speaker verification
Support
Quality
Security
License
Reuse
N
Neural-Voice-Cloningby IEEE-NITK
Jupyter Notebook 41 Version:Current License: No License (No License)
Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-speaker generative model with a few cloning samples.
Support
Quality
Security
License
Reuse
w
whisper-openai-gradio-implementationby innovatorved
Python 41 Version:Current License: No License (No License)
Whisper is an automatic speech recognition (ASR) system Gradio Web UI Implementation
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Fast Fourier Transform algorithms from Processing's Minim audio library, adapted to work in android
Support
Quality
Security
License
Reuse
A statistical model-based Speech Enhancement Using MMSE-STSA
Support
Quality
Security
License
Reuse
Chatbot in russian with speech recognition using PocketSphinx and speech synthesis using RHVoice. The AttentionSeq2Seq model is used. Imlemented using Python3+TensorFlow+Keras.
Support
Quality
Security
License
Reuse
Sound augmentation using Large-scale audio dataset (Audioset)
Support
Quality
Security
License
Reuse
PyTorch end-to-end speech recognition
Support
Quality
Security
License
Reuse
Scripts for training Mozilla's DeepSpeech using german speech data
Support
Quality
Security
License
Reuse
Speech Signal Processing - a small collection of routines in Python to do signal processing
Support
Quality
Security
License
Reuse
p
pitch-detection-librosa-pythonby miromasat
Python 40 Version:Current License: No License (No License)
A simple example for extracting a pitch of a voice-track using a python library called librosa. The script is generating smoothed graphs of pitch.
Support
Quality
Security
License
Reuse
A HTML widget for speech recognition from audio or video
Support
Quality
Security
License
Reuse
iOS application to tell the time in the British way 🇬🇧⏰
Support
Quality
Security
License
Reuse
A command line tool to read text with Microsoft Speech API (SAPI).
Support
Quality
Security
License
Reuse
C++ 11 algorithm implementation for voice conversion using harmonic plus stochastic models
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Zero-Shot Text-To-Speech
Support
Quality
Security
License
Reuse
S
Spoken_language_identificationby SpeechFlow-io
Python 40 Version:Current License: Permissive (Apache-2.0)
A TensorFlow-based spoken language identification
Support
Quality
Security
License
Reuse
A
Automatic-Speech-Recognitionby 30stomercury
Python 39 Version:Current License: No License (No License)
End-to-End Speech Recognition Using Tensorflow
Support
Quality
Security
License
Reuse
A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.
Support
Quality
Security
License
Reuse
Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别
Support
Quality
Security
License
Reuse
wwdc 2017 video subtitles
Support
Quality
Security
License
Reuse
v
voiceCmdrby jj09
Add voice commands to your website with easy and simple way
JavaScript 43Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
m
meowth-iosby yrezgui
📱Voice transcription iOS app powered by IBM Watson
JavaScript 43Updated: 5 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
speech-to-text-code-patternby IBM
React app using the Watson Speech to Text service to transform voice audio into written text.
JavaScript 43Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
w
web-speech-angularby luixaviles
A Web Application that implements Speech Recognition and Speech Synthesis using Web APIs, Angular, TypeScript, RxJS, and Angular Material
TypeScript 43Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
v
voxcelebby cyrta
mirror of VoxCeleb dataset - a large-scale speaker identification dataset
Shell 43Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
A
Adaptive-MultiSpeaker-Separationby Totoketchup
Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem
Jupyter Notebook 43Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
C
CDFSE_FastSpeech2by Labmem-Zhouyx
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis”
Python 43Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
SpecAugmentby pyyush
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
Python 42Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
T
Tzara---A-Personal-Assistantby Suman7495
A highly customisable Intelligent Personal Assistant
Python 42Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
speech-emotion-recognitionby PiotrSobczak
Multi-modal Speech Emotion Recogniton on IEMOCAP dataset
Python 42Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
DeepDenoisingAutoencoderby jonlu0602
Tensorflow implementation for Speech Enhancement (DDAE)
Python 42Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
u
url-bar-gamesby MatthewRayfield
JavaScript 42Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
assistant-clientby sberdevices
Инструмент для тестирования и отладки СanvasApps — навыков семейства Виртуальных Ассистентов "Салют"
TypeScript 42Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
T
TTSAutomateby CaffeineAU
A tool to generate Audio files from text strings in WAV and MP3 format, using various TTS engines as the source
C# 42Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
L
LVCNetby ZENGZHEN-TTS
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
Python 42Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
t
tractorby micahgodbolt
Old Angular Prototyping Project (no longer maintained)
CSS 42Updated: 7 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
cainteoir-engineby rhdunn
The Cainteoir Text-to-Speech core engine
C++ 42Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
l
lightning-asrby sooftware
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
Python 42Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
melganby rishikksh20
MelGAN implementation with Multi-Band and Full Band supports...
Jupyter Notebook 42Updated: 3 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
M
MioTTSby NaruseMioShirakana
使用C++ OnnxRuntime 重构了Tacotron2的推理,使用Libtorch实现了VITS单角色和多角色模型推理的集成UI软件
C++ 42Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
f
fasttag_v2by mark-watson
My Part of Speech Tagger
Java 41Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
V
Voicemailby thegenuinegourav
:speaker: :e-mail: Voice Based Email for (Blinds?)
Java 41Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
T
Support
Quality
Security
License
Reuse
s
sp2si-codeby jayneelparekh
Contains code for our work on speech to singing conversion (ICASSP 2020)
Python 41Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
Subtitles-generatorby nestyme
generates transcript for video from link
Python 41Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
y
yandex-speechby antirek
node.js module for Yandex speech systems (ASR & TTS)
JavaScript 41Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
schmixby ramirezd42
simplistic Digital Audio Workstation written with React/Redux and Electron
JavaScript 41Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
K
KaldiBasedSpeakerVerificationby qianhwan
Kaldi based speaker verification
C++ 41Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
N
Neural-Voice-Cloningby IEEE-NITK
Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-speaker generative model with a few cloning samples.
Jupyter Notebook 41Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
whisper-openai-gradio-implementationby innovatorved
Whisper is an automatic speech recognition (ASR) system Gradio Web UI Implementation
Python 41Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
U
UUVCby b04901014
Python 41Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
android_fft_minimby dasaki
Fast Fourier Transform algorithms from Processing's Minim audio library, adapted to work in android
Java 40Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
S
Speech_Enhancement_MMSE-STSAby eesungkim
A statistical model-based Speech Enhancement Using MMSE-STSA
Python 40Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
V
Voice_ChatBotby Desklop
Chatbot in russian with speech recognition using PocketSphinx and speech synthesis using RHVoice. The AttentionSeq2Seq model is used. Imlemented using Python3+TensorFlow+Keras.
Python 40Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
a
audioset_augmentorby AppleHolic
Sound augmentation using Large-scale audio dataset (Audioset)
Python 40Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
o
open_stt_e2eby 1ytic
PyTorch end-to-end speech recognition
Python 40Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
deepspeech-germanby ynop
Scripts for training Mozilla's DeepSpeech using german speech data
Python 40Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
sspby idiap
Speech Signal Processing - a small collection of routines in Python to do signal processing
Python 40Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
p
pitch-detection-librosa-pythonby miromasat
A simple example for extracting a pitch of a voice-track using a python library called librosa. The script is generating smoothed graphs of pitch.
Python 40Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
H
HTML5-Speech-to-Textby pulipulichen
A HTML widget for speech recognition from audio or video
JavaScript 40Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
telltimeby renaudjenny
iOS application to tell the time in the British way 🇬🇧⏰
Swift 40Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
ttsby brookhong
A command line tool to read text with Microsoft Speech API (SAPI).
C++ 40Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
v
vchsmby ShuhuaGao
C++ 11 algorithm implementation for voice conversion using harmonic plus stochastic models
C++ 40Updated: 3 y ago License: Weak Copyleft (LGPL-3.0)
Support
Quality
Security
License
Reuse
k
kiirkirjutajaby alumae
Python 40Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
v
Support
Quality
Security
License
Reuse
S
Spoken_language_identificationby SpeechFlow-io
A TensorFlow-based spoken language identification
Python 40Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
A
Automatic-Speech-Recognitionby 30stomercury
End-to-End Speech Recognition Using Tensorflow
Python 39Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
tf_kaldi_ioby open-speech
A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.
Python 39Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
speech_recognition_ctcby EliasCai
Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别
Python 39Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
wwdc2017_videos_subtitlesby jianpx
wwdc 2017 video subtitles
JavaScript 39Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse