Sphinx speech recognition for unity
Support
Quality
Security
License
Reuse
A public domain single speaker Japanese speech dataset
Support
Quality
Security
License
Reuse
T
Text-to-speech-in-pythonby vishwajeetanand21
Python 21 Version:Current License: No License (No License)
Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!
Support
Quality
Security
License
Reuse
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.
Support
Quality
Security
License
Reuse
Code for the paper: Separate but TOGETHER: UNSUPERVISED FEDERATED LEARNING FOR SPEECH ENHANCEMENT FROM NON-IID DATA
Support
Quality
Security
License
Reuse
Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS
Support
Quality
Security
License
Reuse
Collection of papers, datasets and tools on the topic of Speech Dereverberation and Speech Enhancement
Support
Quality
Security
License
Reuse
Beam-guided TasNet
Support
Quality
Security
License
Reuse
Simple and free Text-to-Speech (TTS) engine that reads for you any text on your screen with high-quality voices powered by AI models.
Support
Quality
Security
License
Reuse
VN Like Interface for Chatbots
Support
Quality
Security
License
Reuse
A toolkit for processing speech data and creating speech datasets
Support
Quality
Security
License
Reuse
Speech to Text for Processing
Support
Quality
Security
License
Reuse
Kaldi gstreamer android client
Support
Quality
Security
License
Reuse
Android library for offline speech recognition base on Pocketsphinx engine. Add speech recognition feature into your Android app with easier implementations.
Support
Quality
Security
License
Reuse
An Android application which performs SpeechToText
Support
Quality
Security
License
Reuse
Dee - the DeepLens Educating Entertainer #AWSDeepLensChallenge
Support
Quality
Security
License
Reuse
Java+MaryTTS=Java Text To Speech
Support
Quality
Security
License
Reuse
一个基于Java的粤语发音TTS,文字转语音.
Support
Quality
Security
License
Reuse
Python script that lets you easily convert passed text to synthesized audio files, with the help of Amazon's IVONA.
Support
Quality
Security
License
Reuse
G
GAN-based-speech-enhancement-Keras-by fy378968174
Python 20 Version:Current License: No License (No License)
Keras implementation of speech enhancement based on LSGAN
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Linux Voice Assistant for to Make Your Work Easier
Support
Quality
Security
License
Reuse
Text frontend for ESPnet tts recipes
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Wordlists by part of speech and syllable count
Support
Quality
Security
License
Reuse
This application is developed to help speechless people interact with others with ease. It detects voice and converts the input speech into a sign language based video.
Support
Quality
Security
License
Reuse
A Python script to speech some text with Google Translate.
Support
Quality
Security
License
Reuse
Microsoft Speech-Client
Support
Quality
Security
License
Reuse
A Ruby library for consuming the AT&T Speech API for speech to text.
Support
Quality
Security
License
Reuse
s
speech-to-text-websockets-rubyby watson-developer-cloud
Ruby 20 Version:Current License: Permissive (Apache-2.0)
Ruby client that interacts with the IBM Watson Speech to Text service through its WebSockets interface
Support
Quality
Security
License
Reuse
plugin to add part-of-speech (POS) tags
Support
Quality
Security
License
Reuse
A webapp for collecting speech samples for voice recognition testing and training
Support
Quality
Security
License
Reuse
JavaScript MIDI-to-WAV synthesizer
Support
Quality
Security
License
Reuse
WARNING: This repository is no longer maintained ⚠️ This repository will not be updated. The repository will be kept available in read-only mode.
Support
Quality
Security
License
Reuse
Live Transcription based on Speech Recognition API
Support
Quality
Security
License
Reuse
Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.
Support
Quality
Security
License
Reuse
d
dialogflow-audio-recorder-nodejsby googlearchive
JavaScript 20 Version:Current License: Permissive (Apache-2.0)
A simple web app to record audio and a Dialogflow agent to playback the audio as an Action.
Support
Quality
Security
License
Reuse
Binary parsers for arcsecond!
Support
Quality
Security
License
Reuse
Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone
Support
Quality
Security
License
Reuse
R
Robust_Fine_Grained_Prosody_Controlby keonlee9420
Python 20 Version:Current License: Permissive (BSD-3-Clause)
Pytorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
Support
Quality
Security
License
Reuse
CVC: Contrastive Learning for Non-parallel Voice Conversion (in PyTorch)
Support
Quality
Security
License
Reuse
An end-to-end speech recognition system with Wavenet. Built using C++ and python.
Support
Quality
Security
License
Reuse
This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline
Support
Quality
Security
License
Reuse
A simple gstreamer wrapper around Chromium Embedded Framework
Support
Quality
Security
License
Reuse
Android kotlin library for continuous speech recognition with localisations.
Support
Quality
Security
License
Reuse
Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux
Support
Quality
Security
License
Reuse
Repository for the web pages and scripts associated with OpenSLR: the open speech and language repository
Support
Quality
Security
License
Reuse
Antlr parser for NLU examples
Support
Quality
Security
License
Reuse
scripts to align a given wave to its transcription using trained models by Kaldi
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
u
unitysphinxby irllabs
Sphinx speech recognition for unity
C 21Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
K
Kokoro-Speech-Datasetby kaiidams
A public domain single speaker Japanese speech dataset
Python 21Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
T
Text-to-speech-in-pythonby vishwajeetanand21
Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!
Python 21Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
v
voice100by kaiidams
Voice100 includes neural TTS/ASR models. Inference of Voice100 is low cost as its models are tiny and only depend on CNN without autoregression.
Python 21Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
f
fedenhanceby etzinis
Code for the paper: Separate but TOGETHER: UNSUPERVISED FEDERATED LEARNING FOR SPEECH ENHANCEMENT FROM NON-IID DATA
Jupyter Notebook 21Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
coqui_tts_koreaby ttop32
Korean TTS using coqui TTS (glowtts and multiband melgan) - 한국어 TTS
Jupyter Notebook 21Updated: 2 y ago License: Weak Copyleft (MPL-2.0)
Support
Quality
Security
License
Reuse
s
speech-enhancementby jonashaag
Collection of papers, datasets and tools on the topic of Speech Dereverberation and Speech Enhancement
Jupyter Notebook 21Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
B
Beam-Guided-TasNetby hangtingchen
Beam-guided TasNet
Python 21Updated: 2 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
V
Verbify-TTSby MattePalte
Simple and free Text-to-Speech (TTS) engine that reads for you any text on your screen with high-quality voices powered by AI models.
Python 21Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
R
RenAI-Chatby Rubiksman78
VN Like Interface for Chatbots
Python 21Updated: 2 y ago License: Permissive (CC0-1.0)
Support
Quality
Security
License
Reuse
N
NeMo-speech-data-processorby NVIDIA
A toolkit for processing speech data and creating speech datasets
Python 21Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
S
STTby getflourish
Speech to Text for Processing
Java 20Updated: 6 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
k
kaldi-gstreamer-android-clientby truongdo
Kaldi gstreamer android client
Java 20Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
R
RapidSphinxby odetoyama
Android library for offline speech recognition base on Pocketsphinx engine. Add speech recognition feature into your Android app with easier implementations.
Java 20Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
V
Voicesby DroidDip
An Android application which performs SpeechToText
Java 20Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
d
deeby matthew1000
Dee - the DeepLens Educating Entertainer #AWSDeepLensChallenge
Python 20Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
J
Java-Text-To-Speech-Tutorialby goxr3plus
Java+MaryTTS=Java Text To Speech
Java 20Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
j
Support
Quality
Security
License
Reuse
i
ivona-speakby Pythonity
Python script that lets you easily convert passed text to synthesized audio files, with the help of Amazon's IVONA.
Python 20Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
G
GAN-based-speech-enhancement-Keras-by fy378968174
Keras implementation of speech enhancement based on LSGAN
Python 20Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
I
ISGANby b04901014
Python 20Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
L
LinuxVoiceAssistantby aydinnyunus
Linux Voice Assistant for to Make Your Work Easier
Python 20Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
e
espnet_tts_frontendby espnet
Text frontend for ESPnet tts recipes
Python 20Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
N
Neural-mask-estimationby AkojimaSLP
Python 20Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
syllposby mewo2
Wordlists by part of speech and syllable count
Python 20Updated: 5 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
S
SignDetectby salil-gtm
This application is developed to help speechless people interact with others with ease. It detects voice and converts the input speech into a sign language based video.
Python 20Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
G
Google-Text-To-Speechby JulienD
A Python script to speech some text with Google Translate.
Python 20Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
cruzhacks2018by jjuraska
Microsoft Speech-Client
Python 20Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
att_speechby adhearsion
A Ruby library for consuming the AT&T Speech API for speech to text.
Ruby 20Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
speech-to-text-websockets-rubyby watson-developer-cloud
Ruby client that interacts with the IBM Watson Speech to Text service through its WebSockets interface
Ruby 20Updated: 6 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
r
retext-posby retextjs
plugin to add part-of-speech (POS) tags
JavaScript 20Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
murmurby mozilla
A webapp for collecting speech samples for voice recognition testing and training
JavaScript 20Updated: 5 y ago License: Weak Copyleft (MPL-2.0)
Support
Quality
Security
License
Reuse
s
synth-jsby patrickroberts
JavaScript MIDI-to-WAV synthesizer
JavaScript 20Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
tjbot-sports-buddyby IBM
WARNING: This repository is no longer maintained ⚠️ This repository will not be updated. The repository will be kept available in read-only mode.
JavaScript 20Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
t
transcriptionby marktnoonan
Live Transcription based on Speech Recognition API
JavaScript 20Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
w
web-speech-demoby gladchinda
Learn how to build a simple text-to-speech voice app for the web using the Web Speech API.
JavaScript 20Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
dialogflow-audio-recorder-nodejsby googlearchive
A simple web app to record audio and a Dialogflow agent to playback the audio as an Action.
JavaScript 20Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
a
arcsecond-binaryby francisrstokes
Binary parsers for arcsecond!
JavaScript 20Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
k
kaldi-model-serverby uhh-lt
Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone
JavaScript 20Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
R
Robust_Fine_Grained_Prosody_Controlby keonlee9420
Pytorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis
Python 20Updated: 3 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
C
CVCby Tinglok
CVC: Contrastive Learning for Non-parallel Voice Conversion (in PyTorch)
Python 20Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
w
wavenet-sttby Narasimha1997
An end-to-end speech recognition system with Wavenet. Built using C++ and python.
Python 20Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
a
audio-to-speech-pipelineby Open-Speech-EkStep
This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline
Python 20Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
gstcefsrcby centricular
A simple gstreamer wrapper around Chromium Embedded Framework
C++ 20Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
DroidSpeech2.0by vikramezhil
Android kotlin library for continuous speech recognition with localisations.
Kotlin 20Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
T
Termux-DeepSpeechby T-vK
Open source offline speech recognition for Android using Mozilla's DeepSpeech in Termux
Shell 20Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
o
openslrby danpovey
Repository for the web pages and scripts associated with OpenSLR: the open speech and language repository
HTML 20Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
n
nlu-example-parserby speechly
Antlr parser for NLU examples
Go 20Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
k
kaldi-allignerby amirharati
scripts to align a given wave to its transcription using trained models by Kaldi
Shell 20Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
m
mixupby speechpro
C++ 20Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse