S
SpeechControlledCommandLineSystemby debojeetjha10
Python 3 Version:Current License: No License (No License)
Support
Quality
Security
License
Reuse
Pronounce and Speech Text - Enter Word and Get the Pronunciation and Speech Text.
Support
Quality
Security
License
Reuse
Control Thymio Robot via Voice Commands
Support
Quality
Security
License
Reuse
Bemba ASR model obtained by fine-tuning a well performing DeepSpeech English pretrained model.
Support
Quality
Security
License
Reuse
This is a mandarin version of speech separation dataset like WSJMix and LibriMix
Support
Quality
Security
License
Reuse
c
cognitive-services-speech-sdk-rsby jabber-tools
Rust 3 Version:Current License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
Low latency multi-thread audio transforms and conversions
Support
Quality
Security
License
Reuse
Basketball themed voice assistant created using Speech Recognition API and Google Text-to-speech API
Support
Quality
Security
License
Reuse
Find duplicate audio files using fingerprints
Support
Quality
Security
License
Reuse
Y
Youtube-video-transcriptorby labrijisaad
Jupyter Notebook 3 Version:Current License: No License (No License)
In this notebook, I tried to write a script capable of transcribing youtube videos (audios in general) using the google's speech-to-text.
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
bridge between the discourse and the Telegram | Matrix
Support
Quality
Security
License
Reuse
User interface for the ASR system
Support
Quality
Security
License
Reuse
Unofficial implementations of environmental sound synthesis system with Transformer
Support
Quality
Security
License
Reuse
ANDROID APP that can RECOGNIZE ANY LIVE AUDIO/VIDEO STREAMING (using free VOSK Speech Recognition API) then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE
Support
Quality
Security
License
Reuse
I
IBM-Project-35339-1660283665by IBM-EPBL
Jupyter Notebook 3 Version:Current License: No License (No License)
AI based discourse for Banking Industry
Support
Quality
Security
License
Reuse
N
Non-English-Tacotron-2-Training-Notebookby Mildemelwe
Jupyter Notebook 3 Version:Current License: Permissive (MIT)
Tacotron 2 training notebook supporting Japanese, French, and Mandarin
Support
Quality
Security
License
Reuse
a
amazon-live-translation-polly-transcribeby aws-samples
Python 3 Version:Current License: Permissive (MIT-0)
Support
Quality
Security
License
Reuse
a
audio_transcription_deepspeechby NathaliaBarreiros
Python 3 Version:Current License: Permissive (MIT)
Transcription .wav audio files with DeepSpeech library
Support
Quality
Security
License
Reuse
Speech-to-Speech translation dataset for German and English (text and speech quadruplets). Currently bussy uploading the audiofiles
Support
Quality
Security
License
Reuse
I
IBM-Project-1463-1658388896by IBM-EPBL
Jupyter Notebook 3 Version:Current License: No License (No License)
AI based discourse for Banking Industry
Support
Quality
Security
License
Reuse
Text-to-Speech for Crimean Tatar language
Support
Quality
Security
License
Reuse
A locally hosted TTS solution for Neos
Support
Quality
Security
License
Reuse
K
Kaggle-Speech-Recognition-Challengeby benjaminlq
Jupyter Notebook 3 Version:Current License: Permissive (MIT)
Speech Recognition
Support
Quality
Security
License
Reuse
Audio Share can share Windows computer's audio to Android phone over network, so your phone becomes the speaker of computer. (You needn't to buy a new speaker😄.)
Support
Quality
Security
License
Reuse
It's our Internship work with Traivis UK
Support
Quality
Security
License
Reuse
Articulatory Speech Synthesis
Support
Quality
Security
License
Reuse
Android app demonstrating on-device wake word voice recognition using the PocketSphinx engine
Support
Quality
Security
License
Reuse
simple live-speech-recognition implementation in python
Support
Quality
Security
License
Reuse
A simple but powerful voice assistant.
Support
Quality
Security
License
Reuse
c++ WAV class for my processing modules
Support
Quality
Security
License
Reuse
The online office hours queue for 15-122 @ CMU
Support
Quality
Security
License
Reuse
Speech synthesis software without voice actors
Support
Quality
Security
License
Reuse
REST API to check text for spelling issues and give suggestions to the user.
Support
Quality
Security
License
Reuse
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
Support
Quality
Security
License
Reuse
Automatic speech transcription and speaker identification pipeline based on Kaldi and Nextflow
Support
Quality
Security
License
Reuse
turn text into speech
Support
Quality
Security
License
Reuse
Vosk (offline speech recognition library) binding for Lua
Support
Quality
Security
License
Reuse
D
Deep_Learning_Speech_Recognition_Modelsby ylei532
Python 3 Version:Current License: No License (No License)
Developed DL models for speech recognition systems for speech impaired individuals. Models were trained and evaluated on the TORGO and UASpeech datasets
Support
Quality
Security
License
Reuse
Change utterance's gender of an audio file
Support
Quality
Security
License
Reuse
source for for the blog "Use Amazon Polly to create voice messages from your IoT messages"
Support
Quality
Security
License
Reuse
Adds two-way radios with different upgrades.
Support
Quality
Security
License
Reuse
speech recognition sysytem
Support
Quality
Security
License
Reuse
ASR with Speaker Diarization API
Support
Quality
Security
License
Reuse
A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Speech (gTTS) to play out the translation.
Support
Quality
Security
License
Reuse
S
SpeechRecognitionComparisonRussianby Mike-Kuznetsov
Python 3 Version:Current License: No License (No License)
I tested the most popular russian speech recognizers
Support
Quality
Security
License
Reuse
Official repository for paper "MATERobot: Material Recognition in Wearable Robotics for People with Visual Impairments"
Support
Quality
Security
License
Reuse
Speech to text search and text to speech voice over using Google API
Support
Quality
Security
License
Reuse
Amazon Polly Plugin for Fonoster Voice Apps
Support
Quality
Security
License
Reuse
Simple Python wrapper for ChatGPT with speech modules (STT, TTS)
Support
Quality
Security
License
Reuse
S
SpeechControlledCommandLineSystemby debojeetjha10
Python 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
pronounce-and-speechby mskian
Pronounce and Speech Text - Enter Word and Get the Pronunciation and Speech Text.
JavaScript 3Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
control-thymio-via-voiceby ahmad081177
Control Thymio Robot via Voice Commands
Python 3Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
b
bembaspeech-expsby csikasote
Bemba ASR model obtained by fine-tuning a well performing DeepSpeech English pretrained model.
Jupyter Notebook 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
A
Aishell1Mixby huangzj421
This is a mandarin version of speech separation dataset like WSJMix and LibriMix
Python 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
cognitive-services-speech-sdk-rsby jabber-tools
Rust 3Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
n
nwaveby ionite34
Low latency multi-thread audio transforms and conversions
Python 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
B
Basketbot-Voice-Assistantby IshaanPuri
Basketball themed voice assistant created using Speech Recognition API and Google Text-to-speech API
Python 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
soundalikeby derat
Find duplicate audio files using fingerprints
Go 3Updated: 2 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
Y
Youtube-video-transcriptorby labrijisaad
In this notebook, I tried to write a script capable of transcribing youtube videos (audios in general) using the google's speech-to-text.
Jupyter Notebook 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
text_to_speechby yui-mhcp
Python 3Updated: 2 y ago License: Strong Copyleft (AGPL-3.0)
Support
Quality
Security
License
Reuse
d
discourse-bridgeby aosus
bridge between the discourse and the Telegram | Matrix
JavaScript 3Updated: 2 y ago License: Strong Copyleft (AGPL-3.0)
Support
Quality
Security
License
Reuse
e
est-asr-uiby taltechnlp
User interface for the ASR system
TypeScript 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
o
onoma-to-wave_transformerby tam17aki
Unofficial implementations of environmental sound synthesis system with Transformer
Python 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
V
VOSK-Powered-Live-Subtitle-V3by botbahlul
ANDROID APP that can RECOGNIZE ANY LIVE AUDIO/VIDEO STREAMING (using free VOSK Speech Recognition API) then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE
Java 3Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
I
IBM-Project-35339-1660283665by IBM-EPBL
AI based discourse for Banking Industry
Jupyter Notebook 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
N
Non-English-Tacotron-2-Training-Notebookby Mildemelwe
Tacotron 2 training notebook supporting Japanese, French, and Mandarin
Jupyter Notebook 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
amazon-live-translation-polly-transcribeby aws-samples
Python 3Updated: 2 y ago License: Permissive (MIT-0)
Support
Quality
Security
License
Reuse
a
audio_transcription_deepspeechby NathaliaBarreiros
Transcription .wav audio files with DeepSpeech library
Python 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
L
LibriS2Sby PedroDKE
Speech-to-Speech translation dataset for German and English (text and speech quadruplets). Currently bussy uploading the audiofiles
Python 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
I
IBM-Project-1463-1658388896by IBM-EPBL
AI based discourse for Banking Industry
Jupyter Notebook 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
q
qirimtatar-ttsby robinhad
Text-to-Speech for Crimean Tatar language
Python 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
n
neos-local-ttsby Zetaphor
A locally hosted TTS solution for Neos
Python 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
K
Kaggle-Speech-Recognition-Challengeby benjaminlq
Speech Recognition
Jupyter Notebook 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
audio-shareby mkckr0
Audio Share can share Windows computer's audio to Android phone over network, so your phone becomes the speaker of computer. (You needn't to buy a new speaker😄.)
C++ 3Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
T
Traivisby Tihsrah
It's our Internship work with Traivis UK
Jupyter Notebook 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
artspeechby vribeiro1
Articulatory Speech Synthesis
Python 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
wakewordappby tuckercr
Android app demonstrating on-device wake word voice recognition using the PocketSphinx engine
Kotlin 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
l
live-speech-recognitionby Bratet
simple live-speech-recognition implementation in python
Python 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
akulai_coreby Akul-AI
A simple but powerful voice assistant.
Python 3Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
W
WAVby kooBH
c++ WAV class for my processing modules
C++ 3Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
q
q-primeby cmu15122
The online office hours queue for 15-122 @ CMU
TypeScript 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
poinoby KoharuYuzuki
Speech synthesis software without voice actors
TypeScript 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
spelling-checkerby djdelima
REST API to check text for spelling issues and give suggestions to the user.
TypeScript 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
T
Turkish-Text-to-Speechby Rumeysakeskin
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
Python 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
e
est-asr-pipelineby taltechnlp
Automatic speech transcription and speaker identification pipeline based on Kaldi and Nextflow
Python 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
speech-synthesisby butolinka
turn text into speech
CSS 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
l
lua-voskby igor725
Vosk (offline speech recognition library) binding for Lua
C 3Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
D
Deep_Learning_Speech_Recognition_Modelsby ylei532
Developed DL models for speech recognition systems for speech impaired individuals. Models were trained and evaluated on the TORGO and UASpeech datasets
Python 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
v
voice-gender-changerby radinshayanfar
Change utterance's gender of an audio file
Python 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
i
iot-pollyby aws-samples
source for for the blog "Use Amazon Polly to create voice messages from your IoT messages"
Python 3Updated: 2 y ago License: Permissive (MIT-0)
Support
Quality
Security
License
Reuse
w
walkie-talkie-modby Flaton1
Adds two-way radios with different upgrades.
Java 3Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
speech-recognition-by sanusinghmon
speech recognition sysytem
Python 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
whisper-diar-apiby marccgrau
ASR with Speaker Diarization API
Python 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
r
real-time-translatorby lperezmo
A quick app to translate speech in real time using the Whisper API for transcribing audio, translating, and then using Google Text-to-Speech (gTTS) to play out the translation.
Python 3Updated: 1 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
S
SpeechRecognitionComparisonRussianby Mike-Kuznetsov
I tested the most popular russian speech recognizers
Python 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
M
MATERobotby JunweiZheng93
Official repository for paper "MATERobot: Material Recognition in Wearable Robotics for People with Visual Impairments"
Python 3Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
python-web-quick-searchby szcharlesji
Speech to text search and text to speech voice over using Google API
Python 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
pollyttsby fonoster
Amazon Polly Plugin for Fonoster Voice Apps
TypeScript 3Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
speech2speech_chatGPTby Janghyun1230
Simple Python wrapper for ChatGPT with speech modules (STT, TTS)
Python 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse