Clara Project for Theremin
Support
Quality
Security
License
Reuse
A
Audio-Book-Corpus-for-European-Languages-by ajinkyakulkarni14
Jupyter Notebook 3 Version:Current License: Strong Copyleft (GPL-2.0)
Audio Book Corpus (ABC) project has been developed to aid linguist researchers in the field of text to speech for purely academic purposes. In the current form, the corpus consists approximately 200 minutes of speech data in German language. Besides German, we are also in the process of developing Corpus Portuguese and Italian langugae. Future versions of the corpus shall encompass most European languages such as French, Spanish, Czech, Dutch, Polish, Romanian.
Support
Quality
Security
License
Reuse
L
Language-Translator-Using-Pythonby NightWalker110
Jupyter Notebook 3 Version:Current License: No License (No License)
Support
Quality
Security
License
Reuse
L
Language-Translator-Using-Pythonby jahanvisharma-dotcom
Jupyter Notebook 3 Version:Current License: No License (No License)
Support
Quality
Security
License
Reuse
a
automatic-speech-recognitionby ChristophSchmidl
Jupyter Notebook 3 Version:Current License: No License (No License)
Support
Quality
Security
License
Reuse
m
music-audio-recsys-wavenetby dmarcous
Jupyter Notebook 3 Version:Current License: Permissive (Apache-2.0)
Music recommender system based solely on song audio using Wavenet embeddings
Support
Quality
Security
License
Reuse
C
Cycle-Consitency-Audio-Noise-Filterby bboycoi
Jupyter Notebook 3 Version:Current License: No License (No License)
Support
Quality
Security
License
Reuse
T
TriggerWordAssistantby susantabiswas
Jupyter Notebook 3 Version:Current License: No License (No License)
A RNN based voice application which can do tasks when it recognizes the user speaking the Trigger word. Here the trigger word is "activate".
Support
Quality
Security
License
Reuse
Project of noise reduction/speech enhancement in the context of a course in Audio and Acoustic Signal Processing
Support
Quality
Security
License
Reuse
Dataset creation for hate speech detection in Romanian
Support
Quality
Security
License
Reuse
F
Frequency-of-Parts-of-Speech-POS-by abhisheksaxena1998
Jupyter Notebook 3 Version:Current License: No License (No License)
Python code to determine Frequency of Parts of Speech(POS)
Support
Quality
Security
License
Reuse
S
Speech-To-Textby Gopi-Durgaprasad
Jupyter Notebook 3 Version:Current License: No License (No License)
End-to-End Speech Recognition
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Automatic removal of copyrighted music from audio streams.
Support
Quality
Security
License
Reuse
Aim to implement a classifier which classifies an audio sample into speech or music.
Support
Quality
Security
License
Reuse
Transform beatboxing performed by a human to drum patterns.
Support
Quality
Security
License
Reuse
A
Action-Points-retrieval-from-meeting-transcriptby RayuduAdabala
Jupyter Notebook 3 Version:Current License: Permissive (MIT)
Classifying the dialogues speech from the meeting transcript using transformer based XLNet pretrained model fine tuning with AMI meeting corpus
Support
Quality
Security
License
Reuse
Kaggle Google TensorFlow Speech Recognition competition
Support
Quality
Security
License
Reuse
S
Speech-Music_Classification_NeuralNetby nataliest
Jupyter Notebook 3 Version:Current License: No License (No License)
Neural network music vs. speech recognition using GTZAN dataset.
Support
Quality
Security
License
Reuse
s
spectrograms_speech_classificationby edumunozsala
Jupyter Notebook 3 Version:Current License: Strong Copyleft (GPL-3.0)
A Keras Tensorflow model trained on Azure Machine Learning Services to identify accents in spectrograms of speech
Support
Quality
Security
License
Reuse
K
Kaldi_ASR_Tutorialby nessessence
Jupyter Notebook 3 Version:Current License: No License (No License)
speech recognition using Kaldi framework
Support
Quality
Security
License
Reuse
s
speech-recognitionby ankitsingh1240
Jupyter Notebook 3 Version:Current License: No License (No License)
Support
Quality
Security
License
Reuse
a
analyzing-audio-databy HashimovH
Jupyter Notebook 3 Version:Current License: No License (No License)
Analyzing Audio Data with DTW Algorithm. Algorithmics 2020 Project
Support
Quality
Security
License
Reuse
s
speech-enhancement-CNNby sathvikyesprabhu
Jupyter Notebook 3 Version:Current License: No License (No License)
Denoising autoencoders based on DNN and CNN topologies for speech enhancement
Support
Quality
Security
License
Reuse
autonomous system for hate speech moderation for an inclusive work-space
Support
Quality
Security
License
Reuse
Middleware module for our speech synthesis systems
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
I/O Voice Recognition using Conditional Rendering
Support
Quality
Security
License
Reuse
A simple speech to text web app
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
Implementation of TTS with combination of Tacotron2 and HiFi-GAN
Support
Quality
Security
License
Reuse
Bijan ASR Engine Based-on KalD
Support
Quality
Security
License
Reuse
A simple python based virtual voice assistant that can take and execute the commands
Support
Quality
Security
License
Reuse
Utilities for transcribing a set of audio files with IBM Watson Speech to Text (STT), then analyzing the error rate of the STT transcription against a known-good transcription
Support
Quality
Security
License
Reuse
.NET SDK for Deepgram's automated speech recognition APIs.
Support
Quality
Security
License
Reuse
Transcribe Videos with Deepgram
Support
Quality
Security
License
Reuse
This is the repository for One More Voice. One More Voice is a digital humanities recovery project that identifies, documents, and critically engages with the voices of racialized creators in British imperial and colonial archives. The voices take multiple forms and appear in multiple genres. Our project seeks to introduce these rich and diverse materials to broad academic and public audiences. Recourse to the voices promises to transform our understanding of imperial and colonial history and literature while foregrounding perspectives that scholarship in majority has hitherto overlooked or silenced.
Support
Quality
Security
License
Reuse
o
ovos-tts-plugin-responsivevoiceby OpenVoiceOS
Python 3 Version:Current License: No License (No License)
responsive voice TTS plugin for mycroft
Support
Quality
Security
License
Reuse
A pipeline to isolate and transcribe one language in mixed-language speech
Support
Quality
Security
License
Reuse
Speech Booster is Learning Speech by Recording Video
Support
Quality
Security
License
Reuse
SNER = Speech Naturalness and Emotion Recognition
Support
Quality
Security
License
Reuse
Create automatically diarized and transcribed oTranscribe templates
Support
Quality
Security
License
Reuse
An app for non-native English speakers to practice creating English speeches and get feedback from native English teachers. Student users record their voice and upload audio files. Teachers listen and re-record the same speech with corrected English. Built with Ruby on Rails and Javascript for the media recording.
Support
Quality
Security
License
Reuse
.NET SDK for Deepgram's automated speech recognition APIs.
Support
Quality
Security
License
Reuse
Modular audio workbench.
Support
Quality
Security
License
Reuse
Audio language processing tools
Support
Quality
Security
License
Reuse
p
polly-translate-realtime-speech-translatorby aws-samples
Java 3 Version:Current License: Permissive (MIT-0)
Support
Quality
Security
License
Reuse
Rudimentary Chatterbot written in Python
Support
Quality
Security
License
Reuse
The NAVI's Text-To-Speech System for VLSP 2021
Support
Quality
Security
License
Reuse
Вирівнювання довгих аудіофайлів з текстом
Support
Quality
Security
License
Reuse
c
claraby fortachong
Clara Project for Theremin
Jupyter Notebook 3Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
A
Audio-Book-Corpus-for-European-Languages-by ajinkyakulkarni14
Audio Book Corpus (ABC) project has been developed to aid linguist researchers in the field of text to speech for purely academic purposes. In the current form, the corpus consists approximately 200 minutes of speech data in German language. Besides German, we are also in the process of developing Corpus Portuguese and Italian langugae. Future versions of the corpus shall encompass most European languages such as French, Spanish, Czech, Dutch, Polish, Romanian.
Jupyter Notebook 3Updated: 6 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
L
Language-Translator-Using-Pythonby NightWalker110
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
L
Language-Translator-Using-Pythonby jahanvisharma-dotcom
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
automatic-speech-recognitionby ChristophSchmidl
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
m
music-audio-recsys-wavenetby dmarcous
Music recommender system based solely on song audio using Wavenet embeddings
Jupyter Notebook 3Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
C
Cycle-Consitency-Audio-Noise-Filterby bboycoi
Jupyter Notebook 3Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
T
TriggerWordAssistantby susantabiswas
A RNN based voice application which can do tasks when it recognizes the user speaking the Trigger word. Here the trigger word is "activate".
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
aasp-noise-redby lcolbois
Project of noise reduction/speech enhancement in the context of a course in Audio and Acoustic Signal Processing
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
h
hate-speech-roby andra-pumnea
Dataset creation for hate speech detection in Romanian
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
F
Frequency-of-Parts-of-Speech-POS-by abhisheksaxena1998
Python code to determine Frequency of Parts of Speech(POS)
Jupyter Notebook 3Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
Speech-To-Textby Gopi-Durgaprasad
End-to-End Speech Recognition
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
G
GAN-TTS-repl2by rickyHong
Jupyter Notebook 3Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
T
TrackSubtractby Mitchellpkt
Automatic removal of copyrighted music from audio streams.
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
m
music-speech-classifierby r10a
Aim to implement a classifier which classifies an audio sample into speech or music.
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
b
beat2dby jrobchin
Transform beatboxing performed by a human to drum patterns.
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
A
Action-Points-retrieval-from-meeting-transcriptby RayuduAdabala
Classifying the dialogues speech from the meeting transcript using transformer based XLNet pretrained model fine tuning with AMI meeting corpus
Jupyter Notebook 3Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
SpeechCNNby kwsumlin
Kaggle Google TensorFlow Speech Recognition competition
Jupyter Notebook 3Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
Speech-Music_Classification_NeuralNetby nataliest
Neural network music vs. speech recognition using GTZAN dataset.
Jupyter Notebook 3Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
spectrograms_speech_classificationby edumunozsala
A Keras Tensorflow model trained on Azure Machine Learning Services to identify accents in spectrograms of speech
Jupyter Notebook 3Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
K
Kaldi_ASR_Tutorialby nessessence
speech recognition using Kaldi framework
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
speech-recognitionby ankitsingh1240
Jupyter Notebook 3Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
analyzing-audio-databy HashimovH
Analyzing Audio Data with DTW Algorithm. Algorithmics 2020 Project
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
speech-enhancement-CNNby sathvikyesprabhu
Denoising autoencoders based on DNN and CNN topologies for speech enhancement
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
safeSpaceby Mou97
autonomous system for hate speech moderation for an inclusive work-space
Jupyter Notebook 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
tts-middlewareby skit-ai
Middleware module for our speech synthesis systems
Python 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
g
goEagiby andrewyang17
Go 3Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
A
AI-Assistantby RFebrians
I/O Voice Recognition using Conditional Rendering
Python 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
Speech-to-textby samuelajala01
A simple speech to text web app
HTML 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
K
Kaleemby Afn4nz
Swift 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
T
Tacotron2-Mandarin-HiFiGANby zsl24
Implementation of TTS with combination of Tacotron2 and HiFi-GAN
Python 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
B
Benjaminby bijanbina
Bijan ASR Engine Based-on KalD
C++ 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
P
Python-Virtual-assistantby 4BH1J337
A simple python based virtual voice assistant that can take and execute the commands
Python 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
w
watson-stt-wer-pythonby IBM
Utilities for transcribing a set of audio files with IBM Watson Speech to Text (STT), then analyzing the error rate of the STT transcription against a known-good transcription
Python 3Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
d
deepgram-dotnet-sdkby deepgram-devs
.NET SDK for Deepgram's automated speech recognition APIs.
C# 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
transcribe-videosby deepgram-devs
Transcribe Videos with Deepgram
JavaScript 3Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
o
onemorevoiceby livingstoneonline
This is the repository for One More Voice. One More Voice is a digital humanities recovery project that identifies, documents, and critically engages with the voices of racialized creators in British imperial and colonial archives. The voices take multiple forms and appear in multiple genres. Our project seeks to introduce these rich and diverse materials to broad academic and public audiences. Recourse to the voices promises to transform our understanding of imperial and colonial history and literature while foregrounding perspectives that scholarship in majority has hitherto overlooked or silenced.
HTML 3Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
o
ovos-tts-plugin-responsivevoiceby OpenVoiceOS
responsive voice TTS plugin for mycroft
Python 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
v
vad-sli-asrby CoEDL
A pipeline to isolate and transcribe one language in mixed-language speech
Python 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
speech-boosterby amirisback
Speech Booster is Learning Speech by Recording Video
Kotlin 3Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
snerby bagustris
SNER = Speech Naturalness and Emotion Recognition
Python 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
autotemplaterby translatorswb
Create automatically diarized and transcribed oTranscribe templates
Python 3Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
M
My-English-Speechby tmravic
An app for non-native English speakers to practice creating English speeches and get feedback from native English teachers. Student users record their voice and upload audio files. Teachers listen and re-record the same speech with corrected English. Built with Ruby on Rails and Javascript for the media recording.
Ruby 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
deepgram-dotnet-sdkby deepgram
.NET SDK for Deepgram's automated speech recognition APIs.
C# 3Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
m
Support
Quality
Security
License
Reuse
s
songby ontocord
Audio language processing tools
Python 3Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
p
polly-translate-realtime-speech-translatorby aws-samples
Java 3Updated: 1 y ago License: Permissive (MIT-0)
Support
Quality
Security
License
Reuse
H
Hamster-Bot-Prototypeby TheMonocledHamster
Rudimentary Chatterbot written in Python
Python 3Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
N
NAVI-TTS.github.ioby NAVI-TTS
The NAVI's Text-To-Speech System for VLSP 2021
JavaScript 3Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
u
ukby proger
Вирівнювання довгих аудіофайлів з текстом
Python 3Updated: 2 y ago License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse