Speech Libraries - Page 16

LPCTronby alokprasad

C 81 Version:Current
License: No License (No License)

Tacotron2 + LPCNET for complete End-to-End TTS System

Support

Quality

Security

License

Reuse

android-vadby gkonovalov

C 81 Version:Current
License: Permissive (MIT)

This VAD library is designed to process audio in real-time and detect human speech in audio samples that have a mix of speech and noise. It supports both DNN-based Silero VAD and GMM-based WebRTC VAD models.

Support

Quality

Security

License

Reuse

NBSSby Audio-WestlakeU

Python 81 Version:Current
License: No License (No License)

The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".

Support

Quality

Security

License

Reuse

control-vcby MelissaChen15

Python 81 Version:Current
License: Proprietary (Proprietary)

This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"

Support

Quality

Security

License

Reuse

deepspeech-websocket-serverby daanzu

Python 80 Version:Current
License: Weak Copyleft (MPL-2.0)

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

Support

Quality

Security

License

Reuse

Python 80 Version:Current
License: Weak Copyleft (MPL-2.0)

Tool for creation, manipulation and maintenance of voice corpora

Support

Quality

Security

License

Reuse

fftby j2kun

Python 80 Version:Current
License: No License (No License)

Python code and wav files for the post "The Fast Fourier Transform Algorithm, and Denoising a Sound Clip"

Support

Quality

Security

License

Reuse

idearby OpenASR

Kotlin 80 Version:Current
License: Permissive (Apache-2.0)

🎙️ Handsfree Audio Development Interface

Support

Quality

Security

License

Reuse

FFTNetby fatchord

Jupyter Notebook 80 Version:Current
License: No License (No License)

Pytorch Implementation of FFTNet

Support

Quality

Security

License

Reuse

GPTalkby 0ut0flin3

Python 80 Version:Current
License: Permissive (Apache-2.0)

GPT-3 client for Windows and Unix with memories management that supports both text and speech in any language.

Support

Quality

Security

License

Reuse

Python 80 Version:Current
License: Permissive (MIT)

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Support

Quality

Security

License

Reuse

Wav2Letterby LearnedVector

Python 79 Version:Current
License: No License (No License)

Speech Recognition model based off of FAIR research paper built using Pytorch.

Support

Quality

Security

License

Reuse

tacotron2by nii-yamagishilab

Python 79 Version:Current
License: Permissive (BSD-3-Clause)

An implementation of Tacotron and Tacotron2

Support

Quality

Security

License

Reuse

C 79 Version:Current
License: No License (No License)

Emotion recognition by speech in android.

Support

Quality

Security

License

Reuse

OpenASRby by2101

Python 78 Version:Current
License: Permissive (Apache-2.0)

A pytorch based end2end speech recognition system.

Support

Quality

Security

License

Reuse

deepstoryby thetobysiu

Python 78 Version:Current
License: No License (No License)

Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.

Support

Quality

Security

License

Reuse

PHP 78 Version:Current
License: Permissive (BSD-2-Clause)

Let’s Create a Speech Synthesizer

Support

Quality

Security

License

Reuse

voice-generator-webuiby log1stics

Jupyter Notebook 78 Version:Current
License: Permissive (MIT)

A multi-speaker, multilingual speech generation tool

Support

Quality

Security

License

Reuse

speechutilsby Kaljurand

Java 77 Version:Current
License: Permissive (Apache-2.0)

Android library for speech-to-text and text-to-speech apps

Support

Quality

Security

License

Reuse

text-to-speech-sampleby alexram1313

Python 77 Version:Current
License: Permissive (Apache-2.0)

Python3 Text to Speech Video Sample

Support

Quality

Security

License

Reuse

magphaseby CSTR-Edinburgh

Python 77 Version:Current
License: Permissive (Apache-2.0)

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Support

Quality

Security

License

Reuse

Deep-Clustering-for-Speech-Separationby JusperLee

Python 77 Version:Current
License: No License (No License)

Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation

Support

Quality

Security

License

Reuse

pySpeechRevby mravanelli

Python 77 Version:Current
License: No License (No License)

This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of acoustic impulse responses.

Support

Quality

Security

License

Reuse

C++ 77 Version:Current
License: Permissive (Apache-2.0)

Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.

Support

Quality

Security

License

Reuse

kaldi-serveby Vernacular-ai

C++ 77 Version:Current
License: Permissive (Apache-2.0)

Server framework for Kaldi ASR Toolkit

Support

Quality

Security

License

Reuse

C++ 77 Version:Current
License: Permissive (MIT)

Custom decoders for Kaldi

Support

Quality

Security

License

Reuse

easy-speechby jankapunkt

JavaScript 77 Version:Current
License: No License (No License)

Cross browser Speech Synthesis; no dependencies

Support

Quality

Security

License

Reuse

C++ 77 Version:Current
License: Permissive (MIT)

speech recognition in dart support all audio format and support server side client side, + support all language, only support in cpu only

Support

Quality

Security

License

Reuse

Python 77 Version:Current
License: Permissive (MIT)

Official Implementation of StyleTTS-VC

Support

Quality

Security

License

Reuse

emotional-voice-conversion-with-CycleGAN-and-CWT-for-Spectrum-and-F0by KunZhou9646

Python 76 Version:Current
License: No License (No License)

This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-parallel training data".

Support

Quality

Security

License

Reuse

vggvox-speaker-identificationby linhdvu14

Python 76 Version:Current
License: No License (No License)

Speaker identification with VGGVox network

Support

Quality

Security

License

Reuse

Shell 76 Version:Current
License: Weak Copyleft (MPL-2.0)

🐸TTS recipes for different datasets

Support

Quality

Security

License

Reuse

TalkNet2-pytorchby rishikksh20

Python 76 Version:Current
License: Permissive (MIT)

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.

Support

Quality

Security

License

Reuse

manim-voiceoverby ManimCommunity

Python 76 Version:Current
License: Permissive (MIT)

Manim plugin for all things voiceover

Support

Quality

Security

License

Reuse

russian_stt_text_normalizationby snakers4

Python 75 Version:Current
License: Strong Copyleft (GPL-3.0)

Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks

Support

Quality

Security

License

Reuse

C# 75 Version:Current
License: Permissive (MIT)

Easy to use cross platform speech recognition (speech to text) plugin for Xamarin & UWP

Support

Quality

Security

License

Reuse

C++ 75 Version:Current
License: No License (No License)

The Office Assistant was an intelligent user interface for Microsoft Office. The code written in C++ is now avalible for anyone to use that agrees to the licence. Enjoy

Support

Quality

Security

License

Reuse

mcseby DistantSpeechRecognition

Python 74 Version:Current
License: No License (No License)

Multi-channel speech enhancement system (MVDR beamformer + several postfilters)

Support

Quality

Security

License

Reuse

Python 74 Version:Current
License: Permissive (MIT)

A fast cnn-based vocoder

Support

Quality

Security

License

Reuse

JavaScript 74 Version:Current
License: Strong Copyleft (GPL-3.0)

A programmable version of Neil Thapen's Pink Trombone

Support

Quality

Security

License

Reuse

zhttsby Jackiexiao

Python 74 Version:Current
License: Permissive (MIT)

A demo of zh/Chinese Text to Speech system run on CPU in real time. 中文实时语音合成系统Demo

Support

Quality

Security

License

Reuse

AaltoASRby aalto-speech

C++ 74 Version:Current
License: Permissive (BSD-3-Clause)

Aalto Automatic Speech Recognition tools

Support

Quality

Security

License

Reuse

larynx2by rhasspy

C++ 74 Version:Current
License: Permissive (MIT)

A fast, local neural text to speech system

Support

Quality

Security

License

Reuse

ASR_benchmarkby Franck-Dernoncourt

Python 73 Version:Current
License: No License (No License)

Program to benchmark various speech recognition APIs

Support

Quality

Security

License

Reuse

Python 73 Version:Current
License: No License (No License)

A voice-enabled chatbot application built using of 🦜️🔗 LangChain, text-to-speech, and speech-to-text models from 🤗 Hugging Face, and 🍱 BentoML.

Support

Quality

Security

License

Reuse

Android-Speech-Recognitionby maxwellobi

Java 72 Version:Current
License: Permissive (MIT)

Continuous speech recognition library for Android with options to use GoogleVoiceIme dialog and offline mode.

Support

Quality

Security

License

Reuse

2dtanby ChenJoya

Python 72 Version:Current
License: No License (No License)

An optimized re-implementation for 2D-TAN: Learning 2D Temporal Localization Networks for Moment Localization with Natural Language (AAAI'2020).

Support

Quality

Security

License

Reuse

audio_recogition_systemby baliksjosay

Python 72 Version:Current
License: No License (No License)

An audio recognition system

Support

Quality

Security

License

Reuse

speech-emotion-recognition-exerciseby YJango

Python 72 Version:Current
License: No License (No License)

2018年7⽉30⽇-8⽉13⽇持续2周的AI训练营中语⾳情感识别营的项目报告。

Support

Quality

Security

License

Reuse

LocalSTTby ccoreilly

Java 72 Version:Current
License: Strong Copyleft (GPL-3.0)

Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech

Support

Quality

Security

License

Reuse

LPCTronby alokprasad

Tacotron2 + LPCNET for complete End-to-End TTS System

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

android-vadby gkonovalov

This VAD library is designed to process audio in real-time and detect human speech in audio samples that have a mix of speech and noise. It supports both DNN-based Silero VAD and GMM-based WebRTC VAD models.

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

NBSSby Audio-WestlakeU

The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

control-vcby MelissaChen15

This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"

Python

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

deepspeech-websocket-serverby daanzu

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

Python

Updated: 4 y ago

License: Weak Copyleft (MPL-2.0)

Support

Quality

Security

License

Reuse

voice-corpus-toolby mozilla

Tool for creation, manipulation and maintenance of voice corpora

Python

Updated: 4 y ago

License: Weak Copyleft (MPL-2.0)

Support

Quality

Security

License

Reuse

fftby j2kun

Python code and wav files for the post "The Fast Fourier Transform Algorithm, and Denoising a Sound Clip"

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

idearby OpenASR

🎙️ Handsfree Audio Development Interface

Kotlin

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

FFTNetby fatchord

Pytorch Implementation of FFTNet

Jupyter Notebook

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

GPTalkby 0ut0flin3

GPT-3 client for Windows and Unix with memories management that supports both text and speech in any language.

Python

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

PitchExtractorby yl4579

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Wav2Letterby LearnedVector

Speech Recognition model based off of FAIR research paper built using Pytorch.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

tacotron2by nii-yamagishilab

An implementation of Tacotron and Tacotron2

Python

Updated: 4 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

VokaturiAndroidby alshell7

Emotion recognition by speech in android.

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

OpenASRby by2101

A pytorch based end2end speech recognition system.

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

deepstoryby thetobysiu

Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speech_synth_seriesby bisqwit

Let’s Create a Speech Synthesizer

PHP

Updated: 4 y ago

License: Permissive (BSD-2-Clause)

Support

Quality

Security

License

Reuse

voice-generator-webuiby log1stics

A multi-speaker, multilingual speech generation tool

Jupyter Notebook

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

speechutilsby Kaljurand

Android library for speech-to-text and text-to-speech apps

Java

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

text-to-speech-sampleby alexram1313

Python3 Text to Speech Video Sample

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

magphaseby CSTR-Edinburgh

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Deep-Clustering-for-Speech-Separationby JusperLee

Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

pySpeechRevby mravanelli

This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of acoustic impulse responses.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Crystal.TTVSby thuhcsi

Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.

C++

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

kaldi-serveby Vernacular-ai

Server framework for Kaldi ASR Toolkit

C++

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

kaldi-decodersby jpuigcerver

Custom decoders for Kaldi

C++

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

easy-speechby jankapunkt

Cross browser Speech Synthesis; no dependencies

JavaScript

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

whisper_dartby azkadev

speech recognition in dart support all audio format and support server side client side, + support all language, only support in cpu only

C++

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

StyleTTS-VCby yl4579

Official Implementation of StyleTTS-VC

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

emotional-voice-conversion-with-CycleGAN-and-CWT-for-Spectrum-and-F0by KunZhou9646

This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-parallel training data".

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

vggvox-speaker-identificationby linhdvu14

Speaker identification with VGGVox network

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

TTS-recipesby coqui-ai

🐸TTS recipes for different datasets

Shell

Updated: 2 y ago

License: Weak Copyleft (MPL-2.0)

Support

Quality

Security

License

Reuse

TalkNet2-pytorchby rishikksh20

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

manim-voiceoverby ManimCommunity

Manim plugin for all things voiceover

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

russian_stt_text_normalizationby snakers4

Russian text normalization pipeline for speech-to-text and other applications based on tagging s2s networks

Python

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

speechrecognitionby aritchie

Easy to use cross platform speech recognition (speech to text) plugin for Xamarin & UWP

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

OfficeAssistantby thebeebs

The Office Assistant was an intelligent user interface for Microsoft Office. The code written in C++ is now avalible for anyone to use that agrees to the licence. Enjoy

C++

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

mcseby DistantSpeechRecognition

Multi-channel speech enhancement system (MVDR beamformer + several postfilters)

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

cnn_vocoderby tuan3w

A fast cnn-based vocoder

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Pink-Tromboneby zakaton

A programmable version of Neil Thapen's Pink Trombone

JavaScript

Updated: 3 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

zhttsby Jackiexiao

A demo of zh/Chinese Text to Speech system run on CPU in real time. 中文实时语音合成系统Demo

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

AaltoASRby aalto-speech

Aalto Automatic Speech Recognition tools

C++

Updated: 4 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

larynx2by rhasspy

A fast, local neural text to speech system

C++

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ASR_benchmarkby Franck-Dernoncourt

Program to benchmark various speech recognition APIs

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

BentoChainby ssheng

A voice-enabled chatbot application built using of 🦜️🔗 LangChain, text-to-speech, and speech-to-text models from 🤗 Hugging Face, and 🍱 BentoML.

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Android-Speech-Recognitionby maxwellobi

Continuous speech recognition library for Android with options to use GoogleVoiceIme dialog and offline mode.

Java

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

2dtanby ChenJoya

An optimized re-implementation for 2D-TAN: Learning 2D Temporal Localization Networks for Moment Localization with Natural Language (AAAI'2020).

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

audio_recogition_systemby baliksjosay

An audio recognition system

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speech-emotion-recognition-exerciseby YJango

2018年7⽉30⽇-8⽉13⽇持续2周的AI训练营中语⾳情感识别营的项目报告。

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

LocalSTTby ccoreilly

Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeech

Java

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 16

LPCTronby alokprasad

C 81 Version:Current License: No License (No License)

Tacotron2 + LPCNET for complete End-to-End TTS System

android-vadby gkonovalov

C 81 Version:Current License: Permissive (MIT)

This VAD library is designed to process audio in real-time and detect human speech in audio samples that have a mix of speech and noise. It supports both DNN-based Silero VAD and GMM-based WebRTC VAD models.

NBSSby Audio-WestlakeU

Python 81 Version:Current License: No License (No License)

The official repo of "Multi-channel Narrow-band Deep Speech Separation with Full-band Permutation Invariant Training", "Multichannel Speech Separation with Narrow-band Conformer" and "NBC2: Multichannel Speech Separation with Revised Narrow-band Conformer".

control-vcby MelissaChen15

Python 81 Version:Current License: Proprietary (Proprietary)

This is the implementation for "ControlVC: Zero-Shot Voice Conversion with Time-Varying Controls on Pitch and Rhythm"

deepspeech-websocket-serverby daanzu

Python 80 Version:Current License: Weak Copyleft (MPL-2.0)

Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments

voice-corpus-toolby mozilla

Python 80 Version:Current License: Weak Copyleft (MPL-2.0)

Tool for creation, manipulation and maintenance of voice corpora

fftby j2kun

Python 80 Version:Current License: No License (No License)

Python code and wav files for the post "The Fast Fourier Transform Algorithm, and Denoising a Sound Clip"

idearby OpenASR

Kotlin 80 Version:Current License: Permissive (Apache-2.0)

🎙️ Handsfree Audio Development Interface

FFTNetby fatchord

Jupyter Notebook 80 Version:Current License: No License (No License)

Pytorch Implementation of FFTNet

GPTalkby 0ut0flin3

Python 80 Version:Current License: Permissive (Apache-2.0)

GPT-3 client for Windows and Unix with memories management that supports both text and speech in any language.

PitchExtractorby yl4579

Python 80 Version:Current License: Permissive (MIT)

Deep Neural Pitch Extractor for Voice Conversion and TTS Training

Wav2Letterby LearnedVector

Python 79 Version:Current License: No License (No License)

Speech Recognition model based off of FAIR research paper built using Pytorch.

tacotron2by nii-yamagishilab

Python 79 Version:Current License: Permissive (BSD-3-Clause)

An implementation of Tacotron and Tacotron2

VokaturiAndroidby alshell7

C 79 Version:Current License: No License (No License)

Emotion recognition by speech in android.

OpenASRby by2101

Python 78 Version:Current License: Permissive (Apache-2.0)

A pytorch based end2end speech recognition system.

deepstoryby thetobysiu

Python 78 Version:Current License: No License (No License)

Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.

speech_synth_seriesby bisqwit

PHP 78 Version:Current License: Permissive (BSD-2-Clause)

Let’s Create a Speech Synthesizer

voice-generator-webuiby log1stics

Jupyter Notebook 78 Version:Current License: Permissive (MIT)

A multi-speaker, multilingual speech generation tool

speechutilsby Kaljurand

Java 77 Version:Current License: Permissive (Apache-2.0)

Android library for speech-to-text and text-to-speech apps

text-to-speech-sampleby alexram1313

Python 77 Version:Current License: Permissive (Apache-2.0)

Python3 Text to Speech Video Sample

magphaseby CSTR-Edinburgh

Python 77 Version:Current License: Permissive (Apache-2.0)

MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.

Deep-Clustering-for-Speech-Separationby JusperLee

Python 77 Version:Current License: No License (No License)

Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation

pySpeechRevby mravanelli

Python 77 Version:Current License: No License (No License)

This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of acoustic impulse responses.

Crystal.TTVSby thuhcsi

C++ 77 Version:Current License: Permissive (Apache-2.0)

Crystal TTVS engine is a real-time audio-visual Multilingual speech synthesizer with a 3D expressive avatar.

kaldi-serveby Vernacular-ai

C++ 77 Version:Current License: Permissive (Apache-2.0)

Server framework for Kaldi ASR Toolkit

kaldi-decodersby jpuigcerver

C++ 77 Version:Current License: Permissive (MIT)

Custom decoders for Kaldi

easy-speechby jankapunkt

C 81 Version:Current
License: No License (No License)

C 81 Version:Current
License: Permissive (MIT)

Python 81 Version:Current
License: No License (No License)

Python 81 Version:Current
License: Proprietary (Proprietary)

Python 80 Version:Current
License: Weak Copyleft (MPL-2.0)

Python 80 Version:Current
License: Weak Copyleft (MPL-2.0)

Python 80 Version:Current
License: No License (No License)

Kotlin 80 Version:Current
License: Permissive (Apache-2.0)

Jupyter Notebook 80 Version:Current
License: No License (No License)

Python 80 Version:Current
License: Permissive (Apache-2.0)

Python 80 Version:Current
License: Permissive (MIT)

Python 79 Version:Current
License: No License (No License)

Python 79 Version:Current
License: Permissive (BSD-3-Clause)

C 79 Version:Current
License: No License (No License)

Python 78 Version:Current
License: Permissive (Apache-2.0)

Python 78 Version:Current
License: No License (No License)

PHP 78 Version:Current
License: Permissive (BSD-2-Clause)

Jupyter Notebook 78 Version:Current
License: Permissive (MIT)

Java 77 Version:Current
License: Permissive (Apache-2.0)

Python 77 Version:Current
License: Permissive (Apache-2.0)

Python 77 Version:Current
License: Permissive (Apache-2.0)

Python 77 Version:Current
License: No License (No License)

Python 77 Version:Current
License: No License (No License)

C++ 77 Version:Current
License: Permissive (Apache-2.0)

C++ 77 Version:Current
License: Permissive (Apache-2.0)

C++ 77 Version:Current
License: Permissive (MIT)

JavaScript 77 Version:Current
License: No License (No License)

C++ 77 Version:Current
License: Permissive (MIT)

Python 77 Version:Current
License: Permissive (MIT)

Python 76 Version:Current
License: No License (No License)

Python 76 Version:Current
License: No License (No License)

Shell 76 Version:Current
License: Weak Copyleft (MPL-2.0)

Python 76 Version:Current
License: Permissive (MIT)

Python 76 Version:Current
License: Permissive (MIT)

Python 75 Version:Current
License: Strong Copyleft (GPL-3.0)

C# 75 Version:Current
License: Permissive (MIT)

C++ 75 Version:Current
License: No License (No License)

Python 74 Version:Current
License: No License (No License)

Python 74 Version:Current
License: Permissive (MIT)

JavaScript 74 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 74 Version:Current
License: Permissive (MIT)

C++ 74 Version:Current
License: Permissive (BSD-3-Clause)

C++ 74 Version:Current
License: Permissive (MIT)

Python 73 Version:Current
License: No License (No License)

Python 73 Version:Current
License: No License (No License)

Java 72 Version:Current
License: Permissive (MIT)

Python 72 Version:Current
License: No License (No License)

Python 72 Version:Current
License: No License (No License)

Python 72 Version:Current
License: No License (No License)

Java 72 Version:Current
License: Strong Copyleft (GPL-3.0)