Speech Libraries - Page 13

WaveVAEby ksw0306

Python 116 Version:Current
License: Permissive (MIT)

A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")

Support

Quality

Security

License

Reuse

tfg-voice-conversionby albertaparicio

Python 116 Version:Current
License: Strong Copyleft (GPL-3.0)

Deep Learning-based Voice Conversion system

Support

Quality

Security

License

Reuse

panns_inferenceby qiuqiangkong

Python 116 Version:Current
License: Permissive (MIT)

Support

Quality

Security

License

Reuse

HoloBotby ActiveNick

C# 116 Version:Current
License: Permissive (MIT)

HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.

Support

Quality

Security

License

Reuse

kaldipdnnby yajiemiao

Shell 116 Version:Current
License: Permissive (Apache-2.0)

Kaldi+PDNN: Building DNN-based ASR Systems with Kaldi and PDNN

Support

Quality

Security

License

Reuse

Python 115 Version:Current
License: No License (No License)

Desktop assistant that uses speech recognition and gTTS to execute commands and talk back to the user.

Support

Quality

Security

License

Reuse

SpeakerRecognition_tutorialby jymsuper

Python 115 Version:Current
License: Permissive (MIT)

Simple d-vector based Speaker Recognition (verification and identification) using Pytorch

Support

Quality

Security

License

Reuse

DCCRNby maggie0830

Python 115 Version:Current
License: No License (No License)

implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch

Support

Quality

Security

License

Reuse

WaveEditby AndrewBelt

C++ 115 Version:Current
License: Proprietary (Proprietary)

Synthesis Technology WaveEdit for the E370 and E352 Eurorack synthesizer modules

Support

Quality

Security

License

Reuse

LIA_RALby ALIZE-Speaker-Recognition

C++ 115 Version:Current
License: Weak Copyleft (LGPL-3.0)

A high-level toolkit for speaker recognition, build on top of ALIZE-Core.

Support

Quality

Security

License

Reuse

Jarvisby thevickypedia

Python 113 Version:Current
License: Permissive (MIT)

Fully Functional Voice Based Natural Language UI

Support

Quality

Security

License

Reuse

Kotlin 112 Version:Current
License: Permissive (Apache-2.0)

An Android audio management library for real-time communication apps.

Support

Quality

Security

License

Reuse

speech-emotion-recognitionby amanbasu

Jupyter Notebook 112 Version:Current
License: Strong Copyleft (GPL-3.0)

Detecting emotions using MFCC features of human speech using Deep Learning

Support

Quality

Security

License

Reuse

athenaby didi

Python 111 Version:Current
License: Permissive (Apache-2.0)

A release version for https://github.com/athena-team/athena

Support

Quality

Security

License

Reuse

vcc20_baseline_cyclevaeby bigpon

Python 111 Version:Current
License: Permissive (MIT)

Voice Conversion Challenge 2020 CycleVAE baseline system

Support

Quality

Security

License

Reuse

spokestack-pythonby spokestack

Python 111 Version:Current
License: Permissive (Apache-2.0)

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Support

Quality

Security

License

Reuse

pytorch-kaldi-neural-speaker-embeddingsby jefflai108

Perl 111 Version:Current
License: Permissive (BSD-3-Clause)

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

Support

Quality

Security

License

Reuse

C++ 111 Version:Current
License: No License (No License)

FreeSWITCH ASR APP

Support

Quality

Security

License

Reuse

picopiby DougGore

C 111 Version:Current
License: No License (No License)

Port of Android Pico TTS to the Raspberry Pi

Support

Quality

Security

License

Reuse

Python 110 Version:Current
License: No License (No License)

A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM

Support

Quality

Security

License

Reuse

vid2speechby arielephrat

Python 110 Version:Current
License: No License (No License)

Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17

Support

Quality

Security

License

Reuse

Python 110 Version:Current
License: No License (No License)

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Support

Quality

Security

License

Reuse

midi2voiceby mathigatti

Python 110 Version:Current
License: Permissive (MIT)

Singing synthesis from MIDI file

Support

Quality

Security

License

Reuse

htgo-ttsby hegedustibor

Go 110 Version:Current
License: Permissive (MIT)

Text to speech package for Golang.

Support

Quality

Security

License

Reuse

picottsby naggety

C 110 Version:Current
License: No License (No License)

Pico TTS: text to speech voice sinthesizer from SVox, included in Android AOSP

Support

Quality

Security

License

Reuse

VIMABenchby vimalabs

Python 110 Version:Current
License: Permissive (MIT)

Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Support

Quality

Security

License

Reuse

resamplerby cpuimage

C 109 Version:Current
License: Permissive (MIT)

A Simple and Efficient Audio Resampler Implementation in C

Support

Quality

Security

License

Reuse

CASR-DEMOby lihanghang

CSS 109 Version:Current
License: No License (No License)

基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。

Support

Quality

Security

License

Reuse

TypeScript 109 Version:Current
License: Permissive (MIT)

The web browser client library for Speechly API

Support

Quality

Security

License

Reuse

Bangla-deep-speech-Recognitionby Qyum

Jupyter Notebook 109 Version:Current
License: No License (No License)

Support

Quality

Security

License

Reuse

cobraby Picovoice

Python 109 Version:Current
License: Permissive (Apache-2.0)

On-device voice activity detection (VAD) powered by deep learning.

Support

Quality

Security

License

Reuse

JavaScript 109 Version:Current
License: No License (No License)

App that leverages GPT-3 to facilitate new language listening and speaking practice.

Support

Quality

Security

License

Reuse

VI-SVCby PlayVoice

Python 109 Version:Current
License: Permissive (MIT)

vits singing voice conversion based on ppg & hubert；singing voice clone;

Support

Quality

Security

License

Reuse

Python 108 Version:Current
License: Permissive (MIT)

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Support

Quality

Security

License

Reuse

E2E-ASRby HawkAaron

Python 108 Version:Current
License: No License (No License)

PyTorch Implementations for End-to-End Automatic Speech Recognition

Support

Quality

Security

License

Reuse

speechdby brailcom

C 108 Version:Current
License: Strong Copyleft (GPL-2.0)

Common high-level interface to speech synthesis

Support

Quality

Security

License

Reuse

kaldi-gopby jimbozhang

C++ 108 Version:Current
License: Proprietary (Proprietary)

Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.

Support

Quality

Security

License

Reuse

wavegrad2by mindslab-ai

Jupyter Notebook 108 Version:Current
License: Permissive (BSD-3-Clause)

Unofficial Pytorch Implementation of WaveGrad2

Support

Quality

Security

License

Reuse

python-Speech_Recognitionby zthxxx

Python 107 Version:Current
License: No License (No License)

A simple example for use speech recognition baidu api with python.

Support

Quality

Security

License

Reuse

ctc-asrby mdangschat

Python 107 Version:Current
License: Permissive (MIT)

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Support

Quality

Security

License

Reuse

Aurioby protyposis

C# 107 Version:Current
License: Strong Copyleft (AGPL-3.0)

Audio Fingerprinting & Retrieval for .NET

Support

Quality

Security

License

Reuse

mlp-singerby neosapience

Python 107 Version:Current
License: Permissive (MIT)

Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis (IEEE MLSP 2021)

Support

Quality

Security

License

Reuse

edittsby neosapience

Python 107 Version:Current
License: Proprietary (Proprietary)

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)

Support

Quality

Security

License

Reuse

C# 106 Version:Current
License: Permissive (Apache-2.0)

各種 Text-to-Speech エンジンを統一的に操作するライブラリです

Support

Quality

Security

License

Reuse

Translatorby muaz-khan

HTML 106 Version:Current
License: No License (No License)

Translator.js is a JavaScript library built top on Google Speech-Recognition & Translation API to transcript and translate voice and text. It supports many locales and brings globalization in WebRTC! https://www.webrtc-experiment.com/Translator/

Support

Quality

Security

License

Reuse

Python 106 Version:Current
License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Python 105 Version:Current
License: Permissive (MIT)

A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model

Support

Quality

Security

License

Reuse

WebAudioEvaluationToolby BrechtDeMan

JavaScript 105 Version:Current
License: Strong Copyleft (GPL-3.0)

A tool based on the HTML5 Web Audio API to perform perceptual audio evaluation tests locally or on remote machines over the web.

Support

Quality

Security

License

Reuse

Python 104 Version:Current
License: Permissive (MIT)

Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.

Support

Quality

Security

License

Reuse

KTSpeechCrawlerby EgorLakomkin

Python 104 Version:Current
License: Permissive (MIT)

Automatically constructing corpus for automatic speech recognition from YouTube videos

Support

Quality

Security

License

Reuse

WaveVAEby ksw0306

A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")

Python

116

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

tfg-voice-conversionby albertaparicio

Deep Learning-based Voice Conversion system

Python

116

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

panns_inferenceby qiuqiangkong

Python

116

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

HoloBotby ActiveNick

HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.

116

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

kaldipdnnby yajiemiao

Kaldi+PDNN: Building DNN-based ASR Systems with Kaldi and PDNN

Shell

116

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

desktopAssistantby jg-fisher

Desktop assistant that uses speech recognition and gTTS to execute commands and talk back to the user.

Python

115

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

SpeakerRecognition_tutorialby jymsuper

Simple d-vector based Speaker Recognition (verification and identification) using Pytorch

Python

115

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

DCCRNby maggie0830

implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch

Python

115

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

WaveEditby AndrewBelt

Synthesis Technology WaveEdit for the E370 and E352 Eurorack synthesizer modules

C++

115

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

LIA_RALby ALIZE-Speaker-Recognition

A high-level toolkit for speaker recognition, build on top of ALIZE-Core.

C++

115

Updated: 4 y ago

License: Weak Copyleft (LGPL-3.0)

Support

Quality

Security

License

Reuse

Jarvisby thevickypedia

Fully Functional Voice Based Natural Language UI

Python

113

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

audioswitchby twilio

An Android audio management library for real-time communication apps.

Kotlin

112

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

speech-emotion-recognitionby amanbasu

Detecting emotions using MFCC features of human speech using Deep Learning

Jupyter Notebook

112

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

athenaby didi

A release version for https://github.com/athena-team/athena

Python

111

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

vcc20_baseline_cyclevaeby bigpon

Voice Conversion Challenge 2020 CycleVAE baseline system

Python

111

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

spokestack-pythonby spokestack

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Python

111

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

pytorch-kaldi-neural-speaker-embeddingsby jefflai108

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

Perl

111

Updated: 4 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

FreeSWITCH-ASRby cdevelop

FreeSWITCH ASR APP

C++

111

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

picopiby DougGore

Port of Android Pico TTS to the Raspberry Pi

111

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

py_speech_segby wblgers

A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM

Python

110

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

vid2speechby arielephrat

Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17

Python

110

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.

Python

110

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

midi2voiceby mathigatti

Singing synthesis from MIDI file

Python

110

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

htgo-ttsby hegedustibor

Text to speech package for Golang.

110

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

picottsby naggety

Pico TTS: text to speech voice sinthesizer from SVox, included in Android AOSP

110

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

VIMABenchby vimalabs

Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

Python

110

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

resamplerby cpuimage

A Simple and Efficient Audio Resampler Implementation in C

109

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

CASR-DEMOby lihanghang

基于Flask Web的中文自动语音识别演示系统,包含语音识别、语音合成、声纹识别之说话人识别。

CSS

109

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

browser-clientby speechly

The web browser client library for Speechly API

TypeScript

109

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Bangla-deep-speech-Recognitionby Qyum

Jupyter Notebook

109

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

cobraby Picovoice

On-device voice activity detection (VAD) powered by deep learning.

Python

109

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

talk-with-gpt3by JavaFXpert

App that leverages GPT-3 to facilitate new language listening and speaking practice.

JavaScript

109

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

VI-SVCby PlayVoice

vits singing voice conversion based on ppg & hubert；singing voice clone;

Python

109

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Factorized-TDNNby cvqluu

PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and Kaldi

Python

108

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

E2E-ASRby HawkAaron

PyTorch Implementations for End-to-End Automatic Speech Recognition

Python

108

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speechdby brailcom

Common high-level interface to speech synthesis

108

Updated: 2 y ago

License: Strong Copyleft (GPL-2.0)

Support

Quality

Security

License

Reuse

kaldi-gopby jimbozhang

Computes the GMM-based Goodness of Pronunciation (GOP). Bases on Kaldi.

C++

108

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

wavegrad2by mindslab-ai

Unofficial Pytorch Implementation of WaveGrad2

Jupyter Notebook

108

Updated: 2 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

python-Speech_Recognitionby zthxxx

A simple example for use speech recognition baidu api with python.

Python

107

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

ctc-asrby mdangschat

End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.

Python

107

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Aurioby protyposis

Audio Fingerprinting & Retrieval for .NET

107

Updated: 2 y ago

License: Strong Copyleft (AGPL-3.0)

Support

Quality

Security

License

Reuse

mlp-singerby neosapience

Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis (IEEE MLSP 2021)

Python

107

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

edittsby neosapience

Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)

Python

107

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

TTSControllerby ksasao

各種 Text-to-Speech エンジンを統一的に操作するライブラリです

106

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Translatorby muaz-khan

Translator.js is a JavaScript library built top on Google Speech-Recognition & Translation API to transcript and translate voice and text. It supports many locales and brings globalization in WebRTC! https://www.webrtc-experiment.com/Translator/

HTML

106

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

voice_conversionby ebadawy

Python

106

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Tacotron-pytorchby ttaoREtw

A Pytorch Implementation of Tacotron: End-to-end Text-to-speech Deep-Learning Model

Python

105

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

WebAudioEvaluationToolby BrechtDeMan

A tool based on the HTML5 Web Audio API to perform perceptual audio evaluation tests locally or on remote machines over the web.

JavaScript

105

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Tacotron2-PyTorchby BogiHsu

Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.

Python

104

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

KTSpeechCrawlerby EgorLakomkin

Automatically constructing corpus for automatic speech recognition from YouTube videos

Python

104

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 13

WaveVAEby ksw0306

Python 116 Version:Current License: Permissive (MIT)

A Pytorch implementation of WaveVAE ("Parallel Neural Text-to-Speech")

tfg-voice-conversionby albertaparicio

Python 116 Version:Current License: Strong Copyleft (GPL-3.0)

Deep Learning-based Voice Conversion system

panns_inferenceby qiuqiangkong

Python 116 Version:Current License: Permissive (MIT)

HoloBotby ActiveNick

C# 116 Version:Current License: Permissive (MIT)

HoloBot is a reusable 3D interface that allows HoloLens & VR users to interact with any bot using Mixed Reality & Speech.

kaldipdnnby yajiemiao

Shell 116 Version:Current License: Permissive (Apache-2.0)

Kaldi+PDNN: Building DNN-based ASR Systems with Kaldi and PDNN

desktopAssistantby jg-fisher

Python 115 Version:Current License: No License (No License)

Desktop assistant that uses speech recognition and gTTS to execute commands and talk back to the user.

SpeakerRecognition_tutorialby jymsuper

Python 115 Version:Current License: Permissive (MIT)

Simple d-vector based Speaker Recognition (verification and identification) using Pytorch

DCCRNby maggie0830

Python 115 Version:Current License: No License (No License)

implementation of "DCCRN-Deep Complex Convolution Recurrent Network for Phase-Aware Speech Enhancement" by pytorch

WaveEditby AndrewBelt

C++ 115 Version:Current License: Proprietary (Proprietary)

Synthesis Technology WaveEdit for the E370 and E352 Eurorack synthesizer modules

LIA_RALby ALIZE-Speaker-Recognition

C++ 115 Version:Current License: Weak Copyleft (LGPL-3.0)

A high-level toolkit for speaker recognition, build on top of ALIZE-Core.

Jarvisby thevickypedia

Python 113 Version:Current License: Permissive (MIT)

Fully Functional Voice Based Natural Language UI

audioswitchby twilio

Kotlin 112 Version:Current License: Permissive (Apache-2.0)

An Android audio management library for real-time communication apps.

speech-emotion-recognitionby amanbasu

Jupyter Notebook 112 Version:Current License: Strong Copyleft (GPL-3.0)

Detecting emotions using MFCC features of human speech using Deep Learning

athenaby didi

Python 111 Version:Current License: Permissive (Apache-2.0)

A release version for https://github.com/athena-team/athena

vcc20_baseline_cyclevaeby bigpon

Python 111 Version:Current License: Permissive (MIT)

Voice Conversion Challenge 2020 CycleVAE baseline system

spokestack-pythonby spokestack

Python 111 Version:Current License: Permissive (Apache-2.0)

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

pytorch-kaldi-neural-speaker-embeddingsby jefflai108

Perl 111 Version:Current License: Permissive (BSD-3-Clause)

A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.

FreeSWITCH-ASRby cdevelop

C++ 111 Version:Current License: No License (No License)

FreeSWITCH ASR APP

picopiby DougGore

C 111 Version:Current License: No License (No License)

Port of Android Pico TTS to the Raspberry Pi

py_speech_segby wblgers

Python 110 Version:Current License: No License (No License)

A toolkit to implement segmentation on speech based on BIC and nerual network, such as BiLSTM

vid2speechby arielephrat

Python 110 Version:Current License: No License (No License)

Code for "Vid2speech: Speech Reconstruction from Silent Video" ICASSP '17

Voice-synthesisby smoke-trees

Python 110 Version:Current License: No License (No License)

midi2voiceby mathigatti

Python 110 Version:Current License: Permissive (MIT)

Singing synthesis from MIDI file

htgo-ttsby hegedustibor

Go 110 Version:Current License: Permissive (MIT)

Text to speech package for Golang.

picottsby naggety

C 110 Version:Current License: No License (No License)

Pico TTS: text to speech voice sinthesizer from SVox, included in Android AOSP

VIMABenchby vimalabs

Python 110 Version:Current License: Permissive (MIT)

Official Task Suite Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"

resamplerby cpuimage

C 109 Version:Current License: Permissive (MIT)

A Simple and Efficient Audio Resampler Implementation in C

Python 116 Version:Current
License: Permissive (MIT)

Python 116 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 116 Version:Current
License: Permissive (MIT)

C# 116 Version:Current
License: Permissive (MIT)

Shell 116 Version:Current
License: Permissive (Apache-2.0)

Python 115 Version:Current
License: No License (No License)

Python 115 Version:Current
License: Permissive (MIT)

Python 115 Version:Current
License: No License (No License)

C++ 115 Version:Current
License: Proprietary (Proprietary)

C++ 115 Version:Current
License: Weak Copyleft (LGPL-3.0)

Python 113 Version:Current
License: Permissive (MIT)

Kotlin 112 Version:Current
License: Permissive (Apache-2.0)

Jupyter Notebook 112 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 111 Version:Current
License: Permissive (Apache-2.0)

Python 111 Version:Current
License: Permissive (MIT)

Python 111 Version:Current
License: Permissive (Apache-2.0)

Perl 111 Version:Current
License: Permissive (BSD-3-Clause)

C++ 111 Version:Current
License: No License (No License)

C 111 Version:Current
License: No License (No License)

Python 110 Version:Current
License: No License (No License)

Python 110 Version:Current
License: No License (No License)

Python 110 Version:Current
License: No License (No License)

Python 110 Version:Current
License: Permissive (MIT)

Go 110 Version:Current
License: Permissive (MIT)

C 110 Version:Current
License: No License (No License)

Python 110 Version:Current
License: Permissive (MIT)

C 109 Version:Current
License: Permissive (MIT)

CSS 109 Version:Current
License: No License (No License)

TypeScript 109 Version:Current
License: Permissive (MIT)

Jupyter Notebook 109 Version:Current
License: No License (No License)

Python 109 Version:Current
License: Permissive (Apache-2.0)

JavaScript 109 Version:Current
License: No License (No License)

Python 109 Version:Current
License: Permissive (MIT)

Python 108 Version:Current
License: Permissive (MIT)

Python 108 Version:Current
License: No License (No License)

C 108 Version:Current
License: Strong Copyleft (GPL-2.0)

C++ 108 Version:Current
License: Proprietary (Proprietary)

Jupyter Notebook 108 Version:Current
License: Permissive (BSD-3-Clause)

Python 107 Version:Current
License: No License (No License)

Python 107 Version:Current
License: Permissive (MIT)

C# 107 Version:Current
License: Strong Copyleft (AGPL-3.0)

Python 107 Version:Current
License: Permissive (MIT)

Python 107 Version:Current
License: Proprietary (Proprietary)

C# 106 Version:Current
License: Permissive (Apache-2.0)

HTML 106 Version:Current
License: No License (No License)

Python 106 Version:Current
License: Permissive (MIT)

Python 105 Version:Current
License: Permissive (MIT)

JavaScript 105 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 104 Version:Current
License: Permissive (MIT)

Python 104 Version:Current
License: Permissive (MIT)