Speech Libraries - Page 15

speaker_extractionby xuchenglin28

Python 92 Version:Current
License: Strong Copyleft (GPL-3.0)

target speaker extraction and verification for multi-talker speech

Support

Quality

Security

License

Reuse

vad.jsby kdavis-mozilla

JavaScript 92 Version:Current
License: Permissive (BSD-3-Clause)

Voice activity detection in Javascript

Support

Quality

Security

License

Reuse

Jupyter Notebook 92 Version:Current
License: Permissive (Apache-2.0)

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

Support

Quality

Security

License

Reuse

JavaScript 90 Version:Current
License: Permissive (MIT)

A Vue2 Performs synchronous speech recognition Speech to text Google Cloud Speech With Progressive Web App

Support

Quality

Security

License

Reuse

Chrome-Web-Speech-APIby bensonruan

JavaScript 90 Version:Current
License: Permissive (MIT)

Chrome Web Speech API

Support

Quality

Security

License

Reuse

speaker-recognition-papersby bjfu-ai-institute

Python 89 Version:Current
License: No License (No License)

Share some recent speaker recognition papers and their implementations.

Support

Quality

Security

License

Reuse

uPIT-for-speech-separationby funcwj

Python 89 Version:Current
License: No License (No License)

Speech separation with utterance-level PIT experiments

Support

Quality

Security

License

Reuse

JavaScript 89 Version:Current
License: Permissive (MIT)

An inclusive audio guide for The Andy Warhol Museum

Support

Quality

Security

License

Reuse

speechlessby juliuskunze

Python 89 Version:Current
License: Permissive (MIT)

Speech-to-text based on wav2letter built for transfer learning

Support

Quality

Security

License

Reuse

Python 89 Version:Current
License: Permissive (BSD-3-Clause)

Model for recasing and repunctuating ASR transcripts

Support

Quality

Security

License

Reuse

ElevateAIDotNetSDKby NICEElevateAI

C# 89 Version:Current
License: Permissive (MIT)

.Net core 6 SDK for ElevateAI

Support

Quality

Security

License

Reuse

PLDAby RicherMans

Python 88 Version:Current
License: No License (No License)

An LDA/PLDA estimator using KALDI in python for speaker verification tasks

Support

Quality

Security

License

Reuse

speechlessby JuliusKunze

Python 88 Version:Current
License: Permissive (MIT)

Speech-to-text based on wav2letter built for transfer learning

Support

Quality

Security

License

Reuse

Python 88 Version:Current
License: Permissive (MIT)

Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.

Support

Quality

Security

License

Reuse

vits-mandarin-biaobeiby AlexandaJerry

Jupyter Notebook 88 Version:Current
License: Permissive (MIT)

application of vits on mandarin tts

Support

Quality

Security

License

Reuse

festvoxby festvox

Python 87 Version:Current
License: Proprietary (Proprietary)

Festvox voice building tools

Support

Quality

Security

License

Reuse

Python 87 Version:Current
License: No License (No License)

Implementation of speech to singing of interspeech20' paper.

Support

Quality

Security

License

Reuse

hermodby syntithenai

Python 87 Version:Current
License: Proprietary (Proprietary)

voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by messaging protocol over MQTT

Support

Quality

Security

License

Reuse

WaveNet-Enhancementby auspicious3000

Python 87 Version:Current
License: No License (No License)

Speech Enhancement using Bayesian WaveNet

Support

Quality

Security

License

Reuse

Python 87 Version:Current
License: No License (No License)

An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!

Support

Quality

Security

License

Reuse

voice-activity-detectionby Jam3

JavaScript 87 Version:Current
License: Permissive (MIT)

Voice activity detection

Support

Quality

Security

License

Reuse

Python 87 Version:Current
License: Permissive (MIT)

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Support

Quality

Security

License

Reuse

whisper-vits-japaneseby AlexandaJerry

Jupyter Notebook 87 Version:Current
License: Permissive (MIT)

Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)

Support

Quality

Security

License

Reuse

B.E.N.J.I.by the-ethan-hunt

Python 86 Version:Current
License: Permissive (MIT)

B.E.N.J.I.- The Impossible Missions Force's digital assistant

Support

Quality

Security

License

Reuse

ttsby c2h2

Ruby 86 Version:Current
License: Permissive (MIT)

A ruby gem for Text-To-Speech by using google translate service.

Support

Quality

Security

License

Reuse

ttsby eheikes

JavaScript 86 Version:Current
License: Permissive (Apache-2.0)

Tools to convert text to speech :books::speech_balloon:

Support

Quality

Security

License

Reuse

nativescript-speech-recognitionby EddyVerbruggen

TypeScript 86 Version:Current
License: Proprietary (Proprietary)

:speech_balloon: Speech to text, using the awesome engines readily available on the device.

Support

Quality

Security

License

Reuse

torch-pitch-shiftby KentoNishi

Python 86 Version:Current
License: Permissive (MIT)

Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

Support

Quality

Security

License

Reuse

C 86 Version:Current
License: Strong Copyleft (GPL-3.0)

Speech-to-text library in C

Support

Quality

Security

License

Reuse

Python 86 Version:Current
License: No License (No License)

Unofficial implementation of One-Shot Free-View Neural Talking Head Synthesis

Support

Quality

Security

License

Reuse

idiolectby OpenASR

Kotlin 86 Version:Current
License: Permissive (Apache-2.0)

🎙️ Handsfree Audio Development Interface

Support

Quality

Security

License

Reuse

VBDiarizationby Jamiroquai88

Python 85 Version:Current
License: Permissive (Apache-2.0)

Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data

Support

Quality

Security

License

Reuse

audio-visual-speech-enhancementby avivga

Python 85 Version:Current
License: No License (No License)

Official Implementation of "Visual Speech Enhancement", Interspeech 2018.

Support

Quality

Security

License

Reuse

SEGANby leftthomas

Python 85 Version:Current
License: No License (No License)

A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"

Support

Quality

Security

License

Reuse

pyfasstby wslihgt

Python 85 Version:Current
License: Strong Copyleft (GPL-2.0)

Python implementation of the Flexible Audio Source Separation Toolbox (FASST)

Support

Quality

Security

License

Reuse

butlerby 720kb

JavaScript 85 Version:Current
License: Permissive (MIT)

I/O customizable voice driven butler - http://720kb.github.io/butler/

Support

Quality

Security

License

Reuse

arabic-tacotron-ttsby youssefsharief

Python 84 Version:Current
License: Permissive (MIT)

End to end Arabic TTS system based on tacotron

Support

Quality

Security

License

Reuse

Kotlin 84 Version:Current
License: No License (No License)

One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem

Support

Quality

Security

License

Reuse

Swift 84 Version:Current
License: Permissive (MIT)

OSSSpeechKit offers a native iOS Speech wrapper for AVFoundation and Apple's Speech.

Support

Quality

Security

License

Reuse

Jupyter Notebook 84 Version:Current
License: Permissive (BSD-3-Clause)

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow

Support

Quality

Security

License

Reuse

Python 83 Version:Current
License: Permissive (MIT)

开源人工智能，基于开源软硬件构建语音对话机器人、智能音箱……人机对话、自然交互，来宝拥有无限可能。特别说明，来宝运行于Python 3！

Support

Quality

Security

License

Reuse

voicerby antirek

JavaScript 83 Version:Current
License: Permissive (MIT)

AGI-server voice recognizer for #Asterisk

Support

Quality

Security

License

Reuse

ms-bing-speech-serviceby noopkat

JavaScript 83 Version:Current
License: Permissive (MIT)

NodeJS service wrapper for Microsoft Speech API and Custom Speech Service

Support

Quality

Security

License

Reuse

FFTNetby erogol

Jupyter Notebook 83 Version:Current
License: Weak Copyleft (MPL-2.0)

FFTNet vocoder implementation

Support

Quality

Security

License

Reuse

SRD-VCby YoungSeng

Python 83 Version:Current
License: No License (No License)

Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)

Support

Quality

Security

License

Reuse

icassp19by edufonseca

Python 82 Version:Current
License: Permissive (MIT)

Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"

Support

Quality

Security

License

Reuse

odmpyby ping

Python 82 Version:Current
License: Strong Copyleft (GPL-3.0)

A simple command line manager for OverDrive/Libby loans. Download your library loans from the command line.

Support

Quality

Security

License

Reuse

sail_alignby nassosoassos

Perl 82 Version:Current
License: No License (No License)

SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends on freely available software, namely HTK, srilm and sclite.

Support

Quality

Security

License

Reuse

Python 81 Version:Current
License: Permissive (MIT)

Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion

Support

Quality

Security

License

Reuse

pykaldiby UFAL-DSG

Python 81 Version:Current
License: Proprietary (Proprietary)

Python wrapper for Kaldi decoders (Kaldi https://sourceforge.net/projects/kaldi/)

Support

Quality

Security

License

Reuse

speaker_extractionby xuchenglin28

target speaker extraction and verification for multi-talker speech

Python

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

vad.jsby kdavis-mozilla

Voice activity detection in Javascript

JavaScript

Updated: 4 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

efficientspeechby roatienza

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

Jupyter Notebook

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

vue-pwa-speechby aofdev

A Vue2 Performs synchronous speech recognition Speech to text Google Cloud Speech With Progressive Web App

JavaScript

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Chrome-Web-Speech-APIby bensonruan

Chrome Web Speech API

JavaScript

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

speaker-recognition-papersby bjfu-ai-institute

Share some recent speaker recognition papers and their implementations.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

uPIT-for-speech-separationby funcwj

Speech separation with utterance-level PIT experiments

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

TheWarholOutLoudby CMP-Studio

An inclusive audio guide for The Andy Warhol Museum

JavaScript

Updated: 5 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

speechlessby juliuskunze

Speech-to-text based on wav2letter built for transfer learning

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

recasepuncby benob

Model for recasing and repunctuating ASR transcripts

Python

Updated: 2 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

ElevateAIDotNetSDKby NICEElevateAI

.Net core 6 SDK for ElevateAI

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

PLDAby RicherMans

An LDA/PLDA estimator using KALDI in python for speaker verification tasks

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speechlessby JuliusKunze

Speech-to-text based on wav2letter built for transfer learning

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

MaskCycleGAN-VCby GANtastic3

Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

vits-mandarin-biaobeiby AlexandaJerry

application of vits on mandarin tts

Jupyter Notebook

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

festvoxby festvox

Festvox voice building tools

Python

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

speech2singingby ericwudayi

Implementation of speech to singing of interspeech20' paper.

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

hermodby syntithenai

voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by messaging protocol over MQTT

Python

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

WaveNet-Enhancementby auspicious3000

Speech Enhancement using Bayesian WaveNet

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Deep-Expressionby ttsunion

An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

voice-activity-detectionby Jam3

Voice activity detection

JavaScript

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ParallelTTSby atomicoo

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

whisper-vits-japaneseby AlexandaJerry

Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)

Jupyter Notebook

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

B.E.N.J.I.by the-ethan-hunt

B.E.N.J.I.- The Impossible Missions Force's digital assistant

Python

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ttsby c2h2

A ruby gem for Text-To-Speech by using google translate service.

Ruby

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ttsby eheikes

Tools to convert text to speech :books::speech_balloon:

JavaScript

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

nativescript-speech-recognitionby EddyVerbruggen

:speech_balloon: Speech to text, using the awesome engines readily available on the device.

TypeScript

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

torch-pitch-shiftby KentoNishi

Pitch-shift audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

april-asrby abb128

Speech-to-text library in C

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

face-vid2vidby zhengkw18

Unofficial implementation of One-Shot Free-View Neural Talking Head Synthesis

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

idiolectby OpenASR

🎙️ Handsfree Audio Development Interface

Kotlin

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

VBDiarizationby Jamiroquai88

Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

audio-visual-speech-enhancementby avivga

Official Implementation of "Visual Speech Enhancement", Interspeech 2018.

Python

Updated: 5 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

SEGANby leftthomas

A PyTorch implementation of SEGAN based on INTERSPEECH 2017 paper "SEGAN: Speech Enhancement Generative Adversarial Network"

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

pyfasstby wslihgt

Python implementation of the Flexible Audio Source Separation Toolbox (FASST)

Python

Updated: 4 y ago

License: Strong Copyleft (GPL-2.0)

Support

Quality

Security

License

Reuse

butlerby 720kb

I/O customizable voice driven butler - http://720kb.github.io/butler/

JavaScript

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

arabic-tacotron-ttsby youssefsharief

End to end Arabic TTS system based on tacotron

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Android-TTS-STTby hiteshsahu

One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem

Kotlin

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

OSSSpeechKitby AppDevGuy

OSSSpeechKit offers a native iOS Speech wrapper for AVFoundation and Apple's Speech.

Swift

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

tacotron2by ide8

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow

Jupyter Notebook

Updated: 4 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

laibot-clientby jjwang

开源人工智能，基于开源软硬件构建语音对话机器人、智能音箱……人机对话、自然交互，来宝拥有无限可能。特别说明，来宝运行于Python 3！

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

voicerby antirek

AGI-server voice recognizer for #Asterisk

JavaScript

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ms-bing-speech-serviceby noopkat

NodeJS service wrapper for Microsoft Speech API and Custom Speech Service

JavaScript

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

FFTNetby erogol

FFTNet vocoder implementation

Jupyter Notebook

Updated: 4 y ago

License: Weak Copyleft (MPL-2.0)

Support

Quality

Security

License

Reuse

SRD-VCby YoungSeng

Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

icassp19by edufonseca

Public repository for the paper "Learning Sound Event Classifiers from Web Audio with Noisy Labels"

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

odmpyby ping

A simple command line manager for OverDrive/Libby loans. Download your library loans from the command line.

Python

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

sail_alignby nassosoassos

SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognition and text alignment scheme that allows for the processing of very long (and possibly noisy) audio and is robust to transcription errors. It is mainly written as a perl library but its functionality also depends on freely available software, namely HTK, srilm and sclite.

Perl

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

VectorQuantizedCPCby bshall

Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pykaldiby UFAL-DSG

Python wrapper for Kaldi decoders (Kaldi https://sourceforge.net/projects/kaldi/)

Python

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 15

speaker_extractionby xuchenglin28

Python 92 Version:Current License: Strong Copyleft (GPL-3.0)

target speaker extraction and verification for multi-talker speech

vad.jsby kdavis-mozilla

JavaScript 92 Version:Current License: Permissive (BSD-3-Clause)

Voice activity detection in Javascript

efficientspeechby roatienza

Jupyter Notebook 92 Version:Current License: Permissive (Apache-2.0)

PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.

vue-pwa-speechby aofdev

JavaScript 90 Version:Current License: Permissive (MIT)

A Vue2 Performs synchronous speech recognition Speech to text Google Cloud Speech With Progressive Web App

Chrome-Web-Speech-APIby bensonruan

JavaScript 90 Version:Current License: Permissive (MIT)

Chrome Web Speech API

speaker-recognition-papersby bjfu-ai-institute

Python 89 Version:Current License: No License (No License)

Share some recent speaker recognition papers and their implementations.

uPIT-for-speech-separationby funcwj

Python 89 Version:Current License: No License (No License)

Speech separation with utterance-level PIT experiments

TheWarholOutLoudby CMP-Studio

JavaScript 89 Version:Current License: Permissive (MIT)

An inclusive audio guide for The Andy Warhol Museum

speechlessby juliuskunze

Python 89 Version:Current License: Permissive (MIT)

Speech-to-text based on wav2letter built for transfer learning

recasepuncby benob

Python 89 Version:Current License: Permissive (BSD-3-Clause)

Model for recasing and repunctuating ASR transcripts

ElevateAIDotNetSDKby NICEElevateAI

C# 89 Version:Current License: Permissive (MIT)

.Net core 6 SDK for ElevateAI

PLDAby RicherMans

Python 88 Version:Current License: No License (No License)

An LDA/PLDA estimator using KALDI in python for speaker verification tasks

speechlessby JuliusKunze

Python 88 Version:Current License: Permissive (MIT)

Speech-to-text based on wav2letter built for transfer learning

MaskCycleGAN-VCby GANtastic3

Python 88 Version:Current License: Permissive (MIT)

Implementation of Kaneko et al.'s MaskCycleGAN-VC model for non-parallel voice conversion.

vits-mandarin-biaobeiby AlexandaJerry

Jupyter Notebook 88 Version:Current License: Permissive (MIT)

application of vits on mandarin tts

festvoxby festvox

Python 87 Version:Current License: Proprietary (Proprietary)

Festvox voice building tools

speech2singingby ericwudayi

Python 87 Version:Current License: No License (No License)

Implementation of speech to singing of interspeech20' paper.

hermodby syntithenai

Python 87 Version:Current License: Proprietary (Proprietary)

voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by messaging protocol over MQTT

WaveNet-Enhancementby auspicious3000

Python 87 Version:Current License: No License (No License)

Speech Enhancement using Bayesian WaveNet

Deep-Expressionby ttsunion

Python 87 Version:Current License: No License (No License)

An Attention Based Open-Source End to End Speech Synthesis Framework, No CNN, No RNN, No MFCC!!!

voice-activity-detectionby Jam3

JavaScript 87 Version:Current License: Permissive (MIT)

Voice activity detection

ParallelTTSby atomicoo

Python 87 Version:Current License: Permissive (MIT)

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

whisper-vits-japaneseby AlexandaJerry

Jupyter Notebook 87 Version:Current License: Permissive (MIT)

Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)

B.E.N.J.I.by the-ethan-hunt

Python 86 Version:Current License: Permissive (MIT)

B.E.N.J.I.- The Impossible Missions Force's digital assistant

ttsby c2h2

Ruby 86 Version:Current License: Permissive (MIT)

A ruby gem for Text-To-Speech by using google translate service.

ttsby eheikes

JavaScript 86 Version:Current License: Permissive (Apache-2.0)

Tools to convert text to speech :books::speech_balloon:

nativescript-speech-recognitionby EddyVerbruggen

Python 92 Version:Current
License: Strong Copyleft (GPL-3.0)

JavaScript 92 Version:Current
License: Permissive (BSD-3-Clause)

Jupyter Notebook 92 Version:Current
License: Permissive (Apache-2.0)

JavaScript 90 Version:Current
License: Permissive (MIT)

JavaScript 90 Version:Current
License: Permissive (MIT)

Python 89 Version:Current
License: No License (No License)

Python 89 Version:Current
License: No License (No License)

JavaScript 89 Version:Current
License: Permissive (MIT)

Python 89 Version:Current
License: Permissive (MIT)

Python 89 Version:Current
License: Permissive (BSD-3-Clause)

C# 89 Version:Current
License: Permissive (MIT)

Python 88 Version:Current
License: No License (No License)

Python 88 Version:Current
License: Permissive (MIT)

Python 88 Version:Current
License: Permissive (MIT)

Jupyter Notebook 88 Version:Current
License: Permissive (MIT)

Python 87 Version:Current
License: Proprietary (Proprietary)

Python 87 Version:Current
License: No License (No License)

Python 87 Version:Current
License: Proprietary (Proprietary)

Python 87 Version:Current
License: No License (No License)

Python 87 Version:Current
License: No License (No License)

JavaScript 87 Version:Current
License: Permissive (MIT)

Python 87 Version:Current
License: Permissive (MIT)

Jupyter Notebook 87 Version:Current
License: Permissive (MIT)

Python 86 Version:Current
License: Permissive (MIT)

Ruby 86 Version:Current
License: Permissive (MIT)

JavaScript 86 Version:Current
License: Permissive (Apache-2.0)

TypeScript 86 Version:Current
License: Proprietary (Proprietary)

Python 86 Version:Current
License: Permissive (MIT)

C 86 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 86 Version:Current
License: No License (No License)

Kotlin 86 Version:Current
License: Permissive (Apache-2.0)

Python 85 Version:Current
License: Permissive (Apache-2.0)

Python 85 Version:Current
License: No License (No License)

Python 85 Version:Current
License: No License (No License)

Python 85 Version:Current
License: Strong Copyleft (GPL-2.0)

JavaScript 85 Version:Current
License: Permissive (MIT)

Python 84 Version:Current
License: Permissive (MIT)

Kotlin 84 Version:Current
License: No License (No License)

Swift 84 Version:Current
License: Permissive (MIT)

Jupyter Notebook 84 Version:Current
License: Permissive (BSD-3-Clause)

Python 83 Version:Current
License: Permissive (MIT)

JavaScript 83 Version:Current
License: Permissive (MIT)

JavaScript 83 Version:Current
License: Permissive (MIT)

Jupyter Notebook 83 Version:Current
License: Weak Copyleft (MPL-2.0)

Python 83 Version:Current
License: No License (No License)

Python 82 Version:Current
License: Permissive (MIT)

Python 82 Version:Current
License: Strong Copyleft (GPL-3.0)

Perl 82 Version:Current
License: No License (No License)

Python 81 Version:Current
License: Permissive (MIT)

Python 81 Version:Current
License: Proprietary (Proprietary)