Speech Libraries - Page 3

nerd-dictationby ideasman42

Python 956 Version:Current
License: Strong Copyleft (GPL-3.0)

Simple, hackable offline speech to text - using the VOSK-API.

Support

Quality

Security

License

Reuse

subsyncby sc0ty

C++ 943 Version:Current
License: Strong Copyleft (GPL-3.0)

Subtitle Speech Synchronizer

Support

Quality

Security

License

Reuse

botium-speech-processingby codeforequity-at

JavaScript 939 Version:Current
License: Permissive (MIT)

Botium Speech Processing

Support

Quality

Security

License

Reuse

pykaldiby pykaldi

Python 936 Version:Current
License: Permissive (Apache-2.0)

A Python wrapper for Kaldi

Support

Quality

Security

License

Reuse

voicefilterby mindslab-ai

Python 912 Version:Current
License: No License (No License)

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Support

Quality

Security

License

Reuse

audiogrepby antiboredom

Python 912 Version:Current
License: Permissive (MIT)

Creates audio supercuts.

Support

Quality

Security

License

Reuse

zhrtvcby KuangDD

Python 890 Version:Current
License: No License (No License)

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。

Support

Quality

Security

License

Reuse

espressoby freewym

Python 887 Version:Current
License: Proprietary (Proprietary)

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Support

Quality

Security

License

Reuse

athenaby athena-team

C++ 869 Version:Current
License: Permissive (Apache-2.0)

an open-source implementation of sequence-to-sequence based speech processing engine

Support

Quality

Security

License

Reuse

amodemby romanz

Python 864 Version:Current
License: Proprietary (Proprietary)

Audio MODEM Communication Library in Python

Support

Quality

Security

License

Reuse

speechpyby astorfi

Python 839 Version:Current
License: Permissive (Apache-2.0)

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

Support

Quality

Security

License

Reuse

TensorFlowASRby TensorSpeech

Jupyter Notebook 839 Version:Current
License: Permissive (Apache-2.0)

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Support

Quality

Security

License

Reuse

waveform-data.jsby bbc

JavaScript 824 Version:Current
License: Weak Copyleft (LGPL-3.0)

Audio Waveform Data Manipulation API – resample, offset and segment waveform data in JavaScript.

Support

Quality

Security

License

Reuse

flowtronby NVIDIA

Jupyter Notebook 817 Version:Current
License: Permissive (Apache-2.0)

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Support

Quality

Security

License

Reuse

eesenby srvk

C++ 816 Version:Current
License: Permissive (Apache-2.0)

The official repository of the Eesen project

Support

Quality

Security

License

Reuse

piperby rhasspy

C++ 794 Version:Current
License: Permissive (MIT)

A fast, local neural text to speech system

Support

Quality

Security

License

Reuse

stephanie-vaby SlapBot

Python 788 Version:Current
License: Permissive (MIT)

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

Support

Quality

Security

License

Reuse

FastSpeechby xcmyz

Python 785 Version:Current
License: Permissive (MIT)

The Implementation of FastSpeech based on pytorch.

Support

Quality

Security

License

Reuse

OBS-captions-pluginby ratwithacompiler

C++ 785 Version:Current
License: Strong Copyleft (GPL-2.0)

Closed Captioning OBS plugin using Google Speech Recognition

Support

Quality

Security

License

Reuse

jarvisby alexylem

Shell 780 Version:Current
License: Permissive (MIT)

Jarvis.sh is a simple configurable multi-lang assistant.

Support

Quality

Security

License

Reuse

mellotronby NVIDIA

Jupyter Notebook 773 Version:Current
License: Permissive (BSD-3-Clause)

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Support

Quality

Security

License

Reuse

CTCDecoderby githubharald

Python 766 Version:Current
License: Permissive (MIT)

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

Support

Quality

Security

License

Reuse

larynxby rhasspy

Python 766 Version:Current
License: Permissive (MIT)

End to end text to speech system using gruut and onnx

Support

Quality

Security

License

Reuse

quillmanby modal-labs

JavaScript 730 Version:Current
License: Permissive (MIT)

A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.

Support

Quality

Security

License

Reuse

whisper-asr-webserviceby ahmetoner

Python 728 Version:Current
License: Permissive (MIT)

OpenAI Whisper ASR Webservice API

Support

Quality

Security

License

Reuse

open_sttby snakers4

Python 727 Version:Current
License: Proprietary (Proprietary)

Open STT

Support

Quality

Security

License

Reuse

ekhoby hgneng

C++ 719 Version:Current
License: Proprietary (Proprietary)

Chinese text-to-speech engine

Support

Quality

Security

License

Reuse

speech_recognitionby xxbb1234021

Python 716 Version:Current
License: No License (No License)

中文语音识别

Support

Quality

Security

License

Reuse

Speech-Transformerby kaituoxu

Python 709 Version:Current
License: No License (No License)

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Support

Quality

Security

License

Reuse

adaptby MycroftAI

Python 704 Version:Current
License: Permissive (Apache-2.0)

Adapt Intent Parser

Support

Quality

Security

License

Reuse

stream-audio-fingerprintby adblockradio

JavaScript 704 Version:Current
License: Weak Copyleft (MPL-2.0)

Audio landmark fingerprinting as a Node Stream module

Support

Quality

Security

License

Reuse

mycroft-preciseby MycroftAI

Python 700 Version:Current
License: Permissive (Apache-2.0)

A lightweight, simple-to-use, RNN wake word listener

Support

Quality

Security

License

Reuse

wenetby mobvoi

Python 687 Version:Current
License: Permissive (Apache-2.0)

Production First and Production Ready End-to-End Speech Recognition Toolkit

Support

Quality

Security

License

Reuse

lhotseby lhotse-speech

Python 686 Version:Current
License: Permissive (Apache-2.0)

Tools for handling speech data in machine learning projects.

Support

Quality

Security

License

Reuse

nodejs-speechby googleapis

TypeScript 683 Version:Current
License: Permissive (Apache-2.0)

Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.

Support

Quality

Security

License

Reuse

mimic3by MycroftAI

Python 676 Version:Current
License: Strong Copyleft (AGPL-3.0)

A fast local neural text to speech engine for Mycroft

Support

Quality

Security

License

Reuse

ttskitby kuangdd

Python 661 Version:Current
License: Permissive (MIT)

text to speech toolkit. 好用的中文语音合成工具箱，包含语音编码器、语音合成器、声码器和可视化模块。

Support

Quality

Security

License

Reuse

Python-ai-assistantby ggeop

Python 660 Version:Current
License: Permissive (MIT)

Python AI assistant 🧠

Support

Quality

Security

License

Reuse

voice-assistant-scriptsby alan-ai

JavaScript 659 Version:Current
License: No License (No License)

Example scripts for voice assistants created with the Alan AI Platform.

Support

Quality

Security

License

Reuse

speechby awni

Python 654 Version:Current
License: Permissive (Apache-2.0)

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Support

Quality

Security

License

Reuse

speaker-recognitionby ppwwyyxx

C++ 645 Version:Current
License: Permissive (Apache-2.0)

A Speaker Recognition System

Support

Quality

Security

License

Reuse

LibreASRby iceychris

Python 642 Version:Current
License: Permissive (MIT)

:speech_balloon: An On-Premises, Streaming Speech Recognition System

Support

Quality

Security

License

Reuse

Cognitive-Speech-TTSby Azure-Samples

C# 635 Version:Current
License: Proprietary (Proprietary)

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

Support

Quality

Security

License

Reuse

mysamby mysamai

JavaScript 628 Version:Current
License: Proprietary (Proprietary)

An open "intelligent" assistant for the web that can listen to you and learn.

Support

Quality

Security

License

Reuse

auditokby amsehili

Python 626 Version:Current
License: Permissive (MIT)

An audio/acoustic activity detection and audio segmentation tool

Support

Quality

Security

License

Reuse

speech-demoby Baidu-AIP

Java 623 Version:Current
License: No License (No License)

语音api示例

Support

Quality

Security

License

Reuse

kuromoji.jsby takuyaa

JavaScript 620 Version:Current
License: Permissive (Apache-2.0)

JavaScript implementation of Japanese morphological analyzer

Support

Quality

Security

License

Reuse

aiexperiments-drum-machineby googlecreativelab

JavaScript 615 Version:Current
License: Permissive (Apache-2.0)

Thousands of everyday sounds, organized using machine learning.

Support

Quality

Security

License

Reuse

mimic1by MycroftAI

C 615 Version:Current
License: Proprietary (Proprietary)

Mycroft's TTS engine, based on CMU's Flite (Festival Lite)

Support

Quality

Security

License

Reuse

Lip2Wavby Rudrabha

Python 613 Version:Current
License: Permissive (MIT)

This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"

Support

Quality

Security

License

Reuse

nerd-dictationby ideasman42

Simple, hackable offline speech to text - using the VOSK-API.

Python

956

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

subsyncby sc0ty

Subtitle Speech Synchronizer

C++

943

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

botium-speech-processingby codeforequity-at

Botium Speech Processing

JavaScript

939

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pykaldiby pykaldi

A Python wrapper for Kaldi

Python

936

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

voicefilterby mindslab-ai

Unofficial PyTorch implementation of Google AI's VoiceFilter system

Python

912

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

audiogrepby antiboredom

Creates audio supercuts.

Python

912

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

zhrtvcby KuangDD

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。

Python

890

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

espressoby freewym

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

Python

887

Updated: 3 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

athenaby athena-team

an open-source implementation of sequence-to-sequence based speech processing engine

C++

869

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

amodemby romanz

Audio MODEM Communication Library in Python

Python

864

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

speechpyby astorfi

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

Python

839

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

TensorFlowASRby TensorSpeech

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

Jupyter Notebook

839

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

waveform-data.jsby bbc

Audio Waveform Data Manipulation API – resample, offset and segment waveform data in JavaScript.

JavaScript

824

Updated: 3 y ago

License: Weak Copyleft (LGPL-3.0)

Support

Quality

Security

License

Reuse

flowtronby NVIDIA

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

Jupyter Notebook

817

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

eesenby srvk

The official repository of the Eesen project

C++

816

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

piperby rhasspy

A fast, local neural text to speech system

C++

794

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

stephanie-vaby SlapBot

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

Python

788

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

FastSpeechby xcmyz

The Implementation of FastSpeech based on pytorch.

Python

785

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

OBS-captions-pluginby ratwithacompiler

Closed Captioning OBS plugin using Google Speech Recognition

C++

785

Updated: 2 y ago

License: Strong Copyleft (GPL-2.0)

Support

Quality

Security

License

Reuse

jarvisby alexylem

Jarvis.sh is a simple configurable multi-lang assistant.

Shell

780

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

mellotronby NVIDIA

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

Jupyter Notebook

773

Updated: 2 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

CTCDecoderby githubharald

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

Python

766

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

larynxby rhasspy

End to end text to speech system using gruut and onnx

Python

766

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

quillmanby modal-labs

A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.

JavaScript

730

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

whisper-asr-webserviceby ahmetoner

OpenAI Whisper ASR Webservice API

Python

728

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

open_sttby snakers4

Open STT

Python

727

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

ekhoby hgneng

Chinese text-to-speech engine

C++

719

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

speech_recognitionby xxbb1234021

中文语音识别

Python

716

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Speech-Transformerby kaituoxu

A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.

Python

709

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

adaptby MycroftAI

Adapt Intent Parser

Python

704

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

stream-audio-fingerprintby adblockradio

Audio landmark fingerprinting as a Node Stream module

JavaScript

704

Updated: 4 y ago

License: Weak Copyleft (MPL-2.0)

Support

Quality

Security

License

Reuse

mycroft-preciseby MycroftAI

A lightweight, simple-to-use, RNN wake word listener

Python

700

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

wenetby mobvoi

Production First and Production Ready End-to-End Speech Recognition Toolkit

Python

687

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

lhotseby lhotse-speech

Tools for handling speech data in machine learning projects.

Python

686

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

nodejs-speechby googleapis

Node.js client for Google Cloud Speech: Speech to text conversion powered by machine learning.

TypeScript

683

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

mimic3by MycroftAI

A fast local neural text to speech engine for Mycroft

Python

676

Updated: 2 y ago

License: Strong Copyleft (AGPL-3.0)

Support

Quality

Security

License

Reuse

ttskitby kuangdd

text to speech toolkit. 好用的中文语音合成工具箱，包含语音编码器、语音合成器、声码器和可视化模块。

Python

661

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Python-ai-assistantby ggeop

Python AI assistant 🧠

Python

660

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

voice-assistant-scriptsby alan-ai

Example scripts for voice assistants created with the Alan AI Platform.

JavaScript

659

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speechby awni

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Python

654

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

speaker-recognitionby ppwwyyxx

A Speaker Recognition System

C++

645

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

LibreASRby iceychris

:speech_balloon: An On-Premises, Streaming Speech Recognition System

Python

642

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Cognitive-Speech-TTSby Azure-Samples

Microsoft Text-to-Speech API sample code in several languages, part of Cognitive Services.

635

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

mysamby mysamai

An open "intelligent" assistant for the web that can listen to you and learn.

JavaScript

628

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

auditokby amsehili

An audio/acoustic activity detection and audio segmentation tool

Python

626

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

speech-demoby Baidu-AIP

语音api示例

Java

623

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

kuromoji.jsby takuyaa

JavaScript implementation of Japanese morphological analyzer

JavaScript

620

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

aiexperiments-drum-machineby googlecreativelab

Thousands of everyday sounds, organized using machine learning.

JavaScript

615

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

mimic1by MycroftAI

Mycroft's TTS engine, based on CMU's Flite (Festival Lite)

615

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Lip2Wavby Rudrabha

This is the repository containing codes for our CVPR, 2020 paper titled "Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis"

Python

613

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 3

nerd-dictationby ideasman42

Python 956 Version:Current License: Strong Copyleft (GPL-3.0)

Simple, hackable offline speech to text - using the VOSK-API.

subsyncby sc0ty

C++ 943 Version:Current License: Strong Copyleft (GPL-3.0)

Subtitle Speech Synchronizer

botium-speech-processingby codeforequity-at

JavaScript 939 Version:Current License: Permissive (MIT)

Botium Speech Processing

pykaldiby pykaldi

Python 936 Version:Current License: Permissive (Apache-2.0)

A Python wrapper for Kaldi

voicefilterby mindslab-ai

Python 912 Version:Current License: No License (No License)

Unofficial PyTorch implementation of Google AI's VoiceFilter system

audiogrepby antiboredom

Python 912 Version:Current License: Permissive (MIT)

Creates audio supercuts.

zhrtvcby KuangDD

Python 890 Version:Current License: No License (No License)

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。

espressoby freewym

Python 887 Version:Current License: Proprietary (Proprietary)

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

athenaby athena-team

C++ 869 Version:Current License: Permissive (Apache-2.0)

an open-source implementation of sequence-to-sequence based speech processing engine

amodemby romanz

Python 864 Version:Current License: Proprietary (Proprietary)

Audio MODEM Communication Library in Python

speechpyby astorfi

Python 839 Version:Current License: Permissive (Apache-2.0)

:speech_balloon: SpeechPy - A Library for Speech Processing and Recognition: http://speechpy.readthedocs.io/en/latest/

TensorFlowASRby TensorSpeech

Jupyter Notebook 839 Version:Current License: Permissive (Apache-2.0)

:zap: TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords

waveform-data.jsby bbc

JavaScript 824 Version:Current License: Weak Copyleft (LGPL-3.0)

Audio Waveform Data Manipulation API – resample, offset and segment waveform data in JavaScript.

flowtronby NVIDIA

Jupyter Notebook 817 Version:Current License: Permissive (Apache-2.0)

Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style transfer

eesenby srvk

C++ 816 Version:Current License: Permissive (Apache-2.0)

The official repository of the Eesen project

piperby rhasspy

C++ 794 Version:Current License: Permissive (MIT)

A fast, local neural text to speech system

stephanie-vaby SlapBot

Python 788 Version:Current License: Permissive (MIT)

Stephanie is an open-source platform built specifically for voice-controlled applications as well as to automate daily tasks imitating much of an virtual assistant's work.

FastSpeechby xcmyz

Python 785 Version:Current License: Permissive (MIT)

The Implementation of FastSpeech based on pytorch.

OBS-captions-pluginby ratwithacompiler

C++ 785 Version:Current License: Strong Copyleft (GPL-2.0)

Closed Captioning OBS plugin using Google Speech Recognition

jarvisby alexylem

Shell 780 Version:Current License: Permissive (MIT)

Jarvis.sh is a simple configurable multi-lang assistant.

mellotronby NVIDIA

Jupyter Notebook 773 Version:Current License: Permissive (BSD-3-Clause)

Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing training data

CTCDecoderby githubharald

Python 766 Version:Current License: Permissive (MIT)

Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing. Implemented in Python.

larynxby rhasspy

Python 766 Version:Current License: Permissive (MIT)

End to end text to speech system using gruut and onnx

quillmanby modal-labs

JavaScript 730 Version:Current License: Permissive (MIT)

A chat app that transcribes audio in real-time, streams back a response from a language model, and synthesizes this response as natural-sounding speech.

whisper-asr-webserviceby ahmetoner

Python 728 Version:Current License: Permissive (MIT)

OpenAI Whisper ASR Webservice API

open_sttby snakers4

Python 727 Version:Current License: Proprietary (Proprietary)

Open STT

ekhoby hgneng

Python 956 Version:Current
License: Strong Copyleft (GPL-3.0)

C++ 943 Version:Current
License: Strong Copyleft (GPL-3.0)

JavaScript 939 Version:Current
License: Permissive (MIT)

Python 936 Version:Current
License: Permissive (Apache-2.0)

Python 912 Version:Current
License: No License (No License)

Python 912 Version:Current
License: Permissive (MIT)

Python 890 Version:Current
License: No License (No License)

Python 887 Version:Current
License: Proprietary (Proprietary)

C++ 869 Version:Current
License: Permissive (Apache-2.0)

Python 864 Version:Current
License: Proprietary (Proprietary)

Python 839 Version:Current
License: Permissive (Apache-2.0)

Jupyter Notebook 839 Version:Current
License: Permissive (Apache-2.0)

JavaScript 824 Version:Current
License: Weak Copyleft (LGPL-3.0)

Jupyter Notebook 817 Version:Current
License: Permissive (Apache-2.0)

C++ 816 Version:Current
License: Permissive (Apache-2.0)

C++ 794 Version:Current
License: Permissive (MIT)

Python 788 Version:Current
License: Permissive (MIT)

Python 785 Version:Current
License: Permissive (MIT)

C++ 785 Version:Current
License: Strong Copyleft (GPL-2.0)

Shell 780 Version:Current
License: Permissive (MIT)

Jupyter Notebook 773 Version:Current
License: Permissive (BSD-3-Clause)

Python 766 Version:Current
License: Permissive (MIT)

Python 766 Version:Current
License: Permissive (MIT)

JavaScript 730 Version:Current
License: Permissive (MIT)

Python 728 Version:Current
License: Permissive (MIT)

Python 727 Version:Current
License: Proprietary (Proprietary)

C++ 719 Version:Current
License: Proprietary (Proprietary)

Python 716 Version:Current
License: No License (No License)

Python 709 Version:Current
License: No License (No License)

Python 704 Version:Current
License: Permissive (Apache-2.0)

JavaScript 704 Version:Current
License: Weak Copyleft (MPL-2.0)

Python 700 Version:Current
License: Permissive (Apache-2.0)

Python 687 Version:Current
License: Permissive (Apache-2.0)

Python 686 Version:Current
License: Permissive (Apache-2.0)

TypeScript 683 Version:Current
License: Permissive (Apache-2.0)

Python 676 Version:Current
License: Strong Copyleft (AGPL-3.0)

Python 661 Version:Current
License: Permissive (MIT)

Python 660 Version:Current
License: Permissive (MIT)

JavaScript 659 Version:Current
License: No License (No License)

Python 654 Version:Current
License: Permissive (Apache-2.0)

C++ 645 Version:Current
License: Permissive (Apache-2.0)

Python 642 Version:Current
License: Permissive (MIT)

C# 635 Version:Current
License: Proprietary (Proprietary)

JavaScript 628 Version:Current
License: Proprietary (Proprietary)

Python 626 Version:Current
License: Permissive (MIT)

Java 623 Version:Current
License: No License (No License)

JavaScript 620 Version:Current
License: Permissive (Apache-2.0)

JavaScript 615 Version:Current
License: Permissive (Apache-2.0)

C 615 Version:Current
License: Proprietary (Proprietary)

Python 613 Version:Current
License: Permissive (MIT)