Speech Libraries - Page 18

cloud-asrby UFAL-DSG

Python 65 Version:Current
License: Permissive (Apache-2.0)

Cloud-based Automatic Speech Recognition (ASR) platform and a public ASR webservice.

Support

Quality

Security

License

Reuse

UnityGoogleStreamingSpeechToTextby oshoham

C# 65 Version:Current
License: Permissive (MIT)

A Unity plugin for real-time, indefinite speech-to-text transcription from a microphone using Google Cloud Speech-to-Text.

Support

Quality

Security

License

Reuse

80speakby connornishijima

C 65 Version:Current
License: Strong Copyleft (GPL-3.0)

80speak is an online speech synthesizer based on DECtalk, famously used by Professor Stephen Hawking, The US National Weather Service, Back To The Future Part II, and Benny Benassi.

Support

Quality

Security

License

Reuse

SAPI4by TETYYS

C++ 65 Version:Current
License: Permissive (MIT)

Web interface for Microsoft Sam & friends

Support

Quality

Security

License

Reuse

HTML 65 Version:Current
License: Strong Copyleft (CC-BY-SA-4.0)

Voice models for Mimic 3 text to speech system

Support

Quality

Security

License

Reuse

Working-with-the-Web-Audio-APIby joshreiss

HTML 65 Version:Current
License: No License (No License)

Various simple Web Audio API examples

Support

Quality

Security

License

Reuse

willow-inference-serverby toverainc

Python 65 Version:Current
License: No License (No License)

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

Support

Quality

Security

License

Reuse

whisper-standalone-winby Purfview

Python 65 Version:Current
License: No License (No License)

Standalone executables for those who don't want to bother with Python.

Support

Quality

Security

License

Reuse

LAVSEby kagaminccino

Python 64 Version:Current
License: Permissive (MIT)

Python codes for Lite Audio-Visual Speech Enhancement.

Support

Quality

Security

License

Reuse

Inimesedby Kaljurand

Java 64 Version:Current
License: Permissive (Apache-2.0)

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

Support

Quality

Security

License

Reuse

C++ 64 Version:Current
License: Permissive (MIT)

speech-recorder is a node.js module for streaming audio from a device's microphone and filtering for speech.

Support

Quality

Security

License

Reuse

Speechby Microsoft

cloud_api 64 Version:Current
License: Proprietary (Proprietary)

Convert audio to text, understand intent, and convert text back to speech for natural responsiveness.

Support

Quality

Security

License

Reuse

Python 64 Version:Current
License: Permissive (MIT)

This is sample code for an Alexa skill that uses realistic voice cloning powered by Resemble AI's text-to-speech API, and Open AI’s GPT-3 AI engine.

Support

Quality

Security

License

Reuse

Transformer-Transducerby oshindow

Python 64 Version:Current
License: No License (No License)

A pytorch_lightning reimplementation of the Transducer module from ESPnet.

Support

Quality

Security

License

Reuse

watson-speech-translatorby IBM

JavaScript 63 Version:Current
License: Permissive (Apache-2.0)

Use Watson Speech to Text, Language Translator, and Text to Speech in a web app with React components

Support

Quality

Security

License

Reuse

speechmarkdown-jsby speechmarkdown

TypeScript 63 Version:Current
License: Permissive (MIT)

Speech Markdown grammar, parser, and formatters for use with JavaScript.

Support

Quality

Security

License

Reuse

C++ 63 Version:Current
License: Permissive (MIT)

Tacotron text to speech in C++(synthesize only)

Support

Quality

Security

License

Reuse

mopoby mtytel

C++ 63 Version:Current
License: Strong Copyleft (GPL-3.0)

Modular and Polyphonic audio synthesis library

Support

Quality

Security

License

Reuse

Cognitive-Services-Voice-Assistantby Azure-Samples

C++ 63 Version:Current
License: Permissive (MIT)

Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client application for your bot or Custom Command service. You will also be able to easily deploy a working Custom Command based Voice Assistant to your own Azure subscription

Support

Quality

Security

License

Reuse

FLEXby purvaten

Python 63 Version:Current
License: No License (No License)

Code for our CVPR'23 paper - "FLEX: Full-Body Grasping Without Full-Body Grasps"

Support

Quality

Security

License

Reuse

TasNetby kaituoxu

Python 62 Version:Current
License: No License (No License)

A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.

Support

Quality

Security

License

Reuse

Word-to-Number-Russianby SergeyShk

Python 62 Version:Current
License: Permissive (MIT)

Проект для перевода чисел, записанных в текстовом виде на русском языке.

Support

Quality

Security

License

Reuse

TextToSpeechPluginby jamesmontemagno

C# 62 Version:Current
License: Permissive (MIT)

Text to Speech Plugin for Xamarin and Windows

Support

Quality

Security

License

Reuse

ios-clientby speechly

Swift 62 Version:Current
License: Permissive (MIT)

The iOS client library for Speechly API

Support

Quality

Security

License

Reuse

voxsegby NickWilkinson37

Python 62 Version:Current
License: Permissive (MIT)

A python library for voice activity detection (VAD) for speech/non-speech segmentation.

Support

Quality

Security

License

Reuse

Reverb.jsby andibrae

HTML 62 Version:Current
License: Permissive (CC0-1.0)

Reverb.js is a Web Audio API extension for creating reverb nodes and an accompanying impulse-response reverb library.

Support

Quality

Security

License

Reuse

C 62 Version:Current
License: Proprietary (Proprietary)

Transcribe audio files using the "Whisper" Automatic Speech Recognition model from R

Support

Quality

Security

License

Reuse

Python 62 Version:Current
License: Strong Copyleft (GPL-3.0)

A python script that takes an input MP3/FLAC and outputs an acapella/background noise stripped WAV using the power of NVIDIA's RTX Voice

Support

Quality

Security

License

Reuse

simple-speech-recognitionby kelvinguu

Python 61 Version:Current
License: No License (No License)

A complete speech recognition system you can deploy with just a few lines of Python, built on CMU Sphinx-4.

Support

Quality

Security

License

Reuse

TD-PSOLAby sannawag

Python 61 Version:Current
License: Permissive (MIT)

A simple pitch shifting script (Time-Domain Pitch-Synchronous Overlap and Add)

Support

Quality

Security

License

Reuse

asr_preprocessingby hirofumi0810

Python 61 Version:Current
License: Permissive (MIT)

Python implementation of pre-processing for End-to-End speech recognition

Support

Quality

Security

License

Reuse

MMM-voiceby fewieden

JavaScript 61 Version:Current
License: Permissive (MIT)

Offline Voice Recognition Module for MagicMirror²

Support

Quality

Security

License

Reuse

honklingby castorini

JavaScript 61 Version:Current
License: Permissive (MIT)

Web app for keyword spotting using TensorflowJS

Support

Quality

Security

License

Reuse

spokenby stephenlb

JavaScript 61 Version:Current
License: No License (No License)

Spoken - JavaScript Text-to-Speech and Speech-to-Text for AI Artificial Intelligence Apps

Support

Quality

Security

License

Reuse

voice-input-button2by ferrinweb

JavaScript 61 Version:Current
License: No License (No License)

New version of voice input button using new interface of iflytek voice dictation (the stream version). 基于讯飞新版语音听写(流式版) api 的语音输入按钮 vue 组件。

Support

Quality

Security

License

Reuse

lessamplerby YuzukiTsuru

C++ 61 Version:Current
License: Weak Copyleft (LGPL-3.0)

lessampler is a Singing Voice Synthesizer

Support

Quality

Security

License

Reuse

EasyVCby MingjieChen

Python 61 Version:Current
License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

JVoiceXMLby JVoiceXML

Java 60 Version:Current
License: Weak Copyleft (LGPL-2.1)

Open Source VoiceXML interpreter

Support

Quality

Security

License

Reuse

Python 60 Version:Current
License: No License (No License)

Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09017.pdf)

Support

Quality

Security

License

Reuse

kaldi-pythonby janchorowski

C++ 60 Version:Current
License: Permissive (Apache-2.0)

Python wrappers for Kaldi data

Support

Quality

Security

License

Reuse

speechrtcby andrenatal

C 60 Version:Current
License: No License (No License)

Speech recognition using webrtc for FirefoxOS

Support

Quality

Security

License

Reuse

musicgby loisaidasam

Java 59 Version:Current
License: No License (No License)

Automatically exported from code.google.com/p/musicg

Support

Quality

Security

License

Reuse

FFTNetby azraelkuan

Python 59 Version:Current
License: No License (No License)

FFTNet: a Real-Time Speaker-Dependent Neural Vocoder

Support

Quality

Security

License

Reuse

avsr-tf1by georgesterpu

Python 59 Version:Current
License: Strong Copyleft (GPL-3.0)

Audio-Visual Speech Recognition using Sequence to Sequence Models

Support

Quality

Security

License

Reuse

JavaScript 59 Version:Current
License: Permissive (MIT)

A speech-to-text framework and bot for Discord. Take control of your Discord server using speech and voice commands. Can also be useful for hearing impaired and deaf people.

Support

Quality

Security

License

Reuse

mageby numediart

C++ 59 Version:Current
License: Proprietary (Proprietary)

MAGE is a C/C++ software toolkit for reactive implementation of HMM-based speech and singing synthesis.

Support

Quality

Security

License

Reuse

Nakloidby acknak

C++ 59 Version:Current
License: Permissive (MIT)

Nakloid: Unit-waveform-oriented Singing Voice Synthesis System

Support

Quality

Security

License

Reuse

StageMateby Langhalsdino

HTML 59 Version:Current
License: Permissive (MIT)

StageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.

Support

Quality

Security

License

Reuse

kaldi.jsby adrianbg

C++ 59 Version:Current
License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Parallel-Tacotron2by keonlee9420

Python 59 Version:Current
License: Permissive (MIT)

Pytorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Support

Quality

Security

License

Reuse

cloud-asrby UFAL-DSG

Cloud-based Automatic Speech Recognition (ASR) platform and a public ASR webservice.

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

UnityGoogleStreamingSpeechToTextby oshoham

A Unity plugin for real-time, indefinite speech-to-text transcription from a microphone using Google Cloud Speech-to-Text.

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

80speakby connornishijima

80speak is an online speech synthesizer based on DECtalk, famously used by Professor Stephen Hawking, The US National Weather Service, Back To The Future Part II, and Benny Benassi.

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

SAPI4by TETYYS

Web interface for Microsoft Sam & friends

C++

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

mimic3-voicesby MycroftAI

Voice models for Mimic 3 text to speech system

HTML

Updated: 2 y ago

License: Strong Copyleft (CC-BY-SA-4.0)

Support

Quality

Security

License

Reuse

Working-with-the-Web-Audio-APIby joshreiss

Various simple Web Audio API examples

HTML

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

willow-inference-serverby toverainc

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

whisper-standalone-winby Purfview

Standalone executables for those who don't want to bother with Python.

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

LAVSEby kagaminccino

Python codes for Lite Audio-Visual Speech Enhancement.

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Inimesedby Kaljurand

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

Java

Updated: 5 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

speech-recorderby serenadeai

speech-recorder is a node.js module for streaming audio from a device's microphone and filtering for speech.

C++

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Speechby Microsoft

Convert audio to text, understand intent, and convert text back to speech for natural responsiveness.

cloud_api

Updated: Current

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

resemble-alexaby resemble-ai

This is sample code for an Alexa skill that uses realistic voice cloning powered by Resemble AI's text-to-speech API, and Open AI’s GPT-3 AI engine.

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Transformer-Transducerby oshindow

A pytorch_lightning reimplementation of the Transducer module from ESPnet.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

watson-speech-translatorby IBM

Use Watson Speech to Text, Language Translator, and Text to Speech in a web app with React components

JavaScript

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

speechmarkdown-jsby speechmarkdown

Speech Markdown grammar, parser, and formatters for use with JavaScript.

TypeScript

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

tacotron-tts-cppby syoyo

Tacotron text to speech in C++(synthesize only)

C++

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

mopoby mtytel

Modular and Polyphonic audio synthesis library

C++

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Cognitive-Services-Voice-Assistantby Azure-Samples

Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client application for your bot or Custom Command service. You will also be able to easily deploy a working Custom Command based Voice Assistant to your own Azure subscription

C++

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

FLEXby purvaten

Code for our CVPR'23 paper - "FLEX: Full-Body Grasping Without Full-Body Grasps"

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

TasNetby kaituoxu

A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Word-to-Number-Russianby SergeyShk

Проект для перевода чисел, записанных в текстовом виде на русском языке.

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

TextToSpeechPluginby jamesmontemagno

Text to Speech Plugin for Xamarin and Windows

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ios-clientby speechly

The iOS client library for Speechly API

Swift

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

voxsegby NickWilkinson37

A python library for voice activity detection (VAD) for speech/non-speech segmentation.

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Reverb.jsby andibrae

Reverb.js is a Web Audio API extension for creating reverb nodes and an accompanying impulse-response reverb library.

HTML

Updated: 2 y ago

License: Permissive (CC0-1.0)

Support

Quality

Security

License

Reuse

audio.whisperby bnosac

Transcribe audio files using the "Whisper" Automatic Speech Recognition model from R

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

rtx-voice-scriptby amirldn

A python script that takes an input MP3/FLAC and outputs an acapella/background noise stripped WAV using the power of NVIDIA's RTX Voice

Python

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

simple-speech-recognitionby kelvinguu

A complete speech recognition system you can deploy with just a few lines of Python, built on CMU Sphinx-4.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

TD-PSOLAby sannawag

A simple pitch shifting script (Time-Domain Pitch-Synchronous Overlap and Add)

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

asr_preprocessingby hirofumi0810

Python implementation of pre-processing for End-to-End speech recognition

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

MMM-voiceby fewieden

Offline Voice Recognition Module for MagicMirror²

JavaScript

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

honklingby castorini

Web app for keyword spotting using TensorflowJS

JavaScript

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

spokenby stephenlb

Spoken - JavaScript Text-to-Speech and Speech-to-Text for AI Artificial Intelligence Apps

JavaScript

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

voice-input-button2by ferrinweb

New version of voice input button using new interface of iflytek voice dictation (the stream version). 基于讯飞新版语音听写(流式版) api 的语音输入按钮 vue 组件。

JavaScript

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

lessamplerby YuzukiTsuru

lessampler is a Singing Voice Synthesizer

C++

Updated: 2 y ago

License: Weak Copyleft (LGPL-3.0)

Support

Quality

Security

License

Reuse

EasyVCby MingjieChen

Python

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

JVoiceXMLby JVoiceXML

Open Source VoiceXML interpreter

Java

Updated: 2 y ago

License: Weak Copyleft (LGPL-2.1)

Support

Quality

Security

License

Reuse

GST-tacotronby acetylSv

Reproducing Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis (https://arxiv.org/pdf/1803.09017.pdf)

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

kaldi-pythonby janchorowski

Python wrappers for Kaldi data

C++

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

speechrtcby andrenatal

Speech recognition using webrtc for FirefoxOS

Updated: 6 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

musicgby loisaidasam

Automatically exported from code.google.com/p/musicg

Java

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

FFTNetby azraelkuan

FFTNet: a Real-Time Speaker-Dependent Neural Vocoder

Python

Updated: 5 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

avsr-tf1by georgesterpu

Audio-Visual Speech Recognition using Sequence to Sequence Models

Python

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

DiscordEarsBotby healzer

A speech-to-text framework and bot for Discord. Take control of your Discord server using speech and voice commands. Can also be useful for hearing impaired and deaf people.

JavaScript

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

mageby numediart

MAGE is a C/C++ software toolkit for reactive implementation of HMM-based speech and singing synthesis.

C++

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Nakloidby acknak

Nakloid: Unit-waveform-oriented Singing Voice Synthesis System

C++

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

StageMateby Langhalsdino

StageMate is the smart assistant for your presentation. It will cover all aspects of your pitch from skipping slides to reminding you if you miss some major point.

HTML

Updated: 5 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

kaldi.jsby adrianbg

C++

Updated: 5 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Parallel-Tacotron2by keonlee9420

Pytorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 18

cloud-asrby UFAL-DSG

Python 65 Version:Current License: Permissive (Apache-2.0)

Cloud-based Automatic Speech Recognition (ASR) platform and a public ASR webservice.

UnityGoogleStreamingSpeechToTextby oshoham

C# 65 Version:Current License: Permissive (MIT)

A Unity plugin for real-time, indefinite speech-to-text transcription from a microphone using Google Cloud Speech-to-Text.

80speakby connornishijima

C 65 Version:Current License: Strong Copyleft (GPL-3.0)

80speak is an online speech synthesizer based on DECtalk, famously used by Professor Stephen Hawking, The US National Weather Service, Back To The Future Part II, and Benny Benassi.

SAPI4by TETYYS

C++ 65 Version:Current License: Permissive (MIT)

Web interface for Microsoft Sam & friends

mimic3-voicesby MycroftAI

HTML 65 Version:Current License: Strong Copyleft (CC-BY-SA-4.0)

Voice models for Mimic 3 text to speech system

Working-with-the-Web-Audio-APIby joshreiss

HTML 65 Version:Current License: No License (No License)

Various simple Web Audio API examples

willow-inference-serverby toverainc

Python 65 Version:Current License: No License (No License)

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

whisper-standalone-winby Purfview

Python 65 Version:Current License: No License (No License)

Standalone executables for those who don't want to bother with Python.

LAVSEby kagaminccino

Python 64 Version:Current License: Permissive (MIT)

Python codes for Lite Audio-Visual Speech Enhancement.

Inimesedby Kaljurand

Java 64 Version:Current License: Permissive (Apache-2.0)

An Android app that lets you search your contacts by voice. Internet not required. Based on Pocketsphinx. Uses Estonian acoustic models.

speech-recorderby serenadeai

C++ 64 Version:Current License: Permissive (MIT)

speech-recorder is a node.js module for streaming audio from a device's microphone and filtering for speech.

Speechby Microsoft

cloud_api 64 Version:Current License: Proprietary (Proprietary)

Convert audio to text, understand intent, and convert text back to speech for natural responsiveness.

resemble-alexaby resemble-ai

Python 64 Version:Current License: Permissive (MIT)

This is sample code for an Alexa skill that uses realistic voice cloning powered by Resemble AI's text-to-speech API, and Open AI’s GPT-3 AI engine.

Transformer-Transducerby oshindow

Python 64 Version:Current License: No License (No License)

A pytorch_lightning reimplementation of the Transducer module from ESPnet.

watson-speech-translatorby IBM

JavaScript 63 Version:Current License: Permissive (Apache-2.0)

Use Watson Speech to Text, Language Translator, and Text to Speech in a web app with React components

speechmarkdown-jsby speechmarkdown

TypeScript 63 Version:Current License: Permissive (MIT)

Speech Markdown grammar, parser, and formatters for use with JavaScript.

tacotron-tts-cppby syoyo

C++ 63 Version:Current License: Permissive (MIT)

Tacotron text to speech in C++(synthesize only)

mopoby mtytel

C++ 63 Version:Current License: Strong Copyleft (GPL-3.0)

Modular and Polyphonic audio synthesis library

Cognitive-Services-Voice-Assistantby Azure-Samples

C++ 63 Version:Current License: Permissive (MIT)

Welcome to the Microsoft Voice Assistant samples repository! Here you will find samples to help you get started building client application for your bot or Custom Command service. You will also be able to easily deploy a working Custom Command based Voice Assistant to your own Azure subscription

FLEXby purvaten

Python 63 Version:Current License: No License (No License)

Code for our CVPR'23 paper - "FLEX: Full-Body Grasping Without Full-Body Grasps"

TasNetby kaituoxu

Python 62 Version:Current License: No License (No License)

A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.

Word-to-Number-Russianby SergeyShk

Python 62 Version:Current License: Permissive (MIT)

Проект для перевода чисел, записанных в текстовом виде на русском языке.

TextToSpeechPluginby jamesmontemagno

C# 62 Version:Current License: Permissive (MIT)

Text to Speech Plugin for Xamarin and Windows

ios-clientby speechly

Swift 62 Version:Current License: Permissive (MIT)

The iOS client library for Speechly API

voxsegby NickWilkinson37

Python 62 Version:Current License: Permissive (MIT)

A python library for voice activity detection (VAD) for speech/non-speech segmentation.

Reverb.jsby andibrae

HTML 62 Version:Current License: Permissive (CC0-1.0)

Reverb.js is a Web Audio API extension for creating reverb nodes and an accompanying impulse-response reverb library.

audio.whisperby bnosac

Python 65 Version:Current
License: Permissive (Apache-2.0)

C# 65 Version:Current
License: Permissive (MIT)

C 65 Version:Current
License: Strong Copyleft (GPL-3.0)

C++ 65 Version:Current
License: Permissive (MIT)

HTML 65 Version:Current
License: Strong Copyleft (CC-BY-SA-4.0)

HTML 65 Version:Current
License: No License (No License)

Python 65 Version:Current
License: No License (No License)

Python 65 Version:Current
License: No License (No License)

Python 64 Version:Current
License: Permissive (MIT)

Java 64 Version:Current
License: Permissive (Apache-2.0)

C++ 64 Version:Current
License: Permissive (MIT)

cloud_api 64 Version:Current
License: Proprietary (Proprietary)

Python 64 Version:Current
License: Permissive (MIT)

Python 64 Version:Current
License: No License (No License)

JavaScript 63 Version:Current
License: Permissive (Apache-2.0)

TypeScript 63 Version:Current
License: Permissive (MIT)

C++ 63 Version:Current
License: Permissive (MIT)

C++ 63 Version:Current
License: Strong Copyleft (GPL-3.0)

C++ 63 Version:Current
License: Permissive (MIT)

Python 63 Version:Current
License: No License (No License)

Python 62 Version:Current
License: No License (No License)

Python 62 Version:Current
License: Permissive (MIT)

C# 62 Version:Current
License: Permissive (MIT)

Swift 62 Version:Current
License: Permissive (MIT)

Python 62 Version:Current
License: Permissive (MIT)

HTML 62 Version:Current
License: Permissive (CC0-1.0)

C 62 Version:Current
License: Proprietary (Proprietary)

Python 62 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 61 Version:Current
License: No License (No License)

Python 61 Version:Current
License: Permissive (MIT)

Python 61 Version:Current
License: Permissive (MIT)

JavaScript 61 Version:Current
License: Permissive (MIT)

JavaScript 61 Version:Current
License: Permissive (MIT)

JavaScript 61 Version:Current
License: No License (No License)

JavaScript 61 Version:Current
License: No License (No License)

C++ 61 Version:Current
License: Weak Copyleft (LGPL-3.0)

Python 61 Version:Current
License: Permissive (Apache-2.0)

Java 60 Version:Current
License: Weak Copyleft (LGPL-2.1)

Python 60 Version:Current
License: No License (No License)

C++ 60 Version:Current
License: Permissive (Apache-2.0)

C 60 Version:Current
License: No License (No License)

Java 59 Version:Current
License: No License (No License)

Python 59 Version:Current
License: No License (No License)

Python 59 Version:Current
License: Strong Copyleft (GPL-3.0)

JavaScript 59 Version:Current
License: Permissive (MIT)

C++ 59 Version:Current
License: Proprietary (Proprietary)

C++ 59 Version:Current
License: Permissive (MIT)

HTML 59 Version:Current
License: Permissive (MIT)

C++ 59 Version:Current
License: Proprietary (Proprietary)

Python 59 Version:Current
License: Permissive (MIT)