Speech Libraries - Page 30

UDP-CPPby UnknownDetectionParty

C++ 25 Version:Current
License: Permissive (MIT)

Unknown Detection Party

Support

Quality

Security

License

Reuse

DroidAudioby yqpan1991

C 25 Version:Current
License: Permissive (Apache-2.0)

useful for learning android audio system

Support

Quality

Security

License

Reuse

Listen-Attend-Spell-v2by foamliu

Shell 25 Version:Current
License: Permissive (MIT)

PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).

Support

Quality

Security

License

Reuse

Khronosby syb0rg

C 25 Version:Current
License: Strong Copyleft (GPL-2.0)

The open source intelligent personal assistant

Support

Quality

Security

License

Reuse

Shell 25 Version:Current
License: No License (No License)

Voice commands (command your PC with spoken commands)

Support

Quality

Security

License

Reuse

tritiumby syb0rg

C 25 Version:Current
License: Permissive (MIT)

A free, premium quality speech synthesis engine written completely in C.

Support

Quality

Security

License

Reuse

tf-flowavenetby gvashkevich

Jupyter Notebook 25 Version:Current
License: Permissive (MIT)

Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Support

Quality

Security

License

Reuse

Python 25 Version:Current
License: Permissive (MIT)

Converts SRT subtitle file to SSML file with speech durations

Support

Quality

Security

License

Reuse

pc-ddspby yxlllc

Python 25 Version:Current
License: Permissive (MIT)

Pitch Controllable DDSP Vocoders

Support

Quality

Security

License

Reuse

Only-Noisy-Trainingby liqingchunnnn

Python 25 Version:Current
License: No License (No License)

A self-supervised speech denoising strategy named Only-Noisy Training (ONT), which solves the speech denoising problem with only noisy audio signals in audio space for the first time.

Support

Quality

Security

License

Reuse

Google-speech-to-text-python-websocket-server-using-microphone-streamby dawntcherian

Python 24 Version:Current
License: No License (No License)

Python WebSocket server which converts input audio stream from microphone to text using Google speech to text

Support

Quality

Security

License

Reuse

Python 24 Version:Current
License: No License (No License)

Convert ppt to video with audio track, using text to speech synthesis

Support

Quality

Security

License

Reuse

koremoby warnikchow

Python 24 Version:Current
License: Permissive (MIT)

5-class Korean speech emotion classifier

Support

Quality

Security

License

Reuse

PyHALby jfach

Python 24 Version:Current
License: Permissive (MIT)

HAL-9000 Speech Simulator

Support

Quality

Security

License

Reuse

speech_separationby xuchenglin28

Python 24 Version:Current
License: No License (No License)

Constrained Permutation Invariant Training, Speech Separation

Support

Quality

Security

License

Reuse

speech-training-recorderby daanzu

Python 24 Version:Current
License: Strong Copyleft (AGPL-3.0)

A simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition.

Support

Quality

Security

License

Reuse

Python 24 Version:Current
License: No License (No License)

an tutorial implement of voice conversion using pytorch

Support

Quality

Security

License

Reuse

nalaby jim-schwoebel

Python 24 Version:Current
License: Proprietary (Proprietary)

🦁 Nala is an agile open-source voice assistant framework (20+ actions).

Support

Quality

Security

License

Reuse

nlp_speechby nguyenhuyanhh

Python 24 Version:Current
License: No License (No License)

Speech recognition using Google Cloud Speech API

Support

Quality

Security

License

Reuse

watson-ipa-web-nodejsby biosopher

JavaScript 24 Version:Current
License: Permissive (Apache-2.0)

Create a web-based intelligent personal assistant (IPA) in NodeJS using two Watson services: Natural Language Classifier (NLC) and Dialog. https://www.ibm.com/smarterplanet/us/en/ibmwatson/developercloud/doc/nl-classifier/ www.ibm.com/smarterplanet/us/en/ibmwatson/developercloud/dialog.html

Support

Quality

Security

License

Reuse

JavaScript 24 Version:Current
License: Permissive (MIT)

Self-contained multilingual TTS speech synthesizer for Node.js in pure js

Support

Quality

Security

License

Reuse

C# 24 Version:Current
License: Permissive (MIT)

Node module for voice commands using native, offline speech recognition.

Support

Quality

Security

License

Reuse

C# 24 Version:Current
License: No License (No License)

RestSharp with Polly

Support

Quality

Security

License

Reuse

WeBADby solyarisoftware

JavaScript 24 Version:Current
License: Permissive (MIT)

Web Browser Audio Detection/Speech Recording Events API

Support

Quality

Security

License

Reuse

ngx-speech-recognitionby kamiazya

TypeScript 24 Version:Current
License: Permissive (MIT)

Angular 5+ speech recognition service (based on browser implementation such as Chrome).

Support

Quality

Security

License

Reuse

tf_multispeakerTTS_fcby caizexin

Python 24 Version:Current
License: Permissive (MIT)

the Tensorflow version of multi-speaker TTS training with feedback constraint

Support

Quality

Security

License

Reuse

AI-Grand-Challenge-2020by NeuroAI-PI

Python 24 Version:Current
License: Permissive (MIT)

AI grand challenge 2020 Repo (Speech Recognition Track)

Support

Quality

Security

License

Reuse

ExtensibleTTS-PyTorchby huiw39

Python 24 Version:Current
License: No License (No License)

An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery

Support

Quality

Security

License

Reuse

Shell 24 Version:Current
License: Permissive (MIT)

Add-ons for Home Assistant's Hass.IO

Support

Quality

Security

License

Reuse

KWS-Scriptsby lallubharteja

Shell 24 Version:Current
License: Permissive (MIT)

Keyword Search Recipe for Subword ASR

Support

Quality

Security

License

Reuse

Sayby youknowone

Swift 24 Version:Current
License: Strong Copyleft (GPL-3.0)

Convert text to audiable speech. Play it or save it to audio file.

Support

Quality

Security

License

Reuse

Few-Shot-KWSby ArchitParnami

Jupyter Notebook 24 Version:Current
License: No License (No License)

Few-Shot Keyword Spotting

Support

Quality

Security

License

Reuse

asterisk-voicekit-modulesby Tinkoff

Shell 24 Version:Current
License: No License (No License)

Non-blocking Asterisk modules for accessing VoiceKit services for speech recognition and speech synthesis.

Support

Quality

Security

License

Reuse

SpeechSplit2by biggytruck

Python 24 Version:Current
License: No License (No License)

Official implementation of SpeechSplit2

Support

Quality

Security

License

Reuse

speaker-anonymizationby DigitalPhonetics

Python 24 Version:Current
License: Strong Copyleft (GPL-3.0)

Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.

Support

Quality

Security

License

Reuse

JavaScript 24 Version:Current
License: Permissive (MIT)

Twitter Spaces Host and Speaker's Lounge

Support

Quality

Security

License

Reuse

ConvS2S-VCby kamepong

Python 24 Version:Current
License: No License (No License)

Support

Quality

Security

License

Reuse

OneRealityby DogeLord081

Python 24 Version:Current
License: Strong Copyleft (GPL-3.0)

A virtual waifu that you can speak to through your mic and it'll speak back to you!

Support

Quality

Security

License

Reuse

miPhysics_Processingby mi-creative

Java 23 Version:Current
License: Strong Copyleft (GPL-3.0)

Mass interaction physics library for Processing, including Audio and Haptic capabilities. Latest compiled release for Processing environment : https://github.com/mi-creative/miPhysics_Processing/releases/tag/2.0.0

Support

Quality

Security

License

Reuse

sbrt2017by igormq

Python 23 Version:Current
License: Permissive (MIT)

Towards an end-to-end speech recognizer for Portuguese using deep neural networks

Support

Quality

Security

License

Reuse

Python 23 Version:Current
License: Permissive (Apache-2.0)

Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible.

Support

Quality

Security

License

Reuse

snickeryby CSTR-Edinburgh

Python 23 Version:Current
License: Permissive (Apache-2.0)

Hybrid speech synthesiser

Support

Quality

Security

License

Reuse

Speech-Separation-TF2by r06944010

Python 23 Version:Current
License: No License (No License)

Tensorflow 2 implementation of Speech Separation Methods

Support

Quality

Security

License

Reuse

PAGANby Zihang97

Python 23 Version:Current
License: Permissive (MIT)

PAGAN: a phase-adapted GAN for speech enhancement

Support

Quality

Security

License

Reuse

tts-korby jw9730

Python 23 Version:Current
License: No License (No License)

[KAIST CS420] Transfer Learning from Speaker Verification to Zero-Shot Multispeaker Korean Text-To-Speech Synthesis

Support

Quality

Security

License

Reuse

subword-kaldiby aalto-speech

Python 23 Version:Current
License: Permissive (MIT)

Properly handle position-dependent phones in a subword lexicon FST

Support

Quality

Security

License

Reuse

Joint-Slot-Fillingby pengshuang

Python 23 Version:Current
License: No License (No License)

Pytorch implementation of "Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling"

Support

Quality

Security

License

Reuse

Python 23 Version:Current
License: No License (No License)

声纹识别(Voiceprint Recognition, VPR)，也称为说话人识别(Speaker Recognition)，有两类，即说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)

Support

Quality

Security

License

Reuse

ConvolutionaNeuralNetworksToEnhanceCodedSpeechby ansleliu

Python 23 Version:Current
License: Permissive (BSD-3-Clause)

In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral domain features. The proposed postprocessors in both domains are evaluated for various narrowband and wideband speech codecs in a wide range of conditions. The proposed postprocessor improves speech quality (PESQ) by up to 0.25 MOS-LQO points for G.711, 0.30 points for G.726, 0.82 points for G.722, and 0.26 points for adaptive multirate wideband codec (AMR-WB). In a subjective CCR listening test, the proposed postprocessor on G.711-coded speech exceeds the speech quality of an ITU-T-standardized postfilter by 0.36 CMOS points, and obtains a clear preference of 1.77 CMOS points compared to G.711, even en par with uncoded speech.

Support

Quality

Security

License

Reuse

Python 23 Version:Current
License: No License (No License)

A wavelet audio denoiser done in python

Support

Quality

Security

License

Reuse

UDP-CPPby UnknownDetectionParty

Unknown Detection Party

C++

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

DroidAudioby yqpan1991

useful for learning android audio system

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Listen-Attend-Spell-v2by foamliu

PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).

Shell

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Khronosby syb0rg

The open source intelligent personal assistant

Updated: 5 y ago

License: Strong Copyleft (GPL-2.0)

Support

Quality

Security

License

Reuse

voice-commandsby baitsart

Voice commands (command your PC with spoken commands)

Shell

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

tritiumby syb0rg

A free, premium quality speech synthesis engine written completely in C.

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

tf-flowavenetby gvashkevich

Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio"

Jupyter Notebook

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

SRT-To-SSMLby ThioJoe

Converts SRT subtitle file to SSML file with speech durations

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pc-ddspby yxlllc

Pitch Controllable DDSP Vocoders

Python

Updated: 1 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Only-Noisy-Trainingby liqingchunnnn

A self-supervised speech denoising strategy named Only-Noisy Training (ONT), which solves the speech denoising problem with only noisy audio signals in audio space for the first time.

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Google-speech-to-text-python-websocket-server-using-microphone-streamby dawntcherian

Python WebSocket server which converts input audio stream from microphone to text using Google speech to text

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

ppt_presenterby chaonan99

Convert ppt to video with audio track, using text to speech synthesis

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

koremoby warnikchow

5-class Korean speech emotion classifier

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

PyHALby jfach

HAL-9000 Speech Simulator

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

speech_separationby xuchenglin28

Constrained Permutation Invariant Training, Speech Separation

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speech-training-recorderby daanzu

A simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition.

Python

Updated: 4 y ago

License: Strong Copyleft (AGPL-3.0)

Support

Quality

Security

License

Reuse

voice-conversionby azraelkuan

an tutorial implement of voice conversion using pytorch

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

nalaby jim-schwoebel

🦁 Nala is an agile open-source voice assistant framework (20+ actions).

Python

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

nlp_speechby nguyenhuyanhh

Speech recognition using Google Cloud Speech API

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

watson-ipa-web-nodejsby biosopher

Create a web-based intelligent personal assistant (IPA) in NodeJS using two Watson services: Natural Language Classifier (NLC) and Dialog. https://www.ibm.com/smarterplanet/us/en/ibmwatson/developercloud/doc/nl-classifier/ www.ibm.com/smarterplanet/us/en/ibmwatson/developercloud/dialog.html

JavaScript

Updated: 6 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

text2wav.node.jsby abbr

Self-contained multilingual TTS speech synthesizer for Node.js in pure js

JavaScript

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

voice-commandby baluubas

Node module for voice commands using native, offline speech recognition.

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

RestSharpPollyby yuessir

RestSharp with Polly

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

WeBADby solyarisoftware

Web Browser Audio Detection/Speech Recording Events API

JavaScript

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ngx-speech-recognitionby kamiazya

Angular 5+ speech recognition service (based on browser implementation such as Chrome).

TypeScript

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

tf_multispeakerTTS_fcby caizexin

the Tensorflow version of multi-speaker TTS training with feedback constraint

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

AI-Grand-Challenge-2020by NeuroAI-PI

AI grand challenge 2020 Repo (Speech Recognition Track)

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ExtensibleTTS-PyTorchby huiw39

An extensible speech synthesis system, build with PyTorch and the original code is from r9y9's https://github.com/r9y9/nnmnkwii_gallery

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

hassio-addonsby rhasspy

Add-ons for Home Assistant's Hass.IO

Shell

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

KWS-Scriptsby lallubharteja

Keyword Search Recipe for Subword ASR

Shell

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Sayby youknowone

Convert text to audiable speech. Play it or save it to audio file.

Swift

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Few-Shot-KWSby ArchitParnami

Few-Shot Keyword Spotting

Jupyter Notebook

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

asterisk-voicekit-modulesby Tinkoff

Non-blocking Asterisk modules for accessing VoiceKit services for speech recognition and speech synthesis.

Shell

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

SpeechSplit2by biggytruck

Official implementation of SpeechSplit2

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speaker-anonymizationby DigitalPhonetics

Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.

Python

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

spacesloungeby avie-dev

Twitter Spaces Host and Speaker's Lounge

JavaScript

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ConvS2S-VCby kamepong

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

OneRealityby DogeLord081

A virtual waifu that you can speak to through your mic and it'll speak back to you!

Python

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

miPhysics_Processingby mi-creative

Mass interaction physics library for Processing, including Audio and Haptic capabilities. Latest compiled release for Processing environment : https://github.com/mi-creative/miPhysics_Processing/releases/tag/2.0.0

Java

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

sbrt2017by igormq

Towards an end-to-end speech recognizer for Portuguese using deep neural networks

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

py-espeak-ngby gooofy

Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible.

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

snickeryby CSTR-Edinburgh

Hybrid speech synthesiser

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Speech-Separation-TF2by r06944010

Tensorflow 2 implementation of Speech Separation Methods

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

PAGANby Zihang97

PAGAN: a phase-adapted GAN for speech enhancement

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

tts-korby jw9730

[KAIST CS420] Transfer Learning from Speaker Verification to Zero-Shot Multispeaker Korean Text-To-Speech Synthesis

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

subword-kaldiby aalto-speech

Properly handle position-dependent phones in a subword lexicon FST

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Joint-Slot-Fillingby pengshuang

Pytorch implementation of "Attention-Based Recurrent Neural Network Models for Joint Intent Detection and Slot Filling"

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Speaker-Recognitionby mialrr

声纹识别(Voiceprint Recognition, VPR)，也称为说话人识别(Speaker Recognition)，有两类，即说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

ConvolutionaNeuralNetworksToEnhanceCodedSpeechby ansleliu

In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the cepstral domain to enhance the coded speech without any modification of the codecs. The time domain approach follows an end-to-end fashion, while the cepstral domain approach uses analysis-synthesis with cepstral domain features. The proposed postprocessors in both domains are evaluated for various narrowband and wideband speech codecs in a wide range of conditions. The proposed postprocessor improves speech quality (PESQ) by up to 0.25 MOS-LQO points for G.711, 0.30 points for G.726, 0.82 points for G.722, and 0.26 points for adaptive multirate wideband codec (AMR-WB). In a subjective CCR listening test, the proposed postprocessor on G.711-coded speech exceeds the speech quality of an ITU-T-standardized postfilter by 0.36 CMOS points, and obtains a clear preference of 1.77 CMOS points compared to G.711, even en par with uncoded speech.

Python

Updated: 4 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

wavelet-denoiserby actonDev

A wavelet audio denoiser done in python

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 30

UDP-CPPby UnknownDetectionParty

C++ 25 Version:Current License: Permissive (MIT)

Unknown Detection Party

DroidAudioby yqpan1991

C 25 Version:Current License: Permissive (Apache-2.0)

useful for learning android audio system

Listen-Attend-Spell-v2by foamliu

Shell 25 Version:Current License: Permissive (MIT)

PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).

Khronosby syb0rg

C 25 Version:Current License: Strong Copyleft (GPL-2.0)

The open source intelligent personal assistant

voice-commandsby baitsart

Shell 25 Version:Current License: No License (No License)

Voice commands (command your PC with spoken commands)

tritiumby syb0rg

C 25 Version:Current License: Permissive (MIT)

A free, premium quality speech synthesis engine written completely in C.

tf-flowavenetby gvashkevich

Jupyter Notebook 25 Version:Current License: Permissive (MIT)

Tensorflow implementation of "FloWaveNet: A Generative Flow for Raw Audio"

SRT-To-SSMLby ThioJoe

Python 25 Version:Current License: Permissive (MIT)

Converts SRT subtitle file to SSML file with speech durations

pc-ddspby yxlllc

Python 25 Version:Current License: Permissive (MIT)

Pitch Controllable DDSP Vocoders

Only-Noisy-Trainingby liqingchunnnn

Python 25 Version:Current License: No License (No License)

A self-supervised speech denoising strategy named Only-Noisy Training (ONT), which solves the speech denoising problem with only noisy audio signals in audio space for the first time.

Google-speech-to-text-python-websocket-server-using-microphone-streamby dawntcherian

Python 24 Version:Current License: No License (No License)

Python WebSocket server which converts input audio stream from microphone to text using Google speech to text

ppt_presenterby chaonan99

Python 24 Version:Current License: No License (No License)

Convert ppt to video with audio track, using text to speech synthesis

koremoby warnikchow

Python 24 Version:Current License: Permissive (MIT)

5-class Korean speech emotion classifier

PyHALby jfach

Python 24 Version:Current License: Permissive (MIT)

HAL-9000 Speech Simulator

speech_separationby xuchenglin28

Python 24 Version:Current License: No License (No License)

Constrained Permutation Invariant Training, Speech Separation

speech-training-recorderby daanzu

Python 24 Version:Current License: Strong Copyleft (AGPL-3.0)

A simple GUI application to help record audio dictated from given text prompts, for use with training speech recognition.

voice-conversionby azraelkuan

Python 24 Version:Current License: No License (No License)

an tutorial implement of voice conversion using pytorch

nalaby jim-schwoebel

Python 24 Version:Current License: Proprietary (Proprietary)

🦁 Nala is an agile open-source voice assistant framework (20+ actions).

nlp_speechby nguyenhuyanhh

Python 24 Version:Current License: No License (No License)

Speech recognition using Google Cloud Speech API

watson-ipa-web-nodejsby biosopher

JavaScript 24 Version:Current License: Permissive (Apache-2.0)

Create a web-based intelligent personal assistant (IPA) in NodeJS using two Watson services: Natural Language Classifier (NLC) and Dialog. https://www.ibm.com/smarterplanet/us/en/ibmwatson/developercloud/doc/nl-classifier/ www.ibm.com/smarterplanet/us/en/ibmwatson/developercloud/dialog.html

text2wav.node.jsby abbr

JavaScript 24 Version:Current License: Permissive (MIT)

Self-contained multilingual TTS speech synthesizer for Node.js in pure js

voice-commandby baluubas

C# 24 Version:Current License: Permissive (MIT)

Node module for voice commands using native, offline speech recognition.

RestSharpPollyby yuessir

C# 24 Version:Current License: No License (No License)

RestSharp with Polly

WeBADby solyarisoftware

JavaScript 24 Version:Current License: Permissive (MIT)

Web Browser Audio Detection/Speech Recording Events API

ngx-speech-recognitionby kamiazya

TypeScript 24 Version:Current License: Permissive (MIT)

Angular 5+ speech recognition service (based on browser implementation such as Chrome).

tf_multispeakerTTS_fcby caizexin

Python 24 Version:Current License: Permissive (MIT)

the Tensorflow version of multi-speaker TTS training with feedback constraint

AI-Grand-Challenge-2020by NeuroAI-PI

C++ 25 Version:Current
License: Permissive (MIT)

C 25 Version:Current
License: Permissive (Apache-2.0)

Shell 25 Version:Current
License: Permissive (MIT)

C 25 Version:Current
License: Strong Copyleft (GPL-2.0)

Shell 25 Version:Current
License: No License (No License)

C 25 Version:Current
License: Permissive (MIT)

Jupyter Notebook 25 Version:Current
License: Permissive (MIT)

Python 25 Version:Current
License: Permissive (MIT)

Python 25 Version:Current
License: Permissive (MIT)

Python 25 Version:Current
License: No License (No License)

Python 24 Version:Current
License: No License (No License)

Python 24 Version:Current
License: No License (No License)

Python 24 Version:Current
License: Permissive (MIT)

Python 24 Version:Current
License: Permissive (MIT)

Python 24 Version:Current
License: No License (No License)

Python 24 Version:Current
License: Strong Copyleft (AGPL-3.0)

Python 24 Version:Current
License: No License (No License)

Python 24 Version:Current
License: Proprietary (Proprietary)

Python 24 Version:Current
License: No License (No License)

JavaScript 24 Version:Current
License: Permissive (Apache-2.0)

JavaScript 24 Version:Current
License: Permissive (MIT)

C# 24 Version:Current
License: Permissive (MIT)

C# 24 Version:Current
License: No License (No License)

JavaScript 24 Version:Current
License: Permissive (MIT)

TypeScript 24 Version:Current
License: Permissive (MIT)

Python 24 Version:Current
License: Permissive (MIT)

Python 24 Version:Current
License: Permissive (MIT)

Python 24 Version:Current
License: No License (No License)

Shell 24 Version:Current
License: Permissive (MIT)

Shell 24 Version:Current
License: Permissive (MIT)

Swift 24 Version:Current
License: Strong Copyleft (GPL-3.0)

Jupyter Notebook 24 Version:Current
License: No License (No License)

Shell 24 Version:Current
License: No License (No License)

Python 24 Version:Current
License: No License (No License)

Python 24 Version:Current
License: Strong Copyleft (GPL-3.0)

JavaScript 24 Version:Current
License: Permissive (MIT)

Python 24 Version:Current
License: No License (No License)

Python 24 Version:Current
License: Strong Copyleft (GPL-3.0)

Java 23 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 23 Version:Current
License: Permissive (MIT)

Python 23 Version:Current
License: Permissive (Apache-2.0)

Python 23 Version:Current
License: Permissive (Apache-2.0)

Python 23 Version:Current
License: No License (No License)

Python 23 Version:Current
License: Permissive (MIT)

Python 23 Version:Current
License: No License (No License)

Python 23 Version:Current
License: Permissive (MIT)

Python 23 Version:Current
License: No License (No License)

Python 23 Version:Current
License: No License (No License)

Python 23 Version:Current
License: Permissive (BSD-3-Clause)

Python 23 Version:Current
License: No License (No License)