Speech Libraries - Page 32

go-speakby nicolaifsf

Go 22 Version:Current
License: Permissive (MIT)

Speech to Text for Golang. Capable of continuous speech. Also wrappers for various Speech to Text APIs

Support

Quality

Security

License

Reuse

gildas-aiby gildasch

Go 22 Version:Current
License: Permissive (MIT)

Easy AI from your friends & family

Support

Quality

Security

License

Reuse

xunfei_ttsby goldengrape

HTML 22 Version:Current
License: Permissive (MIT)

Python library for interfacing with the XunFei text-to-speech API

Support

Quality

Security

License

Reuse

kaldi-ioby open-speech

C 22 Version:Current
License: Permissive (MIT)

c++ Kaldi IO lib (static and dynamic).

Support

Quality

Security

License

Reuse

Shell 22 Version:Current
License: Permissive (Apache-2.0)

nao robot speech recognition module. online file:

Support

Quality

Security

License

Reuse

Go 22 Version:Current
License: Permissive (MIT)

The official Houndify SDK for Go

Support

Quality

Security

License

Reuse

C 22 Version:Current
License: Strong Copyleft (GPL-2.0)

Asterisk module for adjusting pitch of voices

Support

Quality

Security

License

Reuse

fast-gpioby OnionIoT

C++ 22 Version:Current
License: Strong Copyleft (GPL-3.0)

Provides access to GPIOs by directly writing to the hw registers, implements sw PWM as well

Support

Quality

Security

License

Reuse

bubbleby r-lyeh-archived

C++ 22 Version:Current
License: Permissive (Zlib)

:speech_balloon: A simple and lightweight C++11 dialog library (for Windows)

Support

Quality

Security

License

Reuse

Python 22 Version:Current
License: Permissive (MIT)

A Hackable speech recognition library.

Support

Quality

Security

License

Reuse

Thai_TTSby Prim9000

Jupyter Notebook 22 Version:Current
License: Permissive (Apache-2.0)

Thai_TTS is the project about training "Text to Speech in Thai" using Tacotron2 by NVIDIA.

Support

Quality

Security

License

Reuse

REPET-Pythonby zafarrafii

Jupyter Notebook 22 Version:Current
License: No License (No License)

REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REPET-SIM, online REPET-SIM

Support

Quality

Security

License

Reuse

Comprehensive-Tacotron2by keonlee9420

Python 22 Version:Current
License: Permissive (MIT)

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Support

Quality

Security

License

Reuse

TSJ-TTSby FujiwaraShirakana

C++ 22 Version:Current
License: Strong Copyleft (GPL-3.0)

使用C++ OnnxRuntime 重构了Tacotron2的推理，使用Libtorch实现了VITS单角色和多角色模型推理的集成UI软件

Support

Quality

Security

License

Reuse

Unified-Enhance-Separationby YUCHEN005

Python 22 Version:Current
License: Permissive (Apache-2.0)

Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"

Support

Quality

Security

License

Reuse

ld3320by hepingood

C 22 Version:Current
License: Permissive (MIT)

LD3320 full function driver for general MCU and Linux.

Support

Quality

Security

License

Reuse

MB-iSTFT-VITS-44100Hz-Jaby AcogiMin

Jupyter Notebook 22 Version:Current
License: Permissive (Apache-2.0)

【44100Hz and Ja Support】Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Support

Quality

Security

License

Reuse

max-vcby PlayVoice

Python 22 Version:Current
License: Permissive (MIT)

singing voice conversion without f0

Support

Quality

Security

License

Reuse

AudioProcessorby alexanderchiu

Java 21 Version:Current
License: Permissive (MIT)

Java library for speech enhancement

Support

Quality

Security

License

Reuse

python-google-transcribeby korylprince

Python 21 Version:Current
License: No License (No License)

Simple voice to speech transcription using Google

Support

Quality

Security

License

Reuse

Java 21 Version:Current
License: Strong Copyleft (AGPL-3.0)

PMML evaluator library for the Android operating system (http://www.android.com/)

Support

Quality

Security

License

Reuse

Speaker-Recognition-System-using-GMMby genzen2103

Python 21 Version:Current
License: No License (No License)

System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models

Support

Quality

Security

License

Reuse

Python 21 Version:Current
License: Permissive (Apache-2.0)

Example of using Watson's Streaming Speech to Text websockets interface for real time transcription. Written in Python. WARNING: This repository is no longer maintained ⚠️ This repository will not be updated. The repository will be kept available in read-only mode.

Support

Quality

Security

License

Reuse

Python 21 Version:Current
License: No License (No License)

A lightning strike detector using the AS9535 sensor from AMS, that tweets about detected storms.

Support

Quality

Security

License

Reuse

KeenASR-Android-PoCby keenresearch

Java 21 Version:Current
License: No License (No License)

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Support

Quality

Security

License

Reuse

cmusphinx-modelsby collectivat

Python 21 Version:Current
License: Strong Copyleft (AGPL-3.0)

Acoustic and language models for minorised languages.

Support

Quality

Security

License

Reuse

spoken_language_datasetby tomasz-oponowicz

Python 21 Version:Current
License: Permissive (MIT)

The dataset with English, German and Spanish speech samples.

Support

Quality

Security

License

Reuse

PySpeakby johnwyles

Python 21 Version:Current
License: Strong Copyleft (GPL-3.0)

Python Speech Recognition, Voice Recognition, Text-to-Speech and Voice Command Engine

Support

Quality

Security

License

Reuse

Python 21 Version:Current
License: Permissive (MIT)

Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.

Support

Quality

Security

License

Reuse

eloquence_thresholdby pumper42nickel

Python 21 Version:Current
License: No License (No License)

Eloquence synthesizer NVDA add-on compatible with threshold versions of NVDA (2019.3 and later). Supports Python 3 and new NVDA speech framework.

Support

Quality

Security

License

Reuse

ReversoAPIby demian-wolf

Python 21 Version:Current
License: Permissive (MIT)

Reverso API for Python. Currently available for Reverso Context and Reverso Voice

Support

Quality

Security

License

Reuse

gladys-voiceby GladysAssistant

JavaScript 21 Version:Current
License: Permissive (MIT)

[DEPRECATED] To make it possible to speak to Gladys !

Support

Quality

Security

License

Reuse

JavaScript 21 Version:Current
License: Permissive (MIT)

Framework/Library agnostic paystack wrapper

Support

Quality

Security

License

Reuse

C# 21 Version:Current
License: Weak Copyleft (LGPL-3.0)

Encode EAS (Emergency Alert System - United States) audio messages with valid SAME (Specific Area Message Encoding) headers, EBS (Emergency Broadcast System) attention tones, NWS (National Weather Service) attention tones, and/or spoken announcement, which is synthesized by Microsoft SAPI TTS voices. Supports output to .wav/.mp3 file or MemoryStream.

Support

Quality

Security

License

Reuse

asr-rescoringby diego-fustes

Python 21 Version:Current
License: Permissive (Apache-2.0)

Rescoring methods for end-to-end Automatic Speech Recognition

Support

Quality

Security

License

Reuse

C++ 21 Version:Current
License: Permissive (MIT)

Japanese text-to-speech engine binding for NodeJS

Support

Quality

Security

License

Reuse

TypeScript 21 Version:Current
License: Permissive (MIT)

🔛 Angular 5+ Detect online/offline state

Support

Quality

Security

License

Reuse

TypeScript 21 Version:Current
License: Permissive (MIT)

Node.js SDK for the Rev AI API

Support

Quality

Security

License

Reuse

TypeScript 21 Version:Current
License: Permissive (MIT)

This is a Polyfill for the HTML5 Speech Recognition API. It uses Microsoft's Cognitive Services as a backend. All Browsers supporting WebRTC will be supported by this Polyfill.

Support

Quality

Security

License

Reuse

mp3netby korneelvdbroek

Python 21 Version:Current
License: Permissive (MIT)

A convolutional generative audio synthesis model

Support

Quality

Security

License

Reuse

DORiby crodriguezo

Python 21 Version:Current
License: No License (No License)

Public repository for DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video Code accompanying the paper

Support

Quality

Security

License

Reuse

my_hmm_gmm_speech_recognitionby cc8848

Python 21 Version:Current
License: No License (No License)

基于python的hmm-gmm声学模型

Support

Quality

Security

License

Reuse

sepia-stt-serverby SEPIA-Framework

Python 21 Version:Current
License: Permissive (MIT)

SEPIA server to support open-source speech recognition via WebSocket connection.

Support

Quality

Security

License

Reuse

DNN-HSMMby sp-nitech

Python 21 Version:Current
License: Permissive (BSD-3-Clause)

pytorch implementation of DNN-HSMM for TTS

Support

Quality

Security

License

Reuse

porfirby morfeusys

Kotlin 21 Version:Current
License: Permissive (Apache-2.0)

Голосовой ассистент Порфирьевич

Support

Quality

Security

License

Reuse

vadby dreamflyforever

C 21 Version:Current
License: No License (No License)

Vad (Voice Activity Detection ) is for embeded system.

Support

Quality

Security

License

Reuse

C 21 Version:Current
License: Proprietary (Proprietary)

AMBE/AMBE+ Vocoder implementation/decoding library.

Support

Quality

Security

License

Reuse

asterisk-eagi-google-speech-recognitionby phsultan

Shell 21 Version:Current
License: No License (No License)

An example of how to use Asterisk EAGI along with Google Speech recognition to transcribe voice to text

Support

Quality

Security

License

Reuse

kanji-handwriting-swiftby tuanna-hsp

C++ 21 Version:Current
License: No License (No License)

Kanji handwriting recognition for iOS using Zinnia.

Support

Quality

Security

License

Reuse

srvk-eesen-offline-transcriberby srvk

Shell 21 Version:Current
License: No License (No License)

Top level code to transcribe English audio/video files into text/subtitles

Support

Quality

Security

License

Reuse

go-speakby nicolaifsf

Speech to Text for Golang. Capable of continuous speech. Also wrappers for various Speech to Text APIs

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

gildas-aiby gildasch

Easy AI from your friends & family

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

xunfei_ttsby goldengrape

Python library for interfacing with the XunFei text-to-speech API

HTML

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

kaldi-ioby open-speech

c++ Kaldi IO lib (static and dynamic).

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ALSoundRecognitionby zyqzyq

nao robot speech recognition module. online file:

Shell

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

houndify-sdk-goby soundhound

The official Houndify SDK for Go

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

asterisk-voicechangerby jart

Asterisk module for adjusting pitch of voices

Updated: 4 y ago

License: Strong Copyleft (GPL-2.0)

Support

Quality

Security

License

Reuse

fast-gpioby OnionIoT

Provides access to GPIOs by directly writing to the hw registers, implements sw PWM as well

C++

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

bubbleby r-lyeh-archived

:speech_balloon: A simple and lightweight C++11 dialog library (for Windows)

C++

Updated: 5 y ago

License: Permissive (Zlib)

Support

Quality

Security

License

Reuse

thunder-speechby scart97

A Hackable speech recognition library.

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Thai_TTSby Prim9000

Thai_TTS is the project about training "Text to Speech in Thai" using Tacotron2 by NVIDIA.

Jupyter Notebook

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

REPET-Pythonby zafarrafii

REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REPET-SIM, online REPET-SIM

Jupyter Notebook

Updated: 1 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Comprehensive-Tacotron2by keonlee9420

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

Python

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

TSJ-TTSby FujiwaraShirakana

使用C++ OnnxRuntime 重构了Tacotron2的推理，使用Libtorch实现了VITS单角色和多角色模型推理的集成UI软件

C++

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Unified-Enhance-Separationby YUCHEN005

Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"

Python

Updated: 1 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

ld3320by hepingood

LD3320 full function driver for general MCU and Linux.

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

MB-iSTFT-VITS-44100Hz-Jaby AcogiMin

【44100Hz and Ja Support】Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

Jupyter Notebook

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

max-vcby PlayVoice

singing voice conversion without f0

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

AudioProcessorby alexanderchiu

Java library for speech enhancement

Java

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

python-google-transcribeby korylprince

Simple voice to speech transcription using Google

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

jpmml-androidby jpmml

PMML evaluator library for the Android operating system (http://www.android.com/)

Java

Updated: 4 y ago

License: Strong Copyleft (AGPL-3.0)

Support

Quality

Security

License

Reuse

Speaker-Recognition-System-using-GMMby genzen2103

System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

watson-streaming-sttby IBM

Example of using Watson's Streaming Speech to Text websockets interface for real time transcription. Written in Python. WARNING: This repository is no longer maintained ⚠️ This repository will not be updated. The repository will be kept available in read-only mode.

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

LightningTweeterby Hexalyse

A lightning strike detector using the AS9535 sensor from AMS, that tweets about detected storms.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

KeenASR-Android-PoCby keenresearch

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

Java

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

cmusphinx-modelsby collectivat

Acoustic and language models for minorised languages.

Python

Updated: 4 y ago

License: Strong Copyleft (AGPL-3.0)

Support

Quality

Security

License

Reuse

spoken_language_datasetby tomasz-oponowicz

The dataset with English, German and Spanish speech samples.

Python

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

PySpeakby johnwyles

Python Speech Recognition, Voice Recognition, Text-to-Speech and Voice Command Engine

Python

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

CS-Tacotron-Pytorchby andi611

Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

eloquence_thresholdby pumper42nickel

Eloquence synthesizer NVDA add-on compatible with threshold versions of NVDA (2019.3 and later). Supports Python 3 and new NVDA speech framework.

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

ReversoAPIby demian-wolf

Reverso API for Python. Currently available for Reverso Context and Reverso Voice

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

gladys-voiceby GladysAssistant

[DEPRECATED] To make it possible to speak to Gladys !

JavaScript

Updated: 5 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

paystack-simpleby ashinzekene

Framework/Library agnostic paystack wrapper

JavaScript

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

EAS-Encoderby SotaJoe

Encode EAS (Emergency Alert System - United States) audio messages with valid SAME (Specific Area Message Encoding) headers, EBS (Emergency Broadcast System) attention tones, NWS (National Weather Service) attention tones, and/or spoken announcement, which is synthesized by Microsoft SAPI TTS voices. Supports output to .wav/.mp3 file or MemoryStream.

Updated: 3 y ago

License: Weak Copyleft (LGPL-3.0)

Support

Quality

Security

License

Reuse

asr-rescoringby diego-fustes

Rescoring methods for end-to-end Automatic Speech Recognition

Python

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

node-openjtalkby TanUkkii007

Japanese text-to-speech engine binding for NodeJS

C++

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ngx-online-statusby VadimDez

🔛 Angular 5+ Detect online/offline state

TypeScript

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

revai-node-sdkby revdotcom

Node.js SDK for the Rev AI API

TypeScript

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

speech-polyfillby anteloe

This is a Polyfill for the HTML5 Speech Recognition API. It uses Microsoft's Cognitive Services as a backend. All Browsers supporting WebRTC will be supported by this Polyfill.

TypeScript

Updated: 5 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

mp3netby korneelvdbroek

A convolutional generative audio synthesis model

Python

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

DORiby crodriguezo

Public repository for DORi: Discovering Object Relationships for Moment Localization of a Natural Language Query in a Video Code accompanying the paper

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

my_hmm_gmm_speech_recognitionby cc8848

基于python的hmm-gmm声学模型

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

sepia-stt-serverby SEPIA-Framework

SEPIA server to support open-source speech recognition via WebSocket connection.

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

DNN-HSMMby sp-nitech

pytorch implementation of DNN-HSMM for TTS

Python

Updated: 3 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

porfirby morfeusys

Голосовой ассистент Порфирьевич

Kotlin

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

vadby dreamflyforever

Vad (Voice Activity Detection ) is for embeded system.

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

mbelib-testingby pbarfuss

AMBE/AMBE+ Vocoder implementation/decoding library.

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

asterisk-eagi-google-speech-recognitionby phsultan

An example of how to use Asterisk EAGI along with Google Speech recognition to transcribe voice to text

Shell

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

kanji-handwriting-swiftby tuanna-hsp

Kanji handwriting recognition for iOS using Zinnia.

C++

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

srvk-eesen-offline-transcriberby srvk

Top level code to transcribe English audio/video files into text/subtitles

Shell

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 32

go-speakby nicolaifsf

Go 22 Version:Current License: Permissive (MIT)

Speech to Text for Golang. Capable of continuous speech. Also wrappers for various Speech to Text APIs

gildas-aiby gildasch

Go 22 Version:Current License: Permissive (MIT)

Easy AI from your friends & family

xunfei_ttsby goldengrape

HTML 22 Version:Current License: Permissive (MIT)

Python library for interfacing with the XunFei text-to-speech API

kaldi-ioby open-speech

C 22 Version:Current License: Permissive (MIT)

c++ Kaldi IO lib (static and dynamic).

ALSoundRecognitionby zyqzyq

Shell 22 Version:Current License: Permissive (Apache-2.0)

nao robot speech recognition module. online file:

houndify-sdk-goby soundhound

Go 22 Version:Current License: Permissive (MIT)

The official Houndify SDK for Go

asterisk-voicechangerby jart

C 22 Version:Current License: Strong Copyleft (GPL-2.0)

Asterisk module for adjusting pitch of voices

fast-gpioby OnionIoT

C++ 22 Version:Current License: Strong Copyleft (GPL-3.0)

Provides access to GPIOs by directly writing to the hw registers, implements sw PWM as well

bubbleby r-lyeh-archived

C++ 22 Version:Current License: Permissive (Zlib)

:speech_balloon: A simple and lightweight C++11 dialog library (for Windows)

thunder-speechby scart97

Python 22 Version:Current License: Permissive (MIT)

A Hackable speech recognition library.

Thai_TTSby Prim9000

Jupyter Notebook 22 Version:Current License: Permissive (Apache-2.0)

Thai_TTS is the project about training "Text to Speech in Thai" using Tacotron2 by NVIDIA.

REPET-Pythonby zafarrafii

Jupyter Notebook 22 Version:Current License: No License (No License)

REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REPET-SIM, online REPET-SIM

Comprehensive-Tacotron2by keonlee9420

Python 22 Version:Current License: Permissive (MIT)

PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supports both single-, multi-speaker TTS and several techniques to enforce the robustness and efficiency of the model.

TSJ-TTSby FujiwaraShirakana

C++ 22 Version:Current License: Strong Copyleft (GPL-3.0)

使用C++ OnnxRuntime 重构了Tacotron2的推理，使用Libtorch实现了VITS单角色和多角色模型推理的集成UI软件

Unified-Enhance-Separationby YUCHEN005

Python 22 Version:Current License: Permissive (Apache-2.0)

Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"

ld3320by hepingood

C 22 Version:Current License: Permissive (MIT)

LD3320 full function driver for general MCU and Linux.

MB-iSTFT-VITS-44100Hz-Jaby AcogiMin

Jupyter Notebook 22 Version:Current License: Permissive (Apache-2.0)

【44100Hz and Ja Support】Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform

max-vcby PlayVoice

Python 22 Version:Current License: Permissive (MIT)

singing voice conversion without f0

AudioProcessorby alexanderchiu

Java 21 Version:Current License: Permissive (MIT)

Java library for speech enhancement

python-google-transcribeby korylprince

Python 21 Version:Current License: No License (No License)

Simple voice to speech transcription using Google

jpmml-androidby jpmml

Java 21 Version:Current License: Strong Copyleft (AGPL-3.0)

PMML evaluator library for the Android operating system (http://www.android.com/)

Speaker-Recognition-System-using-GMMby genzen2103

Python 21 Version:Current License: No License (No License)

System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models

watson-streaming-sttby IBM

Python 21 Version:Current License: Permissive (Apache-2.0)

Example of using Watson's Streaming Speech to Text websockets interface for real time transcription. Written in Python. WARNING: This repository is no longer maintained ⚠️ This repository will not be updated. The repository will be kept available in read-only mode.

LightningTweeterby Hexalyse

Python 21 Version:Current License: No License (No License)

A lightning strike detector using the AS9535 sensor from AMS, that tweets about detected storms.

KeenASR-Android-PoCby keenresearch

Java 21 Version:Current License: No License (No License)

A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.html

cmusphinx-modelsby collectivat

Python 21 Version:Current License: Strong Copyleft (AGPL-3.0)

Acoustic and language models for minorised languages.

spoken_language_datasetby tomasz-oponowicz

Go 22 Version:Current
License: Permissive (MIT)

Go 22 Version:Current
License: Permissive (MIT)

HTML 22 Version:Current
License: Permissive (MIT)

C 22 Version:Current
License: Permissive (MIT)

Shell 22 Version:Current
License: Permissive (Apache-2.0)

Go 22 Version:Current
License: Permissive (MIT)

C 22 Version:Current
License: Strong Copyleft (GPL-2.0)

C++ 22 Version:Current
License: Strong Copyleft (GPL-3.0)

C++ 22 Version:Current
License: Permissive (Zlib)

Python 22 Version:Current
License: Permissive (MIT)

Jupyter Notebook 22 Version:Current
License: Permissive (Apache-2.0)

Jupyter Notebook 22 Version:Current
License: No License (No License)

Python 22 Version:Current
License: Permissive (MIT)

C++ 22 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 22 Version:Current
License: Permissive (Apache-2.0)

C 22 Version:Current
License: Permissive (MIT)

Jupyter Notebook 22 Version:Current
License: Permissive (Apache-2.0)

Python 22 Version:Current
License: Permissive (MIT)

Java 21 Version:Current
License: Permissive (MIT)

Python 21 Version:Current
License: No License (No License)

Java 21 Version:Current
License: Strong Copyleft (AGPL-3.0)

Python 21 Version:Current
License: No License (No License)

Python 21 Version:Current
License: Permissive (Apache-2.0)

Python 21 Version:Current
License: No License (No License)

Java 21 Version:Current
License: No License (No License)

Python 21 Version:Current
License: Strong Copyleft (AGPL-3.0)

Python 21 Version:Current
License: Permissive (MIT)

Python 21 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 21 Version:Current
License: Permissive (MIT)

Python 21 Version:Current
License: No License (No License)

Python 21 Version:Current
License: Permissive (MIT)

JavaScript 21 Version:Current
License: Permissive (MIT)

JavaScript 21 Version:Current
License: Permissive (MIT)

C# 21 Version:Current
License: Weak Copyleft (LGPL-3.0)

Python 21 Version:Current
License: Permissive (Apache-2.0)

C++ 21 Version:Current
License: Permissive (MIT)

TypeScript 21 Version:Current
License: Permissive (MIT)

TypeScript 21 Version:Current
License: Permissive (MIT)

TypeScript 21 Version:Current
License: Permissive (MIT)

Python 21 Version:Current
License: Permissive (MIT)

Python 21 Version:Current
License: No License (No License)

Python 21 Version:Current
License: No License (No License)

Python 21 Version:Current
License: Permissive (MIT)

Python 21 Version:Current
License: Permissive (BSD-3-Clause)

Kotlin 21 Version:Current
License: Permissive (Apache-2.0)

C 21 Version:Current
License: No License (No License)

C 21 Version:Current
License: Proprietary (Proprietary)

Shell 21 Version:Current
License: No License (No License)

C++ 21 Version:Current
License: No License (No License)

Shell 21 Version:Current
License: No License (No License)