Speech Libraries - Page 26

ITRI-speech-recognition-dataset-generationby khuangaf

Jupyter Notebook 33 Version:Current
License: No License (No License)

Automatic Speech Recognition Dataset Generation

Support

Quality

Security

License

Reuse

SpeechRecognitionAIby viktorvano

Java 33 Version:Current
License: Permissive (Apache-2.0)

Speech recognition AI based on FFNN in Java

Support

Quality

Security

License

Reuse

CIF-PyTorchby MingLunHan

Python 33 Version:Current
License: Permissive (Apache-2.0)

[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).

Support

Quality

Security

License

Reuse

FG-transformer-TTSby b04901014

Python 33 Version:Current
License: Permissive (MIT)

Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.

Support

Quality

Security

License

Reuse

Python 33 Version:Current
License: No License (No License)

Causality Check in Frame-online Speech Separation

Support

Quality

Security

License

Reuse

Python 33 Version:Current
License: No License (No License)

ChatGPT + Google T2S + Google S2T

Support

Quality

Security

License

Reuse

PPSpeechby rishikksh20

Python 32 Version:Current
License: No License (No License)

PPSpeech: Phrase based Parallel End-to-End TTS System

Support

Quality

Security

License

Reuse

Java 32 Version:Current
License: Permissive (MIT)

Java wrapper around the famous sox (sound-exchange) audio processing utility

Support

Quality

Security

License

Reuse

Python 32 Version:Current
License: No License (No License)

Wake-Up-Word Keyword Spotting implemented in Keras

Support

Quality

Security

License

Reuse

RTNetby Andong-Li-speech

Python 32 Version:Current
License: No License (No License)

implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain

Support

Quality

Security

License

Reuse

Python 32 Version:Current
License: No License (No License)

A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'

Support

Quality

Security

License

Reuse

speech-emotion-recognition-exerciseby lmingde

Python 32 Version:Current
License: No License (No License)

2018年7⽉30⽇-8⽉13⽇持续2周的好未来AI训练营中语⾳情感识别营的项目报告

Support

Quality

Security

License

Reuse

vasisualyby Oknolaz

Python 32 Version:Current
License: Strong Copyleft (GPL-3.0)

Vasisualy it's a simple Russian voice assistant written on Python for GNU/Linux, Windows and Android.

Support

Quality

Security

License

Reuse

bingspeech-api-clientby palmerabollo

TypeScript 32 Version:Current
License: Proprietary (Proprietary)

Microsoft Bing Speech API client in node.js

Support

Quality

Security

License

Reuse

go-webrtcvadby maxhawkins

C 32 Version:Current
License: Proprietary (Proprietary)

cgo interface to WebRTC Voice Activity Dectection

Support

Quality

Security

License

Reuse

noise-gateby Michael-F-Bryan

Rust 32 Version:Current
License: Proprietary (Proprietary)

A simple Noise Gate algorithm for splitting an audio stream into chunks based on volume/silence

Support

Quality

Security

License

Reuse

kaldi-brby falabrasil

Shell 32 Version:Current
License: Permissive (MIT)

☕🇧🇷 Scripts para o Kaldi em Português Brasileiro

Support

Quality

Security

License

Reuse

Voicenetby Robofied

Jupyter Notebook 32 Version:Current
License: Permissive (BSD-3-Clause)

Comprehensive Python library for speech and voice.

Support

Quality

Security

License

Reuse

C# 32 Version:Current
License: Permissive (MIT)

.NET library to easily create Voice Command Control feature.

Support

Quality

Security

License

Reuse

carter-voice-assistantby huwprosser

Python 32 Version:Current
License: Permissive (MIT)

An example project showing how to use www.carterapi.com as a voice assistant.

Support

Quality

Security

License

Reuse

vosk-rsby Bear-03

Rust 32 Version:Current
License: Permissive (MIT)

Rust bindings to the Vosk API Speech Recognition library

Support

Quality

Security

License

Reuse

Python 32 Version:Current
License: No License (No License)

A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis

Support

Quality

Security

License

Reuse

RecordDialogby IvanSotelo

Java 31 Version:Current
License: Permissive (MIT)

A Simple Wav audio recorder dialog

Support

Quality

Security

License

Reuse

Python 31 Version:Current
License: No License (No License)

Phase Vocoder In Python

Support

Quality

Security

License

Reuse

deafby yandex

Java 31 Version:Current
License: Proprietary (Proprietary)

Android App for Deaf

Support

Quality

Security

License

Reuse

Jarvisby m4n3dw0lf

Python 31 Version:Current
License: Strong Copyleft (GPL-3.0)

Voice command assistant

Support

Quality

Security

License

Reuse

Translation-Augmented-LibriSpeech-Corpusby alicank

Python 31 Version:Current
License: No License (No License)

Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and contains English utterances (from audiobooks) automatically aligned with French text. Our dataset offers ~236h of speech aligned to translated text.

Support

Quality

Security

License

Reuse

Python 31 Version:Current
License: No License (No License)

Google's TPGST reimplementation.

Support

Quality

Security

License

Reuse

Java 31 Version:Current
License: No License (No License)

program, which helps people to communicate with speech disorders

Support

Quality

Security

License

Reuse

Python 31 Version:Current
License: No License (No License)

Listen, Attend and spell model for E2E ASR. Implementation in Pytorch

Support

Quality

Security

License

Reuse

Python 31 Version:Current
License: Permissive (MIT)

Official Implementation of "Seeing Through Noise: Speaker Separation and Enhancement using Visually-derived Speech", ICASSP 2018.

Support

Quality

Security

License

Reuse

CNTNby candlewill

Python 31 Version:Current
License: No License (No License)

ChiNese Text Normalization (CNTN) tool for Text-to-speech system

Support

Quality

Security

License

Reuse

tUnE.jsby LevyGuy

JavaScript 31 Version:Current
License: No License (No License)

Web Speech recognition grammar POC for webkit using the Levenshtein distance algorithm

Support

Quality

Security

License

Reuse

Swift 31 Version:Current
License: Permissive (Apache-2.0)

Spokestack: give your iOS app a voice interface!

Support

Quality

Security

License

Reuse

Perl 31 Version:Current
License: No License (No License)

THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.

Support

Quality

Security

License

Reuse

STEMMby ictnlp

Python 31 Version:Current
License: Permissive (MIT)

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

Support

Quality

Security

License

Reuse

neon-tts-plugin-coquiby NeonGeckoCom

Python 31 Version:Current
License: Proprietary (Proprietary)

Coqui AI TTS plugin

Support

Quality

Security

License

Reuse

gpt_chatbotby 1nnovat1on

Python 31 Version:Current
License: No License (No License)

This chatbot lets you use your microphone to communicate with GPT-4. It uses the Windows TTS to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows

Support

Quality

Security

License

Reuse

subui-speech-assistantby python019

Python 31 Version:Current
License: Permissive (MIT)

Python AI project

Support

Quality

Security

License

Reuse

TypeScript 31 Version:Current
License: Permissive (MIT)

ChatGPT

Support

Quality

Security

License

Reuse

Audio-Speech-To-Sign-Language-Converterby jigargajjar55

HTML 31 Version:Current
License: Permissive (MIT)

A web based application which accepts Audio speech or Text as input and converts it to corresponding Indian Sign Language for impaired of speaking or impaired of hearing and deaf people.

Support

Quality

Security

License

Reuse

Python 30 Version:Current
License: No License (No License)

This Repository includes four different implementations of the Speaker Verification task including the GMM_UBM, Ivector, Deep-Speaker, and voice-vector

Support

Quality

Security

License

Reuse

Java 30 Version:Current
License: No License (No License)

Command line and webapp application for driving Sonos boxes

Support

Quality

Security

License

Reuse

Java 30 Version:Current
License: Strong Copyleft (GPL-3.0)

Android App to translate text conversations, supporting 90 languages with Speech-To-Text and Text-to-Speech features for ease of accessibility.

Support

Quality

Security

License

Reuse

SNR-Based-Progressive-Learning-of-Deep-Neural-Network-for-Speech-Enhancementby haoxiangsnr

Python 30 Version:Current
License: Permissive (MIT)

Implementation of the paper "SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement."

Support

Quality

Security

License

Reuse

Python 30 Version:Current
License: No License (No License)

The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"

Support

Quality

Security

License

Reuse

DACSby vikolss

Python 30 Version:Current
License: Permissive (MIT)

Code from the paper "DACS: Domain Adaptation via Cross-domain Mixed Sampling"

Support

Quality

Security

License

Reuse

LAS-SpeechRecognitionby PengdaLiu

Python 30 Version:Current
License: No License (No License)

Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).

Support

Quality

Security

License

Reuse

Python 30 Version:Current
License: Permissive (Apache-2.0)

A collection of useful tools for handling speech recognition data

Support

Quality

Security

License

Reuse

pamboxby achabotl

Python 30 Version:Current
License: Permissive (BSD-3-Clause)

Python auditory modeling toolbox.

Support

Quality

Security

License

Reuse

ITRI-speech-recognition-dataset-generationby khuangaf

Automatic Speech Recognition Dataset Generation

Jupyter Notebook

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

SpeechRecognitionAIby viktorvano

Speech recognition AI based on FFNN in Java

Java

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

CIF-PyTorchby MingLunHan

[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).

Python

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

FG-transformer-TTSby b04901014

Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.

Python

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

CausalityCheckby zqwang7

Causality Check in Frame-online Speech Separation

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

chatGPT_Talkingby ch-tseng

ChatGPT + Google T2S + Google S2T

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

PPSpeechby rishikksh20

PPSpeech: Phrase based Parallel End-to-End TTS System

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

sox-wrapper-javaby corballis

Java wrapper around the famous sox (sound-exchange) audio processing utility

Java

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

keyword-spottingby rajathkmp

Wake-Up-Word Keyword Spotting implemented in Keras

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

RTNetby Andong-Li-speech

implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

localatt_emorecogby gogyzzz

A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speech-emotion-recognition-exerciseby lmingde

2018年7⽉30⽇-8⽉13⽇持续2周的好未来AI训练营中语⾳情感识别营的项目报告

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

vasisualyby Oknolaz

Vasisualy it's a simple Russian voice assistant written on Python for GNU/Linux, Windows and Android.

Python

Updated: 3 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

bingspeech-api-clientby palmerabollo

Microsoft Bing Speech API client in node.js

TypeScript

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

go-webrtcvadby maxhawkins

cgo interface to WebRTC Voice Activity Dectection

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

noise-gateby Michael-F-Bryan

A simple Noise Gate algorithm for splitting an audio stream into chunks based on volume/silence

Rust

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

kaldi-brby falabrasil

☕🇧🇷 Scripts para o Kaldi em Português Brasileiro

Shell

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Voicenetby Robofied

Comprehensive Python library for speech and voice.

Jupyter Notebook

Updated: 4 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

VoiceNET.Libraryby nhannt201

.NET library to easily create Voice Command Control feature.

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

carter-voice-assistantby huwprosser

An example project showing how to use www.carterapi.com as a voice assistant.

Python

Updated: 1 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

vosk-rsby Bear-03

Rust bindings to the Vosk API Speech Recognition library

Rust

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

PPG-GradVCby seahore

A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis

Python

Updated: 1 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

RecordDialogby IvanSotelo

A Simple Wav audio recorder dialog

Java

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

phasevocoderby haoyu987

Phase Vocoder In Python

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

deafby yandex

Android App for Deaf

Java

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Jarvisby m4n3dw0lf

Voice command assistant

Python

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Translation-Augmented-LibriSpeech-Corpusby alicank

Large scale (>200h) and publicly available read audio book corpus. This corpus is an augmentation of LibriSpeech ASR Corpus (1000h) and contains English utterances (from audiobooks) automatically aligned with French text. Our dataset offers ~236h of speech aligned to translated text.

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

TPGST-Tacotronby Yangyangii

Google's TPGST reimplementation.

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

linkatype-androidby linkasu

program, which helps people to communicate with speech disorders

Java

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

las-pytorchby jiwidi

Listen, Attend and spell model for E2E ASR. Implementation in Pytorch

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

cocktail-partyby avivga

Official Implementation of "Seeing Through Noise: Speaker Separation and Enhancement using Visually-derived Speech", ICASSP 2018.

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

CNTNby candlewill

ChiNese Text Normalization (CNTN) tool for Text-to-speech system

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

tUnE.jsby LevyGuy

Web Speech recognition grammar POC for webkit using the Levenshtein distance algorithm

JavaScript

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

spokestack-iosby spokestack

Spokestack: give your iOS app a voice interface!

Swift

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

theano-kaldi-rnnby mravanelli

THEANO-KALDI-RNNs is a project implementing various Recurrent Neural Networks (RNNs) for RNN-HMM speech recognition. The Theano Code is coupled with the Kaldi decoder.

Perl

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

STEMMby ictnlp

Code for ACL 2022 main conference paper "STEMM: Self-learning with Speech-text Manifold Mixup for Speech Translation".

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

neon-tts-plugin-coquiby NeonGeckoCom

Coqui AI TTS plugin

Python

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

gpt_chatbotby 1nnovat1on

This chatbot lets you use your microphone to communicate with GPT-4. It uses the Windows TTS to respond with a voice. It uses Pinecone to store long term information and retrieves it to create context. API keys for OpenAI and Pinecone required. Tested on Windows

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

subui-speech-assistantby python019

Python AI project

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

chatgpt-asr-ttsby liuw5367

ChatGPT

TypeScript

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Audio-Speech-To-Sign-Language-Converterby jigargajjar55

A web based application which accepts Audio speech or Text as input and converts it to corresponding Indian Sign Language for impaired of speaking or impaired of hearing and deaf people.

HTML

Updated: 1 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Master-Voice_Printsby prajual

This Repository includes four different implementations of the Speaker Verification task including the GMM_UBM, Ivector, Deep-Speaker, and voice-vector

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

sonos-javaby SR-G

Command line and webapp application for driving Sonos boxes

Java

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

TranslateAppby apaar97

Android App to translate text conversations, supporting 90 languages with Speech-To-Text and Text-to-Speech features for ease of accessibility.

Java

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

SNR-Based-Progressive-Learning-of-Deep-Neural-Network-for-Speech-Enhancementby haoxiangsnr

Implementation of the paper "SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement."

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

SemanticMaskby MarkWuNLP

The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

DACSby vikolss

Code from the paper "DACS: Domain Adaptation via Cross-domain Mixed Sampling"

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

LAS-SpeechRecognitionby PengdaLiu

Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

greenkey-asrtoolkitby finos

A collection of useful tools for handling speech recognition data

Python

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

pamboxby achabotl

Python auditory modeling toolbox.

Python

Updated: 4 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 26

ITRI-speech-recognition-dataset-generationby khuangaf

Jupyter Notebook 33 Version:Current License: No License (No License)

Automatic Speech Recognition Dataset Generation

SpeechRecognitionAIby viktorvano

Java 33 Version:Current License: Permissive (Apache-2.0)

Speech recognition AI based on FFNN in Java

CIF-PyTorchby MingLunHan

Python 33 Version:Current License: Permissive (Apache-2.0)

[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).

FG-transformer-TTSby b04901014

Python 33 Version:Current License: Permissive (MIT)

Official implementation for the paper Fine-grained style control in transformer-based text-to-speech synthesis.

CausalityCheckby zqwang7

Python 33 Version:Current License: No License (No License)

Causality Check in Frame-online Speech Separation

chatGPT_Talkingby ch-tseng

Python 33 Version:Current License: No License (No License)

ChatGPT + Google T2S + Google S2T

PPSpeechby rishikksh20

Python 32 Version:Current License: No License (No License)

PPSpeech: Phrase based Parallel End-to-End TTS System

sox-wrapper-javaby corballis

Java 32 Version:Current License: Permissive (MIT)

Java wrapper around the famous sox (sound-exchange) audio processing utility

keyword-spottingby rajathkmp

Python 32 Version:Current License: No License (No License)

Wake-Up-Word Keyword Spotting implemented in Keras

RTNetby Andong-Li-speech

Python 32 Version:Current License: No License (No License)

implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain

localatt_emorecogby gogyzzz

Python 32 Version:Current License: No License (No License)

A Pytorch implementation of 'AUTOMATIC SPEECH EMOTION RECOGNITION USING RECURRENT NEURAL NETWORKS WITH LOCAL ATTENTION'

speech-emotion-recognition-exerciseby lmingde

Python 32 Version:Current License: No License (No License)

2018年7⽉30⽇-8⽉13⽇持续2周的好未来AI训练营中语⾳情感识别营的项目报告

vasisualyby Oknolaz

Python 32 Version:Current License: Strong Copyleft (GPL-3.0)

Vasisualy it's a simple Russian voice assistant written on Python for GNU/Linux, Windows and Android.

bingspeech-api-clientby palmerabollo

TypeScript 32 Version:Current License: Proprietary (Proprietary)

Microsoft Bing Speech API client in node.js

go-webrtcvadby maxhawkins

C 32 Version:Current License: Proprietary (Proprietary)

cgo interface to WebRTC Voice Activity Dectection

noise-gateby Michael-F-Bryan

Rust 32 Version:Current License: Proprietary (Proprietary)

A simple Noise Gate algorithm for splitting an audio stream into chunks based on volume/silence

kaldi-brby falabrasil

Shell 32 Version:Current License: Permissive (MIT)

☕🇧🇷 Scripts para o Kaldi em Português Brasileiro

Voicenetby Robofied

Jupyter Notebook 32 Version:Current License: Permissive (BSD-3-Clause)

Comprehensive Python library for speech and voice.

VoiceNET.Libraryby nhannt201

C# 32 Version:Current License: Permissive (MIT)

.NET library to easily create Voice Command Control feature.

carter-voice-assistantby huwprosser

Python 32 Version:Current License: Permissive (MIT)

An example project showing how to use www.carterapi.com as a voice assistant.

vosk-rsby Bear-03

Rust 32 Version:Current License: Permissive (MIT)

Rust bindings to the Vosk API Speech Recognition library

PPG-GradVCby seahore

Python 32 Version:Current License: No License (No License)

A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis

RecordDialogby IvanSotelo

Java 31 Version:Current License: Permissive (MIT)

A Simple Wav audio recorder dialog

phasevocoderby haoyu987

Python 31 Version:Current License: No License (No License)

Phase Vocoder In Python

deafby yandex

Java 31 Version:Current License: Proprietary (Proprietary)

Android App for Deaf

Jarvisby m4n3dw0lf

Python 31 Version:Current License: Strong Copyleft (GPL-3.0)

Voice command assistant

Translation-Augmented-LibriSpeech-Corpusby alicank

Jupyter Notebook 33 Version:Current
License: No License (No License)

Java 33 Version:Current
License: Permissive (Apache-2.0)

Python 33 Version:Current
License: Permissive (Apache-2.0)

Python 33 Version:Current
License: Permissive (MIT)

Python 33 Version:Current
License: No License (No License)

Python 33 Version:Current
License: No License (No License)

Python 32 Version:Current
License: No License (No License)

Java 32 Version:Current
License: Permissive (MIT)

Python 32 Version:Current
License: No License (No License)

Python 32 Version:Current
License: No License (No License)

Python 32 Version:Current
License: No License (No License)

Python 32 Version:Current
License: No License (No License)

Python 32 Version:Current
License: Strong Copyleft (GPL-3.0)

TypeScript 32 Version:Current
License: Proprietary (Proprietary)

C 32 Version:Current
License: Proprietary (Proprietary)

Rust 32 Version:Current
License: Proprietary (Proprietary)

Shell 32 Version:Current
License: Permissive (MIT)

Jupyter Notebook 32 Version:Current
License: Permissive (BSD-3-Clause)

C# 32 Version:Current
License: Permissive (MIT)

Python 32 Version:Current
License: Permissive (MIT)

Rust 32 Version:Current
License: Permissive (MIT)

Python 32 Version:Current
License: No License (No License)

Java 31 Version:Current
License: Permissive (MIT)

Python 31 Version:Current
License: No License (No License)

Java 31 Version:Current
License: Proprietary (Proprietary)

Python 31 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 31 Version:Current
License: No License (No License)

Python 31 Version:Current
License: No License (No License)

Java 31 Version:Current
License: No License (No License)

Python 31 Version:Current
License: No License (No License)

Python 31 Version:Current
License: Permissive (MIT)

Python 31 Version:Current
License: No License (No License)

JavaScript 31 Version:Current
License: No License (No License)

Swift 31 Version:Current
License: Permissive (Apache-2.0)

Perl 31 Version:Current
License: No License (No License)

Python 31 Version:Current
License: Permissive (MIT)

Python 31 Version:Current
License: Proprietary (Proprietary)

Python 31 Version:Current
License: No License (No License)

Python 31 Version:Current
License: Permissive (MIT)

TypeScript 31 Version:Current
License: Permissive (MIT)

HTML 31 Version:Current
License: Permissive (MIT)

Python 30 Version:Current
License: No License (No License)

Java 30 Version:Current
License: No License (No License)

Java 30 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 30 Version:Current
License: Permissive (MIT)

Python 30 Version:Current
License: No License (No License)

Python 30 Version:Current
License: Permissive (MIT)

Python 30 Version:Current
License: No License (No License)

Python 30 Version:Current
License: Permissive (Apache-2.0)

Python 30 Version:Current
License: Permissive (BSD-3-Clause)