Speech Libraries - Page 22

TypeScript 46 Version:Current
License: Permissive (MIT)

Official JavaScript SDK for Deepgram's automated speech recognition APIs.

Support

Quality

Security

License

Reuse

translateby apaar97

Java 46 Version:Current
License: Strong Copyleft (GPL-3.0)

Android app to translate text conversations, supporting 90+ languages with speech-to-text and text-to-speech features for ease of accessibility.

Support

Quality

Security

License

Reuse

end2endASRby cdyangbo

Python 45 Version:Current
License: No License (No License)

implement end-to-end asr algorithm with tensorflow

Support

Quality

Security

License

Reuse

se_relativisticganby deepakbaby

Python 45 Version:Current
License: Permissive (MIT)

Keras framework for speech enhancement using relativistic GANs

Support

Quality

Security

License

Reuse

Extended_VQVAEby nii-yamagishilab

Python 45 Version:Current
License: Permissive (MIT)

Support

Quality

Security

License

Reuse

spokestack-androidby spokestack

Java 45 Version:Current
License: Permissive (Apache-2.0)

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Support

Quality

Security

License

Reuse

deepspeechby MyrtleSoftware

Python 45 Version:Current
License: Proprietary (Proprietary)

A PyTorch implementation of DeepSpeech and DeepSpeech2.

Support

Quality

Security

License

Reuse

Python 45 Version:Current
License: Permissive (Apache-2.0)

Tensor2tensor experiment with SpecAugment

Support

Quality

Security

License

Reuse

Cognitive-SpeakerRecognition-Androidby microsoft

Java 45 Version:Current
License: Proprietary (Proprietary)

Android SDK for Microsoft Speaker Recognition API, part of Cognitive Services

Support

Quality

Security

License

Reuse

Tacotron2by kaituoxu

Python 45 Version:Current
License: No License (No License)

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Support

Quality

Security

License

Reuse

jarvisby edisonwong520

Python 45 Version:Current
License: Permissive (MIT)

中文版贾维斯Jarvis语音助手(电脑加强版Siri，自动播放下载音乐/天气播报/问路导航/计时器/搜索等）

Support

Quality

Security

License

Reuse

Python 45 Version:Current
License: No License (No License)

End to End Dialect Identification using Convolutional Neural Network

Support

Quality

Security

License

Reuse

mikutterby katsyoshi

Ruby 45 Version:Current
License: Permissive (MIT)

"my" mikutter mirror. please check official repository. this repository does not apply pr.

Support

Quality

Security

License

Reuse

JavaScript 45 Version:Current
License: No License (No License)

迅飞语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息，让机器能够“听懂”人类语言，相当于给机器安装上“耳朵”，使其具备“能听”的功能。

Support

Quality

Security

License

Reuse

Template10.Validationby Windows-XAML

C# 45 Version:Current
License: Permissive (MIT)

Support

Quality

Security

License

Reuse

smart_app_frameworkby sberdevices

Python 45 Version:Current
License: Proprietary (Proprietary)

SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке Python

Support

Quality

Security

License

Reuse

Swift 45 Version:Current
License: Permissive (MIT)

iOS application for finding formants in spoken sounds

Support

Quality

Security

License

Reuse

khalzamby kisasexypantera94

Go 45 Version:Current
License: Permissive (MIT)

Simple audio recognition library.

Support

Quality

Security

License

Reuse

ofxSpeechby latrokles

C++ 45 Version:Current
License: No License (No License)

[abandoned] Speech Recognition and Synthesis Addon for OpenFrameworks

Support

Quality

Security

License

Reuse

Rust 45 Version:Current
License: Permissive (MIT)

State-of-the-art voice recognition for Rust using vosk. View demo: https://fars.ee/F9-b.mp4

Support

Quality

Security

License

Reuse

mecab-rsby tsurai

Rust 45 Version:Current
License: Permissive (MIT)

Safe Rust bindings for mecab a part-of-speech and morphological analyzer library

Support

Quality

Security

License

Reuse

Audio-Source-Separationby ShichengChen

Jupyter Notebook 45 Version:Current
License: Permissive (MIT)

WaveNet for the separation of audio sources

Support

Quality

Security

License

Reuse

Jupyter Notebook 45 Version:Current
License: Proprietary (Proprietary)

SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enhancement, audio augmentation, feature extraction and visualization, dataset and audio file conversion, and beyond.

Support

Quality

Security

License

Reuse

C++ 45 Version:Current
License: No License (No License)

A ctc decoder for both online and offline asr model

Support

Quality

Security

License

Reuse

C# 45 Version:Current
License: Permissive (Apache-2.0)

Examples on how to use Tinkoff Voicekit

Support

Quality

Security

License

Reuse

Python 45 Version:Current
License: Permissive (MIT)

Listen to any audio stream on your machine and print out the transcribed or translated audio.

Support

Quality

Security

License

Reuse

alqalignby xinjli

Python 45 Version:Current
License: Permissive (Apache-2.0)

multilingual speech aligner

Support

Quality

Security

License

Reuse

chatgpt-voice-assistantby jakecyr

Python 45 Version:Current
License: No License (No License)

A chatbot that uses speech to text for input, sends the text to OpenAI's ChatGPT text generation model and speaks the response using text to speech.

Support

Quality

Security

License

Reuse

knn-vcby bshall

Python 45 Version:Current
License: Proprietary (Proprietary)

Voice Conversion With Just Nearest Neighbors

Support

Quality

Security

License

Reuse

vc-lmby nilboy

Python 45 Version:Current
License: No License (No License)

将任意人的音色转换为成千上万种不同音色

Support

Quality

Security

License

Reuse

Python 44 Version:Current
License: Permissive (MIT)

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition

Support

Quality

Security

License

Reuse

Python 44 Version:Current
License: No License (No License)

The Implementation of FastSpeech2 Based on Pytorch.

Support

Quality

Security

License

Reuse

Python 44 Version:Current
License: No License (No License)

GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An example script is provided for VoxCeleb data.

Support

Quality

Security

License

Reuse

rcaudioby mhy12345

Python 44 Version:Current
License: Permissive (MIT)

Real-time audio analysis library, support acoustic feature extraction and real-time beats detection

Support

Quality

Security

License

Reuse

Python 44 Version:Current
License: No License (No License)

The acoustic rake receiver, a microphone beamformer that uses echoes to improve the noise and interference suppression. Python code to reproduce all the results from Raking the Cocktail Party by Ivan Dokmanic, Robin Scheibler, and Martin Vetterli.

Support

Quality

Security

License

Reuse

AdaIN-VCby cyhuang-tw

Python 44 Version:Current
License: No License (No License)

An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".

Support

Quality

Security

License

Reuse

myG2Pby ye-kyaw-thu

Perl 44 Version:Current
License: No License (No License)

Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).

Support

Quality

Security

License

Reuse

speaker_recognition_GMM_UBMby scelesticsiva

Jupyter Notebook 44 Version:Current
License: No License (No License)

A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schizophrenia.

Support

Quality

Security

License

Reuse

translate-Red-Deat-Redemption-2by IndiMops

Python 44 Version:Current
License: No License (No License)

Українська локалізація для гри Red Dead Redemption 2. Відчуй себе ковбоєм на всі 100

Support

Quality

Security

License

Reuse

Python 44 Version:Current
License: No License (No License)

Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)

Support

Quality

Security

License

Reuse

nltk-maxent-pos-taggerby arne-cl

Python 43 Version:Current
License: No License (No License)

maximum entropy based part-of-speech tagger for NLTK

Support

Quality

Security

License

Reuse

ttsby DeepHorizons

Python 43 Version:Current
License: No License (No License)

A simple python TTS wrapper

Support

Quality

Security

License

Reuse

AiVoiceby candlewill

Python 43 Version:Current
License: No License (No License)

Deep CNN networks for Speech Synthesis

Support

Quality

Security

License

Reuse

Goodness-of-Pronunciationby sweekarsud

Python 43 Version:Current
License: No License (No License)

Support

Quality

Security

License

Reuse

asrby biemster

Python 43 Version:Current
License: No License (No License)

Android offline speech recognition natively on PC

Support

Quality

Security

License

Reuse

deepSpeech2by yao-matrix

Python 43 Version:Current
License: Permissive (BSD-3-Clause)

End-to-end speech recognition using TensorFlow

Support

Quality

Security

License

Reuse

AVSR-Deep-Speechby pandeydivesh15

Python 43 Version:Current
License: Strong Copyleft (GPL-2.0)

Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab

Support

Quality

Security

License

Reuse

ASAMby jacoxu

Python 43 Version:Current
License: No License (No License)

This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]

Support

Quality

Security

License

Reuse

PNCCby supikiti

Python 43 Version:Current
License: Permissive (MIT)

A implementation of Power Normalized Cepstral Coefficients: PNCC

Support

Quality

Security

License

Reuse

vq-vaeby Kyubyong

Python 43 Version:Current
License: Permissive (Apache-2.0)

A Tensorflow Implementation of VQ-VAE Speaker Conversion

Support

Quality

Security

License

Reuse

deepgram-node-sdkby deepgram

Official JavaScript SDK for Deepgram's automated speech recognition APIs.

TypeScript

Updated: 1 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

translateby apaar97

Android app to translate text conversations, supporting 90+ languages with speech-to-text and text-to-speech features for ease of accessibility.

Java

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

end2endASRby cdyangbo

implement end-to-end asr algorithm with tensorflow

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

se_relativisticganby deepakbaby

Keras framework for speech enhancement using relativistic GANs

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Extended_VQVAEby nii-yamagishilab

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

spokestack-androidby spokestack

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

Java

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

deepspeechby MyrtleSoftware

A PyTorch implementation of DeepSpeech and DeepSpeech2.

Python

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

specAugmentby Kyubyong

Tensor2tensor experiment with SpecAugment

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Cognitive-SpeakerRecognition-Androidby microsoft

Android SDK for Microsoft Speaker Recognition API, part of Cognitive Services

Java

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Tacotron2by kaituoxu

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

jarvisby edisonwong520

中文版贾维斯Jarvis语音助手(电脑加强版Siri，自动播放下载音乐/天气播报/问路导航/计时器/搜索等）

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

dialectID_e2eby swshon

End to End Dialect Identification using Convolutional Neural Network

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

mikutterby katsyoshi

"my" mikutter mirror. please check official repository. this repository does not apply pr.

Ruby

Updated: 6 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

VoiceDictationby MuGuiLin

迅飞语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息，让机器能够“听懂”人类语言，相当于给机器安装上“耳朵”，使其具备“能听”的功能。

JavaScript

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Template10.Validationby Windows-XAML

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

smart_app_frameworkby sberdevices

SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке Python

Python

Updated: 3 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

formant-analyzerby fulldecent

iOS application for finding formants in spoken sounds

Swift

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

khalzamby kisasexypantera94

Simple audio recognition library.

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ofxSpeechby latrokles

[abandoned] Speech Recognition and Synthesis Addon for OpenFrameworks

C++

Updated: 6 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

vosk-rsby wzhd

State-of-the-art voice recognition for Rust using vosk. View demo: https://fars.ee/F9-b.mp4

Rust

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

mecab-rsby tsurai

Safe Rust bindings for mecab a part-of-speech and morphological analyzer library

Rust

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Audio-Source-Separationby ShichengChen

WaveNet for the separation of audio sources

Jupyter Notebook

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enhancement, audio augmentation, feature extraction and visualization, dataset and audio file conversion, and beyond.

Jupyter Notebook

Updated: 3 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

ctc_decoderby Slyne

A ctc decoder for both online and offline asr model

C++

Updated: 1 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

voicekit-examplesby Tinkoff

Examples on how to use Tinkoff Voicekit

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

audioWhisperby Awexander

Listen to any audio stream on your machine and print out the transcribed or translated audio.

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

alqalignby xinjli

multilingual speech aligner

Python

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

chatgpt-voice-assistantby jakecyr

A chatbot that uses speech to text for input, sends the text to OpenAI's ChatGPT text generation model and speaks the response using text to speech.

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

knn-vcby bshall

Voice Conversion With Just Nearest Neighbors

Python

Updated: 1 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

vc-lmby nilboy

将任意人的音色转换为成千上万种不同音色

Python

Updated: 1 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

sms_wsjby fgnt

SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

FastSpeech2by xcmyz

The Implementation of FastSpeech2 Based on Pytorch.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

pytorch-ivectorsby vvestman

GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An example script is provided for VoxCeleb data.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

rcaudioby mhy12345

Real-time audio analysis library, support acoustic feature extraction and real-time beats detection

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

AcousticRakeReceiverby LCAV

The acoustic rake receiver, a microphone beamformer that uses echoes to improve the noise and interference suppression. Python code to reproduce all the results from Raking the Cocktail Party by Ivan Dokmanic, Robin Scheibler, and Martin Vetterli.

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

AdaIN-VCby cyhuang-tw

An unofficial implementation of the paper "One-shot Voice Conversion by Separating Speaker and Content Representations with Instance Normalization".

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

myG2Pby ye-kyaw-thu

Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).

Perl

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speaker_recognition_GMM_UBMby scelesticsiva

A speaker recognition system which uses GMM-UBM for use in an Android application which helps in monitoring patients suffering from Schizophrenia.

Jupyter Notebook

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

translate-Red-Deat-Redemption-2by IndiMops

Українська локалізація для гри Red Dead Redemption 2. Відчуй себе ковбоєм на всі 100

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speech-rest-apiby askrella

Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

nltk-maxent-pos-taggerby arne-cl

maximum entropy based part-of-speech tagger for NLTK

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

ttsby DeepHorizons

A simple python TTS wrapper

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

AiVoiceby candlewill

Deep CNN networks for Speech Synthesis

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Goodness-of-Pronunciationby sweekarsud

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

asrby biemster

Android offline speech recognition natively on PC

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

deepSpeech2by yao-matrix

End-to-end speech recognition using TensorFlow

Python

Updated: 3 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

AVSR-Deep-Speechby pandeydivesh15

Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab

Python

Updated: 4 y ago

License: Strong Copyleft (GPL-2.0)

Support

Quality

Security

License

Reuse

ASAMby jacoxu

This is the code&dataset for our paper [Modeling Attention and Memory for Auditory Selection in a Cocktail Party Environment. AAAI 2018]

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

PNCCby supikiti

A implementation of Power Normalized Cepstral Coefficients: PNCC

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

vq-vaeby Kyubyong

A Tensorflow Implementation of VQ-VAE Speaker Conversion

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 22

deepgram-node-sdkby deepgram

TypeScript 46 Version:Current License: Permissive (MIT)

Official JavaScript SDK for Deepgram's automated speech recognition APIs.

translateby apaar97

Java 46 Version:Current License: Strong Copyleft (GPL-3.0)

Android app to translate text conversations, supporting 90+ languages with speech-to-text and text-to-speech features for ease of accessibility.

end2endASRby cdyangbo

Python 45 Version:Current License: No License (No License)

implement end-to-end asr algorithm with tensorflow

se_relativisticganby deepakbaby

Python 45 Version:Current License: Permissive (MIT)

Keras framework for speech enhancement using relativistic GANs

Extended_VQVAEby nii-yamagishilab

Python 45 Version:Current License: Permissive (MIT)

spokestack-androidby spokestack

Java 45 Version:Current License: Permissive (Apache-2.0)

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!

deepspeechby MyrtleSoftware

Python 45 Version:Current License: Proprietary (Proprietary)

A PyTorch implementation of DeepSpeech and DeepSpeech2.

specAugmentby Kyubyong

Python 45 Version:Current License: Permissive (Apache-2.0)

Tensor2tensor experiment with SpecAugment

Cognitive-SpeakerRecognition-Androidby microsoft

Java 45 Version:Current License: Proprietary (Proprietary)

Android SDK for Microsoft Speaker Recognition API, part of Cognitive Services

Tacotron2by kaituoxu

Python 45 Version:Current License: No License (No License)

A PyTorch implementation of Tacotron2, an end-to-end text-to-speech(TTS) system described in "Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions".

jarvisby edisonwong520

Python 45 Version:Current License: Permissive (MIT)

中文版贾维斯Jarvis语音助手(电脑加强版Siri，自动播放下载音乐/天气播报/问路导航/计时器/搜索等）

dialectID_e2eby swshon

Python 45 Version:Current License: No License (No License)

End to End Dialect Identification using Convolutional Neural Network

mikutterby katsyoshi

Ruby 45 Version:Current License: Permissive (MIT)

"my" mikutter mirror. please check official repository. this repository does not apply pr.

VoiceDictationby MuGuiLin

JavaScript 45 Version:Current License: No License (No License)

迅飞 语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息，让机器能够“听懂”人类语言，相当于给机器安装上“耳朵”，使其具备“能听”的功能。

Template10.Validationby Windows-XAML

C# 45 Version:Current License: Permissive (MIT)

smart_app_frameworkby sberdevices

Python 45 Version:Current License: Proprietary (Proprietary)

SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке Python

formant-analyzerby fulldecent

Swift 45 Version:Current License: Permissive (MIT)

iOS application for finding formants in spoken sounds

khalzamby kisasexypantera94

Go 45 Version:Current License: Permissive (MIT)

Simple audio recognition library.

ofxSpeechby latrokles

C++ 45 Version:Current License: No License (No License)

[abandoned] Speech Recognition and Synthesis Addon for OpenFrameworks

vosk-rsby wzhd

Rust 45 Version:Current License: Permissive (MIT)

State-of-the-art voice recognition for Rust using vosk. View demo: https://fars.ee/F9-b.mp4

mecab-rsby tsurai

Rust 45 Version:Current License: Permissive (MIT)

Safe Rust bindings for mecab a part-of-speech and morphological analyzer library

Audio-Source-Separationby ShichengChen

Jupyter Notebook 45 Version:Current License: Permissive (MIT)

WaveNet for the separation of audio sources

Python-Sound-Toolby a-n-rose

Jupyter Notebook 45 Version:Current License: Proprietary (Proprietary)

SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enhancement, audio augmentation, feature extraction and visualization, dataset and audio file conversion, and beyond.

ctc_decoderby Slyne

C++ 45 Version:Current License: No License (No License)

A ctc decoder for both online and offline asr model

voicekit-examplesby Tinkoff

C# 45 Version:Current License: Permissive (Apache-2.0)

Examples on how to use Tinkoff Voicekit

audioWhisperby Awexander

Python 45 Version:Current License: Permissive (MIT)

Listen to any audio stream on your machine and print out the transcribed or translated audio.

alqalignby xinjli

Python 45 Version:Current License: Permissive (Apache-2.0)

multilingual speech aligner

TypeScript 46 Version:Current
License: Permissive (MIT)

Java 46 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 45 Version:Current
License: No License (No License)

Python 45 Version:Current
License: Permissive (MIT)

Python 45 Version:Current
License: Permissive (MIT)

Java 45 Version:Current
License: Permissive (Apache-2.0)

Python 45 Version:Current
License: Proprietary (Proprietary)

Python 45 Version:Current
License: Permissive (Apache-2.0)

Java 45 Version:Current
License: Proprietary (Proprietary)

Python 45 Version:Current
License: No License (No License)

Python 45 Version:Current
License: Permissive (MIT)

Python 45 Version:Current
License: No License (No License)

Ruby 45 Version:Current
License: Permissive (MIT)

JavaScript 45 Version:Current
License: No License (No License)

迅飞语音听写 WebAPI - 把语音(≤60秒)转换成对应的文字信息，让机器能够“听懂”人类语言，相当于给机器安装上“耳朵”，使其具备“能听”的功能。

C# 45 Version:Current
License: Permissive (MIT)

Python 45 Version:Current
License: Proprietary (Proprietary)

Swift 45 Version:Current
License: Permissive (MIT)

Go 45 Version:Current
License: Permissive (MIT)

C++ 45 Version:Current
License: No License (No License)

Rust 45 Version:Current
License: Permissive (MIT)

Rust 45 Version:Current
License: Permissive (MIT)

Jupyter Notebook 45 Version:Current
License: Permissive (MIT)

Jupyter Notebook 45 Version:Current
License: Proprietary (Proprietary)

C++ 45 Version:Current
License: No License (No License)

C# 45 Version:Current
License: Permissive (Apache-2.0)

Python 45 Version:Current
License: Permissive (MIT)

Python 45 Version:Current
License: Permissive (Apache-2.0)

Python 45 Version:Current
License: No License (No License)

Python 45 Version:Current
License: Proprietary (Proprietary)

Python 45 Version:Current
License: No License (No License)

Python 44 Version:Current
License: Permissive (MIT)

Python 44 Version:Current
License: No License (No License)

Python 44 Version:Current
License: No License (No License)

Python 44 Version:Current
License: Permissive (MIT)

Python 44 Version:Current
License: No License (No License)

Python 44 Version:Current
License: No License (No License)

Perl 44 Version:Current
License: No License (No License)

Jupyter Notebook 44 Version:Current
License: No License (No License)

Python 44 Version:Current
License: No License (No License)

Python 44 Version:Current
License: No License (No License)

Python 43 Version:Current
License: No License (No License)

Python 43 Version:Current
License: No License (No License)

Python 43 Version:Current
License: No License (No License)

Python 43 Version:Current
License: No License (No License)

Python 43 Version:Current
License: No License (No License)

Python 43 Version:Current
License: Permissive (BSD-3-Clause)

Python 43 Version:Current
License: Strong Copyleft (GPL-2.0)

Python 43 Version:Current
License: No License (No License)

Python 43 Version:Current
License: Permissive (MIT)

Python 43 Version:Current
License: Permissive (Apache-2.0)