Speech Libraries - Page 25

wavencoderby shangeth

Python 36 Version:Current
License: Permissive (MIT)

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

Support

Quality

Security

License

Reuse

pywsj0-mixby mpariente

Python 36 Version:Current
License: Permissive (MIT)

wsj0-{2, 3, 4, 5} mix generation scripts, in Python.

Support

Quality

Security

License

Reuse

Go 36 Version:Current
License: Permissive (MIT)

Go wrapper for Kitt-AI's snowboy audio detection library.

Support

Quality

Security

License

Reuse

wattsby almazan

C 36 Version:Current
License: Permissive (MIT)

Word Spotting and Recognition with Embedded Attributes

Support

Quality

Security

License

Reuse

TypeScript 36 Version:Current
License: Permissive (MIT)

ChatGPT web application, use OpenAI official API. ChatGPT 网页应用，支持多对话、海量提示词、PWA、ASR、TTS

Support

Quality

Security

License

Reuse

Python 35 Version:Current
License: Permissive (Apache-2.0)

A python IO interface for data accessing in kaldi

Support

Quality

Security

License

Reuse

lstm_ctcby mobvoi

Python 35 Version:Current
License: Proprietary (Proprietary)

LSTM CTC End2End Speech Recognition.

Support

Quality

Security

License

Reuse

Python 35 Version:Current
License: Permissive (MIT)

CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion

Support

Quality

Security

License

Reuse

Python 35 Version:Current
License: No License (No License)

Bidirectional dynamic RNN + CTC for phoneme recognition

Support

Quality

Security

License

Reuse

Python 35 Version:Current
License: No License (No License)

Voice conversion (VC) investigation using three variants of VAE

Support

Quality

Security

License

Reuse

QPPWGby bigpon

Python 35 Version:Current
License: Permissive (MIT)

Quasi-Periodic Parallel WaveGAN Pytorch implementation

Support

Quality

Security

License

Reuse

ProMoby timmahrt

Python 35 Version:Current
License: Permissive (MIT)

Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.

Support

Quality

Security

License

Reuse

Python 35 Version:Current
License: Permissive (MIT)

self-supervised domain adaptation

Support

Quality

Security

License

Reuse

vGraphby fanyun-sun

Python 35 Version:Current
License: No License (No License)

Code for NeurIPS paper "vGraph: A Generative Model for Joint CommunityDetection and Node Representation Learning"

Support

Quality

Security

License

Reuse

yoruba-textby Niger-Volta-LTI

Python 35 Version:Current
License: Strong Copyleft (GPL-3.0)

Yorùbá language training text for NLP, ASR and TTS tasks

Support

Quality

Security

License

Reuse

Deep-Encoder-Decoder-Conv-TasNetby JusperLee

Python 35 Version:Current
License: No License (No License)

A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "

Support

Quality

Security

License

Reuse

PHP 35 Version:Current
License: Strong Copyleft (AGPL-3.0)

how to use the Google Cloud Speech API to transcribe audio/video files.

Support

Quality

Security

License

Reuse

TTS-dataset-toolsby youmebangbang

Python 35 Version:Current
License: Permissive (MIT)

Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to perform diarization and transcription or aeneas to force align text to audio.

Support

Quality

Security

License

Reuse

salutejsby sberdevices

TypeScript 35 Version:Current
License: Proprietary (Proprietary)

SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript

Support

Quality

Security

License

Reuse

THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEMby lenovo-voice

Shell 35 Version:Current
License: No License (No License)

Support

Quality

Security

License

Reuse

Sinsy-Remixby hyperzlib

C++ 35 Version:Current
License: Proprietary (Proprietary)

The HMM-Based Singing Voice Syntheis System Remix "Sinsy-r"

Support

Quality

Security

License

Reuse

ctc_beam_search_lmby Sundy1219

C++ 35 Version:Current
License: No License (No License)

CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统

Support

Quality

Security

License

Reuse

sndby dskinner

Go 35 Version:Current
License: Permissive (BSD-3-Clause)

Package snd provides methods and types for sound processing and synthesis.

Support

Quality

Security

License

Reuse

Voice-conversion-evaluationby tzuhsien

Python 35 Version:Current
License: No License (No License)

An evaluation toolkit for voice conversion models.

Support

Quality

Security

License

Reuse

REPET-Matlabby zafarrafii

Jupyter Notebook 35 Version:Current
License: No License (No License)

REPeating Pattern Extraction Technique (REPET) in Matlab for audio source separation: original REPET, REPET extended, adaptive REPET, REPET-SIM, REPET-SIM online

Support

Quality

Security

License

Reuse

ssl_speech_restorationby Takaaki-Saeki

Python 35 Version:Current
License: Permissive (MIT)

SelfRemaster: SSL Speech Restoration

Support

Quality

Security

License

Reuse

speech-to-intent-datasetby skit-ai

Python 35 Version:Current
License: Proprietary (Proprietary)

Dataset Release for Intent Classification from Speech

Support

Quality

Security

License

Reuse

Attention-Is-All-You-Need-In-Speech-Separationby Zhongyang-debug

Python 35 Version:Current
License: No License (No License)

Speech Separation

Support

Quality

Security

License

Reuse

Python 34 Version:Current
License: No License (No License)

A Chainer implementation of Fast WaveNet(mel-spectrogram vocoder).

Support

Quality

Security

License

Reuse

Python 34 Version:Current
License: Permissive (MIT)

Rev AI Python SDK

Support

Quality

Security

License

Reuse

Python 34 Version:Current
License: Permissive (MIT)

An unofficial implementation of https://arxiv.org/abs/2005.05106

Support

Quality

Security

License

Reuse

ivona-nodeby tmanderson

JavaScript 34 Version:Current
License: No License (No License)

Ivona Cloud (via Amazon services) client library for Node

Support

Quality

Security

License

Reuse

gcloud_speech_voice_recorderby taekb

JavaScript 34 Version:Current
License: Permissive (Apache-2.0)

Flask-based web application that records sound (as PCM/WAV) and converts speech to text via Google Cloud Speech API using HTML, JavaScript, and Python

Support

Quality

Security

License

Reuse

voice-activated-microbitby edgeimpulse

C 34 Version:Current
License: Permissive (MIT)

Bleep, bloop, I'm a computer that responds to your voice

Support

Quality

Security

License

Reuse

spectby lennes

HTML 34 Version:Current
License: Strong Copyleft (GPL-3.0)

SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/

Support

Quality

Security

License

Reuse

athenaby LianjiaTech

C++ 34 Version:Current
License: Permissive (Apache-2.0)

An open-source implementation of sequence-to-sequence based speech processing engine

Support

Quality

Security

License

Reuse

2022-DL-Audio-Courseby severilov

Jupyter Notebook 34 Version:Current
License: No License (No License)

Deep Learning Audio Course, 2022

Support

Quality

Security

License

Reuse

C# 34 Version:Current
License: No License (No License)

Automatic Speech Recognition in Unity using Vosk library

Support

Quality

Security

License

Reuse

Python 33 Version:Current
License: Permissive (Apache-2.0)

Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model

Support

Quality

Security

License

Reuse

melosynthby justinsalamon

Python 33 Version:Current
License: Strong Copyleft (GPL-3.0)

Synthesize a continuous pitch sequence

Support

Quality

Security

License

Reuse

morse-speak-demoby googlecreativelab

JavaScript 33 Version:Current
License: Permissive (Apache-2.0)

Text-to-Speech (TTS) demo web app that converts written text into spoken words via Morse code

Support

Quality

Security

License

Reuse

ASRT_SpeechClient_WPFby nl8590687

C# 33 Version:Current
License: Permissive (Apache-2.0)

An Windows WPF client software for ASRT speech recognition system. 一个可用于ASRT语音识别系统的Windows WPF版客户端软件

Support

Quality

Security

License

Reuse

ConferencingSpeech2021by ConferencingSpeech

Python 33 Version:Current
License: Permissive (Apache-2.0)

Conferencing Speech Challenge

Support

Quality

Security

License

Reuse

voce-browserby trabdlkarim

Python 33 Version:Current
License: Strong Copyleft (GPL-3.0)

Voice Controlled Chromium Web Browser

Support

Quality

Security

License

Reuse

JavaScript 33 Version:Current
License: Strong Copyleft (AGPL-3.0)

An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech

Support

Quality

Security

License

Reuse

flow-synthby nwoeanhinnogaehr

Rust 33 Version:Current
License: Permissive (MIT)

UNMAINTAINED A modular digital audio workstation for synthesis, sequencing, live coding, visuals, etc

Support

Quality

Security

License

Reuse

wavutilsby smallmuou

Shell 33 Version:Current
License: No License (No License)

wavutils is a tool set that process wav file

Support

Quality

Security

License

Reuse

Shell 33 Version:Current
License: No License (No License)

Multilingual Grapheme to Phoneme

Support

Quality

Security

License

Reuse

gochromaby go-fingerprint

Go 33 Version:Current
License: Permissive (MIT)

Go bindings and high-level API to acoustic fingerprinting library chromaprint

Support

Quality

Security

License

Reuse

RaspiAsteriskGoogleby rgrokett

Perl 33 Version:Current
License: Strong Copyleft (GPL-3.0)

Integrating Asterisk with Google Assistant Voice Service on a Raspberry Pi Zero using AGI

Support

Quality

Security

License

Reuse

wavencoderby shangeth

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

Python

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pywsj0-mixby mpariente

wsj0-{2, 3, 4, 5} mix generation scripts, in Python.

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

go-snowboyby brentnd

Go wrapper for Kitt-AI's snowboy audio detection library.

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

wattsby almazan

Word Spotting and Recognition with Embedded Attributes

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

chatgpt-webby liuw5367

ChatGPT web application, use OpenAI official API. ChatGPT 网页应用，支持多对话、海量提示词、PWA、ASR、TTS

TypeScript

Updated: 1 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

kaldi-python-ioby funcwj

A python IO interface for data accessing in kaldi

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

lstm_ctcby mobvoi

LSTM CTC End2End Speech Recognition.

Python

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

CycleGAN-VC2by onejiin

CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

phoneme_ctcby tbornt

Bidirectional dynamic RNN + CTC for phoneme recognition

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

voice-conversionby vsimkus

Voice conversion (VC) investigation using three variants of VAE

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

QPPWGby bigpon

Quasi-Periodic Parallel WaveGAN Pytorch implementation

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ProMoby timmahrt

Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

self-supervised-daby Jiaolong

self-supervised domain adaptation

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

vGraphby fanyun-sun

Code for NeurIPS paper "vGraph: A Generative Model for Joint CommunityDetection and Node Representation Learning"

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

yoruba-textby Niger-Volta-LTI

Yorùbá language training text for NLP, ASR and TTS tasks

Python

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Deep-Encoder-Decoder-Conv-TasNetby JusperLee

A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speech_to_textby m-nathani

how to use the Google Cloud Speech API to transcribe audio/video files.

PHP

Updated: 4 y ago

License: Strong Copyleft (AGPL-3.0)

Support

Quality

Security

License

Reuse

TTS-dataset-toolsby youmebangbang

Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to perform diarization and transcription or aeneas to force align text to audio.

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

salutejsby sberdevices

SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript

TypeScript

Updated: 3 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEMby lenovo-voice

Shell

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Sinsy-Remixby hyperzlib

The HMM-Based Singing Voice Syntheis System Remix "Sinsy-r"

C++

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

ctc_beam_search_lmby Sundy1219

CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统

C++

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

sndby dskinner

Package snd provides methods and types for sound processing and synthesis.

Updated: 4 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

Voice-conversion-evaluationby tzuhsien

An evaluation toolkit for voice conversion models.

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

REPET-Matlabby zafarrafii

REPeating Pattern Extraction Technique (REPET) in Matlab for audio source separation: original REPET, REPET extended, adaptive REPET, REPET-SIM, REPET-SIM online

Jupyter Notebook

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

ssl_speech_restorationby Takaaki-Saeki

SelfRemaster: SSL Speech Restoration

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

speech-to-intent-datasetby skit-ai

Dataset Release for Intent Classification from Speech

Python

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Attention-Is-All-You-Need-In-Speech-Separationby Zhongyang-debug

Speech Separation

Python

Updated: 1 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

chainer-Fast-WaveNetby dhgrs

A Chainer implementation of Fast WaveNet(mel-spectrogram vocoder).

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

revai-python-sdkby revdotcom

Rev AI Python SDK

Python

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

multiband_melganby AppleHolic

An unofficial implementation of https://arxiv.org/abs/2005.05106

Python

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ivona-nodeby tmanderson

Ivona Cloud (via Amazon services) client library for Node

JavaScript

Updated: 5 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

gcloud_speech_voice_recorderby taekb

Flask-based web application that records sound (as PCM/WAV) and converts speech to text via Google Cloud Speech API using HTML, JavaScript, and Python

JavaScript

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

voice-activated-microbitby edgeimpulse

Bleep, bloop, I'm a computer that responds to your voice

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

spectby lennes

SpeCT - Speech Corpus Toolkit for Praat. Documentation: https://lennes.github.io/spect/

HTML

Updated: 3 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

athenaby LianjiaTech

An open-source implementation of sequence-to-sequence based speech processing engine

C++

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

2022-DL-Audio-Courseby severilov

Deep Learning Audio Course, 2022

Jupyter Notebook

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

vosk-unity-asrby alphacep

Automatic Speech Recognition in Unity using Vosk library

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

kaldi-adapt-lmby gooofy

Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

melosynthby justinsalamon

Synthesize a continuous pitch sequence

Python

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

morse-speak-demoby googlecreativelab

Text-to-Speech (TTS) demo web app that converts written text into spoken words via Morse code

JavaScript

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

ASRT_SpeechClient_WPFby nl8590687

An Windows WPF client software for ASRT speech recognition system. 一个可用于ASRT语音识别系统的Windows WPF版客户端软件

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

ConferencingSpeech2021by ConferencingSpeech

Conferencing Speech Challenge

Python

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

voce-browserby trabdlkarim

Voice Controlled Chromium Web Browser

Python

Updated: 3 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

scriptionby smlum

An editor for speech-to-text transcripts such as AWS Transcribe and Mozilla DeepSpeech

JavaScript

Updated: 3 y ago

License: Strong Copyleft (AGPL-3.0)

Support

Quality

Security

License

Reuse

flow-synthby nwoeanhinnogaehr

*UNMAINTAINED* A modular digital audio workstation for synthesis, sequencing, live coding, visuals, etc

Rust

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

wavutilsby smallmuou

wavutils is a tool set that process wav file

Shell

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

multilingual-g2pby jcsilva

Multilingual Grapheme to Phoneme

Shell

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

gochromaby go-fingerprint

Go bindings and high-level API to acoustic fingerprinting library chromaprint

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

RaspiAsteriskGoogleby rgrokett

Integrating Asterisk with Google Assistant Voice Service on a Raspberry Pi Zero using AGI

Perl

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 25

wavencoderby shangeth

Python 36 Version:Current License: Permissive (MIT)

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

pywsj0-mixby mpariente

Python 36 Version:Current License: Permissive (MIT)

wsj0-{2, 3, 4, 5} mix generation scripts, in Python.

go-snowboyby brentnd

Go 36 Version:Current License: Permissive (MIT)

Go wrapper for Kitt-AI's snowboy audio detection library.

wattsby almazan

C 36 Version:Current License: Permissive (MIT)

Word Spotting and Recognition with Embedded Attributes

chatgpt-webby liuw5367

TypeScript 36 Version:Current License: Permissive (MIT)

ChatGPT web application, use OpenAI official API. ChatGPT 网页应用，支持多对话、海量提示词、PWA、ASR、TTS

kaldi-python-ioby funcwj

Python 35 Version:Current License: Permissive (Apache-2.0)

A python IO interface for data accessing in kaldi

lstm_ctcby mobvoi

Python 35 Version:Current License: Proprietary (Proprietary)

LSTM CTC End2End Speech Recognition.

CycleGAN-VC2by onejiin

Python 35 Version:Current License: Permissive (MIT)

CycleGAN-VC2: Improved CycleGAN-based Non-parallel Voice Conversion

phoneme_ctcby tbornt

Python 35 Version:Current License: No License (No License)

Bidirectional dynamic RNN + CTC for phoneme recognition

voice-conversionby vsimkus

Python 35 Version:Current License: No License (No License)

Voice conversion (VC) investigation using three variants of VAE

QPPWGby bigpon

Python 35 Version:Current License: Permissive (MIT)

Quasi-Periodic Parallel WaveGAN Pytorch implementation

ProMoby timmahrt

Python 35 Version:Current License: Permissive (MIT)

Prososdy Morph: A python library for manipulating pitch and duration in an algorithmic way, for resynthesizing speech.

self-supervised-daby Jiaolong

Python 35 Version:Current License: Permissive (MIT)

self-supervised domain adaptation

vGraphby fanyun-sun

Python 35 Version:Current License: No License (No License)

Code for NeurIPS paper "vGraph: A Generative Model for Joint CommunityDetection and Node Representation Learning"

yoruba-textby Niger-Volta-LTI

Python 35 Version:Current License: Strong Copyleft (GPL-3.0)

Yorùbá language training text for NLP, ASR and TTS tasks

Deep-Encoder-Decoder-Conv-TasNetby JusperLee

Python 35 Version:Current License: No License (No License)

A PyTorch implementation of " AN EMPIRICAL STUDY OF CONV-TASNET "

speech_to_textby m-nathani

PHP 35 Version:Current License: Strong Copyleft (AGPL-3.0)

how to use the Google Cloud Speech API to transcribe audio/video files.

TTS-dataset-toolsby youmebangbang

Python 35 Version:Current License: Permissive (MIT)

Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to perform diarization and transcription or aeneas to force align text to audio.

salutejsby sberdevices

TypeScript 35 Version:Current License: Proprietary (Proprietary)

SmartApp Framework для создания навыков семейства Виртуальных Ассистентов "Салют" на языке JavaScript

THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEMby lenovo-voice

Shell 35 Version:Current License: No License (No License)

Sinsy-Remixby hyperzlib

C++ 35 Version:Current License: Proprietary (Proprietary)

The HMM-Based Singing Voice Syntheis System Remix "Sinsy-r"

ctc_beam_search_lmby Sundy1219

C++ 35 Version:Current License: No License (No License)

CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统

sndby dskinner

Go 35 Version:Current License: Permissive (BSD-3-Clause)

Package snd provides methods and types for sound processing and synthesis.

Voice-conversion-evaluationby tzuhsien

Python 35 Version:Current License: No License (No License)

An evaluation toolkit for voice conversion models.

REPET-Matlabby zafarrafii

Jupyter Notebook 35 Version:Current License: No License (No License)

REPeating Pattern Extraction Technique (REPET) in Matlab for audio source separation: original REPET, REPET extended, adaptive REPET, REPET-SIM, REPET-SIM online

ssl_speech_restorationby Takaaki-Saeki

Python 35 Version:Current License: Permissive (MIT)

SelfRemaster: SSL Speech Restoration

speech-to-intent-datasetby skit-ai

Python 35 Version:Current License: Proprietary (Proprietary)

Python 36 Version:Current
License: Permissive (MIT)

Python 36 Version:Current
License: Permissive (MIT)

Go 36 Version:Current
License: Permissive (MIT)

C 36 Version:Current
License: Permissive (MIT)

TypeScript 36 Version:Current
License: Permissive (MIT)

Python 35 Version:Current
License: Permissive (Apache-2.0)

Python 35 Version:Current
License: Proprietary (Proprietary)

Python 35 Version:Current
License: Permissive (MIT)

Python 35 Version:Current
License: No License (No License)

Python 35 Version:Current
License: No License (No License)

Python 35 Version:Current
License: Permissive (MIT)

Python 35 Version:Current
License: Permissive (MIT)

Python 35 Version:Current
License: Permissive (MIT)

Python 35 Version:Current
License: No License (No License)

Python 35 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 35 Version:Current
License: No License (No License)

PHP 35 Version:Current
License: Strong Copyleft (AGPL-3.0)

Python 35 Version:Current
License: Permissive (MIT)

TypeScript 35 Version:Current
License: Proprietary (Proprietary)

Shell 35 Version:Current
License: No License (No License)

C++ 35 Version:Current
License: Proprietary (Proprietary)

C++ 35 Version:Current
License: No License (No License)

Go 35 Version:Current
License: Permissive (BSD-3-Clause)

Python 35 Version:Current
License: No License (No License)

Jupyter Notebook 35 Version:Current
License: No License (No License)

Python 35 Version:Current
License: Permissive (MIT)

Python 35 Version:Current
License: Proprietary (Proprietary)

Python 35 Version:Current
License: No License (No License)

Python 34 Version:Current
License: No License (No License)

Python 34 Version:Current
License: Permissive (MIT)

Python 34 Version:Current
License: Permissive (MIT)

JavaScript 34 Version:Current
License: No License (No License)

JavaScript 34 Version:Current
License: Permissive (Apache-2.0)

C 34 Version:Current
License: Permissive (MIT)

HTML 34 Version:Current
License: Strong Copyleft (GPL-3.0)

C++ 34 Version:Current
License: Permissive (Apache-2.0)

Jupyter Notebook 34 Version:Current
License: No License (No License)

C# 34 Version:Current
License: No License (No License)

Python 33 Version:Current
License: Permissive (Apache-2.0)

Python 33 Version:Current
License: Strong Copyleft (GPL-3.0)

JavaScript 33 Version:Current
License: Permissive (Apache-2.0)

C# 33 Version:Current
License: Permissive (Apache-2.0)

Python 33 Version:Current
License: Permissive (Apache-2.0)

Python 33 Version:Current
License: Strong Copyleft (GPL-3.0)

JavaScript 33 Version:Current
License: Strong Copyleft (AGPL-3.0)

Rust 33 Version:Current
License: Permissive (MIT)

UNMAINTAINED A modular digital audio workstation for synthesis, sequencing, live coding, visuals, etc

Shell 33 Version:Current
License: No License (No License)

Shell 33 Version:Current
License: No License (No License)

Go 33 Version:Current
License: Permissive (MIT)

Perl 33 Version:Current
License: Strong Copyleft (GPL-3.0)