Speech Libraries - Page 5

JavaScript 460 Version:Current
License: Permissive (MIT)

🗣 A flexible GUI for Speech Recognition

Support

Quality

Security

License

Reuse

Rust 458 Version:Current
License: Proprietary (Proprietary)

GStreamer bindings for Rust - This repository moved to https://gitlab.freedesktop.org/gstreamer/gstreamer-rs

Support

Quality

Security

License

Reuse

tone-analyzer-nodejsby watson-developer-cloud

CSS 454 Version:Current
License: Permissive (Apache-2.0)

Sample Node.js Application for the IBM Tone Analyzer Service

Support

Quality

Security

License

Reuse

uSpeechby arjo129

C++ 453 Version:Current
License: Permissive (MIT)

Speech recognition toolkit for the arduino

Support

Quality

Security

License

Reuse

xVA-Synthby DanRuta

JavaScript 452 Version:Current
License: Strong Copyleft (GPL-3.0)

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

Support

Quality

Security

License

Reuse

Java 450 Version:Current
License: Permissive (Apache-2.0)

"Google Now" style animation for Speech Recognizer.

Support

Quality

Security

License

Reuse

react-speech-recognitionby JamesBrill

JavaScript 448 Version:Current
License: Permissive (MIT)

💬Speech recognition for your React app

Support

Quality

Security

License

Reuse

MMTby yxgeee

Python 442 Version:Current
License: Permissive (MIT)

[ICLR-2020] Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification.

Support

Quality

Security

License

Reuse

project_news_alan_aiby adrianhajdin

JavaScript 440 Version:Current
License: No License (No License)

In this video, we're going to build a Conversational Voice Controlled React News Application using Alan AI. Alan AI is a revolutionary speech recognition software that allows you to add voice capabilities to your applications.

Support

Quality

Security

License

Reuse

sudo-promptby jorangreef

JavaScript 438 Version:Current
License: Permissive (MIT)

Run a command using sudo, prompting the user with an OS dialog if necessary.

Support

Quality

Security

License

Reuse

audioreadby beetbox

Python 436 Version:Current
License: Permissive (MIT)

cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python

Support

Quality

Security

License

Reuse

Java 434 Version:Current
License: Permissive (Apache-2.0)

Android speech recognition and text to speech made easy

Support

Quality

Security

License

Reuse

Python 428 Version:Current
License: Permissive (Apache-2.0)

一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目，CPU上的实时率(RTF)小于0.1

Support

Quality

Security

License

Reuse

bumblebeeby jaxcore

JavaScript 427 Version:Current
License: Permissive (MIT)

Jaxcore Bumblebee - a JavaScript voice application framework

Support

Quality

Security

License

Reuse

Palaverby JamezQ

Python 425 Version:Current
License: Strong Copyleft (GPL-3.0)

Linux Speech Recognition

Support

Quality

Security

License

Reuse

FullSubNetby Audio-WestlakeU

Python 425 Version:Current
License: Permissive (MIT)

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Support

Quality

Security

License

Reuse

Python 424 Version:Current
License: Permissive (MIT)

Config for talon for Mac, Windows and Linux. Very much in progress.

Support

Quality

Security

License

Reuse

dialogby sqweek

Go 420 Version:Current
License: Permissive (ISC)

Simple cross-platform dialog API for go-lang

Support

Quality

Security

License

Reuse

elevenlabs-pythonby elevenlabs

Python 419 Version:Current
License: No License (No License)

The official Python API for ElevenLabs text-to-speech.

Support

Quality

Security

License

Reuse

lora-svcby PlayVoice

Python 418 Version:Current
License: Permissive (MIT)

singing voice change based on whisper, and lora for singing voice clone

Support

Quality

Security

License

Reuse

adaptive_voice_conversionby jjery2243542

Python 414 Version:Current
License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

voxpopuliby facebookresearch

Python 407 Version:Current
License: Proprietary (Proprietary)

A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation

Support

Quality

Security

License

Reuse

css10by Kyubyong

HTML 407 Version:Current
License: Permissive (Apache-2.0)

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

Support

Quality

Security

License

Reuse

Python 406 Version:Current
License: Permissive (MIT)

Your personal voice assistant

Support

Quality

Security

License

Reuse

Python 405 Version:Current
License: Permissive (MIT)

Different implementations of "Weighted Prediction Error" for speech dereverberation

Support

Quality

Security

License

Reuse

Python 405 Version:Current
License: Strong Copyleft (GPL-3.0)

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Support

Quality

Security

License

Reuse

Python 397 Version:Current
License: Weak Copyleft (LGPL-3.0)

Open tools and data for cloudless automatic speech recognition

Support

Quality

Security

License

Reuse

Voice-Converter-CycleGANby leimao

Python 396 Version:Current
License: Permissive (MIT)

Voice Converter Using CycleGAN and Non-Parallel Data

Support

Quality

Security

License

Reuse

PESQby ludlows

C 395 Version:Current
License: Permissive (MIT)

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

Support

Quality

Security

License

Reuse

Phonetisaurusby AdolfVonKleist

Shell 393 Version:Current
License: Permissive (BSD-3-Clause)

Phonetisaurus G2P

Support

Quality

Security

License

Reuse

NISQAby gabrielmittag

Python 392 Version:Current
License: Permissive (MIT)

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Support

Quality

Security

License

Reuse

Neural-Voice-Cloning-With-Few-Samplesby SforAiDl

Python 391 Version:Current
License: Permissive (MIT)

This repository has implementation for "Neural Voice Cloning With Few Samples"

Support

Quality

Security

License

Reuse

musigby sfluor

Go 390 Version:Current
License: Permissive (MIT)

A shazam like tool to store songs fingerprints and retrieve them

Support

Quality

Security

License

Reuse

Speech-Backbonesby huawei-noah

Jupyter Notebook 388 Version:Current
License: No License (No License)

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Support

Quality

Security

License

Reuse

Python 382 Version:Current
License: Proprietary (Proprietary)

Library to build speech synthesis systems designed for easy and fast prototyping.

Support

Quality

Security

License

Reuse

picovoiceby Picovoice

Python 377 Version:Current
License: Permissive (Apache-2.0)

On-device voice assistant platform powered by deep learning

Support

Quality

Security

License

Reuse

MASRby yeyupiaoling

Python 372 Version:Current
License: Permissive (Apache-2.0)

Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2模型，支持多种数据增强方法。

Support

Quality

Security

License

Reuse

esp-skainetby espressif

C 372 Version:Current
License: Proprietary (Proprietary)

Espressif intelligent voice assistant

Support

Quality

Security

License

Reuse

paseby santi-pdp

Python 365 Version:Current
License: Permissive (MIT)

Problem Agnostic Speech Encoder

Support

Quality

Security

License

Reuse

Python 364 Version:Current
License: Weak Copyleft (LGPL-3.0)

ARCHIVED! - Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS) and Windows Speech Recognition (WSR)

Support

Quality

Security

License

Reuse

Python 364 Version:Current
License: Permissive (MIT)

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Support

Quality

Security

License

Reuse

pocketsphinx-pythonby bambocher

Python 363 Version:Current
License: Proprietary (Proprietary)

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Support

Quality

Security

License

Reuse

timeby golang

Go 363 Version:Current
License: Permissive (BSD-3-Clause)

[mirror] Go supplementary time packages

Support

Quality

Security

License

Reuse

C 363 Version:Current
License: Permissive (Apache-2.0)

Server for the Echoprint audio fingerprint system

Support

Quality

Security

License

Reuse

dlaby markovka17

Jupyter Notebook 362 Version:Current
License: Permissive (MIT)

Deep learning for audio processing

Support

Quality

Security

License

Reuse

Python 361 Version:Current
License: Permissive (Apache-2.0)

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Support

Quality

Security

License

Reuse

C++ 361 Version:Current
License: Proprietary (Proprietary)

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

Support

Quality

Security

License

Reuse

nnsvsby r9y9

Python 360 Version:Current
License: Permissive (MIT)

Neural network-based singing voice synthesis library for research

Support

Quality

Security

License

Reuse

Python 360 Version:Current
License: No License (No License)

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

Support

Quality

Security

License

Reuse

leopardby Picovoice

Python 359 Version:Current
License: Permissive (Apache-2.0)

On-device speech-to-text engine powered by deep learning

Support

Quality

Security

License

Reuse

SpeechKITTby TalAter

🗣 A flexible GUI for Speech Recognition

JavaScript

460

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

gstreamer-rsby sdroege

GStreamer bindings for Rust - This repository moved to https://gitlab.freedesktop.org/gstreamer/gstreamer-rs

Rust

458

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

tone-analyzer-nodejsby watson-developer-cloud

Sample Node.js Application for the IBM Tone Analyzer Service

CSS

454

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

uSpeechby arjo129

Speech recognition toolkit for the arduino

C++

453

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

xVA-Synthby DanRuta

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

JavaScript

452

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

SpeechRecognitionViewby zagum

"Google Now" style animation for Speech Recognizer.

Java

450

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

react-speech-recognitionby JamesBrill

💬Speech recognition for your React app

JavaScript

448

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

MMTby yxgeee

[ICLR-2020] Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification.

Python

442

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

project_news_alan_aiby adrianhajdin

In this video, we're going to build a Conversational Voice Controlled React News Application using Alan AI. Alan AI is a revolutionary speech recognition software that allows you to add voice capabilities to your applications.

JavaScript

440

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

sudo-promptby jorangreef

Run a command using sudo, prompting the user with an OS dialog if necessary.

JavaScript

438

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

audioreadby beetbox

cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python

Python

436

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

android-speechby gotev

Android speech recognition and text to speech made easy

Java

434

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

TensorflowASRby Z-yq

一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目，CPU上的实时率(RTF)小于0.1

Python

428

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

bumblebeeby jaxcore

Jaxcore Bumblebee - a JavaScript voice application framework

JavaScript

427

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Palaverby JamezQ

Linux Speech Recognition

Python

425

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

FullSubNetby Audio-WestlakeU

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

Python

425

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

knausj_talonby knausj85

Config for talon for Mac, Windows and Linux. Very much in progress.

Python

424

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

dialogby sqweek

Simple cross-platform dialog API for go-lang

420

Updated: 2 y ago

License: Permissive (ISC)

Support

Quality

Security

License

Reuse

elevenlabs-pythonby elevenlabs

The official Python API for ElevenLabs text-to-speech.

Python

419

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

lora-svcby PlayVoice

singing voice change based on whisper, and lora for singing voice clone

Python

418

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

adaptive_voice_conversionby jjery2243542

Python

414

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

voxpopuliby facebookresearch

A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation

Python

407

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

css10by Kyubyong

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

HTML

407

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

hey-athena-clientby rcbyron

Your personal voice assistant

Python

406

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

nara_wpeby fgnt

Different implementations of "Weighted Prediction Error" for speech dereverberation

Python

405

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

allosaurusby xinjli

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

Python

405

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

zamia-speechby gooofy

Open tools and data for cloudless automatic speech recognition

Python

397

Updated: 4 y ago

License: Weak Copyleft (LGPL-3.0)

Support

Quality

Security

License

Reuse

Voice-Converter-CycleGANby leimao

Voice Converter Using CycleGAN and Non-Parallel Data

Python

396

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

PESQby ludlows

PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)

395

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Phonetisaurusby AdolfVonKleist

Phonetisaurus G2P

Shell

393

Updated: 2 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

NISQAby gabrielmittag

NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment

Python

392

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Neural-Voice-Cloning-With-Few-Samplesby SforAiDl

This repository has implementation for "Neural Voice Cloning With Few Samples"

Python

391

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

musigby sfluor

A shazam like tool to store songs fingerprints and retrieve them

390

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Speech-Backbonesby huawei-noah

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Jupyter Notebook

388

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

nnmnkwiiby r9y9

Library to build speech synthesis systems designed for easy and fast prototyping.

Python

382

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

picovoiceby Picovoice

On-device voice assistant platform powered by deep learning

Python

377

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

MASRby yeyupiaoling

Pytorch实现的流式与非流式的自动语音识别框架，同时兼容在线和离线识别，目前支持Conformer、Squeezeformer、DeepSpeech2模型，支持多种数据增强方法。

Python

372

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

esp-skainetby espressif

Espressif intelligent voice assistant

372

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

paseby santi-pdp

Problem Agnostic Speech Encoder

Python

365

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

dragonflyby t4ngo

ARCHIVED! - Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS) and Windows Speech Recognition (WSR)

Python

364

Updated: 4 y ago

License: Weak Copyleft (LGPL-3.0)

Support

Quality

Security

License

Reuse

StarGANv2-VCby yl4579

StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion

Python

364

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pocketsphinx-pythonby bambocher

Python interface to CMU Sphinxbase and Pocketsphinx libraries

Python

363

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

timeby golang

[mirror] Go supplementary time packages

363

Updated: 2 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

echoprint-serverby spotify

Server for the Echoprint audio fingerprint system

363

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

dlaby markovka17

Deep learning for audio processing

Jupyter Notebook

362

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

vosk-serverby alphacep

WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries

Python

361

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

speech-alignerby open-speech

speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription

C++

361

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

nnsvsby r9y9

Neural network-based singing voice synthesis library for research

Python

360

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

mandarin-ttsby ranchlai

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

Python

360

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

leopardby Picovoice

On-device speech-to-text engine powered by deep learning

Python

359

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 5

SpeechKITTby TalAter

JavaScript 460 Version:Current License: Permissive (MIT)

🗣 A flexible GUI for Speech Recognition

gstreamer-rsby sdroege

Rust 458 Version:Current License: Proprietary (Proprietary)

GStreamer bindings for Rust - This repository moved to https://gitlab.freedesktop.org/gstreamer/gstreamer-rs

tone-analyzer-nodejsby watson-developer-cloud

CSS 454 Version:Current License: Permissive (Apache-2.0)

Sample Node.js Application for the IBM Tone Analyzer Service

uSpeechby arjo129

C++ 453 Version:Current License: Permissive (MIT)

Speech recognition toolkit for the arduino

xVA-Synthby DanRuta

JavaScript 452 Version:Current License: Strong Copyleft (GPL-3.0)

Machine learning based speech synthesis Electron app, with voices from specific characters from video games

SpeechRecognitionViewby zagum

Java 450 Version:Current License: Permissive (Apache-2.0)

"Google Now" style animation for Speech Recognizer.

react-speech-recognitionby JamesBrill

JavaScript 448 Version:Current License: Permissive (MIT)

💬Speech recognition for your React app

MMTby yxgeee

Python 442 Version:Current License: Permissive (MIT)

[ICLR-2020] Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification.

project_news_alan_aiby adrianhajdin

JavaScript 440 Version:Current License: No License (No License)

In this video, we're going to build a Conversational Voice Controlled React News Application using Alan AI. Alan AI is a revolutionary speech recognition software that allows you to add voice capabilities to your applications.

sudo-promptby jorangreef

JavaScript 438 Version:Current License: Permissive (MIT)

Run a command using sudo, prompting the user with an OS dialog if necessary.

audioreadby beetbox

Python 436 Version:Current License: Permissive (MIT)

cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding for Python

android-speechby gotev

Java 434 Version:Current License: Permissive (Apache-2.0)

Android speech recognition and text to speech made easy

TensorflowASRby Z-yq

Python 428 Version:Current License: Permissive (Apache-2.0)

一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目，CPU上的实时率(RTF)小于0.1

bumblebeeby jaxcore

JavaScript 427 Version:Current License: Permissive (MIT)

Jaxcore Bumblebee - a JavaScript voice application framework

Palaverby JamezQ

Python 425 Version:Current License: Strong Copyleft (GPL-3.0)

Linux Speech Recognition

FullSubNetby Audio-WestlakeU

Python 425 Version:Current License: Permissive (MIT)

PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."

knausj_talonby knausj85

Python 424 Version:Current License: Permissive (MIT)

Config for talon for Mac, Windows and Linux. Very much in progress.

dialogby sqweek

Go 420 Version:Current License: Permissive (ISC)

Simple cross-platform dialog API for go-lang

elevenlabs-pythonby elevenlabs

Python 419 Version:Current License: No License (No License)

The official Python API for ElevenLabs text-to-speech.

lora-svcby PlayVoice

Python 418 Version:Current License: Permissive (MIT)

singing voice change based on whisper, and lora for singing voice clone

adaptive_voice_conversionby jjery2243542

Python 414 Version:Current License: Permissive (Apache-2.0)

voxpopuliby facebookresearch

Python 407 Version:Current License: Proprietary (Proprietary)

A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation

css10by Kyubyong

HTML 407 Version:Current License: Permissive (Apache-2.0)

CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages

hey-athena-clientby rcbyron

Python 406 Version:Current License: Permissive (MIT)

Your personal voice assistant

nara_wpeby fgnt

Python 405 Version:Current License: Permissive (MIT)

Different implementations of "Weighted Prediction Error" for speech dereverberation

allosaurusby xinjli

Python 405 Version:Current License: Strong Copyleft (GPL-3.0)

Allosaurus is a pretrained universal phone recognizer for more than 2000 languages

zamia-speechby gooofy

Python 397 Version:Current License: Weak Copyleft (LGPL-3.0)

JavaScript 460 Version:Current
License: Permissive (MIT)

Rust 458 Version:Current
License: Proprietary (Proprietary)

CSS 454 Version:Current
License: Permissive (Apache-2.0)

C++ 453 Version:Current
License: Permissive (MIT)

JavaScript 452 Version:Current
License: Strong Copyleft (GPL-3.0)

Java 450 Version:Current
License: Permissive (Apache-2.0)

JavaScript 448 Version:Current
License: Permissive (MIT)

Python 442 Version:Current
License: Permissive (MIT)

JavaScript 440 Version:Current
License: No License (No License)

JavaScript 438 Version:Current
License: Permissive (MIT)

Python 436 Version:Current
License: Permissive (MIT)

Java 434 Version:Current
License: Permissive (Apache-2.0)

Python 428 Version:Current
License: Permissive (Apache-2.0)

JavaScript 427 Version:Current
License: Permissive (MIT)

Python 425 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 425 Version:Current
License: Permissive (MIT)

Python 424 Version:Current
License: Permissive (MIT)

Go 420 Version:Current
License: Permissive (ISC)

Python 419 Version:Current
License: No License (No License)

Python 418 Version:Current
License: Permissive (MIT)

Python 414 Version:Current
License: Permissive (Apache-2.0)

Python 407 Version:Current
License: Proprietary (Proprietary)

HTML 407 Version:Current
License: Permissive (Apache-2.0)

Python 406 Version:Current
License: Permissive (MIT)

Python 405 Version:Current
License: Permissive (MIT)

Python 405 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 397 Version:Current
License: Weak Copyleft (LGPL-3.0)

Python 396 Version:Current
License: Permissive (MIT)

C 395 Version:Current
License: Permissive (MIT)

Shell 393 Version:Current
License: Permissive (BSD-3-Clause)

Python 392 Version:Current
License: Permissive (MIT)

Python 391 Version:Current
License: Permissive (MIT)

Go 390 Version:Current
License: Permissive (MIT)

Jupyter Notebook 388 Version:Current
License: No License (No License)

Python 382 Version:Current
License: Proprietary (Proprietary)

Python 377 Version:Current
License: Permissive (Apache-2.0)

Python 372 Version:Current
License: Permissive (Apache-2.0)

C 372 Version:Current
License: Proprietary (Proprietary)

Python 365 Version:Current
License: Permissive (MIT)

Python 364 Version:Current
License: Weak Copyleft (LGPL-3.0)

Python 364 Version:Current
License: Permissive (MIT)

Python 363 Version:Current
License: Proprietary (Proprietary)

Go 363 Version:Current
License: Permissive (BSD-3-Clause)

C 363 Version:Current
License: Permissive (Apache-2.0)

Jupyter Notebook 362 Version:Current
License: Permissive (MIT)

Python 361 Version:Current
License: Permissive (Apache-2.0)

C++ 361 Version:Current
License: Proprietary (Proprietary)

Python 360 Version:Current
License: Permissive (MIT)

Python 360 Version:Current
License: No License (No License)

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

Python 359 Version:Current
License: Permissive (Apache-2.0)