Speech Libraries - Page 19

brouhaha-vadby marianne-m

Jupyter Notebook 59 Version:Current
License: Permissive (MIT)

Predicts the level of noise and reverberation on your audiofiles

Support

Quality

Security

License

Reuse

Python 59 Version:Current
License: Permissive (MIT)

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

Support

Quality

Security

License

Reuse

speech-mfccby education-service

Java 58 Version:Current
License: No License (No License)

基于MFCC语音特征提取和识别

Support

Quality

Security

License

Reuse

Speech_Feature_Extractionby pchao6

Python 58 Version:Current
License: No License (No License)

Feature extraction of speech signal is the initial stage of any speech recognition system.

Support

Quality

Security

License

Reuse

Lip_Reading_in_the_Wild_AVSRby ajinkyaT

Python 58 Version:Current
License: No License (No License)

Audio-Visual Speech Recognition using Deep Learning

Support

Quality

Security

License

Reuse

fastwerby kahne

Python 58 Version:Current
License: Permissive (MIT)

A PyPI package for fast word/character error rate (WER/CER) calculation

Support

Quality

Security

License

Reuse

Shell 58 Version:Current
License: No License (No License)

Adapting your own Language Model for Kaldi

Support

Quality

Security

License

Reuse

libcharsetdetectby batterseapower

C++ 58 Version:Current
License: No License (No License)

A dependency-free C interface to the Mozilla Universal Character Set Detector

Support

Quality

Security

License

Reuse

glateby keshavbhatt

C++ 58 Version:Current
License: Permissive (MIT)

Open Source Google Translator and TTS App for Linux Desktop

Support

Quality

Security

License

Reuse

JavaScript 58 Version:Current
License: Strong Copyleft (GPL-3.0)

Desktop application to convert text-to-speech to preview Twitch donations.

Support

Quality

Security

License

Reuse

Python 58 Version:Current
License: No License (No License)

Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation

Support

Quality

Security

License

Reuse

Transformer-Transducerby okkteam

Python 57 Version:Current
License: No License (No License)

A pytorch_lightning reimplementation of the Transducer module from ESPnet.

Support

Quality

Security

License

Reuse

JELVISby kiahamedi

Python 57 Version:Current
License: Strong Copyleft (GPL-3.0)

Intelligent audio assistant like Iron Man Jarvis

Support

Quality

Security

License

Reuse

Speech_Emotion_Recognition_DNN-ELMby eesungkim

Python 57 Version:Current
License: No License (No License)

Implementation of Speech Emotion Recognition using DNN-ELM

Support

Quality

Security

License

Reuse

JavaScript 57 Version:Current
License: Permissive (MIT)

A Vue2 Streaming Speech Recognition Speech to text with Google Cloud Speech

Support

Quality

Security

License

Reuse

Cognitive-SpeakerRecognition-Windowsby microsoft

C# 57 Version:Current
License: Proprietary (Proprietary)

Windows SDK for the Microsoft Speaker Recognition API, part of Cognitive Services

Support

Quality

Security

License

Reuse

C# 57 Version:Current
License: Proprietary (Proprietary)

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Support

Quality

Security

License

Reuse

Python 57 Version:Current
License: Permissive (MIT)

Noise removal/ reducer from the audio file in python. De-noising is done using Wavelets and thresholding is done by VISU Shrink thresholding technique

Support

Quality

Security

License

Reuse

docker-kaldi-androidby jcsilva

Shell 57 Version:Current
License: No License (No License)

Dockerfile for compiling Kaldi for Android.

Support

Quality

Security

License

Reuse

cognitive-services-speech-sdk-goby microsoft

Go 57 Version:Current
License: Permissive (MIT)

Go bindings for the Microsoft Cognitive Services Speech SDK

Support

Quality

Security

License

Reuse

AdaVocoderby yuan1615

Python 57 Version:Current
License: Permissive (MIT)

Adaptive Vocoder for Custom Voice

Support

Quality

Security

License

Reuse

viet-asrby dangvansam98

Python 57 Version:Current
License: No License (No License)

VietASR - Vietnamese Automatic Speech Recognition

Support

Quality

Security

License

Reuse

Context-aware-ZSRby ruotianluo

Python 56 Version:Current
License: Permissive (MIT)

Official code for paper Context-aware Zero-shot Recognition (https://arxiv.org/abs/1904.09320 to appear at AAAI2020)

Support

Quality

Security

License

Reuse

jovo-cliby jovotech

TypeScript 56 Version:Current
License: Permissive (Apache-2.0)

🛠 Command Line Interface for the Jovo Framework: Makes voice experience deployment a breeze, including features like local development and staging.

Support

Quality

Security

License

Reuse

tdmelodicby PKSHATechnology-Research

Python 56 Version:Current
License: Permissive (BSD-3-Clause)

A Japanese accent dictionary generator

Support

Quality

Security

License

Reuse

SHIROby Sleepwalking

C 56 Version:Current
License: Strong Copyleft (GPL-3.0)

Phoneme-to-speech alignment toolkit based on liblrhsmm

Support

Quality

Security

License

Reuse

tts-util-appby Danesprite

Kotlin 56 Version:Current
License: Permissive (Apache-2.0)

TTS Util — Text-to-speech utility Android app for synthesising text into audible speech

Support

Quality

Security

License

Reuse

AliMeetingby yufan-aslp

Python 56 Version:Current
License: No License (No License)

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

Support

Quality

Security

License

Reuse

deepgram-python-sdkby deepgram

Python 56 Version:Current
License: Permissive (MIT)

Official Python SDK for Deepgram's automated speech recognition APIs.

Support

Quality

Security

License

Reuse

SVCC23_FastSVCby lesterphillip

Python 56 Version:Current
License: No License (No License)

Singing Voice Conversion Challenge 2023 Starter Kit: FastSVC Reimplementation

Support

Quality

Security

License

Reuse

Python 56 Version:Current
License: Permissive (MIT)

Whisper realtime streaming for long speech-to-text transcription and translation

Support

Quality

Security

License

Reuse

logmmseby rajivpoddar

Python 55 Version:Current
License: Permissive (MIT)

LogMMSE speech enhancement/noise reduction

Support

Quality

Security

License

Reuse

java-google-speech-apiby goxr3plus

Java 55 Version:Current
License: Strong Copyleft (GPL-3.0)

🙊 Speech Recognition , Text To Speech , Google Translate

Support

Quality

Security

License

Reuse

python_kaldi_featuresby ZitengWang

Python 55 Version:Current
License: Permissive (MIT)

python codes to extract MFCC and FBANK speech features for Kaldi

Support

Quality

Security

License

Reuse

webspeechby ranacseruet

JavaScript 55 Version:Current
License: No License (No License)

HTML5 Speech Recognition API Wrapper Library

Support

Quality

Security

License

Reuse

Kotlin 55 Version:Current
License: Permissive (Apache-2.0)

Voice assistant SDK for Android

Support

Quality

Security

License

Reuse

C 55 Version:Current
License: No License (No License)

use iflytek's technology to realize awaken and order recognition

Support

Quality

Security

License

Reuse

ParametricSpeakerby NiklasFauth

C++ 55 Version:Current
License: Permissive (Apache-2.0)

Design files and software for my parametric ultrasonic speaker.

Support

Quality

Security

License

Reuse

android_device_moto_jordan-commonby Quarx2k

C 55 Version:Current
License: No License (No License)

common repo for MB520/MB525/MB526/

Support

Quality

Security

License

Reuse

juicerby idiap

C++ 55 Version:Current
License: Proprietary (Proprietary)

Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).

Support

Quality

Security

License

Reuse

brnby Brain-up

Kotlin 55 Version:Current
License: Permissive (CC0-1.0)

The idea of this project is to design and make a web-application (with scientist cooperation) which would contained series of special audio trainings to support people with central auditory skills deficit to allow them to train them to listen better.

Support

Quality

Security

License

Reuse

SoundStorm-pytorchby rishikksh20

Python 55 Version:Current
License: Permissive (MIT)

Google's SoundStorm: Efficient Parallel Audio Generation

Support

Quality

Security

License

Reuse

Python 54 Version:Current
License: No License (No License)

TTS model based on Transformer.

Support

Quality

Security

License

Reuse

bangla-ttsby zabir-nabil

Python 54 Version:Current
License: No License (No License)

Bangla text to speech, Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library

Support

Quality

Security

License

Reuse

ASRby shiyuzh2007

Python 54 Version:Current
License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

gespeakerby muflone

Python 54 Version:Current
License: No License (No License)

A text to speech GTK+ front-end for eSpeak and mbrola to play a text in many languages with settings for voice, pitch, volume and speed.

Support

Quality

Security

License

Reuse

speechby unk1911

JavaScript 54 Version:Current
License: No License (No License)

text-to-speech synthesis

Support

Quality

Security

License

Reuse

simple-speaker-embeddingby RF5

Jupyter Notebook 54 Version:Current
License: Proprietary (Proprietary)

A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.

Support

Quality

Security

License

Reuse

quasar-speech-apiby patrickmonteiro

JavaScript 54 Version:Current
License: Permissive (MIT)

🎤 🔉 Projeto de um SPA desenvolvido com Quasar Framework 1.0 + Speech API para capturar áudio e transformar em texto, ou utilizar um texto como base para a aplicação emitir um áudio.

Support

Quality

Security

License

Reuse

voicesby mklement0

Shell 54 Version:Current
License: No License (No License)

macOS CLI for changing the default TTS (text-to-speech) voice and printing information about and speaking text with multiple voices.

Support

Quality

Security

License

Reuse

brouhaha-vadby marianne-m

Predicts the level of noise and reverberation on your audiofiles

Jupyter Notebook

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

AuxiliaryASRby yl4579

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

speech-mfccby education-service

基于MFCC语音特征提取和识别

Java

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Speech_Feature_Extractionby pchao6

Feature extraction of speech signal is the initial stage of any speech recognition system.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Lip_Reading_in_the_Wild_AVSRby ajinkyaT

Audio-Visual Speech Recognition using Deep Learning

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

fastwerby kahne

A PyPI package for fast word/character error rate (WER/CER) calculation

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

lm_buildby srvk

Adapting your own Language Model for Kaldi

Shell

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

libcharsetdetectby batterseapower

A dependency-free C interface to the Mozilla Universal Character Set Detector

C++

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

glateby keshavbhatt

Open Source Google Translator and TTS App for Linux Desktop

C++

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ElundusCoreAppby SietseT

Desktop application to convert text-to-speech to preview Twitch donations.

JavaScript

Updated: 3 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

PromptingWhisperby jasonppy

Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation

Python

Updated: 1 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Transformer-Transducerby okkteam

A pytorch_lightning reimplementation of the Transducer module from ESPnet.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

JELVISby kiahamedi

Intelligent audio assistant like Iron Man Jarvis

Python

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Speech_Emotion_Recognition_DNN-ELMby eesungkim

Implementation of Speech Emotion Recognition using DNN-ELM

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

vue-speech-streamingby aofdev

A Vue2 Streaming Speech Recognition Speech to text with Google Cloud Speech

JavaScript

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Cognitive-SpeakerRecognition-Windowsby microsoft

Windows SDK for the Microsoft Speaker Recognition API, part of Cognitive Services

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

syn-speechby SynHub

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Audio-Denoisingby AP-Atul

Noise removal/ reducer from the audio file in python. De-noising is done using Wavelets and thresholding is done by VISU Shrink thresholding technique

Python

Updated: 1 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

docker-kaldi-androidby jcsilva

Dockerfile for compiling Kaldi for Android.

Shell

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

cognitive-services-speech-sdk-goby microsoft

Go bindings for the Microsoft Cognitive Services Speech SDK

Updated: 1 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

AdaVocoderby yuan1615

Adaptive Vocoder for Custom Voice

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

viet-asrby dangvansam98

VietASR - Vietnamese Automatic Speech Recognition

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Context-aware-ZSRby ruotianluo

Official code for paper Context-aware Zero-shot Recognition (https://arxiv.org/abs/1904.09320 to appear at AAAI2020)

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

jovo-cliby jovotech

🛠 Command Line Interface for the Jovo Framework: Makes voice experience deployment a breeze, including features like local development and staging.

TypeScript

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

tdmelodicby PKSHATechnology-Research

A Japanese accent dictionary generator

Python

Updated: 3 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

SHIROby Sleepwalking

Phoneme-to-speech alignment toolkit based on liblrhsmm

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

tts-util-appby Danesprite

TTS Util — Text-to-speech utility Android app for synthesising text into audible speech

Kotlin

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

AliMeetingby yufan-aslp

The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to provide participants with baseline systems for speech recognition and speaker diarization in conference scenario.

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

deepgram-python-sdkby deepgram

Official Python SDK for Deepgram's automated speech recognition APIs.

Python

Updated: 1 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

SVCC23_FastSVCby lesterphillip

Singing Voice Conversion Challenge 2023 Starter Kit: FastSVC Reimplementation

Python

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

whisper_streamingby ufal

Whisper realtime streaming for long speech-to-text transcription and translation

Python

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

logmmseby rajivpoddar

LogMMSE speech enhancement/noise reduction

Python

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

java-google-speech-apiby goxr3plus

🙊 Speech Recognition , Text To Speech , Google Translate

Java

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

python_kaldi_featuresby ZitengWang

python codes to extract MFCC and FBANK speech features for Kaldi

Python

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

webspeechby ranacseruet

HTML5 Speech Recognition API Wrapper Library

JavaScript

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

aimybox-android-sdkby just-ai

Voice assistant SDK for Android

Kotlin

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

iflytek_awaken_asrby HaoQChen

use iflytek's technology to realize awaken and order recognition

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

ParametricSpeakerby NiklasFauth

Design files and software for my parametric ultrasonic speaker.

C++

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

android_device_moto_jordan-commonby Quarx2k

common repo for MB520/MB525/MB526/

Updated: 5 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

juicerby idiap

Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).

C++

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

brnby Brain-up

The idea of this project is to design and make a web-application (with scientist cooperation) which would contained series of special audio trainings to support people with central auditory skills deficit to allow them to train them to listen better.

Kotlin

Updated: 2 y ago

License: Permissive (CC0-1.0)

Support

Quality

Security

License

Reuse

SoundStorm-pytorchby rishikksh20

Google's SoundStorm: Efficient Parallel Audio Generation

Python

Updated: 1 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Transformer-TTSby xcmyz

TTS model based on Transformer.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

bangla-ttsby zabir-nabil

Bangla text to speech, Multilingual (Bangla, English) real-time ([almost] in a GPU) speech synthesis library

Python

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

ASRby shiyuzh2007

Python

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

gespeakerby muflone

A text to speech GTK+ front-end for eSpeak and mbrola to play a text in many languages with settings for voice, pitch, volume and speed.

Python

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speechby unk1911

text-to-speech synthesis

JavaScript

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

simple-speaker-embeddingby RF5

A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.

Jupyter Notebook

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

quasar-speech-apiby patrickmonteiro

🎤 🔉 Projeto de um SPA desenvolvido com Quasar Framework 1.0 + Speech API para capturar áudio e transformar em texto, ou utilizar um texto como base para a aplicação emitir um áudio.

JavaScript

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

voicesby mklement0

macOS CLI for changing the default TTS (text-to-speech) voice and printing information about and speaking text with multiple voices.

Shell

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 19

brouhaha-vadby marianne-m

Jupyter Notebook 59 Version:Current License: Permissive (MIT)

Predicts the level of noise and reverberation on your audiofiles

AuxiliaryASRby yl4579

Python 59 Version:Current License: Permissive (MIT)

Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)

speech-mfccby education-service

Java 58 Version:Current License: No License (No License)

基于MFCC语音特征提取和识别

Speech_Feature_Extractionby pchao6

Python 58 Version:Current License: No License (No License)

Feature extraction of speech signal is the initial stage of any speech recognition system.

Lip_Reading_in_the_Wild_AVSRby ajinkyaT

Python 58 Version:Current License: No License (No License)

Audio-Visual Speech Recognition using Deep Learning

fastwerby kahne

Python 58 Version:Current License: Permissive (MIT)

A PyPI package for fast word/character error rate (WER/CER) calculation

lm_buildby srvk

Shell 58 Version:Current License: No License (No License)

Adapting your own Language Model for Kaldi

libcharsetdetectby batterseapower

C++ 58 Version:Current License: No License (No License)

A dependency-free C interface to the Mozilla Universal Character Set Detector

glateby keshavbhatt

C++ 58 Version:Current License: Permissive (MIT)

Open Source Google Translator and TTS App for Linux Desktop

ElundusCoreAppby SietseT

JavaScript 58 Version:Current License: Strong Copyleft (GPL-3.0)

Desktop application to convert text-to-speech to preview Twitch donations.

PromptingWhisperby jasonppy

Python 58 Version:Current License: No License (No License)

Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation

Transformer-Transducerby okkteam

Python 57 Version:Current License: No License (No License)

A pytorch_lightning reimplementation of the Transducer module from ESPnet.

JELVISby kiahamedi

Python 57 Version:Current License: Strong Copyleft (GPL-3.0)

Intelligent audio assistant like Iron Man Jarvis

Speech_Emotion_Recognition_DNN-ELMby eesungkim

Python 57 Version:Current License: No License (No License)

Implementation of Speech Emotion Recognition using DNN-ELM

vue-speech-streamingby aofdev

JavaScript 57 Version:Current License: Permissive (MIT)

A Vue2 Streaming Speech Recognition Speech to text with Google Cloud Speech

Cognitive-SpeakerRecognition-Windowsby microsoft

C# 57 Version:Current License: Proprietary (Proprietary)

Windows SDK for the Microsoft Speaker Recognition API, part of Cognitive Services

syn-speechby SynHub

C# 57 Version:Current License: Proprietary (Proprietary)

Syn.Speech is a flexible speaker independent continuous speech recognition engine for Mono and .NET framework

Audio-Denoisingby AP-Atul

Python 57 Version:Current License: Permissive (MIT)

Noise removal/ reducer from the audio file in python. De-noising is done using Wavelets and thresholding is done by VISU Shrink thresholding technique

docker-kaldi-androidby jcsilva

Shell 57 Version:Current License: No License (No License)

Dockerfile for compiling Kaldi for Android.

cognitive-services-speech-sdk-goby microsoft

Go 57 Version:Current License: Permissive (MIT)

Go bindings for the Microsoft Cognitive Services Speech SDK

AdaVocoderby yuan1615

Python 57 Version:Current License: Permissive (MIT)

Adaptive Vocoder for Custom Voice

viet-asrby dangvansam98

Python 57 Version:Current License: No License (No License)

VietASR - Vietnamese Automatic Speech Recognition

Context-aware-ZSRby ruotianluo

Python 56 Version:Current License: Permissive (MIT)

Official code for paper Context-aware Zero-shot Recognition (https://arxiv.org/abs/1904.09320 to appear at AAAI2020)

jovo-cliby jovotech

TypeScript 56 Version:Current License: Permissive (Apache-2.0)

🛠 Command Line Interface for the Jovo Framework: Makes voice experience deployment a breeze, including features like local development and staging.

tdmelodicby PKSHATechnology-Research

Python 56 Version:Current License: Permissive (BSD-3-Clause)

A Japanese accent dictionary generator

SHIROby Sleepwalking

C 56 Version:Current License: Strong Copyleft (GPL-3.0)

Phoneme-to-speech alignment toolkit based on liblrhsmm

tts-util-appby Danesprite

Jupyter Notebook 59 Version:Current
License: Permissive (MIT)

Python 59 Version:Current
License: Permissive (MIT)

Java 58 Version:Current
License: No License (No License)

Python 58 Version:Current
License: No License (No License)

Python 58 Version:Current
License: No License (No License)

Python 58 Version:Current
License: Permissive (MIT)

Shell 58 Version:Current
License: No License (No License)

C++ 58 Version:Current
License: No License (No License)

C++ 58 Version:Current
License: Permissive (MIT)

JavaScript 58 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 58 Version:Current
License: No License (No License)

Python 57 Version:Current
License: No License (No License)

Python 57 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 57 Version:Current
License: No License (No License)

JavaScript 57 Version:Current
License: Permissive (MIT)

C# 57 Version:Current
License: Proprietary (Proprietary)

C# 57 Version:Current
License: Proprietary (Proprietary)

Python 57 Version:Current
License: Permissive (MIT)

Shell 57 Version:Current
License: No License (No License)

Go 57 Version:Current
License: Permissive (MIT)

Python 57 Version:Current
License: Permissive (MIT)

Python 57 Version:Current
License: No License (No License)

Python 56 Version:Current
License: Permissive (MIT)

TypeScript 56 Version:Current
License: Permissive (Apache-2.0)

Python 56 Version:Current
License: Permissive (BSD-3-Clause)

C 56 Version:Current
License: Strong Copyleft (GPL-3.0)

Kotlin 56 Version:Current
License: Permissive (Apache-2.0)

Python 56 Version:Current
License: No License (No License)

Python 56 Version:Current
License: Permissive (MIT)

Python 56 Version:Current
License: No License (No License)

Python 56 Version:Current
License: Permissive (MIT)

Python 55 Version:Current
License: Permissive (MIT)

Java 55 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 55 Version:Current
License: Permissive (MIT)

JavaScript 55 Version:Current
License: No License (No License)

Kotlin 55 Version:Current
License: Permissive (Apache-2.0)

C 55 Version:Current
License: No License (No License)

C++ 55 Version:Current
License: Permissive (Apache-2.0)

C 55 Version:Current
License: No License (No License)

C++ 55 Version:Current
License: Proprietary (Proprietary)

Kotlin 55 Version:Current
License: Permissive (CC0-1.0)

Python 55 Version:Current
License: Permissive (MIT)

Python 54 Version:Current
License: No License (No License)

Python 54 Version:Current
License: No License (No License)

Python 54 Version:Current
License: Permissive (Apache-2.0)

Python 54 Version:Current
License: No License (No License)

JavaScript 54 Version:Current
License: No License (No License)

Jupyter Notebook 54 Version:Current
License: Proprietary (Proprietary)

JavaScript 54 Version:Current
License: Permissive (MIT)

Shell 54 Version:Current
License: No License (No License)