DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
C++ Updated: 11 mo ago License: Weak Copyleft
kaldi-asr/kaldi is the official location of the Kaldi project.
Shell Updated: 10 mo ago License: Non-SPDX
A PyTorch-based Speech Toolkit
Python Updated: 3 mo ago License: Permissive
Jupyter Interactive Notebook
Jupyter Notebook Updated: 3 mo ago License: Non-SPDX
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Tensors and Dynamic neural networks in Python with strong GPU acceleration
C++ Updated: 3 mo ago License: Non-SPDX
SoundFile is an audio library based on libsndfile, CFFI, and NumPy
Python Updated: 5 mo ago License: Permissive
A python package to analyze and compare voices with deep learning
Python Updated: 10 mo ago License: Permissive