Speech Libraries - Page 11

CCAlignerby saurabhshri

C++ 150 Version:Current
License: No License (No License)

🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.

Support

Quality

Security

License

Reuse

iSTFTNet-pytorchby rishikksh20

Python 149 Version:Current
License: Permissive (Apache-2.0)

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Support

Quality

Security

License

Reuse

Speech-Translateby Dadangdut33

Python 149 Version:Current
License: Permissive (MIT)

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

Support

Quality

Security

License

Reuse

TypeScript 148 Version:Current
License: No License (No License)

Angular Ivy library compatibility validation project

Support

Quality

Security

License

Reuse

speech2textby shenasa-ai

Jupyter Notebook 148 Version:Current
License: Permissive (MIT)

A Deep-Learning-Based Persian Speech Recognition System

Support

Quality

Security

License

Reuse

Panakoby JorenSix

Java 147 Version:Current
License: Strong Copyleft (AGPL-3.0)

The Panako acoustic fingerprinting system.

Support

Quality

Security

License

Reuse

taggerby yanshao9798

Python 147 Version:Current
License: No License (No License)

A Joint Chinese segmentation and POS tagger based on bidirectional GRU-CRF

Support

Quality

Security

License

Reuse

Amplitudaby lincollincol

C 147 Version:Current
License: Permissive (Apache-2.0)

Audio processing library, which provides waveform data

Support

Quality

Security

License

Reuse

SER-datasetsby SuperKogito

HTML 147 Version:Current
License: Permissive (MIT)

A collection of datasets for the purpose of emotion recognition/detection in speech.

Support

Quality

Security

License

Reuse

leon-cliby leon-ai

TypeScript 147 Version:Current
License: Permissive (MIT)

⌨️ Command-line interface (CLI) for a better use of Leon, your open-source personal assistant. GNU/Linux, macOS and Windows supported.

Support

Quality

Security

License

Reuse

voice-engineby voice-engine

Python 146 Version:Current
License: Strong Copyleft (GPL-3.0)

building blocks to create voice interface applications

Support

Quality

Security

License

Reuse

VideoSyncby allisonnicoledeal

Python 146 Version:Current
License: No License (No License)

Automatically synchronize crowd-sourced concert videos

Support

Quality

Security

License

Reuse

UnityWavby deadlyfingers

C# 146 Version:Current
License: Permissive (MIT)

WAV utility for saving and loading wav files in Unity

Support

Quality

Security

License

Reuse

SpeechRecognizerButtonby alexruperez

Swift 146 Version:Current
License: Permissive (MIT)

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

Support

Quality

Security

License

Reuse

Python 144 Version:Current
License: Permissive (MIT)

Include some core functions and model to handle speech separation

Support

Quality

Security

License

Reuse

sova-asrby sovaai

Python 144 Version:Current
License: Permissive (Apache-2.0)

SOVA ASR (Automatic Speech Recognition)

Support

Quality

Security

License

Reuse

Swift 144 Version:Current
License: Permissive (MIT)

A Speech-to-text Demo App

Support

Quality

Security

License

Reuse

awesome-ai-servicesby sekwiatkowski

Java 142 Version:Current
License: No License (No License)

An overview of the AI-as-a-service landscape

Support

Quality

Security

License

Reuse

Go 142 Version:Current
License: Permissive (MIT)

Golang bindings for Mozilla's DeepSpeech speech-to-text library

Support

Quality

Security

License

Reuse

Comprehensive-Transformer-TTSby keonlee9420

Python 142 Version:Current
License: Permissive (MIT)

A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.

Support

Quality

Security

License

Reuse

pitchtronby hash2430

Python 141 Version:Current
License: Proprietary (Proprietary)

TTS for pitch-accented language. Korean dialect DB.

Support

Quality

Security

License

Reuse

hubertby bshall

Python 141 Version:Current
License: Permissive (MIT)

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Support

Quality

Security

License

Reuse

Python 140 Version:Current
License: Permissive (BSD-3-Clause)

Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.

Support

Quality

Security

License

Reuse

pikachu2by chenghuige

Jupyter Notebook 140 Version:Current
License: No License (No License)

微信大数据2021 1st，qq浏览器2021 3rd，mind新闻推荐2020 1st，NAIC2020 AI+遥感影像 2nd

Support

Quality

Security

License

Reuse

dual-path-RNNs-DPRNNs-based-speech-separationby ShiZiqiang

Python 139 Version:Current
License: No License (No License)

A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".

Support

Quality

Security

License

Reuse

Python 139 Version:Current
License: Permissive (Apache-2.0)

A Fast Sequence Transducer Implementation with PyTorch Bindings

Support

Quality

Security

License

Reuse

Python 139 Version:Current
License: No License (No License)

Some basic praat scripts.

Support

Quality

Security

License

Reuse

Flite-TTS-Engine-for-Androidby happyalu

Java 138 Version:Current
License: Proprietary (Proprietary)

Port of the Festival-lite (Flite TTS) speech-synthesis engine to Android

Support

Quality

Security

License

Reuse

pb_bssby fgnt

Python 138 Version:Current
License: Permissive (MIT)

Collection of EM algorithms for blind source separation of audio signals

Support

Quality

Security

License

Reuse

deep_avsrby smeetrs

Python 138 Version:Current
License: Permissive (MIT)

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Support

Quality

Security

License

Reuse

SoundSourceSeparationby sekiguchi92

Python 137 Version:Current
License: Proprietary (Proprietary)

The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.

Support

Quality

Security

License

Reuse

AndroidMaryTTSby AndroidMaryTTS

Java 137 Version:Current
License: No License (No License)

Android MARY TTS - an open-source, offline HMM-Based text-to-speech synthesis system based on MaryTTS

Support

Quality

Security

License

Reuse

Python 137 Version:Current
License: Strong Copyleft (GPL-3.0)

ASR with PyTorch

Support

Quality

Security

License

Reuse

hifigan-denoiserby rishikksh20

Python 137 Version:Current
License: Permissive (Apache-2.0)

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Support

Quality

Security

License

Reuse

isabellaby chrisvfritz

Ruby 136 Version:Current
License: No License (No License)

A voice-computing assistant built in Ruby.

Support

Quality

Security

License

Reuse

C++ 136 Version:Current
License: No License (No License)

Audio fingerprinting and recognition in C++

Support

Quality

Security

License

Reuse

Python 135 Version:Current
License: No License (No License)

Raspberry Pi Translation Tool

Support

Quality

Security

License

Reuse

Python 135 Version:Current
License: Proprietary (Proprietary)

Accompanying repository for Ubicoustics: Plug-and-Play Acoustic Activity Recognition

Support

Quality

Security

License

Reuse

A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancementby haoxiangsnr

Python 134 Version:Current
License: No License (No License)

A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch

Support

Quality

Security

License

Reuse

tensorflow-wavenetby Deeperjia

Python 134 Version:Current
License: No License (No License)

speech recognition based on tensorflow 1.0.0

Support

Quality

Security

License

Reuse

SpeechCmdRecognitionby douglas125

Jupyter Notebook 134 Version:Current
License: Permissive (MIT)

A neural attention model for speech command recognition

Support

Quality

Security

License

Reuse

AdaSpeechby rishikksh20

Jupyter Notebook 134 Version:Current
License: Permissive (Apache-2.0)

AdaSpeech: Adaptive Text to Speech for Custom Voice

Support

Quality

Security

License

Reuse

Speech-Recognition-Unityby LightBuzz

C# 133 Version:Current
License: Permissive (MIT)

Speech recognition in Unity3D.

Support

Quality

Security

License

Reuse

santokuby hughjonesd

JavaScript 133 Version:Current
License: Proprietary (Proprietary)

A versatile cutting tool for R

Support

Quality

Security

License

Reuse

GPT-Automatorby chidiwilliams

Python 133 Version:Current
License: No License (No License)

Your voice-controlled Mac assistant

Support

Quality

Security

License

Reuse

ttsmmsby wannaphong

Python 133 Version:Current
License: Permissive (MIT)

TTS with The Massively Multilingual Speech (MMS) project

Support

Quality

Security

License

Reuse

angleby pannous

Python 132 Version:Current
License: No License (No License)

⦠ Angle: new speakable syntax for python 💡

Support

Quality

Security

License

Reuse

elpisby CoEDL

Python 132 Version:Current
License: Permissive (Apache-2.0)

🙊 software for creating speech recognition models.

Support

Quality

Security

License

Reuse

Looking-to-Listen-at-the-Cocktail-Partyby JusperLee

Python 132 Version:Current
License: Permissive (MIT)

Executable code based on Google articles

Support

Quality

Security

License

Reuse

aws_transcribe_to_docxby kibaffo33

Python 132 Version:Current
License: Permissive (MIT)

Produce Word Document, CSV or SQLite transcriptions using the automatic speech recognition from AWS Transcribe.

Support

Quality

Security

License

Reuse

CCAlignerby saurabhshri

🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.

C++

150

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

iSTFTNet-pytorchby rishikksh20

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Python

149

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Speech-Translateby Dadangdut33

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

Python

149

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

ngcc-validationby angular

Angular Ivy library compatibility validation project

TypeScript

148

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

speech2textby shenasa-ai

A Deep-Learning-Based Persian Speech Recognition System

Jupyter Notebook

148

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Panakoby JorenSix

The Panako acoustic fingerprinting system.

Java

147

Updated: 2 y ago

License: Strong Copyleft (AGPL-3.0)

Support

Quality

Security

License

Reuse

taggerby yanshao9798

A Joint Chinese segmentation and POS tagger based on bidirectional GRU-CRF

Python

147

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Amplitudaby lincollincol

Audio processing library, which provides waveform data

147

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

SER-datasetsby SuperKogito

A collection of datasets for the purpose of emotion recognition/detection in speech.

HTML

147

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

leon-cliby leon-ai

⌨️ Command-line interface (CLI) for a better use of Leon, your open-source personal assistant. GNU/Linux, macOS and Windows supported.

TypeScript

147

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

voice-engineby voice-engine

building blocks to create voice interface applications

Python

146

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

VideoSyncby allisonnicoledeal

Automatically synchronize crowd-sourced concert videos

Python

146

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

UnityWavby deadlyfingers

WAV utility for saving and loading wav files in Unity

146

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

SpeechRecognizerButtonby alexruperez

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

Swift

146

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

speech_separationby bill9800

Include some core functions and model to handle speech separation

Python

144

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

sova-asrby sovaai

SOVA ASR (Automatic Speech Recognition)

Python

144

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

SpeechToTextDemoby appcoda

A Speech-to-text Demo App

Swift

144

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

awesome-ai-servicesby sekwiatkowski

An overview of the AI-as-a-service landscape

Java

142

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

go-astideepspeechby asticode

Golang bindings for Mozilla's DeepSpeech speech-to-text library

142

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Comprehensive-Transformer-TTSby keonlee9420

A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.

Python

142

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pitchtronby hash2430

TTS for pitch-accented language. Korean dialect DB.

Python

141

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

hubertby bshall

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

Python

141

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

griffin_limby bkvogel

Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.

Python

140

Updated: 4 y ago

License: Permissive (BSD-3-Clause)

Support

Quality

Security

License

Reuse

pikachu2by chenghuige

微信大数据2021 1st，qq浏览器2021 3rd，mind新闻推荐2020 1st，NAIC2020 AI+遥感影像 2nd

Jupyter Notebook

140

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

dual-path-RNNs-DPRNNs-based-speech-separationby ShiZiqiang

A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".

Python

139

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

transducerby awni

A Fast Sequence Transducer Implementation with PyTorch Bindings

Python

139

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Praat_Scriptsby feelins

Some basic praat scripts.

Python

139

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

Flite-TTS-Engine-for-Androidby happyalu

Port of the Festival-lite (Flite TTS) speech-synthesis engine to Android

Java

138

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

pb_bssby fgnt

Collection of EM algorithms for blind source separation of audio signals

Python

138

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

deep_avsrby smeetrs

A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.

Python

138

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

SoundSourceSeparationby sekiguchi92

The code for multi-channel source separation and dereverberation such as FastMNMF1, FastMNMF2, and AR-FastMNMF2.

Python

137

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

AndroidMaryTTSby AndroidMaryTTS

Android MARY TTS - an open-source, offline HMM-Based text-to-speech synthesis system based on MaryTTS

Java

137

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

pytorch-asrby jinserk

ASR with PyTorch

Python

137

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

hifigan-denoiserby rishikksh20

HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks

Python

137

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

isabellaby chrisvfritz

A voice-computing assistant built in Ruby.

Ruby

136

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

audio_recognitionby JiahuiYu

Audio fingerprinting and recognition in C++

C++

136

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

PiTranslateby dconroy

Raspberry Pi Translation Tool

Python

135

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

ubicousticsby FIGLAB

Accompanying repository for Ubicoustics: Plug-and-Play Acoustic Activity Recognition

Python

135

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

A-Convolutional-Recurrent-Neural-Network-for-Real-Time-Speech-Enhancementby haoxiangsnr

A minimum unofficial implementation of the "A Convolutional Recurrent Neural Network for Real-Time Speech Enhancement" (CRN) using PyTorch

Python

134

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

tensorflow-wavenetby Deeperjia

speech recognition based on tensorflow 1.0.0

Python

134

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

SpeechCmdRecognitionby douglas125

A neural attention model for speech command recognition

Jupyter Notebook

134

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

AdaSpeechby rishikksh20

AdaSpeech: Adaptive Text to Speech for Custom Voice

Jupyter Notebook

134

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Speech-Recognition-Unityby LightBuzz

Speech recognition in Unity3D.

133

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

santokuby hughjonesd

A versatile cutting tool for R

JavaScript

133

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

GPT-Automatorby chidiwilliams

Your voice-controlled Mac assistant

Python

133

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

ttsmmsby wannaphong

TTS with The Massively Multilingual Speech (MMS) project

Python

133

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

angleby pannous

⦠ Angle: new speakable syntax for python 💡

Python

132

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

elpisby CoEDL

🙊 software for creating speech recognition models.

Python

132

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

Looking-to-Listen-at-the-Cocktail-Partyby JusperLee

Executable code based on Google articles

Python

132

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

aws_transcribe_to_docxby kibaffo33

Produce Word Document, CSV or SQLite transcriptions using the automatic speech recognition from AWS Transcribe.

Python

132

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Speech Libraries - Page 11

CCAlignerby saurabhshri

C++ 150 Version:Current License: No License (No License)

🔮 Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.

iSTFTNet-pytorchby rishikksh20

Python 149 Version:Current License: Permissive (Apache-2.0)

iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform

Speech-Translateby Dadangdut33

Python 149 Version:Current License: Permissive (MIT)

A realtime speech transcription and translation application using Whisper OpenAI and free translation API. Interface made using Tkinter. Code written fully in Python.

ngcc-validationby angular

TypeScript 148 Version:Current License: No License (No License)

Angular Ivy library compatibility validation project

speech2textby shenasa-ai

Jupyter Notebook 148 Version:Current License: Permissive (MIT)

A Deep-Learning-Based Persian Speech Recognition System

Panakoby JorenSix

Java 147 Version:Current License: Strong Copyleft (AGPL-3.0)

The Panako acoustic fingerprinting system.

taggerby yanshao9798

Python 147 Version:Current License: No License (No License)

A Joint Chinese segmentation and POS tagger based on bidirectional GRU-CRF

Amplitudaby lincollincol

C 147 Version:Current License: Permissive (Apache-2.0)

Audio processing library, which provides waveform data

SER-datasetsby SuperKogito

HTML 147 Version:Current License: Permissive (MIT)

A collection of datasets for the purpose of emotion recognition/detection in speech.

leon-cliby leon-ai

TypeScript 147 Version:Current License: Permissive (MIT)

⌨️ Command-line interface (CLI) for a better use of Leon, your open-source personal assistant. GNU/Linux, macOS and Windows supported.

voice-engineby voice-engine

Python 146 Version:Current License: Strong Copyleft (GPL-3.0)

building blocks to create voice interface applications

VideoSyncby allisonnicoledeal

Python 146 Version:Current License: No License (No License)

Automatically synchronize crowd-sourced concert videos

UnityWavby deadlyfingers

C# 146 Version:Current License: Permissive (MIT)

WAV utility for saving and loading wav files in Unity

SpeechRecognizerButtonby alexruperez

Swift 146 Version:Current License: Permissive (MIT)

UIButton subclass with push to talk recording, speech recognition and Siri-style waveform view.

speech_separationby bill9800

Python 144 Version:Current License: Permissive (MIT)

Include some core functions and model to handle speech separation

sova-asrby sovaai

Python 144 Version:Current License: Permissive (Apache-2.0)

SOVA ASR (Automatic Speech Recognition)

SpeechToTextDemoby appcoda

Swift 144 Version:Current License: Permissive (MIT)

A Speech-to-text Demo App

awesome-ai-servicesby sekwiatkowski

Java 142 Version:Current License: No License (No License)

An overview of the AI-as-a-service landscape

go-astideepspeechby asticode

Go 142 Version:Current License: Permissive (MIT)

Golang bindings for Mozilla's DeepSpeech speech-to-text library

Comprehensive-Transformer-TTSby keonlee9420

Python 142 Version:Current License: Permissive (MIT)

A Non-Autoregressive Transformer based TTS, supporting a family of SOTA transformers with supervised and unsupervised duration modelings. This project grows with the research community, aiming to achieve the ultimate TTS.

pitchtronby hash2430

Python 141 Version:Current License: Proprietary (Proprietary)

TTS for pitch-accented language. Korean dialect DB.

hubertby bshall

Python 141 Version:Current License: Permissive (MIT)

HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion

griffin_limby bkvogel

Python 140 Version:Current License: Permissive (BSD-3-Clause)

Implementation of the Griffin and Lim algorithm to recover an audio signal from a magnitude-only spectrogram.

pikachu2by chenghuige

Jupyter Notebook 140 Version:Current License: No License (No License)

微信大数据2021 1st，qq浏览器2021 3rd，mind新闻推荐2020 1st，NAIC2020 AI+遥感影像 2nd

dual-path-RNNs-DPRNNs-based-speech-separationby ShiZiqiang

Python 139 Version:Current License: No License (No License)

A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".

transducerby awni

Python 139 Version:Current License: Permissive (Apache-2.0)

A Fast Sequence Transducer Implementation with PyTorch Bindings

Praat_Scriptsby feelins

C++ 150 Version:Current
License: No License (No License)

Python 149 Version:Current
License: Permissive (Apache-2.0)

Python 149 Version:Current
License: Permissive (MIT)

TypeScript 148 Version:Current
License: No License (No License)

Jupyter Notebook 148 Version:Current
License: Permissive (MIT)

Java 147 Version:Current
License: Strong Copyleft (AGPL-3.0)

Python 147 Version:Current
License: No License (No License)

C 147 Version:Current
License: Permissive (Apache-2.0)

HTML 147 Version:Current
License: Permissive (MIT)

TypeScript 147 Version:Current
License: Permissive (MIT)

Python 146 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 146 Version:Current
License: No License (No License)

C# 146 Version:Current
License: Permissive (MIT)

Swift 146 Version:Current
License: Permissive (MIT)

Python 144 Version:Current
License: Permissive (MIT)

Python 144 Version:Current
License: Permissive (Apache-2.0)

Swift 144 Version:Current
License: Permissive (MIT)

Java 142 Version:Current
License: No License (No License)

Go 142 Version:Current
License: Permissive (MIT)

Python 142 Version:Current
License: Permissive (MIT)

Python 141 Version:Current
License: Proprietary (Proprietary)

Python 141 Version:Current
License: Permissive (MIT)

Python 140 Version:Current
License: Permissive (BSD-3-Clause)

Jupyter Notebook 140 Version:Current
License: No License (No License)

Python 139 Version:Current
License: No License (No License)

Python 139 Version:Current
License: Permissive (Apache-2.0)

Python 139 Version:Current
License: No License (No License)

Java 138 Version:Current
License: Proprietary (Proprietary)

Python 138 Version:Current
License: Permissive (MIT)

Python 138 Version:Current
License: Permissive (MIT)

Python 137 Version:Current
License: Proprietary (Proprietary)

Java 137 Version:Current
License: No License (No License)

Python 137 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 137 Version:Current
License: Permissive (Apache-2.0)

Ruby 136 Version:Current
License: No License (No License)

C++ 136 Version:Current
License: No License (No License)

Python 135 Version:Current
License: No License (No License)

Python 135 Version:Current
License: Proprietary (Proprietary)

Python 134 Version:Current
License: No License (No License)

Python 134 Version:Current
License: No License (No License)

Jupyter Notebook 134 Version:Current
License: Permissive (MIT)

Jupyter Notebook 134 Version:Current
License: Permissive (Apache-2.0)

C# 133 Version:Current
License: Permissive (MIT)

JavaScript 133 Version:Current
License: Proprietary (Proprietary)

Python 133 Version:Current
License: No License (No License)

Python 133 Version:Current
License: Permissive (MIT)

Python 132 Version:Current
License: No License (No License)

Python 132 Version:Current
License: Permissive (Apache-2.0)

Python 132 Version:Current
License: Permissive (MIT)

Python 132 Version:Current
License: Permissive (MIT)