8 cool open-source projects to explore the power of voice technology in AI

share link

by shweta10 dot icon Updated: Nov 25, 2023

technology logo
technology logo

Guide Kit Guide Kit  

Explore the power of voice technology in AI with these 8 cool open-source projects.

TTSby mozilla

Jupyter Notebook doticonstar image 7519 doticonVersion:v0.0.9doticon
License: Weak Copyleft (MPL-2.0)

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

Support
    Quality
      Security
        License
          Reuse

            TTSby mozilla

            Jupyter Notebook doticon star image 7519 doticonVersion:v0.0.9doticon License: Weak Copyleft (MPL-2.0)

            :robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
            Support
              Quality
                Security
                  License
                    Reuse

                      espeak-ngby espeak-ng

                      C doticonstar image 2099 doticonVersion:1.51doticon
                      License: Strong Copyleft (GPL-3.0)

                      eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

                      Support
                        Quality
                          Security
                            License
                              Reuse

                                espeak-ngby espeak-ng

                                C doticon star image 2099 doticonVersion:1.51doticon License: Strong Copyleft (GPL-3.0)

                                eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
                                Support
                                  Quality
                                    Security
                                      License
                                        Reuse

                                          vall-eby enhuiz

                                          Python doticonstar image 2443 doticonVersion:Currentdoticon
                                          License: Permissive (MIT)

                                          An unofficial PyTorch implementation of the audio LM VALL-E

                                          Support
                                            Quality
                                              Security
                                                License
                                                  Reuse

                                                    vall-eby enhuiz

                                                    Python doticon star image 2443 doticonVersion:Currentdoticon License: Permissive (MIT)

                                                    An unofficial PyTorch implementation of the audio LM VALL-E
                                                    Support
                                                      Quality
                                                        Security
                                                          License
                                                            Reuse

                                                              TensorFlowTTSby TensorSpeech

                                                              Python doticonstar image 3375 doticonVersion:v1.8doticon
                                                              License: Permissive (Apache-2.0)

                                                              :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)

                                                              Support
                                                                Quality
                                                                  Security
                                                                    License
                                                                      Reuse

                                                                        TensorFlowTTSby TensorSpeech

                                                                        Python doticon star image 3375 doticonVersion:v1.8doticon License: Permissive (Apache-2.0)

                                                                        :stuck_out_tongue_closed_eyes: TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, German and Easy to adapt for other languages)
                                                                        Support
                                                                          Quality
                                                                            Security
                                                                              License
                                                                                Reuse

                                                                                  DiffSingerby MoonInTheRiver

                                                                                  Python doticonstar image 3372 doticonVersion:pretrain-modeldoticon
                                                                                  License: Permissive (MIT)

                                                                                  DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

                                                                                  Support
                                                                                    Quality
                                                                                      Security
                                                                                        License
                                                                                          Reuse

                                                                                            DiffSingerby MoonInTheRiver

                                                                                            Python doticon star image 3372 doticonVersion:pretrain-modeldoticon License: Permissive (MIT)

                                                                                            DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
                                                                                            Support
                                                                                              Quality
                                                                                                Security
                                                                                                  License
                                                                                                    Reuse

                                                                                                      MockingBirdby babysor

                                                                                                      Python doticonstar image 29415 doticonVersion:v0.0.1doticon
                                                                                                      License: Others (Non-SPDX)

                                                                                                      🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

                                                                                                      Support
                                                                                                        Quality
                                                                                                          Security
                                                                                                            License
                                                                                                              Reuse

                                                                                                                MockingBirdby babysor

                                                                                                                Python doticon star image 29415 doticonVersion:v0.0.1doticon License: Others (Non-SPDX)

                                                                                                                🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
                                                                                                                Support
                                                                                                                  Quality
                                                                                                                    Security
                                                                                                                      License
                                                                                                                        Reuse

                                                                                                                          leonby leon-ai

                                                                                                                          TypeScript doticonstar image 12924 doticonVersion:nodejs-bridge_v1.0.0doticon
                                                                                                                          License: Permissive (MIT)

                                                                                                                          🧠 Leon is your open-source personal assistant.

                                                                                                                          Support
                                                                                                                            Quality
                                                                                                                              Security
                                                                                                                                License
                                                                                                                                  Reuse

                                                                                                                                    leonby leon-ai

                                                                                                                                    TypeScript doticon star image 12924 doticonVersion:nodejs-bridge_v1.0.0doticon License: Permissive (MIT)

                                                                                                                                    🧠 Leon is your open-source personal assistant.
                                                                                                                                    Support
                                                                                                                                      Quality
                                                                                                                                        Security
                                                                                                                                          License
                                                                                                                                            Reuse

                                                                                                                                              NeMoby NVIDIA

                                                                                                                                              Python doticonstar image 7027 doticonVersion:v1.19.0doticon
                                                                                                                                              License: Permissive (Apache-2.0)

                                                                                                                                              NeMo: a toolkit for conversational AI

                                                                                                                                              Support
                                                                                                                                                Quality
                                                                                                                                                  Security
                                                                                                                                                    License
                                                                                                                                                      Reuse

                                                                                                                                                        NeMoby NVIDIA

                                                                                                                                                        Python doticon star image 7027 doticonVersion:v1.19.0doticon License: Permissive (Apache-2.0)

                                                                                                                                                        NeMo: a toolkit for conversational AI
                                                                                                                                                        Support
                                                                                                                                                          Quality
                                                                                                                                                            Security
                                                                                                                                                              License
                                                                                                                                                                Reuse

                                                                                                                                                                  See similar Kits and Libraries