technology logo
technology logo

Ai Hulks App Kit

share link

by ddmasterdon dot icon Updated: Nov 2, 2021

Solution Kit Solution Kit  

SPEAKER COUNTING It enhances understanding through automatic speech recognition Beneficial for real - world applications like call-center transcription and meeting transcription analytics Speaker Diarization is a developing field of study, with new approaches being published on a frequent basis. The Problem Not many studies have been done for estimating a large number of speakers. Diarization becomes extremely difficult when the number of speakers is huge. Providing the number of speakers to the diarization system can be advantageous Complete solution Architecture - Machine Learning model - To predict the no. of speakers and the time stamps of the speaker. Web App - Frontend for the user to use the feature. Middleware Flask Api - To connect Frontend and ML Model. We have build a Web App that a user can use to communicate and leverage the advantages of the our Machine learning model. Since the model we build and the web app are build on different platforms, we used REST API as a middleware to connect frontend and model.

ML Model Solution Process

These are used to create our Web UI using node as backend and VueJs as front end. 1. Preprocessing: Denoising -> Speech separation 2. Embedding Extraction: YAMNet sound & classification model 3. Speaker Counting: Machine learning model selection -> Model training -> Model prediction

Data Preprocessing

Technologies used for pre processing the audio data.

numpyby numpy

Python doticonstar image 23036 doticonVersion:v1.24.2doticon
License: Permissive (BSD-3-Clause)

The fundamental package for scientific computing with Python.

Support
    Quality
      Security
        License
          Reuse

            numpyby numpy

            Python doticon star image 23036 doticonVersion:v1.24.2doticon License: Permissive (BSD-3-Clause)

            The fundamental package for scientific computing with Python.
            Support
              Quality
                Security
                  License
                    Reuse

                      csvby thephpleague

                      PHP doticonstar image 3077 doticonVersion:9.9.0doticon
                      License: Permissive (MIT)

                      CSV data manipulation made easy in PHP

                      Support
                        Quality
                          Security
                            License
                              Reuse

                                csvby thephpleague

                                PHP doticon star image 3077 doticonVersion:9.9.0doticon License: Permissive (MIT)

                                CSV data manipulation made easy in PHP
                                Support
                                  Quality
                                    Security
                                      License
                                        Reuse

                                          scipyby scipy

                                          Python doticonstar image 10995 doticonVersion:v1.10.1doticon
                                          License: Permissive (BSD-3-Clause)

                                          SciPy library main repository

                                          Support
                                            Quality
                                              Security
                                                License
                                                  Reuse

                                                    scipyby scipy

                                                    Python doticon star image 10995 doticonVersion:v1.10.1doticon License: Permissive (BSD-3-Clause)

                                                    SciPy library main repository
                                                    Support
                                                      Quality
                                                        Security
                                                          License
                                                            Reuse

                                                              matplotlibby matplotlib

                                                              Python doticonstar image 17111 doticonVersion:v3.7.1doticon
                                                              no licences License: No License (null)

                                                              matplotlib: plotting with Python

                                                              Support
                                                                Quality
                                                                  Security
                                                                    License
                                                                      Reuse

                                                                        matplotlibby matplotlib

                                                                        Python doticon star image 17111 doticonVersion:v3.7.1doticonno licences License: No License

                                                                        matplotlib: plotting with Python
                                                                        Support
                                                                          Quality
                                                                            Security
                                                                              License
                                                                                Reuse

                                                                                  pandasby pandas-dev

                                                                                  Python doticonstar image 37439 doticonVersion:v2.0.0rc1doticon
                                                                                  License: Permissive (BSD-3-Clause)

                                                                                  Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

                                                                                  Support
                                                                                    Quality
                                                                                      Security
                                                                                        License
                                                                                          Reuse

                                                                                            pandasby pandas-dev

                                                                                            Python doticon star image 37439 doticonVersion:v2.0.0rc1doticon License: Permissive (BSD-3-Clause)

                                                                                            Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
                                                                                            Support
                                                                                              Quality
                                                                                                Security
                                                                                                  License
                                                                                                    Reuse

                                                                                                      Audio Pre Processing

                                                                                                      The additional libaries are use to processing the audio which are needed to be fed into the classifier model.

                                                                                                      py-webrtcvadby wiseman

                                                                                                      C doticonstar image 1608 doticonVersion:Currentdoticon
                                                                                                      License: Others (Non-SPDX)

                                                                                                      Python interface to the WebRTC Voice Activity Detector

                                                                                                      Support
                                                                                                        Quality
                                                                                                          Security
                                                                                                            License
                                                                                                              Reuse

                                                                                                                py-webrtcvadby wiseman

                                                                                                                C doticon star image 1608 doticonVersion:Currentdoticon License: Others (Non-SPDX)

                                                                                                                Python interface to the WebRTC Voice Activity Detector
                                                                                                                Support
                                                                                                                  Quality
                                                                                                                    Security
                                                                                                                      License
                                                                                                                        Reuse

                                                                                                                          Resemblyzerby resemble-ai

                                                                                                                          Python doticonstar image 2197 doticonVersion:0.1.1-devdoticon
                                                                                                                          License: Permissive (Apache-2.0)

                                                                                                                          A python package to analyze and compare voices with deep learning

                                                                                                                          Support
                                                                                                                            Quality
                                                                                                                              Security
                                                                                                                                License
                                                                                                                                  Reuse

                                                                                                                                    Resemblyzerby resemble-ai

                                                                                                                                    Python doticon star image 2197 doticonVersion:0.1.1-devdoticon License: Permissive (Apache-2.0)

                                                                                                                                    A python package to analyze and compare voices with deep learning
                                                                                                                                    Support
                                                                                                                                      Quality
                                                                                                                                        Security
                                                                                                                                          License
                                                                                                                                            Reuse
                                                                                                                                              Python doticonstar image 528 doticonVersion:0.12.1doticon
                                                                                                                                              License: Permissive (BSD-3-Clause)

                                                                                                                                              SoundFile is an audio library based on libsndfile, CFFI, and NumPy

                                                                                                                                              Support
                                                                                                                                                Quality
                                                                                                                                                  Security
                                                                                                                                                    License
                                                                                                                                                      Reuse

                                                                                                                                                        python-soundfileby bastibe

                                                                                                                                                        Python doticon star image 528 doticonVersion:0.12.1doticon License: Permissive (BSD-3-Clause)

                                                                                                                                                        SoundFile is an audio library based on libsndfile, CFFI, and NumPy
                                                                                                                                                        Support
                                                                                                                                                          Quality
                                                                                                                                                            Security
                                                                                                                                                              License
                                                                                                                                                                Reuse

                                                                                                                                                                  SoundFileby bastibe

                                                                                                                                                                  Python doticonstar image 316 doticonVersion:0.10.3post1doticon
                                                                                                                                                                  License: Permissive (BSD-3-Clause)

                                                                                                                                                                  SoundFile is an audio library based on libsndfile, CFFI, and NumPy

                                                                                                                                                                  Support
                                                                                                                                                                    Quality
                                                                                                                                                                      Security
                                                                                                                                                                        License
                                                                                                                                                                          Reuse

                                                                                                                                                                            SoundFileby bastibe

                                                                                                                                                                            Python doticon star image 316 doticonVersion:0.10.3post1doticon License: Permissive (BSD-3-Clause)

                                                                                                                                                                            SoundFile is an audio library based on libsndfile, CFFI, and NumPy
                                                                                                                                                                            Support
                                                                                                                                                                              Quality
                                                                                                                                                                                Security
                                                                                                                                                                                  License
                                                                                                                                                                                    Reuse

                                                                                                                                                                                      pydubby jiaaro

                                                                                                                                                                                      Python doticonstar image 6964 doticonVersion:v0.25.1doticon
                                                                                                                                                                                      License: Permissive (MIT)

                                                                                                                                                                                      Manipulate audio with a simple and easy high level interface

                                                                                                                                                                                      Support
                                                                                                                                                                                        Quality
                                                                                                                                                                                          Security
                                                                                                                                                                                            License
                                                                                                                                                                                              Reuse

                                                                                                                                                                                                pydubby jiaaro

                                                                                                                                                                                                Python doticon star image 6964 doticonVersion:v0.25.1doticon License: Permissive (MIT)

                                                                                                                                                                                                Manipulate audio with a simple and easy high level interface
                                                                                                                                                                                                Support
                                                                                                                                                                                                  Quality
                                                                                                                                                                                                    Security
                                                                                                                                                                                                      License
                                                                                                                                                                                                        Reuse

                                                                                                                                                                                                          Model Trainning

                                                                                                                                                                                                          This libaries are used to create the two classifier models which are then both combined into one.

                                                                                                                                                                                                          tensorflowby tensorflow

                                                                                                                                                                                                          C++ doticonstar image 172599 doticonVersion:v2.12.0doticon
                                                                                                                                                                                                          License: Permissive (Apache-2.0)

                                                                                                                                                                                                          An Open Source Machine Learning Framework for Everyone

                                                                                                                                                                                                          Support
                                                                                                                                                                                                            Quality
                                                                                                                                                                                                              Security
                                                                                                                                                                                                                License
                                                                                                                                                                                                                  Reuse

                                                                                                                                                                                                                    tensorflowby tensorflow

                                                                                                                                                                                                                    C++ doticon star image 172599 doticonVersion:v2.12.0doticon License: Permissive (Apache-2.0)

                                                                                                                                                                                                                    An Open Source Machine Learning Framework for Everyone
                                                                                                                                                                                                                    Support
                                                                                                                                                                                                                      Quality
                                                                                                                                                                                                                        Security
                                                                                                                                                                                                                          License
                                                                                                                                                                                                                            Reuse

                                                                                                                                                                                                                              hubby tensorflow

                                                                                                                                                                                                                              Python doticonstar image 3284 doticonVersion:v0.13.0doticon
                                                                                                                                                                                                                              License: Permissive (Apache-2.0)

                                                                                                                                                                                                                              A library for transfer learning by reusing parts of TensorFlow models.

                                                                                                                                                                                                                              Support
                                                                                                                                                                                                                                Quality
                                                                                                                                                                                                                                  Security
                                                                                                                                                                                                                                    License
                                                                                                                                                                                                                                      Reuse

                                                                                                                                                                                                                                        hubby tensorflow

                                                                                                                                                                                                                                        Python doticon star image 3284 doticonVersion:v0.13.0doticon License: Permissive (Apache-2.0)

                                                                                                                                                                                                                                        A library for transfer learning by reusing parts of TensorFlow models.
                                                                                                                                                                                                                                        Support
                                                                                                                                                                                                                                          Quality
                                                                                                                                                                                                                                            Security
                                                                                                                                                                                                                                              License
                                                                                                                                                                                                                                                Reuse

                                                                                                                                                                                                                                                  librosaby librosa

                                                                                                                                                                                                                                                  Python doticonstar image 5772 doticonVersion:0.10.0.post2doticon
                                                                                                                                                                                                                                                  License: Permissive (ISC)

                                                                                                                                                                                                                                                  Python library for audio and music analysis

                                                                                                                                                                                                                                                  Support
                                                                                                                                                                                                                                                    Quality
                                                                                                                                                                                                                                                      Security
                                                                                                                                                                                                                                                        License
                                                                                                                                                                                                                                                          Reuse

                                                                                                                                                                                                                                                            librosaby librosa

                                                                                                                                                                                                                                                            Python doticon star image 5772 doticonVersion:0.10.0.post2doticon License: Permissive (ISC)

                                                                                                                                                                                                                                                            Python library for audio and music analysis
                                                                                                                                                                                                                                                            Support
                                                                                                                                                                                                                                                              Quality
                                                                                                                                                                                                                                                                Security
                                                                                                                                                                                                                                                                  License
                                                                                                                                                                                                                                                                    Reuse

                                                                                                                                                                                                                                                                      tqdmby tqdm

                                                                                                                                                                                                                                                                      Python doticonstar image 24341 doticonVersion:v4.65.0doticon
                                                                                                                                                                                                                                                                      License: Others (Non-SPDX)

                                                                                                                                                                                                                                                                      A Fast, Extensible Progress Bar for Python and CLI

                                                                                                                                                                                                                                                                      Support
                                                                                                                                                                                                                                                                        Quality
                                                                                                                                                                                                                                                                          Security
                                                                                                                                                                                                                                                                            License
                                                                                                                                                                                                                                                                              Reuse

                                                                                                                                                                                                                                                                                tqdmby tqdm

                                                                                                                                                                                                                                                                                Python doticon star image 24341 doticonVersion:v4.65.0doticon License: Others (Non-SPDX)

                                                                                                                                                                                                                                                                                A Fast, Extensible Progress Bar for Python and CLI
                                                                                                                                                                                                                                                                                Support
                                                                                                                                                                                                                                                                                  Quality
                                                                                                                                                                                                                                                                                    Security
                                                                                                                                                                                                                                                                                      License
                                                                                                                                                                                                                                                                                        Reuse

                                                                                                                                                                                                                                                                                          Deployment Information

                                                                                                                                                                                                                                                                                          https://github.com/aihulks69/Speaker-dirization

                                                                                                                                                                                                                                                                                          See similar Kits and Libraries