Speech summarization help us in generating a gist of a speech by solving the problem of transcribing and summarization. Speech summarizer can also be used to comprehend Podcasts on variety of topics.
Below are the steps involved in building a speech summarizer. The speech summarizer takes an audio file as an input and generates text or audio as an output.
Speech Summarizer created using this kit are added in this section. The entire solution is available as a package to download from the source code repository.
Click on the button below to download the solution and follow the deployment instructions to begin set-up. This 1-click kit has all the required dependencies and resources you may need to build your Speech Summarizer App.
VSCode and Jupyter Notebook are used for development and debugging. Jupyter Notebook is a web based interactive environment often used for experiments, whereas VSCode is used to get a typical experience of IDE for developers. Jupyter Notebook is used for our development.
For extensive analysis and exploration of data, and to deal with arrays, these libraries are used. They are also used for performing scientific computation and data manipulation.
Libraries in this group are used for analysis and processing of unstructured natural language. The data, as in its original form aren't used as it has to go through processing pipeline to become suitable for applying machine learning techniques and algorithms.
Transcribing libraries help in converting speech to text.
Machine learning libraries and frameworks here are helpful in generating state-of-the-art summarization.
Web frameworks help build serving solution as REST APIs. The resources involved for servicing request can be handled by containerising and hosting on hyperscalers.