Speech Summarizer Using Python Transformers

by kandikits Updated: Aug 22, 2023

1-Click Kit

1-Click Kit installer

Speech summarization help us in generating a gist of a speech by solving the problem of transcribing and summarization. Speech summarizer can also be used to comprehend Podcasts on variety of topics.

Below are the steps involved in building a speech summarizer. The speech summarizer takes an audio file as an input and generates text or audio as an output.

Transform audio to meet the following spec a. '.wav' file format b. 16KHz sample rate c. Mono channel
Transcribe transformed audio file
Process transcribed text if necessary
Summarize transcribed text using pre-trained state-of-the-art models
Generate audio out of summarized text.

Deployment Information

Speech Summarizer created using this kit are added in this section. The entire solution is available as a package to download from the source code repository.

Download, extract and double-click the kit installer file to install the kit. Do ensure to extract the zip file before running it.
After successful installation of the kit, press 'Y' to run the kit and execute cells in the notebook.
To run the kit manually, press 'N' and locate the zip file 'speech-summarizer.zip'
Extract the zip file and navigate to the directory 'speech-summarizer'
Open command prompt in the extracted directory 'speech-summarizer' and run the command 'jupyter notebook'
Locate and open the 'Speech Summarizer.ipynb' notebook from the Jupyter Notebook browser window.
Execute cells in the notebook

Click on the button below to download the solution and follow the deployment instructions to begin set-up. This 1-click kit has all the required dependencies and resources you may need to build your Speech Summarizer App.

1-Click Kit installer

For a detailed tutorial on installing & executing the solution as well as learning resources including training & certification opportunities, please visit the OpenWeaver Community

Development Environment

VSCode and Jupyter Notebook are used for development and debugging. Jupyter Notebook is a web based interactive environment often used for experiments, whereas VSCode is used to get a typical experience of IDE for developers. Jupyter Notebook is used for our development.

notebookby jupyter

Jupyter Notebook

10204

Version:v7.0.0b4

License: Permissive (BSD-3-Clause)

Jupyter Interactive Notebook

Support

Quality

Security

License

Reuse

notebookby jupyter

Jupyter Notebook 10204 Version:v7.0.0b4 License: Permissive (BSD-3-Clause)

Jupyter Interactive Notebook

Support

Quality

Security

License

Reuse

vscodeby microsoft

TypeScript

147328

Version:1.79.2

License: Permissive (MIT)

Visual Studio Code

Support

Quality

Security

License

Reuse

vscodeby microsoft

TypeScript 147328 Version:1.79.2 License: Permissive (MIT)

Visual Studio Code

Support

Quality

Security

License

Reuse

Exploratory Data Analysis

For extensive analysis and exploration of data, and to deal with arrays, these libraries are used. They are also used for performing scientific computation and data manipulation.

pandasby pandas-dev

Python

38689

Version:v2.0.2

License: Permissive (BSD-3-Clause)

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Support

Quality

Security

License

Reuse

pandasby pandas-dev

Python 38689 Version:v2.0.2 License: Permissive (BSD-3-Clause)

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Support

Quality

Security

License

Reuse

numpyby numpy

Python

23755

Version:v1.25.0rc1

License: Permissive (BSD-3-Clause)

The fundamental package for scientific computing with Python.

Support

Quality

Security

License

Reuse

numpyby numpy

Python 23755 Version:v1.25.0rc1 License: Permissive (BSD-3-Clause)

The fundamental package for scientific computing with Python.

Support

Quality

Security

License

Reuse

Text mining

Libraries in this group are used for analysis and processing of unstructured natural language. The data, as in its original form aren't used as it has to go through processing pipeline to become suitable for applying machine learning techniques and algorithms.

Transcribing

Transcribing libraries help in converting speech to text.

Machine Learning

Machine learning libraries and frameworks here are helpful in generating state-of-the-art summarization.

scikit-learnby scikit-learn

Python

54584

Version:1.2.2

License: Permissive (BSD-3-Clause)

scikit-learn: machine learning in Python

Support

Quality

Security

License

Reuse

scikit-learnby scikit-learn

Python 54584 Version:1.2.2 License: Permissive (BSD-3-Clause)

scikit-learn: machine learning in Python

Support

Quality

Security

License

Reuse

Request servicing via REST API

Web frameworks help build serving solution as REST APIs. The resources involved for servicing request can be handled by containerising and hosting on hyperscalers.

Kit Solution Source

speech-summarizerby kandi1clickkits

Jupyter Notebook

Version:v1.0.0

License: Permissive (Apache-2.0)

Transcribes and summarizes speech or audio

Support

Quality

Security

License

Reuse

speech-summarizerby kandi1clickkits

Jupyter Notebook 0 Version:v1.0.0 License: Permissive (Apache-2.0)

Transcribes and summarizes speech or audio

Support

Quality

Security

License

Reuse

Support

If you need help using this kit, you may reach us at the OpenWeaver Community.

See similar Kits and Libraries

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy