Speech-Transformer | PyTorch implementation of Speech Transformer | Speech library
kandi X-RAY | Speech-Transformer Summary
kandi X-RAY | Speech-Transformer Summary
A PyTorch implementation of Speech Transformer [1], an end-to-end automatic speech recognition with Transformer network, which directly converts acoustic features to character sequence using a single nueral network.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Train the model
- Run one epoch
- Calculate the loss of the loss
- Calculate the Calculation of the loss
- Compute the decoder
- Preprocess padded input
- Get key - pad mask for key query
- Pad a list
- Compute the layer
- Decodes a model
- Build LFR features from inputs
- Load a model from a package
- Recognize beam
- Extracts the sos_id and eos_id
- Load a model
- Reset checkpoint
- Recognize the beam
- Recognize a beam
- Extracts the sos id and eos id
- Load a model from a file
Speech-Transformer Key Features
Speech-Transformer Examples and Code Snippets
Community Discussions
Trending Discussions on Speech-Transformer
QUESTION
github: https://github.com/sephiroce/tfsr/tree/exprimental
I'm trying to reproduce recognition accuracies described in the speech transformer paper [1]. The attention penalty is a technique I could not fully understand. This is the description of the attention penalty in the paper.
"In addition, we encouraged the model attending to closer positions by adding bigger penalty on the attention weights of more distant position-pairs."
I understood as it means adding smaller negative values for more away from the diagonal on scaled attention logits (before masking) except for the first multi-head attention in decoders.
This is a code snippet for computing attention weights.
...ANSWER
Answered 2020-Jan-13 at 10:33I think you understand it well. They probably did a stripe around the diagonal, something like:
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install Speech-Transformer
PyTorch 0.4.1+
Kaldi (just for feature extraction)
pip install -r requirements.txt
cd tools; make KALDI=/path/to/kaldi
If you want to run egs/aishell/run.sh, download aishell dataset for free.
You can change parameter by $ bash run.sh --parameter_name parameter_value, egs, $ bash run.sh --stage 3. See parameter name in egs/aishell/run.sh before . utils/parse_options.sh.
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page