common-voice | Common Voice is part of Mozilla's initiative to help teach machines how real people speak | Speech library
kandi X-RAY | common-voice Summary
kandi X-RAY | common-voice Summary
This is the web app for Mozilla Common Voice, a platform for collecting speech donations in order to create public domain datasets for training voice recognition-related tools.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of common-voice
common-voice Key Features
common-voice Examples and Code Snippets
Community Discussions
Trending Discussions on common-voice
QUESTION
I am trying to download some large files from https://commonvoice.mozilla.org/en/datasets to a Linux server using wget
. The raw links are not provided directly, one has to enter their email address and the browser will then download the files.
My Chrome browser started downloading from this link, denoted URL
:
ANSWER
Answered 2021-Jan-15 at 15:33The URL in question is incomplete, the X-Amz-SignedHeaders
parameter is missing.
To get a working URL do the following:
- Add a valid E-Mail address
- Agree to the terms using the checkboxes
- Do not click the Download Button, instead do a right click -> Copy link address and use that URL in your
wget
command (be sure to escape the ampersands&
by adding a backslash in front of them).
What you're seeing is a presigned URL from Amazon S3. This is essentially a temporary download-link for a single object (~file) in an S3-Bucket that appears to belong to Mozilla.
The keyword here is temporary. If you use that link before it's expired, you can download the data using wget
without problems.
You can estimate the expiry time by adding the value of X-Amz-Expires
to X-Amz-Date
, in your case the URL had been valid for 43200 / 3600 = 12
hours starting at the specified date. Don't bother changing these values, the whole thing is cryptographically signed, it won't accept it ;-)
So the way you can do that is:
- Login to the Website
- Copy the Download Links
- Download within roughly 12 hours using
wget
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install common-voice
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page