edgar-10k-mda | Download and extract MDA section from edgar 10k forms | Crawler library
kandi X-RAY | edgar-10k-mda Summary
kandi X-RAY | edgar-10k-mda Summary
Download and extract MDA section from edgar 10k forms
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Download all the forms
- Download a file
- Reads all urls from a combined CSV file
- Write content to file
- Combine index files to csv
- Parse a line to record
- Parse html files
- Parse mda files
- Download the indices of a year and quarter
- Creates argument parser
edgar-10k-mda Key Features
edgar-10k-mda Examples and Code Snippets
Community Discussions
Trending Discussions on edgar-10k-mda
QUESTION
I want to extract the product description from a 10-k report for my master thesis (new at programming, finance background). This product description is between "ITEM 1" and "ITEM 2" from the reports. What I did until now is to download all the 10-ks in .txt form, remove html tags and make all text uppercase. My problem is now when I try to select the text I need and save it into another directory. I tried doing the selection on my own, but with unsatisfactory results. Currently, I am using a code made by a guy "iammrhelo" on GitHub. His code is for selecting "ITEM 7" to "ITEM 8". With a bit of tweaking, made it search for what I need. Link to his code: https://github.com/iammrhelo/edgar-10k-mda
My problem is now that the parsing he does not work for all 10-ks. It works for selecting the product description in this 10k:
picture: 10k that the code is able to parse
picture: 10k that the code is NOT able to parse
To give a little context, I need to find the right syntax that the code has to look for. The syntaxes that is looking for are in the list item1_begins. The code I am using to select the text, is the following:
...ANSWER
Answered 2018-Dec-24 at 06:39If I am understanding you correctly, you need to grab the data in between ITEM's and place it into a list.
What you can do is use a regular expression https://docs.python.org/3.4/library/re.html. Super powerful for parsing text, I see in that script that it is imported and not used.
If you want to create a list of the data in between items, you could do something like:
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install edgar-10k-mda
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page