REST API for scraping dynamic websites using Node.js, headless Chrome and Cheerio.
Support
Quality
Security
License
Reuse
Web scraping the popular job listing site "Glassdoor" with Python and BeautifulSoup. Implemented from scratch.
Support
Quality
Security
License
Reuse
:soccer: Instantly find :trophy:EURO 2016 live-streams & highlights, now a Web App!
Support
Quality
Security
License
Reuse
p
python_web_scrapingby telunyang
Jupyter Notebook 
54
Version:Current
License: No License (No License)
Web scraping using python, requests and selenium
Support
Quality
Security
License
Reuse
A scraper to scrape the NBA API and compile a play by play file
Support
Quality
Security
License
Reuse
An apartments.com scraper using beautifulsoup4 and python
Support
Quality
Security
License
Reuse
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
Support
Quality
Security
License
Reuse
News extraction and scraping. Article Parsing
Support
Quality
Security
License
Reuse
Monitor instagram user account and automatically post new images to discord channel via a webhook. Working 2021!
Support
Quality
Security
License
Reuse
This is a complete profile scraper that returns a JSON file.
Support
Quality
Security
License
Reuse
Service to scrape a web page easily without knowing their HTML structure.
Support
Quality
Security
License
Reuse
Download all Snapmaps content from a specific location.
Support
Quality
Security
License
Reuse
Extract user info from their reddit comments and activity.
Support
Quality
Security
License
Reuse
A repo where I will put my scripts to analyze the data that Google collects about me.
Support
Quality
Security
License
Reuse
图片爬取下载工具,极速爬取下载 站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户 上传的 图片/照片/插画。
Support
Quality
Security
License
Reuse
The scraper/parser that produces data for TheyWorkForYou, PublicWhip, etc
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
n
node-website-scraper-phantomby website-scraper
JavaScript 
51
Version:Current
License: Permissive (MIT)
Plugin for website-scraper which returns html for dynamic websites using PhantomJS.
Support
Quality
Security
License
Reuse
A web scraping framework for .NET
Support
Quality
Security
License
Reuse
Instagram power tool
Support
Quality
Security
License
Reuse
Web Scraping Framework
Support
Quality
Security
License
Reuse
c
california-coronavirus-scrapersby datadesk
Jupyter Notebook 
51
Version:Current
License: Permissive (MIT)
The open-source web scrapers that feed the Los Angeles Times California coronavirus tracker.
Support
Quality
Security
License
Reuse
Scrape the Twitter frontend API without any authentication and restriction.
Support
Quality
Security
License
Reuse
Scrapes information from Gogoanime to get Anime, Episode & Video information & urls.
Support
Quality
Security
License
Reuse
Contains scraper in R for grabbing NBA Sport Tracking Data
Support
Quality
Security
License
Reuse
Scraping HN for TaggerNews
Support
Quality
Security
License
Reuse
Scraper & GraphQL API untuk data Perguruan Tinggi di Indonesia berdasarkan dari website Kementrian RISTEKDIKTI.
Support
Quality
Security
License
Reuse
An intelligent web service to automatically detect web content and extract information from it.
Support
Quality
Security
License
Reuse
I'm trying to finish the scraplat as a scraper platform
Support
Quality
Security
License
Reuse
Python script to download messages from a Facebook page to a CSV file
Support
Quality
Security
License
Reuse
Python web crawler / scraper for WG-Gesucht. Crawls the WG-Gesucht site for new apartment listings and send a message to the poster, based off your saved filters and saved text
Support
Quality
Security
License
Reuse
Extract emails from a given website
Support
Quality
Security
License
Reuse
scrapers for building your own image databases
Support
Quality
Security
License
Reuse
Scrape employee names from search engine LinkedIn profiles. Convert employee names to a specified username format.
Support
Quality
Security
License
Reuse
Public web scraping scripts for the University of Toronto.
Support
Quality
Security
License
Reuse
A data structure for fast Unicode character metadata lookup, ported from ICU
Support
Quality
Security
License
Reuse
Uscrapper is an OSINT tool that allows users to extract various personal information from a website. It leverages web scraping techniques and regular expressions to extract email addresses, social media links, author names, geolocations, phone numbers, and usernames from both hyperlinked and non-hyperlinked sources on the webpage.
Support
Quality
Security
License
Reuse
Selenium based web scraper to generate passwords list
Support
Quality
Security
License
Reuse
Node.js module for scraping images from the web.
Support
Quality
Security
License
Reuse
DMM影片内容刮削器
Support
Quality
Security
License
Reuse
Automatically exported from code.google.com/p/lingoes-extractor
Support
Quality
Security
License
Reuse
A Google/Bing Scraping tool for LinkedIn
Support
Quality
Security
License
Reuse
🎼 Expandable lyrics-scraping API for Java
Support
Quality
Security
License
Reuse
Fucking Search Engines Scraper - python library to scrap url's from search engines
Support
Quality
Security
License
Reuse
a
angel.co-companies-list-scrapingby iamtodor
Python 
47
Version:Current
License: No License (No License)
Support
Quality
Security
License
Reuse
Scrape highlights from kindle.amazon.com
Support
Quality
Security
License
Reuse
instaclient is a Python library for accessing Instagram's features. With this library, you can create Instagram Bots with ease and simplicity. The InstaClient takes advantage of the selenium library to execute tasks which are not allowed in the Instagram Graph API (such as sending DMs, scraping user's followers).
Support
Quality
Security
License
Reuse
Read an Amazon wishlist programmatically with Python
Support
Quality
Security
License
Reuse
This is a anti-scraping cracker for extracting apply information of one of Taiwan jobs recruiting website.
Support
Quality
Security
License
Reuse
R
Reddit-Image-Scraper-1.0by 2hands10fingers
Python 
46
Version:Current
License: No License (No License)
Scrapes/downloads a selected subreddit's posted images by a specified date range on http://reddit.com
Support
Quality
Security
License
Reuse
s
scraping-serviceby weld-io
REST API for scraping dynamic websites using Node.js, headless Chrome and Cheerio.
JavaScript
54
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
glassdoor-scraperby kelvinxuande
Web scraping the popular job listing site "Glassdoor" with Python and BeautifulSoup. Implemented from scratch.
Python
54
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
E
Euro2016_TerminalAppby jctissier
:soccer: Instantly find :trophy:EURO 2016 live-streams & highlights, now a Web App!
HTML
54
Updated: 5 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
p
python_web_scrapingby telunyang
Web scraping using python, requests and selenium
Jupyter Notebook
54
Updated: 2 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
n
nba_scraperby mcbarlowe
A scraper to scrape the NBA API and compile a play by play file
Python
53
Updated: 4 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
a
apartments-scraperby adinutzyc21
An apartments.com scraper using beautifulsoup4 and python
Python
53
Updated: 4 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
d
diffbot-php-clientby Swader
[Deprecated - Maintenance mode - use APIs directly please!] The official Diffbot client library
PHP
53
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
n
newspaperjsby flickz
News extraction and scraping. Article Parsing
HTML
53
Updated: 3 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
I
Instagram-to-discordby fernandod1
Monitor instagram user account and automatically post new images to discord channel via a webhook. Working 2021!
Python
53
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
l
linkedin-profile-scraperby toxtli
This is a complete profile scraper that returns a JSON file.
Python
52
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
laravel-intelligent-scraperby softonic
Service to scrape a web page easily without knowing their HTML structure.
PHP
52
Updated: 4 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
s
snapmap-archiverby king-millez
Download all Snapmaps content from a specific location.
Python
52
Updated: 2 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
s
sherlockby orionmelt
Extract user info from their reddit comments and activity.
Python
51
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
google-data-analysesby matthieuheitz
A repo where I will put my scripts to analyze the data that Google collects about me.
Python
51
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
scraperby lonsty
图片爬取下载工具,极速爬取下载 站酷https://www.zcool.com.cn/, CNU 视觉 http://www.cnu.cc/ 设计师/用户 上传的 图片/照片/插画。
Python
51
Updated: 3 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
parlparseby mysociety
The scraper/parser that produces data for TheyWorkForYou, PublicWhip, etc
Python
51
Updated: 3 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
w
web-scraping-courseby rafikahmed
Python
51
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
n
node-website-scraper-phantomby website-scraper
Plugin for website-scraper which returns html for dynamic websites using PhantomJS.
JavaScript
51
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
N
NScrapeby darrylwhitmore
A web scraping framework for .NET
C#
51
Updated: 3 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
i
Support
Quality
Security
License
Reuse
c
Support
Quality
Security
License
Reuse
c
california-coronavirus-scrapersby datadesk
The open-source web scrapers that feed the Los Angeles Times California coronavirus tracker.
Jupyter Notebook
51
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
t
tweet_scrapperby 5hirish
Scrape the Twitter frontend API without any authentication and restriction.
Python
50
Updated: 4 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
n
node-anime-scraperby roflmuffin
Scrapes information from Gogoanime to get Anime, Episode & Video information & urls.
JavaScript
50
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
N
NBAdataby Fossj117
Contains scraper in R for grabbing NBA Sport Tracking Data
R
50
Updated: 5 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
s
scrape_hnby dodger487
Scraping HN for TaggerNews
Shell
50
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
k
kampus-scraperby gadingnst
Scraper & GraphQL API untuk data Perguruan Tinggi di Indonesia berdasarkan dari website Kementrian RISTEKDIKTI.
JavaScript
50
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
w
webspotby crawlab-team
An intelligent web service to automatically detect web content and extract information from it.
Python
50
Updated: 2 y ago
License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
s
scraplatby VillanCh
I'm trying to finish the scraplat as a scraper platform
Python
49
Updated: 4 y ago
License: Permissive (WTFPL)
Support
Quality
Security
License
Reuse
f
fb-page-chat-downloadby eisenjulian
Python script to download messages from a Facebook page to a CSV file
Python
49
Updated: 4 y ago
License: Strong Copyleft (AGPL-3.0)
Support
Quality
Security
License
Reuse
w
wg-gesucht-crawler-cliby grantwilliams
Python web crawler / scraper for WG-Gesucht. Crawls the WG-Gesucht site for new apartment listings and send a message to the poster, based off your saved filters and saved text
Python
49
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
e
extract-emailsby dmitriiweb
Extract emails from a given website
Python
49
Updated: 3 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
scrapersby montoyamoraga
scrapers for building your own image databases
Python
49
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
B
BridgeKeeperby 0xZDH
Scrape employee names from search engine LinkedIn profiles. Convert employee names to a specified username format.
Python
49
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
u
uoft-scrapersby cobalt-uoft
Public web scraping scripts for the University of Toronto.
Python
49
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
u
unicode-trieby foliojs
A data structure for fast Unicode character metadata lookup, ported from ICU
JavaScript
49
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
U
Uscrapperby z0m31en7
Uscrapper is an OSINT tool that allows users to extract various personal information from a website. It leverages web scraping techniques and regular expressions to extract email addresses, social media links, author names, geolocations, phone numbers, and usernames from both hyperlinked and non-hyperlinked sources on the webpage.
Python
49
Updated: 2 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
w
words-scraperby dariusztytko
Selenium based web scraper to generate passwords list
Python
48
Updated: 4 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
n
node-image-scraperby leon-vv
Node.js module for scraping images from the web.
JavaScript
48
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
b
Support
Quality
Security
License
Reuse
l
lingoes-extractorby PurlingNayuki
Automatically exported from code.google.com/p/lingoes-extractor
Java
47
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
L
LeetLinkedby Sq00ky
A Google/Bing Scraping tool for LinkedIn
Python
47
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
J
JLyricsby jagrosh
🎼 Expandable lyrics-scraping API for Java
Java
47
Updated: 2 y ago
License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
f
fsesby mthbernardes
Fucking Search Engines Scraper - python library to scrap url's from search engines
Python
47
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
angel.co-companies-list-scrapingby iamtodor
Python
47
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse
k
kindle-your-highlightsby parroty
Scrape highlights from kindle.amazon.com
Ruby
47
Updated: 4 y ago
License: Permissive (MIT)
Support
Quality
Security
License
Reuse
i
instaclientby davidwickerhf
instaclient is a Python library for accessing Instagram's features. With this library, you can create Instagram Bots with ease and simplicity. The InstaClient takes advantage of the selenium library to execute tasks which are not allowed in the Instagram Graph API (such as sending DMs, scraping user's followers).
Python
47
Updated: 3 y ago
License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
w
wishlistby Jaymon
Read an Amazon wishlist programmatically with Python
Python
46
Updated: 2 y ago
License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
s
scraper-fourone-jobsby kokokuo
This is a anti-scraping cracker for extracting apply information of one of Taiwan jobs recruiting website.
Python
46
Updated: 4 y ago
License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
R
Reddit-Image-Scraper-1.0by 2hands10fingers
Scrapes/downloads a selected subreddit's posted images by a specified date range on http://reddit.com
Python
46
Updated: 4 y ago
License: No License (No License)
Support
Quality
Security
License
Reuse