Scraper Libraries

FILTER

LANGUAGES

All

LICENSES

All

COMPONENT TYPES

All

SUPPORT

All

SOURCES

All

SECURITY

All

INDUSTRIES

All
Click on the libraries for details

Sort by

Relevance
y

you-getby soimort

:arrow_double_down: Dumb downloader that scrapes the web

Python Updated: 3 mo ago License: Proprietary

Support
Quality
Security
License
Reuse
r

requests-htmlby psf

Pythonic HTML Parsing for Humans™

Python Updated: 4 d ago License: Permissive

Support
Quality
Security
License
Reuse
t

twintby twintproject

An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

Python Updated: 4 mo ago License: Permissive

Support
Quality
Security
License
Reuse
n

newspaperby codelucas

News, full-text, and article metadata extraction in Python 3. Advanced docs:

Python Updated: 3 mo ago License: Proprietary

Support
Quality
Security
License
Reuse
G

Goutteby FriendsOfPHP

Goutte, a simple PHP Web Scraper

PHP Updated: 3 mo ago License: Permissive

Support
Quality
Security
License
Reuse
p

portiaby scrapinghub

Visual scraping for Scrapy

Python Updated: 6 mo ago License: Proprietary

Support
Quality
Security
License
Reuse
x

x-rayby matthewmueller

The next web scraper. See through the <html> noise.

JavaScript Updated: 6 mo ago License: Permissive

Support
Quality
Security
License
Reuse
i

instagram-scraperby arc298

Scrapes an instagram user's photos and videos

Python Updated: 5 mo ago License: Permissive

Support
Quality
Security
License
Reuse
f

ferretby MontFerret

Declarative web scraping

Go Updated: 4 d ago License: Permissive

Support
Quality
Security
License
Reuse
s

scrape-itby IonicaBizau

🔮 A Node.js scraper for humans.

JavaScript Updated: 6 mo ago License: Permissive

Support
Quality
Security
License
Reuse
p

python-gooseby grangier

Html Content / Article Extractor, web scrapping lib in Python

HTML Updated: 6 mo ago License: Permissive

Support
Quality
Security
License
Reuse
a

autoscraperby alirezamika

A Smart, Automatic, Fast and Lightweight Web Scraper for Python

Python Updated: 6 mo ago License: Permissive

Support
Quality
Security
License
Reuse
p

python-scrapingby REMitchell

Code samples from the book Web Scraping with Python http://shop.oreilly.com/product/0636920034391.do

Jupyter Notebook Updated: 4 mo ago License: No License

Support
Quality
Security
License
Reuse
i

instaloaderby instaloader

Download pictures (or videos) along with their captions and other metadata from Instagram.

Python Updated: 3 mo ago License: Permissive

Support
Quality
Security
License
Reuse
s

scrapy-splashby scrapy-plugins

Scrapy+Splash for JavaScript integration

Python Updated: 6 mo ago License: Proprietary

Support
Quality
Security
License
Reuse
O

OnlyFansby DIGITALCRIMINAL

Scrape all the media from an OnlyFans account - Updated regularly

Python Updated: 4 mo ago License: Strong Copyleft

Support
Quality
Security
License
Reuse
c

cloudflare-scrapeby Anorov

A Python module to bypass Cloudflare's anti-bot page.

Python Updated: 6 mo ago License: Permissive

Support
Quality
Security
License
Reuse
t

thalby emadehsan

Getting started with Puppeteer and Chrome Headless for Web Scraping

JavaScript Updated: 6 mo ago License: Permissive

Support
Quality
Security
License
Reuse
i

instagram-scraperby realsirjoe

scrapes medias, likes, followers, tags and all metadata. Inspired by instagram-php-scraper,bot

Python Updated: 4 d ago License: Permissive

Support
Quality
Security
License
Reuse
u

uptonby propublica

A batteries-included framework for easy web-scraping. Just add CSS! (Or do more.)

HTML Updated: 8 mo ago License: Permissive

Support
Quality
Security
License
Reuse