Crawler Libraries - Page 10

Python 154 Version:Current
License: No License (No License)

WEIBO_SCRAPY is a Multi-Threading SINA WEIBO data extraction Framework in Python.

Support

Quality

Security

License

Reuse

NCrawlerby esbencarlsen

C# 154 Version:Current
License: Permissive (Apache-2.0)

.NET based webcrawler

Support

Quality

Security

License

Reuse

adstxtcrawlerby InteractiveAdvertisingBureau

Python 154 Version:Current
License: No License (No License)

A reference implementation in python of a simple crawler for Ads.txt

Support

Quality

Security

License

Reuse

Python 153 Version:Current
License: Proprietary (Proprietary)

Crawler for linguistic corpora

Support

Quality

Security

License

Reuse

JavPyby TheodoreKrypton

JavaScript 153 Version:Current
License: Permissive (Apache-2.0)

Enjoy driving on a Javascriptive (originally Pythonic) way to Japanese AV!

Support

Quality

Security

License

Reuse

C# 153 Version:Current
License: Permissive (Apache-2.0)

简单、易用、高效一个有态度的开源.Net Http请求框架!可以用制作爬虫，api请求等等。

Support

Quality

Security

License

Reuse

ZhihuQuestionsSpiderby StevenKin

Java 152 Version:Current
License: Permissive (MIT)

:blush::blush::blush: 知乎问题爬虫

Support

Quality

Security

License

Reuse

cocrawlerby cocrawler

Python 152 Version:Current
License: Permissive (Apache-2.0)

CoCrawler is a versatile web crawler built using modern tools and concurrency.

Support

Quality

Security

License

Reuse

Java 151 Version:Current
License: Permissive (Apache-2.0)

A lite distributed Java spider framework :-)

Support

Quality

Security

License

Reuse

spiderby simapple

Python 151 Version:Current
License: No License (No License)

python爬虫全球网址URL滚动提取

Support

Quality

Security

License

Reuse

EasyCSRFby 0ang3el

Python 151 Version:Current
License: No License (No License)

Support

Quality

Security

License

Reuse

netease-music-spiderby wenhaoliang

Python 150 Version:Current
License: No License (No License)

netease-music-spider is a sipder that you can find beautiful girlfriend or handsome boyfriend.

Support

Quality

Security

License

Reuse

hncrawlby mvanveen

Python 150 Version:Current
License: Permissive (MIT)

A scrapy-based Hacker News crawler.

Support

Quality

Security

License

Reuse

scrapy_demoby BruceDone

Python 150 Version:Current
License: No License (No License)

all kinds of scrapy demo

Support

Quality

Security

License

Reuse

neocrawlerby ahkimkoo

JavaScript 150 Version:Current
License: Proprietary (Proprietary)

Nodejs Crawler, including schedule, spider, web ui config, proxy modules. using nodejs, redis/ssdb, hbase, phantomjs. css selector extraction rules and regex extraction rules supported.

Support

Quality

Security

License

Reuse

python-dcdownloaderby dev-techmoe

Python 148 Version:Current
License: Permissive (MIT)

由Python编写的全异步实现的动漫之家(dmzj)漫画批量下载器（爬虫）

Support

Quality

Security

License

Reuse

HTML 147 Version:Current
License: No License (No License)

The python crawler which automatically crawls the original microblogs and pictures of the specified user, analyzes the microblogs, and displays them in the form of html charts.

Support

Quality

Security

License

Reuse

rpqueueby josiahcarlson

Python 146 Version:Current
License: Weak Copyleft (LGPL-2.1)

Redis Priority Queue offers a priority/timeline based queue for use with Redis

Support

Quality

Security

License

Reuse

scrape-linkedinby ericfourrier

Python 146 Version:Current
License: Permissive (MIT)

Scrape a public LinkedIn profile.

Support

Quality

Security

License

Reuse

evineby saeeddhqan

Go 146 Version:Current
License: Strong Copyleft (GPL-3.0)

Interactive CLI Web Crawler

Support

Quality

Security

License

Reuse

HTML 146 Version:Current
License: No License (No License)

使用scrapy和pandas完成对知乎300w用户的数据分析。首先使用scrapy爬取知乎网的300w，用户资料，最后使用pandas对数据进行过滤，找出想要的知乎大牛，并用图表的形式可视化。

Support

Quality

Security

License

Reuse

crawlerby trandoshan-io

Go 146 Version:Current
License: Strong Copyleft (GPL-3.0)

Go process used to crawl websites

Support

Quality

Security

License

Reuse

JavaScript 146 Version:Current
License: No License (No License)

html+ python +django +爬虫 +pyecharts 实时疫情动态

Support

Quality

Security

License

Reuse

scrapy-zhihu-usersby ansenhuang

Python 145 Version:Current
License: No License (No License)

scrapy爬取知乎用户数据

Support

Quality

Security

License

Reuse

Python 145 Version:Current
License: No License (No License)

爬虫项目：链家网（普通/scrapy）、虎扑、维基百科、百度地图api、房天下（分布式爬虫）、微信公众号（代理池爬取）

Support

Quality

Security

License

Reuse

haulby vinta

Python 145 Version:Current
License: Permissive (MIT)

An Extensible Image Crawler

Support

Quality

Security

License

Reuse

scrapy_guruby michael-yin

JavaScript 145 Version:Current
License: Strong Copyleft (GPL-3.0)

Everybody can be scrapy guru

Support

Quality

Security

License

Reuse

soksaccountsby chenjiandongx

Python 144 Version:Current
License: No License (No License)

🔥 Shadowsocks 账号爬虫

Support

Quality

Security

License

Reuse

tieba_signby Aruelius

Python 144 Version:Current
License: Permissive (MIT)

📱 百度贴吧多线程扫码登陆 / 自动签到 / 自动打码

Support

Quality

Security

License

Reuse

Python 144 Version:Current
License: Permissive (MIT)

基于Python3的pornhub网站爬虫

Support

Quality

Security

License

Reuse

Python 144 Version:Current
License: No License (No License)

This is a crawler for Sina Weiqun website(WAP) information, including given Weiqun's posts, replies, users and their follow relation. Written in Python 2.7.1, store data in SQLite3. Relation-crawling part customized on Github Project sina_reptile.

Support

Quality

Security

License

Reuse

galerby dwisiswant0

Shell 144 Version:Current
License: Permissive (MIT)

A fast tool to fetch URLs from HTML attributes by crawl-in.

Support

Quality

Security

License

Reuse

massivedlby dimkouv

Go 144 Version:Current
License: Strong Copyleft (GPL-3.0)

Download a large list of files concurrently

Support

Quality

Security

License

Reuse

crawlerby bplawler

Scala 144 Version:Current
License: Proprietary (Proprietary)

Scala DSL for web crawling

Support

Quality

Security

License

Reuse

Python 143 Version:Current
License: No License (No License)

A distributed Sina Weibo Search spider base on Scrapy and Redis.

Support

Quality

Security

License

Reuse

douban-group-spiderby kaito-kidd

Python 143 Version:Current
License: No License (No License)

爬取豆瓣小组帖子的爬虫。

Support

Quality

Security

License

Reuse

qiandaoby bonfy

Python 143 Version:Current
License: No License (No License)

🌟⏳🌟 各种网站的签到（停止维护）

Support

Quality

Security

License

Reuse

nsfcby suqingdong

Python 143 Version:Current
License: Proprietary (Proprietary)

国家自然科学基金查询

Support

Quality

Security

License

Reuse

TypeScript 143 Version:Current
License: No License (No License)

(2020年最新)斗鱼弹幕抓取及可视化管理平台第二版，提供弹幕抓取、弹幕实时发送速度可视化、抓取记录查询、弹幕下载、自定义关键词统计、铁粉统计、高光时刻自动捕获、高频弹幕词云等功能，起飞~~~

Support

Quality

Security

License

Reuse

RecordWavby shaoshuai904

Java 141 Version:Current
License: No License (No License)

Android wav/pcm 录音机，支持暂停、再录制。支持跳过静音区模式。

Support

Quality

Security

License

Reuse

NGCBotby ngc660sec

Python 141 Version:Current
License: Strong Copyleft (GPL-3.0)

一个基于✨HOOK机制的微信机器人，支持🌱安全新闻定时推送【FreeBuf，先知，安全客，奇安信攻防社区】，👯后缀名查询，⚡备案查询，⚡手机号归属地查询，⚡WHOIS信息查询，🎉星座查询，⚡天气查询，🌱摸鱼日历⚡微步威胁情报查询， 🐛美女视频，⚡美女图片，👯帮助菜单。📫 支持积分功能，😄自定义程度丰富，小白也可轻松上手！

Support

Quality

Security

License

Reuse

IPProxyby ZKeeer

Python 140 Version:Current
License: No License (No License)

爬虫所需要的IP代理，抓取九个网站的代理IP检测/清洗/入库/更新，添加调用接口

Support

Quality

Security

License

Reuse

Python 140 Version:Current
License: Permissive (MIT)

As you can see, a kuaishou crawler

Support

Quality

Security

License

Reuse

Facebook-Page-Crawlerby chenjr0719

Python 140 Version:Current
License: Permissive (MIT)

A Python crawler uses Facebook Graph API to crawling fan page's public posts, comments, and reactions.

Support

Quality

Security

License

Reuse

PornSpiderby QuantumLiu

HTML 140 Version:Current
License: Strong Copyleft (GPL-3.0)

A parallel web spider of PornHub.成人网站Pornhub的并行网络爬虫。

Support

Quality

Security

License

Reuse

go-crawlerby liunian1004

Go 140 Version:Current
License: No License (No License)

useful crawler project for practice

Support

Quality

Security

License

Reuse

PHP 140 Version:Current
License: No License (No License)

短视频图集图片去水印：快手,皮皮虾,最右,小红书,微博

Support

Quality

Security

License

Reuse

mm131by qwertyuiop6

Python 137 Version:Current
License: No License (No License)

MM131网站图片爬取 :rotating_light:

Support

Quality

Security

License

Reuse

JavaScript 137 Version:Current
License: Permissive (MIT)

An object crawler/property search library.

Support

Quality

Security

License

Reuse

Ruby 137 Version:Current
License: Proprietary (Proprietary)

Asynchronous Web Crawler & Scraper

Support

Quality

Security

License

Reuse

weibo_scrapyby yoyzhou

WEIBO_SCRAPY is a Multi-Threading SINA WEIBO data extraction Framework in Python.

Python

154

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

NCrawlerby esbencarlsen

.NET based webcrawler

154

Updated: 2 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

adstxtcrawlerby InteractiveAdvertisingBureau

A reference implementation in python of a simple crawler for Ads.txt

Python

154

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

corpuscrawlerby google

Crawler for linguistic corpora

Python

153

Updated: 3 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

JavPyby TheodoreKrypton

Enjoy driving on a Javascriptive (originally Pythonic) way to Japanese AV!

JavaScript

153

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

HttpCode.Coreby stulzq

简单、易用、高效一个有态度的开源.Net Http请求框架!可以用制作爬虫，api请求等等。

153

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

ZhihuQuestionsSpiderby StevenKin

:blush::blush::blush: 知乎问题爬虫

Java

152

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

cocrawlerby cocrawler

CoCrawler is a versatile web crawler built using modern tools and concurrency.

Python

152

Updated: 3 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

jlitespiderby luohaha

A lite distributed Java spider framework :-)

Java

151

Updated: 4 y ago

License: Permissive (Apache-2.0)

Support

Quality

Security

License

Reuse

spiderby simapple

python爬虫全球网址URL滚动提取

Python

151

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

EasyCSRFby 0ang3el

Python

151

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

netease-music-spiderby wenhaoliang

netease-music-spider is a sipder that you can find beautiful girlfriend or handsome boyfriend.

Python

150

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

hncrawlby mvanveen

A scrapy-based Hacker News crawler.

Python

150

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

scrapy_demoby BruceDone

all kinds of scrapy demo

Python

150

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

neocrawlerby ahkimkoo

Nodejs Crawler, including schedule, spider, web ui config, proxy modules. using nodejs, redis/ssdb, hbase, phantomjs. css selector extraction rules and regex extraction rules supported.

JavaScript

150

Updated: 3 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

python-dcdownloaderby dev-techmoe

由Python编写的全异步实现的动漫之家(dmzj)漫画批量下载器（爬虫）

Python

148

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

weibo_analysisby dingmyu

The python crawler which automatically crawls the original microblogs and pictures of the specified user, analyzes the microblogs, and displays them in the form of html charts.

HTML

147

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

rpqueueby josiahcarlson

Redis Priority Queue offers a priority/timeline based queue for use with Redis

Python

146

Updated: 2 y ago

License: Weak Copyleft (LGPL-2.1)

Support

Quality

Security

License

Reuse

scrape-linkedinby ericfourrier

Scrape a public LinkedIn profile.

Python

146

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

evineby saeeddhqan

Interactive CLI Web Crawler

146

Updated: 2 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

Zhihu_bigdataby yoghurtjia

使用scrapy和pandas完成对知乎300w用户的数据分析。首先使用scrapy爬取知乎网的300w，用户资料，最后使用pandas对数据进行过滤，找出想要的知乎大牛，并用图表的形式可视化。

HTML

146

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

crawlerby trandoshan-io

Go process used to crawl websites

146

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

COVID-19-KSHby whhsky

html+ python +django +爬虫 +pyecharts 实时疫情动态

JavaScript

146

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

scrapy-zhihu-usersby ansenhuang

scrapy爬取知乎用户数据

Python

145

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

CrawlerProjectby LMFrank

爬虫项目：链家网（普通/scrapy）、虎扑、维基百科、百度地图api、房天下（分布式爬虫）、微信公众号（代理池爬取）

Python

145

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

haulby vinta

An Extensible Image Crawler

Python

145

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

scrapy_guruby michael-yin

Everybody can be scrapy guru

JavaScript

145

Updated: 4 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

soksaccountsby chenjiandongx

🔥 Shadowsocks 账号爬虫

Python

144

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

tieba_signby Aruelius

📱 百度贴吧多线程扫码登陆 / 自动签到 / 自动打码

Python

144

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

pornhubbotby levphon

基于Python3的pornhub网站爬虫

Python

144

Updated: 2 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

This is a crawler for Sina Weiqun website(WAP) information, including given Weiqun's posts, replies, users and their follow relation. Written in Python 2.7.1, store data in SQLite3. Relation-crawling part customized on Github Project sina_reptile.

Python

144

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

galerby dwisiswant0

A fast tool to fetch URLs from HTML attributes by crawl-in.

Shell

144

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

massivedlby dimkouv

Download a large list of files concurrently

144

Updated: 3 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

crawlerby bplawler

Scala DSL for web crawling

Scala

144

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

weibosearchby tpeng

A distributed Sina Weibo Search spider base on Scrapy and Redis.

Python

143

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

douban-group-spiderby kaito-kidd

爬取豆瓣小组帖子的爬虫。

Python

143

Updated: 1 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

qiandaoby bonfy

🌟⏳🌟 各种网站的签到（停止维护）

Python

143

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

nsfcby suqingdong

国家自然科学基金查询

Python

143

Updated: 2 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

DouyuBarrage-Proby Crawler995

(2020年最新)斗鱼弹幕抓取及可视化管理平台第二版，提供弹幕抓取、弹幕实时发送速度可视化、抓取记录查询、弹幕下载、自定义关键词统计、铁粉统计、高光时刻自动捕获、高频弹幕词云等功能，起飞~~~

TypeScript

143

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

RecordWavby shaoshuai904

Android wav/pcm 录音机，支持暂停、再录制。支持跳过静音区模式。

Java

141

Updated: 1 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

NGCBotby ngc660sec

一个基于✨HOOK机制的微信机器人，支持🌱安全新闻定时推送【FreeBuf，先知，安全客，奇安信攻防社区】，👯后缀名查询，⚡备案查询，⚡手机号归属地查询，⚡WHOIS信息查询，🎉星座查询，⚡天气查询，🌱摸鱼日历⚡微步威胁情报查询， 🐛美女视频，⚡美女图片，👯帮助菜单。📫 支持积分功能，😄自定义程度丰富，小白也可轻松上手！

Python

141

Updated: 1 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

IPProxyby ZKeeer

爬虫所需要的IP代理，抓取九个网站的代理IP检测/清洗/入库/更新，添加调用接口

Python

140

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

kuaishou-crawlerby oGsLP

As you can see, a kuaishou crawler

Python

140

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

Facebook-Page-Crawlerby chenjr0719

A Python crawler uses Facebook Graph API to crawling fan page's public posts, comments, and reactions.

Python

140

Updated: 3 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

PornSpiderby QuantumLiu

A parallel web spider of PornHub.成人网站Pornhub的并行网络爬虫。

HTML

140

Updated: 3 y ago

License: Strong Copyleft (GPL-3.0)

Support

Quality

Security

License

Reuse

go-crawlerby liunian1004

useful crawler project for practice

140

Updated: 4 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

images_spiderby 5ime

短视频图集图片去水印：快手,皮皮虾,最右,小红书,微博

PHP

140

Updated: 2 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

mm131by qwertyuiop6

MM131网站图片爬取 :rotating_light:

Python

137

Updated: 3 y ago

License: No License (No License)

Support

Quality

Security

License

Reuse

spotlight.jsby bestiejs

An object crawler/property search library.

JavaScript

137

Updated: 4 y ago

License: Permissive (MIT)

Support

Quality

Security

License

Reuse

rubyretrieverby joenorton

Asynchronous Web Crawler & Scraper

Ruby

137

Updated: 4 y ago

License: Proprietary (Proprietary)

Support

Quality

Security

License

Reuse

Open Weaver – Develop Applications Faster with Open Source

Terms
Privacy policy

Crawler Libraries - Page 10

weibo_scrapyby yoyzhou

Python 154 Version:Current License: No License (No License)

WEIBO_SCRAPY is a Multi-Threading SINA WEIBO data extraction Framework in Python.

NCrawlerby esbencarlsen

C# 154 Version:Current License: Permissive (Apache-2.0)

.NET based webcrawler

adstxtcrawlerby InteractiveAdvertisingBureau

Python 154 Version:Current License: No License (No License)

A reference implementation in python of a simple crawler for Ads.txt

corpuscrawlerby google

Python 153 Version:Current License: Proprietary (Proprietary)

Crawler for linguistic corpora

JavPyby TheodoreKrypton

JavaScript 153 Version:Current License: Permissive (Apache-2.0)

Enjoy driving on a Javascriptive (originally Pythonic) way to Japanese AV!

HttpCode.Coreby stulzq

C# 153 Version:Current License: Permissive (Apache-2.0)

简单、易用、高效 一个有态度的开源.Net Http请求框架!可以用制作爬虫，api请求等等。

ZhihuQuestionsSpiderby StevenKin

Java 152 Version:Current License: Permissive (MIT)

:blush::blush::blush: 知乎问题爬虫

cocrawlerby cocrawler

Python 152 Version:Current License: Permissive (Apache-2.0)

CoCrawler is a versatile web crawler built using modern tools and concurrency.

jlitespiderby luohaha

Java 151 Version:Current License: Permissive (Apache-2.0)

A lite distributed Java spider framework :-)

spiderby simapple

Python 151 Version:Current License: No License (No License)

python爬虫 全球网址URL滚动提取

EasyCSRFby 0ang3el

Python 151 Version:Current License: No License (No License)

netease-music-spiderby wenhaoliang

Python 150 Version:Current License: No License (No License)

netease-music-spider is a sipder that you can find beautiful girlfriend or handsome boyfriend.

hncrawlby mvanveen

Python 150 Version:Current License: Permissive (MIT)

A scrapy-based Hacker News crawler.

scrapy_demoby BruceDone

Python 150 Version:Current License: No License (No License)

all kinds of scrapy demo

neocrawlerby ahkimkoo

JavaScript 150 Version:Current License: Proprietary (Proprietary)

Nodejs Crawler, including schedule, spider, web ui config, proxy modules. using nodejs, redis/ssdb, hbase, phantomjs. css selector extraction rules and regex extraction rules supported.

python-dcdownloaderby dev-techmoe

Python 148 Version:Current License: Permissive (MIT)

由Python编写的全异步实现的动漫之家(dmzj)漫画批量下载器（爬虫）

weibo_analysisby dingmyu

HTML 147 Version:Current License: No License (No License)

The python crawler which automatically crawls the original microblogs and pictures of the specified user, analyzes the microblogs, and displays them in the form of html charts.

rpqueueby josiahcarlson

Python 146 Version:Current License: Weak Copyleft (LGPL-2.1)

Redis Priority Queue offers a priority/timeline based queue for use with Redis

scrape-linkedinby ericfourrier

Python 146 Version:Current License: Permissive (MIT)

Scrape a public LinkedIn profile.

evineby saeeddhqan

Go 146 Version:Current License: Strong Copyleft (GPL-3.0)

Interactive CLI Web Crawler

Zhihu_bigdataby yoghurtjia

HTML 146 Version:Current License: No License (No License)

使用scrapy和pandas完成对知乎300w用户的数据分析。首先使用scrapy爬取知乎网的300w，用户资料，最后使用pandas对数据进行过滤，找出想要的知乎大牛，并用图表的形式可视化。

crawlerby trandoshan-io

Go 146 Version:Current License: Strong Copyleft (GPL-3.0)

Go process used to crawl websites

COVID-19-KSHby whhsky

JavaScript 146 Version:Current License: No License (No License)

html+ python +django +爬虫 +pyecharts 实时疫情动态

scrapy-zhihu-usersby ansenhuang

Python 145 Version:Current License: No License (No License)

scrapy爬取知乎用户数据

CrawlerProjectby LMFrank

Python 145 Version:Current License: No License (No License)

爬虫项目：链家网（普通/scrapy）、虎扑、维基百科、百度地图api、房天下（分布式爬虫）、微信公众号（代理池爬取）

haulby vinta

Python 145 Version:Current License: Permissive (MIT)

An Extensible Image Crawler

scrapy_guruby michael-yin

JavaScript 145 Version:Current License: Strong Copyleft (GPL-3.0)

Python 154 Version:Current
License: No License (No License)

C# 154 Version:Current
License: Permissive (Apache-2.0)

Python 154 Version:Current
License: No License (No License)

Python 153 Version:Current
License: Proprietary (Proprietary)

JavaScript 153 Version:Current
License: Permissive (Apache-2.0)

C# 153 Version:Current
License: Permissive (Apache-2.0)

简单、易用、高效一个有态度的开源.Net Http请求框架!可以用制作爬虫，api请求等等。

Java 152 Version:Current
License: Permissive (MIT)

Python 152 Version:Current
License: Permissive (Apache-2.0)

Java 151 Version:Current
License: Permissive (Apache-2.0)

Python 151 Version:Current
License: No License (No License)

python爬虫全球网址URL滚动提取

Python 151 Version:Current
License: No License (No License)

Python 150 Version:Current
License: No License (No License)

Python 150 Version:Current
License: Permissive (MIT)

Python 150 Version:Current
License: No License (No License)

JavaScript 150 Version:Current
License: Proprietary (Proprietary)

Python 148 Version:Current
License: Permissive (MIT)

HTML 147 Version:Current
License: No License (No License)

Python 146 Version:Current
License: Weak Copyleft (LGPL-2.1)

Python 146 Version:Current
License: Permissive (MIT)

Go 146 Version:Current
License: Strong Copyleft (GPL-3.0)

HTML 146 Version:Current
License: No License (No License)

Go 146 Version:Current
License: Strong Copyleft (GPL-3.0)

JavaScript 146 Version:Current
License: No License (No License)

Python 145 Version:Current
License: No License (No License)

Python 145 Version:Current
License: No License (No License)

Python 145 Version:Current
License: Permissive (MIT)

JavaScript 145 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 144 Version:Current
License: No License (No License)

Python 144 Version:Current
License: Permissive (MIT)

Python 144 Version:Current
License: Permissive (MIT)

Python 144 Version:Current
License: No License (No License)

Shell 144 Version:Current
License: Permissive (MIT)

Go 144 Version:Current
License: Strong Copyleft (GPL-3.0)

Scala 144 Version:Current
License: Proprietary (Proprietary)

Python 143 Version:Current
License: No License (No License)

Python 143 Version:Current
License: No License (No License)

Python 143 Version:Current
License: No License (No License)

Python 143 Version:Current
License: Proprietary (Proprietary)

TypeScript 143 Version:Current
License: No License (No License)

Java 141 Version:Current
License: No License (No License)

Python 141 Version:Current
License: Strong Copyleft (GPL-3.0)

Python 140 Version:Current
License: No License (No License)

Python 140 Version:Current
License: Permissive (MIT)

Python 140 Version:Current
License: Permissive (MIT)

HTML 140 Version:Current
License: Strong Copyleft (GPL-3.0)

Go 140 Version:Current
License: No License (No License)

PHP 140 Version:Current
License: No License (No License)

Python 137 Version:Current
License: No License (No License)

JavaScript 137 Version:Current
License: Permissive (MIT)

Ruby 137 Version:Current
License: Proprietary (Proprietary)