🎊 Design and implement of lightweight crawler framework.
Support
Quality
Security
License
Reuse
An R web crawler and scraper
Support
Quality
Security
License
Reuse
A lil' bookmarklet that will strip out your CSS3 rules and show you how gracefully you're degrading.
Support
Quality
Security
License
Reuse
天眼查爬虫&企查查爬虫,指定关键字爬取公司信息
Support
Quality
Security
License
Reuse
A golang utility to spider through a website searching for additional links.
Support
Quality
Security
License
Reuse
狠心开源企业级舆情新闻爬虫项目:支持任意数量爬虫一键运行、爬虫定时任务、爬虫批量删除;爬虫一键部署;爬虫监控可视化; 配置集群爬虫分配策略;👉 现成的docker一键部署文档已为大家踩坑
Support
Quality
Security
License
Reuse
Unofficial API for PornHub.com in Python
Support
Quality
Security
License
Reuse
The simple, easy to use command line web crawler.
Support
Quality
Security
License
Reuse
一个基于webmagic框架二次开发的java爬虫框架实战,已实现能爬取腾讯,搜狐,今日头条(单独集成功能)等资讯内容,配合elasticsearch框架用法,实现了自动爬虫,已投入线上生产使用。
Support
Quality
Security
License
Reuse
php 写的视频下载工具,现已支持:Youku、Miaopai、腾讯、XVideos、Pornhub、91porn、微博酷燃、bilibili、今日头条、芒果TV
Support
Quality
Security
License
Reuse
🕷 A lightning fast multithreaded network scanner framework with modules.
Support
Quality
Security
License
Reuse
information gathering via dorks
Support
Quality
Security
License
Reuse
《Python爬虫开发 从入门到实战》配套源代码。
Support
Quality
Security
License
Reuse
geetest,滑动验证码
Support
Quality
Security
License
Reuse
GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Support
Quality
Security
License
Reuse
一个知乎爬虫,登陆,获取答案,图片
Support
Quality
Security
License
Reuse
Crawlera middleware for Scrapy
Support
Quality
Security
License
Reuse
知乎爬虫/可以爬出关注关系的爬虫
Support
Quality
Security
License
Reuse
Zhihu Daily Web GoLang
Support
Quality
Security
License
Reuse
一个灵活、友好的爬虫框架
Support
Quality
Security
License
Reuse
爬蟲極簡教學(fetch, parse, search, multiprocessing, API)- PTT 為例
Support
Quality
Security
License
Reuse
A Laravel wrapper for CrawlerDetect - the web crawler detection library
Support
Quality
Security
License
Reuse
淘宝/天猫预售幸运值/京东拆快递618任务助手AutoJS脚本源码。淘宝/淘特/天猫/京东/京喜/拼多多/唯品会/苏宁易购/考拉海购/抖音/快手内部优惠券。 饿了么/美团外卖红包,高德打车/花小猪/滴滴打车/滴滴货运/滴滴代驾/滴滴加油优惠券。肯德基/麦当劳/汉堡王/必胜客/星巴克/瑞幸咖啡/喜茶/奈雪的茶优惠券红包。 优酷/爱奇艺/腾讯视频/芒果TV/哔哩哔哩/QQ音乐/网易云音乐/喜马拉雅/京东PLUS/迅雷/知网/腾讯体育/百度文库/百度网盘超级SVIP会员半价/账号解析下载共享。秒杀软件/抢购软件/抢购助手项目集合。
Support
Quality
Security
License
Reuse
blacksheepwall is a hostname reconnaissance tool
Support
Quality
Security
License
Reuse
Emby 增强/美化 插件 (适用于 Chrome 内核浏览器 / EmbyServer)
Support
Quality
Security
License
Reuse
scan book's ISBN to get the information of this book
Support
Quality
Security
License
Reuse
Crawlera middleware for Scrapy
Support
Quality
Security
License
Reuse
A PHP web crawler framework
Support
Quality
Security
License
Reuse
B
Bulk-Bing-Image-downloaderby ostrolucky
Python 277 Version:Current License: Strong Copyleft (GPL-2.0)
Download full sized images returned from bing image search
Support
Quality
Security
License
Reuse
Quick and dirty web crawling.
Support
Quality
Security
License
Reuse
爬取微博内评论,获取评论内容和图片信息
Support
Quality
Security
License
Reuse
百度贴吧爬虫(基于scrapy和mysql)
Support
Quality
Security
License
Reuse
Crawler of zhihu.com
Support
Quality
Security
License
Reuse
批量ShiroKey检测爆破工具
Support
Quality
Security
License
Reuse
厦门大学每日健康打卡程序,在网站上即可实现打卡,支持失败自动重试,支持邮件通知功能。
Support
Quality
Security
License
Reuse
🔧 🔩 🔨 收集整理了爬虫相关的工具、模拟登陆技术、代理IP、scrapy模板代码等内容。
Support
Quality
Security
License
Reuse
C# Crawler 多线程爬虫程序,支持正则表达式过滤、关键字过滤、正文内容识别等等
Support
Quality
Security
License
Reuse
Công cụ quét và phân tích từ khoá các trang báo mạng Việt Nam
Support
Quality
Security
License
Reuse
纯 PHP 开发的并行抓取工具 (Parallel web crawler written in PHP)
Support
Quality
Security
License
Reuse
大众点评爬虫、API,可以进行单独城市、单独地区、单独商铺的爬取、搜索、多类型地区搜索、信息获取、提供MongoDB数据库存储支持,可以进行点评文本解密的爬取、存储
Support
Quality
Security
License
Reuse
批量查询ip对应域名及百度权重、备案信息;ip反查域名;ip查备案信息;资产归属查询;百度权重查询
Support
Quality
Security
License
Reuse
Fierce.pl Domain Scanner
Support
Quality
Security
License
Reuse
中国场外基金数据爬取&汇总分析
Support
Quality
Security
License
Reuse
Download weibo images without logging-in
Support
Quality
Security
License
Reuse
This is a Multi-thread crawler for Tumblr.
Support
Quality
Security
License
Reuse
Extracting URLs of a specific target based on the results of "commoncrawl.org"
Support
Quality
Security
License
Reuse
这是一个作者毕业设计的爬虫,爬取58同城、赶集网、链家、安居客、我爱我家网站的房价交易数据。
Support
Quality
Security
License
Reuse
腾讯新闻、知乎话题、微博粉丝,Tumblr爬虫、斗鱼弹幕、妹子图爬虫、分布式设计等
Support
Quality
Security
License
Reuse
CheckIPTools 扫描谷歌IP以及实用IP转换小工具
Support
Quality
Security
License
Reuse
直接通过链家 API 抓取数据的极速爬虫,宇宙最快~~ 🚀
Support
Quality
Security
License
Reuse
e
elvesby biezhi
🎊 Design and implement of lightweight crawler framework.
Java 319Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
R
Rcrawlerby salimk
An R web crawler and scraper
R 318Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
d
deCSS3by davatron5000
A lil' bookmarklet that will strip out your CSS3 rules and show you how gracefully you're degrading.
JavaScript 317Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
company-crawlerby bouxinLou
天眼查爬虫&企查查爬虫,指定关键字爬取公司信息
Python 317Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
u
urlgrabby IAmStoxe
A golang utility to spider through a website searching for additional links.
Go 314Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
N
NewsCrawlby casual-silva
狠心开源企业级舆情新闻爬虫项目:支持任意数量爬虫一键运行、爬虫定时任务、爬虫批量删除;爬虫一键部署;爬虫监控可视化; 配置集群爬虫分配策略;👉 现成的docker一键部署文档已为大家踩坑
Python 314Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
pornhub-apiby sskender
Unofficial API for PornHub.com in Python
Python 309Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
spidyby rivermont
The simple, easy to use command line web crawler.
Python 309Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
j
java-spiderby hemin1003
一个基于webmagic框架二次开发的java爬虫框架实战,已实现能爬取腾讯,搜狐,今日头条(单独集成功能)等资讯内容,配合elasticsearch框架用法,实现了自动爬虫,已投入线上生产使用。
Java 301Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
phpVideosby Mickeyto
php 写的视频下载工具,现已支持:Youku、Miaopai、腾讯、XVideos、Pornhub、91porn、微博酷燃、bilibili、今日头条、芒果TV
PHP 301Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
portSpiderby xdavidhu
🕷 A lightning fast multithreaded network scanner framework with modules.
Python 300Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
snitchby Smaash
information gathering via dorks
Python 299Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
SourceCodeOfBookby kingname
《Python爬虫开发 从入门到实战》配套源代码。
Python 298Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
Support
Quality
Security
License
Reuse
g
gopaby infinitbyte
GOPA, a spider written in Golang, for Elasticsearch. DEMO: http://index.elasticsearch.cn
Go 295Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
p
Support
Quality
Security
License
Reuse
s
scrapy-zyte-smartproxyby scrapy-plugins
Crawlera middleware for Scrapy
Python 291Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
Z
Support
Quality
Security
License
Reuse
G
GO-ZhihuDailyby LeiHao0
Zhihu Daily Web GoLang
Go 289Updated: 4 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
S
Support
Quality
Security
License
Reuse
C
CrawlerTutorialby leVirve
爬蟲極簡教學(fetch, parse, search, multiprocessing, API)- PTT 為例
Python 287Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
L
Laravel-Crawler-Detectby JayBizzle
A Laravel wrapper for CrawlerDetect - the web crawler detection library
PHP 287Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
x
xbby omxmo
淘宝/天猫预售幸运值/京东拆快递618任务助手AutoJS脚本源码。淘宝/淘特/天猫/京东/京喜/拼多多/唯品会/苏宁易购/考拉海购/抖音/快手内部优惠券。 饿了么/美团外卖红包,高德打车/花小猪/滴滴打车/滴滴货运/滴滴代驾/滴滴加油优惠券。肯德基/麦当劳/汉堡王/必胜客/星巴克/瑞幸咖啡/喜茶/奈雪的茶优惠券红包。 优酷/爱奇艺/腾讯视频/芒果TV/哔哩哔哩/QQ音乐/网易云音乐/喜马拉雅/京东PLUS/迅雷/知网/腾讯体育/百度文库/百度网盘超级SVIP会员半价/账号解析下载共享。秒杀软件/抢购软件/抢购助手项目集合。
JavaScript 287Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
b
blacksheepwallby tomsteele
blacksheepwall is a hostname reconnaissance tool
Go 286Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
e
emby-crxby Nolovenodie
Emby 增强/美化 插件 (适用于 Chrome 内核浏览器 / EmbyServer)
JavaScript 286Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
ScanBookby JayFang1993
scan book's ISBN to get the information of this book
Java 285Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
scrapy-crawleraby scrapy-plugins
Crawlera middleware for Scrapy
Python 285Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
phpfetcherby fanfank
A PHP web crawler framework
HTML 279Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
B
Bulk-Bing-Image-downloaderby ostrolucky
Download full sized images returned from bing image search
Python 277Updated: 3 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
h
Support
Quality
Security
License
Reuse
w
weibo_spiderby python3xxx
爬取微博内评论,获取评论内容和图片信息
Python 274Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
T
Tieba_Spiderby Aqua-Dream
百度贴吧爬虫(基于scrapy和mysql)
Python 272Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
z
Support
Quality
Security
License
Reuse
s
Support
Quality
Security
License
Reuse
X
XMU-Daily-Reporterby poor-circle
厦门大学每日健康打卡程序,在网站上即可实现打卡,支持失败自动重试,支持邮件通知功能。
C++ 266Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
h
happy-spidersby HITFRobot
🔧 🔩 🔨 收集整理了爬虫相关的工具、模拟登陆技术、代理IP、scrapy模板代码等内容。
Python 265Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
SimpleCrawlerby lei-zhu
C# Crawler 多线程爬虫程序,支持正则表达式过滤、关键字过滤、正文内容识别等等
C# 264Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
docbaoby hailoc12
Công cụ quét và phân tích từ khoá các trang báo mạng Việt Nam
Python 263Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
pspiderby hightman
纯 PHP 开发的并行抓取工具 (Parallel web crawler written in PHP)
PHP 263Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
DPspiderby 01ly
大众点评爬虫、API,可以进行单独城市、单独地区、单独商铺的爬取、搜索、多类型地区搜索、信息获取、提供MongoDB数据库存储支持,可以进行点评文本解密的爬取、存储
HTML 263Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
i
ip2domainby Sma11New
批量查询ip对应域名及百度权重、备案信息;ip反查域名;ip查备案信息;资产归属查询;百度权重查询
Python 262Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
f
fierce-domain-scannerby davidpepper
Fierce.pl Domain Scanner
Perl 261Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
chinese-fund-crawlerby jackluson
中国场外基金数据爬取&汇总分析
Python 260Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
weiboPicDownloaderby nondanee
Download weibo images without logging-in
Python 259Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
T
Tumblr_Crawlerby sparrow629
This is a Multi-thread crawler for Tumblr.
Python 259Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
c
cc.pyby si9int
Extracting URLs of a specific target based on the results of "commoncrawl.org"
Python 258Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
F
Fang_Scrapyby lihansunbai
这是一个作者毕业设计的爬虫,爬取58同城、赶集网、链家、安居客、我爱我家网站的房价交易数据。
Python 257Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
awesome_crawlby zhangslob
腾讯新闻、知乎话题、微博粉丝,Tumblr爬虫、斗鱼弹幕、妹子图爬虫、分布式设计等
Python 255Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
checkiptoolsby xyuanmu
CheckIPTools 扫描谷歌IP以及实用IP转换小工具
Python 253Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
F
Fast-LianJia-Crawlerby CaoZ
直接通过链家 API 抓取数据的极速爬虫,宇宙最快~~ 🚀
Python 252Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse