Easy to use lightweight web crawler(易用的轻量化网络爬虫)
Support
Quality
Security
License
Reuse
新浪微博爬虫,用python爬取新浪微博数据,并下载微博图片和微博视频
Support
Quality
Security
License
Reuse
一个用于安装/更新 NS 模拟器的工具
Support
Quality
Security
License
Reuse
An easy to use, powerful crawler implemented in PHP. Can execute Javascript.
Support
Quality
Security
License
Reuse
Go语言实例教程从入门到进阶,包括基础库使用、设计模式、面试易错点、工具类、对接第三方等
Support
Quality
Security
License
Reuse
Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
Support
Quality
Security
License
Reuse
Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.
Support
Quality
Security
License
Reuse
Web crawling framework based on asyncio.
Support
Quality
Security
License
Reuse
2019新型冠状病毒疫情实时爬虫及API | COVID-19/2019-nCoV Realtime Infection Crawler and API
Support
Quality
Security
License
Reuse
Gospider - Fast web spider written in Go
Support
Quality
Security
License
Reuse
突破百度云限速合集,另外附带Baidu-Go、Tampermonkey、Proxyee-down教程。从此云端女友从不断线,有了这个它,忘掉那个她!
Support
Quality
Security
License
Reuse
Polite, slim and concurrent web crawler.
Support
Quality
Security
License
Reuse
一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.
Support
Quality
Security
License
Reuse
🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单,功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、BatchSpider四种爬虫解决不同场景的需求。且支持断点续爬、监控报警、浏览器渲染、海量数据去重等功能。更有功能强大的爬虫管理系统feaplat为其提供方便的部署及调度
Support
Quality
Security
License
Reuse
哔哩哔哩的API调用模块
Support
Quality
Security
License
Reuse
Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.
Support
Quality
Security
License
Reuse
[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.
Support
Quality
Security
License
Reuse
🔎Sniffing and parsing mysql,redis,http,mongodb etc protocol. 抓包截取项目中的数据库请求并解析成相应的语句。
Support
Quality
Security
License
Reuse
🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent
Support
Quality
Security
License
Reuse
简单易用的Python爬虫框架,QQ交流群:597510560
Support
Quality
Security
License
Reuse
Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.
Support
Quality
Security
License
Reuse
Async Python 3.6+ web scraping micro-framework based on asyncio
Support
Quality
Security
License
Reuse
Anemone web-spider framework
Support
Quality
Security
License
Reuse
Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!
Support
Quality
Security
License
Reuse
EditThisCookie is the famous Google Chrome/Chromium extension for editing cookies
Support
Quality
Security
License
Reuse
Crawl a website and run it through Google lighthouse
Support
Quality
Security
License
Reuse
🏀 Python3 网络爬虫实战(部分含详细教程)猫眼 腾讯视频 豆瓣 研招网 微博 笔趣阁小说 百度热点 B站 CSDN 网易云阅读 阿里文学 百度股票 今日头条 微信公众号 网易云音乐 拉勾 有道 unsplash 实习僧 汽车之家 英雄联盟盒子 大众点评 链家 LPL赛程 台风 梦幻西游、阴阳师藏宝阁 天气 牛客网 百度文库 睡前故事 知乎 Wish
Support
Quality
Security
License
Reuse
A configurable and extensible PHP web spider
Support
Quality
Security
License
Reuse
Python爬虫,京东自动登录,在线抢购商品
Support
Quality
Security
License
Reuse
微博爬虫及配套工具箱,微博用户、话题、评论采集一网打尽。图片下载、情感分析,地理位置、关系网络、spammer 机器人识别等功能应有尽有。Docs:https://buyixiao.github.io/blog/weibo-super-spider.html 配套可视化网站:https://buyixiao.github.io/blog/one-stop-weibo-visualization.html
Support
Quality
Security
License
Reuse
QQ空间导出助手,用于备份QQ空间的说说、日志、私密日记、相册、视频、留言板、QQ好友、收藏夹、分享、最近访客为文件,便于迁移与保存
Support
Quality
Security
License
Reuse
Beanbun 是用 PHP 编写的多进程网络爬虫框架,具有良好的开放性、高可扩展性,基于 Workerman。
Support
Quality
Security
License
Reuse
Elasticsearch File System Crawler (FS Crawler)
Support
Quality
Security
License
Reuse
Python爬虫,京东自动登录,在线抢购商品
Support
Quality
Security
License
Reuse
The complete web scraping toolkit for PHP.
Support
Quality
Security
License
Reuse
Easily download all the photos/videos from tumblr blogs. 下载指定的 Tumblr 博客中的图片,视频
Support
Quality
Security
License
Reuse
⚡️Lightning-fast async download tool for bilibili and more | 快如闪电的异步下载工具,支持bilibili及更多
Support
Quality
Security
License
Reuse
A web privacy measurement framework
Support
Quality
Security
License
Reuse
Creating Scrapy scrapers via the Django admin interface
Support
Quality
Security
License
Reuse
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
Support
Quality
Security
License
Reuse
百度云网盘搜索引擎,包含爬虫 & 网站
Support
Quality
Security
License
Reuse
JSpider会每周更新至少一个网站的JS解密方式,欢迎 Star,交流微信:13298307816
Support
Quality
Security
License
Reuse
📝 quickly crawl the information (e.g. followers, tags etc...) of an instagram profile.
Support
Quality
Security
License
Reuse
:rocket: 采集|免费|优质|的订阅链接;科学上网,从娃娃抓起!
Support
Quality
Security
License
Reuse
👧 美女写真套图爬虫(二)
Support
Quality
Security
License
Reuse
A task based API for taking screenshots and scraping text from websites.
Support
Quality
Security
License
Reuse
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Support
Quality
Security
License
Reuse
哔哩哔哩的API调用模块
Support
Quality
Security
License
Reuse
A configurable web spider with a easy-to-use web console
Support
Quality
Security
License
Reuse
Kimurai is a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with JavaScript rendered websites
Support
Quality
Security
License
Reuse
g
geccoby xtuhcy
Easy to use lightweight web crawler(易用的轻量化网络爬虫)
Java 2444Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
w
weibo-crawlerby dataabc
新浪微博爬虫,用python爬取新浪微博数据,并下载微博图片和微博视频
Python 2405Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
n
ns-emu-toolsby triwinds
一个用于安装/更新 NS 模拟器的工具
Python 2363Updated: 1 y ago License: Strong Copyleft (AGPL-3.0)
Support
Quality
Security
License
Reuse
c
crawlerby spatie
An easy to use, powerful crawler implemented in PHP. Can execute Javascript.
PHP 2354Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
go-demoby pibigstar
Go语言实例教程从入门到进阶,包括基础库使用、设计模式、面试易错点、工具类、对接第三方等
Go 2126Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
abotby sjdirect
Cross Platform C# web crawler framework built for speed and flexibility. Please star this project! +1.
C# 2071Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
g
geziyorby geziyor
Geziyor, blazing fast web crawling & scraping framework for Go. Supports JS rendering.
Go 2047Updated: 1 y ago License: Weak Copyleft (MPL-2.0)
Support
Quality
Security
License
Reuse
g
gainby gaojiuli
Web crawling framework based on asyncio.
Python 2016Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
D
DXY-COVID-19-Crawlerby BlankerL
2019新型冠状病毒疫情实时爬虫及API | COVID-19/2019-nCoV Realtime Infection Crawler and API
Python 2011Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
gospiderby jaeles-project
Gospider - Fast web spider written in Go
Go 1993Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
A
Aria2by itgoyo
突破百度云限速合集,另外附带Baidu-Go、Tampermonkey、Proxyee-down教程。从此云端女友从不断线,有了这个它,忘掉那个她!
JavaScript 1956Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
g
gocrawlby PuerkitoBio
Polite, slim and concurrent web crawler.
Go 1938Updated: 3 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
S
SeimiCrawlerby zhegexiaohuozi
一个简单、敏捷、分布式的支持SpringBoot的Java爬虫框架;An agile, distributed crawler framework.
Java 1904Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
f
feapderby Boris-code
🚀🚀🚀feapder is an easy to use, powerful crawler framework | feapder是一款上手简单,功能强大的Python爬虫框架。内置AirSpider、Spider、TaskSpider、BatchSpider四种爬虫解决不同场景的需求。且支持断点续爬、监控报警、浏览器渲染、海量数据去重等功能。更有功能强大的爬虫管理系统feaplat为其提供方便的部署及调度
Python 1839Updated: 1 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
b
Support
Quality
Security
License
Reuse
I
Image-Downloaderby QianyanTech
Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.
Python 1817Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
go_spiderby hu17889
[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.
Go 1807Updated: 2 y ago License: Weak Copyleft (MPL-2.0)
Support
Quality
Security
License
Reuse
g
go-snifferby 40t
🔎Sniffing and parsing mysql,redis,http,mongodb etc protocol. 抓包截取项目中的数据库请求并解析成相应的语句。
Go 1798Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
C
Crawler-Detectby JayBizzle
🕷 CrawlerDetect is a PHP class for detecting bots/crawlers/spiders via the user agent
PHP 1779Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
P
PSpiderby xianhu
简单易用的Python爬虫框架,QQ交流群:597510560
Python 1766Updated: 2 y ago License: Permissive (BSD-2-Clause)
Support
Quality
Security
License
Reuse
I
Image-Downloaderby sczhengyabin
Download images from Google, Bing, Baidu. 谷歌、百度、必应图片下载.
Python 1707Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
r
ruiaby howie6879
Async Python 3.6+ web scraping micro-framework based on asyncio
Python 1680Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
a
Support
Quality
Security
License
Reuse
s
scrapoxyby fabienvauchelles
Scrapoxy hides your scraper behind a cloud. It starts a pool of proxies to send your requests. Now, you can crawl without thinking about blacklisting!
JavaScript 1561Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
E
Edit-This-Cookieby ETCExtensions
EditThisCookie is the famous Google Chrome/Chromium extension for editing cookies
JavaScript 1549Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
l
lightcrawlerby github
Crawl a website and run it through Google lighthouse
JavaScript 1470Updated: 2 y ago License: Permissive (ISC)
Support
Quality
Security
License
Reuse
R
Reptileby librauee
🏀 Python3 网络爬虫实战(部分含详细教程)猫眼 腾讯视频 豆瓣 研招网 微博 笔趣阁小说 百度热点 B站 CSDN 网易云阅读 阿里文学 百度股票 今日头条 微信公众号 网易云音乐 拉勾 有道 unsplash 实习僧 汽车之家 英雄联盟盒子 大众点评 链家 LPL赛程 台风 梦幻西游、阴阳师藏宝阁 天气 牛客网 百度文库 睡前故事 知乎 Wish
Python 1455Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
p
php-spiderby mvdbos
A configurable and extensible PHP web spider
PHP 1291Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
j
jd-autobuyby adyzng
Python爬虫,京东自动登录,在线抢购商品
Python 1269Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
W
WeiboSuperSpiderby Python3Spiders
微博爬虫及配套工具箱,微博用户、话题、评论采集一网打尽。图片下载、情感分析,地理位置、关系网络、spammer 机器人识别等功能应有尽有。Docs:https://buyixiao.github.io/blog/weibo-super-spider.html 配套可视化网站:https://buyixiao.github.io/blog/one-stop-weibo-visualization.html
Python 1218Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
Q
QZoneExportby ShunCai
QQ空间导出助手,用于备份QQ空间的说说、日志、私密日记、相册、视频、留言板、QQ好友、收藏夹、分享、最近访客为文件,便于迁移与保存
JavaScript 1210Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
B
Beanbunby kiddyuchina
Beanbun 是用 PHP 编写的多进程网络爬虫框架,具有良好的开放性、高可扩展性,基于 Workerman。
PHP 1206Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
f
fscrawlerby dadoonet
Elasticsearch File System Crawler (FS Crawler)
Java 1197Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
j
jd-autobuyby Adyzng
Python爬虫,京东自动登录,在线抢购商品
Python 1197Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
coreby roach-php
The complete web scraping toolkit for PHP.
PHP 1188Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
tumblr-crawlerby dixudx
Easily download all the photos/videos from tumblr blogs. 下载指定的 Tumblr 博客中的图片,视频
Python 1146Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
b
bilixby HFrost0
⚡️Lightning-fast async download tool for bilibili and more | 快如闪电的异步下载工具,支持bilibili及更多
Python 1141Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
O
OpenWPMby mozilla
A web privacy measurement framework
Python 1137Updated: 3 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
d
django-dynamic-scraperby holgerd77
Creating Scrapy scrapers via the Django admin interface
Python 1121Updated: 1 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
s
scrapy-clusterby istresearch
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
Python 1114Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
B
BaiduyunSpiderby k1995
百度云网盘搜索引擎,包含爬虫 & 网站
JavaScript 1052Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
J
JSpiderby ScrapingBoot
JSpider会每周更新至少一个网站的JS解密方式,欢迎 Star,交流微信:13298307816
JavaScript 1038Updated: 1 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
i
instagram-profilecrawlby InstaPy
📝 quickly crawl the information (e.g. followers, tags etc...) of an instagram profile.
Python 1027Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
V
V2RayCloudSpiderby QIN2DIM
:rocket: 采集|免费|优质|的订阅链接;科学上网,从娃娃抓起!
Python 1023Updated: 1 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
m
Support
Quality
Security
License
Reuse
s
sketchyby Netflix-Skunkworks
A task based API for taking screenshots and scraping text from websites.
JavaScript 990Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
g
grab-siteby ArchiveTeam
The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns
Python 983Updated: 2 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
b
Support
Quality
Security
License
Reuse
s
spiderby gsh199449
A configurable web spider with a easy-to-use web console
Java 958Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
k
kimuraframeworkby vifreefly
Kimurai is a modern web scraping framework written in Ruby which works out of box with Headless Chromium/Firefox, PhantomJS, or simple HTTP requests and allows to scrape and interact with JavaScript rendered websites
Ruby 952Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse