A web crawler (for bug hunting) that gathers more than you can imagine.
Support
Quality
Security
License
Reuse
A web spider for Sina Weibo, based on Scrapy framework and mongodb database.
Support
Quality
Security
License
Reuse
滑动验证码,希望对你们有所帮助❤️
Support
Quality
Security
License
Reuse
python 爬虫,下载一些vip音乐(网易云、酷狗、QQ音乐)
Support
Quality
Security
License
Reuse
Grabs all of the audio files from all of the Blinkist books
Support
Quality
Security
License
Reuse
实时接口获取中国各个城市、省份、国家的新型冠状肺炎(新冠肺炎 / 2019-nCoV / Covid-19)。疫情数据以及整体统计详情,新增美国各州统计、每日疫情数据 API。爬虫实时追踪新冠疫情变化,数据来自丁香园和 covidtracking.com。数据大屏示例:http://ncov.leafcoder.cn/demo/ 项目文档:http://ncov.leafcoder.cn/docs/
Support
Quality
Security
License
Reuse
Pessimistic locking using Redis
Support
Quality
Security
License
Reuse
知乎爬虫程序,定时跟踪问题数据,定时推送热门话题
Support
Quality
Security
License
Reuse
手机淘宝App 闲鱼App 相关爬虫 学习测试
Support
Quality
Security
License
Reuse
A web crawling framework written in Kotlin
Support
Quality
Security
License
Reuse
A decorator to write coroutine-like spider callbacks.
Support
Quality
Security
License
Reuse
一个 Golang 实现的相对智能、无需规则维护的通用新闻网站数据提取工具库。含域名探测、网页编码语种识别、网页链接分类提取、网页新闻要素抽取以及新闻正文抽取等组件。
Support
Quality
Security
License
Reuse
爬虫, http代理, 模拟登陆!
Support
Quality
Security
License
Reuse
网络爬虫和数据分析,当当、豆瓣、知乎、猫眼、微信公众号、联想官网、今日头条爬虫
Support
Quality
Security
License
Reuse
🌈 Python网络爬虫实战:王者荣耀超清壁纸、抖音无水印视频、M3U8推流视频、正方系统、财务报表、美女帅哥图片、CSDN阅读量、淘宝、京东、网易云、B站、12306、抖音、笔趣阁、漫画小说音乐电影下载等
Support
Quality
Security
License
Reuse
Those years of learning Python - 这些年学习的Python
Support
Quality
Security
License
Reuse
日常小脚本,懒人欢乐多。
Support
Quality
Security
License
Reuse
土巴兔和谷居装修网站爬虫
Support
Quality
Security
License
Reuse
😛 源视频mp4链接获取: toutiao今日头条app视频;🍉xigua西瓜视频; 🐧tencent腾讯视频; 🎼douyin抖音分享短链接解析,获取无水印播放链接;
Support
Quality
Security
License
Reuse
bing必应官网每日背景图片爬取
Support
Quality
Security
License
Reuse
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
Support
Quality
Security
License
Reuse
A dungeon crawler
Support
Quality
Security
License
Reuse
A zhihu user information crawler, which will collect some useful message including username, education, profession, follower and folling count.
Support
Quality
Security
License
Reuse
Scrapy + Puppeteer
Support
Quality
Security
License
Reuse
Amazon商品引流的 python 爬虫
Support
Quality
Security
License
Reuse
Usefull stuff from around teh internetz
Support
Quality
Security
License
Reuse
AngularJS integrations to SocketStream
Support
Quality
Security
License
Reuse
Find broken links in webpage
Support
Quality
Security
License
Reuse
Structured HTML table data extraction from URLs in Go that has almost no external dependencies
Support
Quality
Security
License
Reuse
Collection of python scripts I have created to crawl various websites, mostly for lead generation projects to match keywords and collect email addresses and post URLs
Support
Quality
Security
License
Reuse
Java 網路資料爬蟲包
Support
Quality
Security
License
Reuse
a passive scanner based on Mitmproxy and Arachni
Support
Quality
Security
License
Reuse
DEFUNCT: PHP Web Spider from 2011
Support
Quality
Security
License
Reuse
Scrapy Spider for 各种新闻网站
Support
Quality
Security
License
Reuse
PyQuery-based scraping micro-framework.
Support
Quality
Security
License
Reuse
A spider... ^.^
Support
Quality
Security
License
Reuse
一个简单的分布式爬虫框架
Support
Quality
Security
License
Reuse
scrapy-monitor,实现爬虫可视化,监控实时状态
Support
Quality
Security
License
Reuse
Support
Quality
Security
License
Reuse
the shittiest job titles you'll ever see
Support
Quality
Security
License
Reuse
🔎 Find usernames and download their data across social media.
Support
Quality
Security
License
Reuse
The "easiest" way to get all 70+ Nintendo Switch keys to use with hactool!
Support
Quality
Security
License
Reuse
A complimentary proxy to help to use SPM with headless browsers
Support
Quality
Security
License
Reuse
a crawler for zhihu
Support
Quality
Security
License
Reuse
A simple, fast, and reliable Coursera crawling & downloading tool
Support
Quality
Security
License
Reuse
Web recon tool (find temporary files, parse robots.txt, search some folders, google dorks and search domains hosted on same server)
Support
Quality
Security
License
Reuse
Parse user agent to deduce the platform
Support
Quality
Security
License
Reuse
一个工业和信息化部ICP备案查询的爬虫
Support
Quality
Security
License
Reuse
自如实时房源提醒
Support
Quality
Security
License
Reuse
capture pictures from website like sina, lofter, huaban and so on
Support
Quality
Security
License
Reuse
n
not-your-average-web-crawlerby tijme
A web crawler (for bug hunting) that gathers more than you can imagine.
Python 110Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
S
SinaWeiboSpiderby wen-fei
A web spider for Sina Weibo, based on Scrapy framework and mongodb database.
Python 109Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
Support
Quality
Security
License
Reuse
v
vipMusicby weitw
python 爬虫,下载一些vip音乐(网易云、酷狗、QQ音乐)
Python 109Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
b
blinkist-m4a-downloaderby luckylittle
Grabs all of the audio files from all of the Blinkist books
Go 109Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
django-covid19by leafcoder
实时接口获取中国各个城市、省份、国家的新型冠状肺炎(新冠肺炎 / 2019-nCoV / Covid-19)。疫情数据以及整体统计详情,新增美国各州统计、每日疫情数据 API。爬虫实时追踪新冠疫情变化,数据来自丁香园和 covidtracking.com。数据大屏示例:http://ncov.leafcoder.cn/demo/ 项目文档:http://ncov.leafcoder.cn/docs/
Python 108Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
r
redis-lockby mlanett
Pessimistic locking using Redis
Ruby 108Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
z
zhihu-spiderby wuomzfx
知乎爬虫程序,定时跟踪问题数据,定时推送热门话题
JavaScript 108Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
y
yue-spiderby 3394772548
手机淘宝App 闲鱼App 相关爬虫 学习测试
Go 108Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
k
krawlerby brianmadden
A web crawling framework written in Kotlin
Kotlin 108Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
scrapy-inline-requestsby rmax
A decorator to write coroutine-like spider callbacks.
Python 107Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
g
go-pkg-spiderby suosi-inc
一个 Golang 实现的相对智能、无需规则维护的通用新闻网站数据提取工具库。含域名探测、网页编码语种识别、网页链接分类提取、网页新闻要素抽取以及新闻正文抽取等组件。
Go 107Updated: 1 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
c
Support
Quality
Security
License
Reuse
w
web_scraping_and_data_analysisby keejo125
网络爬虫和数据分析,当当、豆瓣、知乎、猫眼、微信公众号、联想官网、今日头条爬虫
Python 106Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
python-spidersby tinygeeker
🌈 Python网络爬虫实战:王者荣耀超清壁纸、抖音无水印视频、M3U8推流视频、正方系统、财务报表、美女帅哥图片、CSDN阅读量、淘宝、京东、网易云、B站、12306、抖音、笔趣阁、漫画小说音乐电影下载等
Python 106Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
python-learningby happyjared
Those years of learning Python - 这些年学习的Python
Python 105Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
Support
Quality
Security
License
Reuse
d
decoration-design-crawlerby imflyn
土巴兔和谷居装修网站爬虫
Python 105Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
v
video_spiderby CoderCharm
😛 源视频mp4链接获取: toutiao今日头条app视频;🍉xigua西瓜视频; 🐧tencent腾讯视频; 🎼douyin抖音分享短链接解析,获取无水印播放链接;
Python 105Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
b
bing-image-spiderby chengquan223
bing必应官网每日背景图片爬取
C# 105Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
D
DotnetCrawlerby mehmetozkaya
DotnetCrawler is a straightforward, lightweight web crawling/scrapying library for Entity Framework Core output based on dotnet core. This library designed like other strong crawler libraries like WebMagic and Scrapy but for enabling extandable your custom requirements. Medium link : https://medium.com/@mehmetozkaya/creating-custom-web-crawler-with-dotnet-core-using-entity-framework-core-ec8d23f0ca7c
C# 105Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
Support
Quality
Security
License
Reuse
z
zhihu-crawlerby cpselvis
A zhihu user information crawler, which will collect some useful message including username, education, profession, follower and folling count.
Python 104Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
scrapy-puppeteerby clemfromspace
Scrapy + Puppeteer
Python 104Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
A
Support
Quality
Security
License
Reuse
s
scriptsby jhaddix
Usefull stuff from around teh internetz
Python 104Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
ss-angularby polidore
AngularJS integrations to SocketStream
JavaScript 104Updated: 5 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
L
Support
Quality
Security
License
Reuse
g
go-htmltableby nfx
Structured HTML table data extraction from URLs in Go that has almost no external dependencies
Go 104Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
scrapy-spidersby dcondrey
Collection of python scripts I have created to crawl various websites, mostly for lead generation projects to match keywords and collect email addresses and post URLs
Python 103Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
C
Support
Quality
Security
License
Reuse
P
PassiveScannerby jjf012
a passive scanner based on Mitmproxy and Arachni
Python 103Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
s
spidermonkeyby tlhunter
DEFUNCT: PHP Web Spider from 2011
PHP 103Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
spider_news_allby hailong0707-zz
Scrapy Spider for 各种新闻网站
Python 102Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
d
demiurgeby matiasb
PyQuery-based scraping micro-framework.
Python 102Updated: 4 y ago License: Permissive (BSD-3-Clause)
Support
Quality
Security
License
Reuse
M
Support
Quality
Security
License
Reuse
p
Support
Quality
Security
License
Reuse
s
scrapy-monitorby ioiogoo
scrapy-monitor,实现爬虫可视化,监控实时状态
Python 102Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
antispiderby dytttf
JavaScript 101Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
b
bullshit-job-titlesby bullgit
the shittiest job titles you'll ever see
HTML 101Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
I
Investigoby tdh8316
🔎 Find usernames and download their data across social media.
Go 101Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
k
kezplez-nxby tesnos
The "easiest" way to get all 70+ Nintendo Switch keys to use with hactool!
C 101Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
z
zyte-smartproxy-headless-proxyby zytedata
A complimentary proxy to help to use SPM with headless browsers
Go 101Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
Z
Zhihu_Crawlerby salamer
a crawler for zhihu
Python 100Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
d
dl_courseraby feng-lei
A simple, fast, and reliable Coursera crawling & downloading tool
Python 100Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
C
Crawlicby Ganapati
Web recon tool (find temporary files, parse robots.txt, search some folders, google dorks and search domains hosted on same server)
Python 100Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
p
platform_agentby basecamp
Parse user agent to deduce the platform
Ruby 100Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
b
beian_miit_spiderby Ithrael
一个工业和信息化部ICP备案查询的爬虫
Python 99Updated: 1 y ago License: Strong Copyleft (GPL-2.0)
Support
Quality
Security
License
Reuse
z
Support
Quality
Security
License
Reuse
c
capturerby Litreily
capture pictures from website like sina, lofter, huaban and so on
Python 99Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse