一个五子棋学习资料站
Support
Quality
Security
License
Reuse
B站3亿用户信息爬虫(mid号,昵称,性别,关注,粉丝,等级)
Support
Quality
Security
License
Reuse
A Simple spider that use to crawl the douban Top 100 moive name and input all list
Support
Quality
Security
License
Reuse
Cross domain getImageData work around and jQuery plugin
Support
Quality
Security
License
Reuse
百度文库爬虫 Baidu Wenku Spider 百度文库下载器
Support
Quality
Security
License
Reuse
Distributed crawler, database and web frontend for public directories indexing
Support
Quality
Security
License
Reuse
A spider/crawler edit by Node.js to download torrents of Adult videos.
Support
Quality
Security
License
Reuse
a .js scanner, built in php. designed to scrape urls and other info
Support
Quality
Security
License
Reuse
Argo is an automated general crawler for automatically obtaining website URLs . Argo 是一个自动化通用爬虫 用于自动化获取网站的URL 基于无头浏览器实现了静态和动态结合的方式来实现
Support
Quality
Security
License
Reuse
Github stargazers information gathering tool
Support
Quality
Security
License
Reuse
:star2::octocat: powered by python3( simple learning of spider) 百度文库;网易云歌曲; 豆瓣电影; GitHub; 京东; QQ空间; 天气; vip解析助手; TED文本内容; wifi破解脚本; 必应图片设置为桌面等爬取
Support
Quality
Security
License
Reuse
Immospider is a crawler for the Immoscout24 website.
Support
Quality
Security
License
Reuse
Learn the web crawler.
Support
Quality
Security
License
Reuse
Zhihu User Spider
Support
Quality
Security
License
Reuse
An adult websites crawler for PPAV .
Support
Quality
Security
License
Reuse
Leetcode Contest Ranking Searcher
Support
Quality
Security
License
Reuse
字体混淆服务
Support
Quality
Security
License
Reuse
基于Scrapy的QQ音乐爬虫(QQ Music Spider),爬取歌曲信息、歌词、精彩评论等,并且分享了QQ音乐中排名前6400名的内地和港台歌手的49万+的音乐语料
Support
Quality
Security
License
Reuse
企业信息爬虫,关键字爬取公司信息
Support
Quality
Security
License
Reuse
预约美帝签证各个签证处最早时间的爬虫
Support
Quality
Security
License
Reuse
scrapy-redis的集群版,可以借助Redis集群实现海量网站的独立去重,避免单机内存不足的尴尬
Support
Quality
Security
License
Reuse
爬取淘宝商品信息
Support
Quality
Security
License
Reuse
a web crawler
Support
Quality
Security
License
Reuse
备份豆瓣计划
Support
Quality
Security
License
Reuse
Kubernetes Redis with High Availability
Support
Quality
Security
License
Reuse
react + flask + scrapy 构建的单页应用漫画网站
Support
Quality
Security
License
Reuse
An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Support
Quality
Security
License
Reuse
Weibo Spider Using Scrapy
Support
Quality
Security
License
Reuse
A selector-based html snapshot tool using Puppeteer or PhantomJS that sources sitemap.xml, sitemap-index, robots.txt, or arbitrary input
Support
Quality
Security
License
Reuse
i
instant-username-searchby instant-username-search
JavaScript 125 Version:Current License: Strong Copyleft (GPL-3.0)
⚡ Instantly search for the availability of your username on more than 100 social media sites.
Support
Quality
Security
License
Reuse
用爬虫爬取小说网站上所有小说,存储到数据库中,并用爬到的数据构建自己的小说网站
Support
Quality
Security
License
Reuse
A flask API for running your scrapy spiders
Support
Quality
Security
License
Reuse
Multithreading download all HD photos / pictures from someone's Sina Weibo album.
Support
Quality
Security
License
Reuse
一个基于SVM的验证码破解程序
Support
Quality
Security
License
Reuse
Pixiv插画批量下载,提供关注画师插画、收藏作品下载(单/多/动图)及API - pixiv爬虫
Support
Quality
Security
License
Reuse
pdd (拼多多) 爬虫 js 解密 anti_content 参数解密及全站抓取代码思路实现
Support
Quality
Security
License
Reuse
Distributed Ruby Web Crawler, backed up by Redis
Support
Quality
Security
License
Reuse
A crawler on taobao live barrages.
Support
Quality
Security
License
Reuse
The unix-way web crawler
Support
Quality
Security
License
Reuse
新浪微博主题爬虫
Support
Quality
Security
License
Reuse
Scrape GSoC organisations using a single script.
Support
Quality
Security
License
Reuse
Scrapy YouTube watch history spider. Because YouTube didn't have a history search.
Support
Quality
Security
License
Reuse
Some scrapy spiders useful to crawl instagram posts using public APIS (No TOKEN)
Support
Quality
Security
License
Reuse
Xenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure :dizzy: 多语言执行器,分布式爬虫
Support
Quality
Security
License
Reuse
文章采集工具 Article collection tool
Support
Quality
Security
License
Reuse
Sometimes sites make crawling hard. Selenium-crawler uses selenium automation to fix that.
Support
Quality
Security
License
Reuse
百度mp3全站爬虫
Support
Quality
Security
License
Reuse
爬虫-博客大全
Support
Quality
Security
License
Reuse
A php crawler that finds emails on the internets
Support
Quality
Security
License
Reuse
微博爬虫:每天定时爬取微博热搜榜的内容,留下互联网人的记忆。
Support
Quality
Security
License
Reuse
k
Support
Quality
Security
License
Reuse
b
bilibili-user-information-spiderby zhang0peter
B站3亿用户信息爬虫(mid号,昵称,性别,关注,粉丝,等级)
Python 136Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
d
dou_ban_spiderby Andrew-liu
A Simple spider that use to crawl the douban Top 100 moive name and input all list
Python 136Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
g
getImageDataby betamax
Cross domain getImageData work around and jQuery plugin
JavaScript 136Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
wksby BoyInTheSun
百度文库爬虫 Baidu Wenku Spider 百度文库下载器
Python 136Updated: 1 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
o
od-databaseby simon987
Distributed crawler, database and web frontend for public directories indexing
Python 135Updated: 1 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
i
islandBeautyby zhangjh
A spider/crawler edit by Node.js to download torrents of Adult videos.
JavaScript 134Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
J
JS-Scanby zseano
a .js scanner, built in php. designed to scrape urls and other info
CSS 134Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
A
Argoby Ciyfly
Argo is an automated general crawler for automatically obtaining website URLs . Argo 是一个自动化通用爬虫 用于自动化获取网站的URL 基于无头浏览器实现了静态和动态结合的方式来实现
Go 134Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
S
Stardoxby 0xPrateek
Github stargazers information gathering tool
Python 133Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
s
spiderby Winniekun
:star2::octocat: powered by python3( simple learning of spider) 百度文库;网易云歌曲; 豆瓣电影; GitHub; 京东; QQ空间; 天气; vip解析助手; TED文本内容; wifi破解脚本; 必应图片设置为桌面等爬取
Python 133Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
I
ImmoSpiderby asmaier
Immospider is a crawler for the Immoscout24 website.
Jupyter Notebook 133Updated: 2 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
S
Spidersby Tactful-biao
Learn the web crawler.
Python 132Updated: 3 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
Z
Support
Quality
Security
License
Reuse
P
PPAV-crawlerby PPAV-inc
An adult websites crawler for PPAV .
HTML 132Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
l
leetcode-ranking-searchby chiehmin
Leetcode Contest Ranking Searcher
JavaScript 131Updated: 2 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
f
Support
Quality
Security
License
Reuse
Q
QQMusicSpiderby yangjianxin1
基于Scrapy的QQ音乐爬虫(QQ Music Spider),爬取歌曲信息、歌词、精彩评论等,并且分享了QQ音乐中排名前6400名的内地和港台歌手的49万+的音乐语料
Python 130Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
w
Support
Quality
Security
License
Reuse
u
Support
Quality
Security
License
Reuse
s
scrapy_redis_clusterby thsheep
scrapy-redis的集群版,可以借助Redis集群实现海量网站的独立去重,避免单机内存不足的尴尬
Python 130Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
Support
Quality
Security
License
Reuse
c
Support
Quality
Security
License
Reuse
d
Support
Quality
Security
License
Reuse
k
k8s-redis-haby tarosky
Kubernetes Redis with High Availability
Shell 129Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
s
soul-mangaby fyxtc
react + flask + scrapy 构建的单页应用漫画网站
JavaScript 128Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
s
seleniumcrawlerby voliveirajr
An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
Python 127Updated: 2 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
W
Weiboby Python3WebSpider
Weibo Spider Using Scrapy
Python 126Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
h
html-snapshotsby localnerve
A selector-based html snapshot tool using Puppeteer or PhantomJS that sources sitemap.xml, sitemap-index, robots.txt, or arbitrary input
JavaScript 126Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
i
instant-username-searchby instant-username-search
⚡ Instantly search for the availability of your username on more than 100 social media sites.
JavaScript 125Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
c
copyBookby hahaha108
用爬虫爬取小说网站上所有小说,存储到数据库中,并用爬到的数据构建自己的小说网站
CSS 125Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
a
arachneby kirankoduru
A flask API for running your scrapy spiders
Python 124Updated: 4 y ago License: Proprietary (Proprietary)
Support
Quality
Security
License
Reuse
S
Sina-Weibo-Album-Downloaderby lincanbin
Multithreading download all HD photos / pictures from someone's Sina Weibo album.
Python 124Updated: 4 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
c
Support
Quality
Security
License
Reuse
P
PixiCby Coder-Sakura
Pixiv插画批量下载,提供关注画师插画、收藏作品下载(单/多/动图)及API - pixiv爬虫
Python 123Updated: 2 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
p
pinduoduoby onetwo1
pdd (拼多多) 爬虫 js 解密 anti_content 参数解密及全站抓取代码思路实现
Python 123Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
c
cloud-crawlerby CalculatedContent
Distributed Ruby Web Crawler, backed up by Redis
Ruby 123Updated: 4 y ago License: No License (No License)
Support
Quality
Security
License
Reuse
t
taobao-live-crawlerby xiaozhongliu
A crawler on taobao live barrages.
JavaScript 123Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
c
Support
Quality
Security
License
Reuse
S
Support
Quality
Security
License
Reuse
G
GSoC-Organisation-Scraperby rohithasrk
Scrape GSoC organisations using a single script.
Python 122Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
Y
Youtube-Watch-History-Scraperby zvodd
Scrapy YouTube watch history spider. Because YouTube didn't have a history search.
Python 122Updated: 4 y ago License: Permissive (Unlicense)
Support
Quality
Security
License
Reuse
i
instagram-scraperby h4t0n
Some scrapy spiders useful to crawl instagram posts using public APIS (No TOKEN)
Python 122Updated: 4 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse
s
sentinel-crawlerby wx-chevalier
Xenomorph Crawler, a Concise, Declarative and Observable Distributed Crawler(Node / Go / Java / Rust) For Web, RDB, OS, also can act as a Monitor(with Prometheus) or ETL for Infrastructure :dizzy: 多语言执行器,分布式爬虫
JavaScript 122Updated: 3 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
a
article-spiderby PeterYangs
文章采集工具 Article collection tool
Go 122Updated: 3 y ago License: Permissive (Apache-2.0)
Support
Quality
Security
License
Reuse
s
selenium-crawlerby corywalker
Sometimes sites make crawling hard. Selenium-crawler uses selenium automation to fix that.
Python 121Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
b
Support
Quality
Security
License
Reuse
l
Support
Quality
Security
License
Reuse
p
php-crawlerby hedii
A php crawler that finds emails on the internets
PHP 121Updated: 4 y ago License: Permissive (MIT)
Support
Quality
Security
License
Reuse
w
weibo_Hot_Searchby Writeup007
微博爬虫:每天定时爬取微博热搜榜的内容,留下互联网人的记忆。
Python 121Updated: 3 y ago License: Strong Copyleft (GPL-3.0)
Support
Quality
Security
License
Reuse