crawlab | Distributed web crawler admin platform for spiders | Crawler library
kandi X-RAY | crawlab Summary
kandi X-RAY | crawlab Summary
Distributed web crawler admin platform for spiders management regardless of languages and frameworks.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of crawlab
crawlab Key Features
crawlab Examples and Code Snippets
Community Discussions
Trending Discussions on crawlab
QUESTION
i setted docker on ec2 and upload crawler scrapy code. since yesterday when i run ec2, it could collect data well. but suddenly now it doesn't work well. i guessed it is EC2 problem, because it could work well till today lunch time.
then i try to check EC2 condition, typed df -h
, it showed
ANSWER
Answered 2021-Mar-23 at 09:29It is clear that your EC2 instance has disk usage issue (disk gets full).
You successfully increased the volume size (as shown in lsblk
output) but you also need to extend the file system, as described here.
Essentially, you should run sudo growpart /dev/xvda 1
and sudo resize2fs /dev/xvda1
. Use df -h
again to verify.
In order to optimize performance, you should consider:
- Using a dedicated volume for your workload (i.e. mount a new, larger EBS volume on
/mnt/data
and configure your application to use this path for data storage) - Resize this volume and/or change type (i.e. use provisioned IOPS SSD)
Scrapy also supports S3 for storage.
You can configure scrapy logging to use a specific log file (--logfile FILE
which you can later delete) or disabled it altogether (--nolog
, not recommended).
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install crawlab
Direct Deploy (Check Internal Kernel)
Kubernetes (Multi-Node Deployment)
Please open the command line prompt and execute the command below. Make sure you have installed docker-compose in advance. Next, you can look into the docker-compose.yml (with detailed config params) and the Documentation (Chinese) for further information.
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page