gae-dataflow | periodically launch a Dataflow analytics pipeline
kandi X-RAY | gae-dataflow Summary
kandi X-RAY | gae-dataflow Summary
An example that shows how to periodically launch a Dataflow analytics pipeline from GAE Flex, that reads from Datastore.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Process tweets from Cloud Datastore .
- Get Twitter API .
- Launch the dataflow pipeline .
- Create a read from the given pcoll .
- Make a query protobuf .
- Run a custom command .
- Return a list of urls .
- Start bundle .
- Hello!
- Argparse argument parser .
gae-dataflow Key Features
gae-dataflow Examples and Code Snippets
Community Discussions
Trending Discussions on gae-dataflow
QUESTION
I'm trying to build an app in Python with Google App Engine that fetches followers of specific accounts and then their tweets. I'm basing it on this template and changing it to adapt it to what I need.
The issue at the moment is that when I try to fetch followers, I get an DeadlineExceededError
due to the Twitter API waiting time.
I have found this post on how to fix the same problem and I think that in my case the best solution would be to use backends, but I noticed that they are deprecated.
Does someone know how I can achieve the same result without the deprecated module?
...ANSWER
Answered 2018-Sep-05 at 16:30Backends (modules) have been deprecated in favor of Services:
https://cloud.google.com/appengine/docs/flexible/python/an-overview-of-app-engine
For the Service you want to be able to handle requests longer than 60 seconds, set it to Manual Scaling. Then, a request can run for up to 24 hours (or until you shut it down). See:
https://cloud.google.com/appengine/docs/standard/python/how-instances-are-managed#instance_scaling
Of course, your costs may go up with long running instances and request.
QUESTION
I have been trying to run a cron job with GAE (code developed in Python), but when I trigger the job, it fails without any error message -- I can't find anything at all in the logs.
This is happening for a service for which I'm using the flexible environment.
This is the structure of my files:
my_service.yaml
looks like this:
ANSWER
Answered 2018-Oct-04 at 19:05EDIT:
def get(self):
may have some issues.
First, get
may be reserved. Second, you aren't able to send self
to that function. Change that to:
def get_service():
EDIT2:
You also need to import logging
at the top of any page that uses it. And, you have not imported Flask
and its components:
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install gae-dataflow
You can use gae-dataflow like any standard Python library. You will need to make sure that you have a development environment consisting of a Python distribution including header files, a compiler, pip, and git installed. Make sure that your pip, setuptools, and wheel are up to date. When using pip it is generally recommended to install packages in a virtual environment to avoid changes to the system.
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page