dockerflow | workflow runner that uses Dataflow | BPM library
kandi X-RAY | dockerflow Summary
kandi X-RAY | dockerflow Summary
Dockerflow makes it easy to run a multi-step workflow of Docker tasks using Google Cloud Dataflow for orchestration. Docker steps are run using the Pipelines API. You can run Dockerflow from a shell on your laptop, and the job will run in Google Cloud Platform using Dataflow's fully managed service and web UI. Dockerflow workflows can be defined in YAML files, or by writing Java code. Examples of workflows defined in YAML can be found in. Examples of workflows defined in Java can be found in. You can run a batch of workflows at once by providing a CSV file with one row per workflow to define the parameters.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Create a new task
- Declares an input parameter for a path in a GCS container
- Declares an output parameter for execution
- Declares an array of input files
- Runs the tool with the given options
- Resolves a local path
- Load a workflow definition
- Deletes an element
- Performs an HTTPS DELETE request
- Replaces the variables with the default value
- Checks if a path exists
- Gather task arguments
- Adds parameters to the table
- Set the parameters for a project
- Creates a new arguments table
- Sets a global variable
- Set the task logging to use
- Set the minimum CPU cores
- Set whether this table is preemptible
- Set the memory allocation
- Run a single task
- Run a task in Docker
- Run a Dataflow on a Dataflow
- Process the element
- Create a dataflow pipeline
- Creates a new task
dockerflow Key Features
dockerflow Examples and Code Snippets
Community Discussions
Trending Discussions on dockerflow
QUESTION
I am using Dockerflow to run parallel tasks through the Google Pipelines API on Google Cloud Platform. I started a single-step task running 1389 VMs in parallel and found that 233 of the VMs were apparently doing nothing and hanging indefinitely.
I did a spot check of the serial console output and repeatedly saw the VMs running into "Getting controller config failed" errors.
When I tried logging into the VMs I received the error: "Connection Failed. We are unable to connect to the VM on port 22".
I am wondering why my VM instances are hanging, and if there is something I can do to avoid running into these issues.
I've included a snippet of the serial console output below
...ANSWER
Answered 2017-Feb-15 at 20:13there may have been a temporary networking issue. Can you give me some failed operation ids (or failed VM names)?
Have you tried again since then; can you reproduce the problem?
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install dockerflow
git clone this repository. git clone https://github.com/googlegenomics/dockerflow
Build it with Maven. cd dockerflow mvn package -DskipTests
Set up the DOCKERFLOW_HOME environment. export DOCKERFLOW_HOME="$(pwd)" export PATH="${PATH}":"${DOCKERFLOW_HOME}/bin" chmod +x bin/*
Run a sample workflow: dockerflow --project=MY-PROJECT \ --workflow-file=src/test/resources/linear-graph.yaml \ --workspace=gs://MY-BUCKET/MY-PATH \ --input BASE_DIR=gs://MY-BUCKET/MY-PATH/MY-INPUT-FILE.txt --runner=DirectPipelineRunner
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page