astronomer | Helm Charts for the Astronomer Platform, Apache Airflow as a Service on Kubernetes | Continuous Deployment library
kandi X-RAY | astronomer Summary
kandi X-RAY | astronomer Summary
This repository contains the helm charts for deploying the Astronomer Platform into a Kubernetes cluster. Astronomer is a commercial "Airflow as a Service" platform that runs on Kubernetes. Source code is made available for the benefit of our customers, if you'd like to use the platform reach out for a license.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
- Main entry point .
astronomer Key Features
astronomer Examples and Code Snippets
Community Discussions
Trending Discussions on astronomer
QUESTION
I'm running Airflow v2.2.3 on Kubernetes.
I have a dedicated scheduler pod, a dedicated web-server pod and I'm using Kubernetes Executors.
I have ~4,000 DAGs, created dynamically in a for-loop.
I read Astronomer's post about "fast-follow" (also called "mini-scheduler") and I wonder if there's an option to disable it via configuration or something.
The thing is, that it comes with a penalty where:
- Dynamic DAGs are being calculated per each task operator, as a new fresh executor is running every time - I never saw reuse of Kubernetes executor (and if it using pod's
patch
verb for that - I guess that the dynamic DAGs would recalculated again anyway). - The time it takes to recalculate dynamic DAGs lead to a "queued" status for every task - which mean that a lot of time is spent just for recalculating the DAGs over and over again.
- Is there a configuration (or any other way) to disable the mini-scheduler?
- Am I using the KubernetesExecutor wrong? should it stay up for more than one task (and thus calculate the dags only at the beginning)
- I'm running a multi-tenant setup, where each one of my customers (businesses) run it's own DAG - but they all look the same (except input arguments + schedule time). Should I take a different approach? (other than using a dedicated DAG per customer).
Thanks
...ANSWER
Answered 2022-Jan-17 at 13:17Answering your questions:
1.
This feature introduced in PR
To disable the feature set in airflow.cfg
:
QUESTION
This is my first stack overflow question, so if I am presenting something wrong, please let me know. I am pretty new to computer programming, so I just have a small webpage where I am just implementing things that I am learning.
I made a little quiz with random trivia multiple choice questions you can take if you press a button. I am using window prompts to ask the questions and get the answers, and I have all of the questions and answers stored as objects with question/prompt and answer pairs. All of those objects are stored in an array in a variable called shortQuizPrompts. I already have the quiz working and everything, aka., It tells you after every question if you got the answer to that question right or wrong, and it gives you a grade afterwards... I also have it set up so that if you enter an answer that is not "a", "b", "c", or "d", it lets you know that it isnt a valid answer. Those sorts of things.
As of right now, you can choose how many questions long you want the quiz to be out of the 24 total questions I have so far. It just asks the questions in the order that they are stored in the array. For example, you will never be asked the last question in the array if you do not choose for the quiz to be the full 24 questions long. However, I want to make the quiz ask the questions in a random order, while also removing those questions from the array as to not ask the same question multiple times.
I have tried increasing the iterator while looping through the array to a random number from 0 to the length of however many questions they chose. Then checking to see if the iterator was larger than the length of the number of questions they chose, it would decrease the iterator until it found a question that is still in the array that it could ask...
If anyone knows how to go about doing that, it would be great. Sorry for the long question btw. I am pretty new to coding, so this is probably a simple answer, but I digress. I'm pretty sure I did everything right. Thx.
...ANSWER
Answered 2022-Jan-12 at 01:03You can shuffle the shortQuizPrompts
array before starting the quiz. Array shuffle details can be found in this answer.
QUESTION
Is there a way to design a python class that implements a specific data pipeline pattern outside of a dag in order to use this class for all data-pipelines that needs this pattern ?
Example: in order to load data from Google Cloud Storage to Big Query, the process can be to validate ingestion candidate files with data quality tests. Then attempt to load data in a raw table in Big Query then dispatching the file in archive or in a rejected folder depending on loading result.
Doing it one time is easy, what if it needs to be done 1000 times ? i am trying to figure out how to optimize engineering time.
SubDag could be considered but it shows limitations in terms of performances and is going to be deprecated anyway.
Task groups needs to be part of a dag to be implemented https://github.com/apache/airflow/blob/1be3ef635fab635f741b775c52e0da7fe0871567/airflow/utils/task_group.py#L35.
One way to achieve the expected behavior might be to generate dags, task groups and tasks from a single python file that leverage dynamic DAGing
Nevertheless, code that is used in this particular file can't be reused somewhere in the code base. It is against DRYness even though DRYness vs understandability is always a tradeoff.
...ANSWER
Answered 2021-Sep-09 at 05:41You should just create your own Operator and then use it inside your DAGs. Extend BaseOperator and use hooks to BigQuery or whatever you need.
QUESTION
I want to get the exception passed to on_failure_callback in order to check what is the error. For example, if it contains 'there are duplicates' in a certain DAG, the function won't do anything. Otherwise, it will send an email.
However, I am not able to see the format of the exception. I am using Airflow 2.1.2 in Docker and being my dag definition the following:
...ANSWER
Answered 2021-Aug-24 at 10:00You can define on_failure_callback
on the DAG and on the task level. Exceptions are only passed to the failure callback on the task level, so configure the callback either on your operator, or via default_args
on the DAG to all operators:
QUESTION
say I have two lists
list_1 = [ 'Tar', 'Arc', 'Elbow', 'State', 'Cider', 'Dusty', 'Night', 'Inch', 'Brag', 'Cat', 'Bored', 'Save', 'Angel','bla', 'Stressed', 'Dormitory', 'School master','Awesoame', 'Conversation', 'Listen', 'Astronomer', 'The eyes', 'A gentleman', 'Funeral', 'The Morse Code', 'Eleven plus two', 'Slot machines', 'Fourth of July', 'Jim Morrison', 'Damon Albarn', 'George Bush', 'Clint Eastwood', 'Ronald Reagan', 'Elvis', 'Madonna Louise Ciccone', 'Bart', 'Paris', 'San Diego', 'Denver', 'Las Vegas', 'Statue of Liberty']
and
list_B = ['Cried', 'He bugs Gore', 'They see', 'Lives', 'Joyful Fourth', 'The classroom', 'Diagnose', 'Silent', 'Taste', 'Car', 'Act', 'Nerved', 'Thing', 'A darn long era', 'Brat', 'Twelve plus one', 'Elegant man', 'Below', 'Robed', 'Study', 'Voices rant on', 'Chin', 'Here come dots', 'Real fun', 'Pairs', 'Desserts', 'Moon starer', 'Dan Abnormal', 'Old West action', 'Built to stay free', 'One cool dance musician', 'Dirty room', 'Grab', 'Salvages', 'Cash lost in me', "Mr. Mojo Risin'", 'Glean', 'Rat', 'Vase']
What I am looking for is to find the anagram pairs of list_A in list_B. Create a list of tuples of the anagrams. For one list I can do the following and generate the list of tuples, however, for two lists I need some assistance. Thanks in advance for the help!
What I have tried for one list,
...ANSWER
Answered 2021-Jul-14 at 14:47You can use a nested for loop, outer for the first list, inner for the second (also, use str.lower
to make it case-insensitive):
QUESTION
I'm trying to test a DAG I wrote in Astronomer/Airflow but I'm getting a really peculiar behavior that has me totally stuck.
Airflow's webserver comes up, and then after login as admin it redirects to http://localhost:8080/home which doesn’t appear to exist.
The specific error I get in the browser is ‘too many redirects’. I’m sure something is wrong in my configuration, but it’s not obvious (to me) what. Can you suggest steps for further diagnosis?
Thanks, Keith
...ANSWER
Answered 2021-Jul-10 at 13:10I have had similar issues when the username or the password is incorrect. I am expecting you are either entering wrong login credentials or your configuration is checking some external authentication server and it is not configured correctly.
QUESTION
I am currently working in a blog website, it is made using HTML and CSS.
In this, on the top navigation bar, when I click on the Contact button, I get redirected to the environment section, instead I should be redirected to the contacts section which is at the bottom of the website.
Link for the blog website: https://riyad-dev.github.io/Personal-Blog/
HTML code:
ANSWER
Answered 2021-Jun-19 at 05:22Your
id="environment"
content is out of this div, This ID'sdiv
closed before content!
QUESTION
I have an array like this:
...ANSWER
Answered 2021-Jun-18 at 21:16np.where
is the first thing that comes to mind, though I'm sure there are many ways of accomplishing the same thing.
QUESTION
I have a problem with React and Typescript and it will be nice if I get some help from you guys!
I'm trying to assign an onclick event to my child box component but it isn't working, it doesn't trigger any error, just plainly doesn't work.
This his is the parent:
...ANSWER
Answered 2021-Jun-06 at 09:41onClick={() => this.changeActive}
is wrong.
Use onClick={this.changeActive}
or onClick={() => this.changeActive()}
QUESTION
I am a newbie as far as both Airflow and Docker are concerned; to make things more complicated, I use Astronomer, and to make things worse, I run Airflow on Windows. (Not on a Unix subsystem - could not install Docker on Ubuntu 20.4). "astro dev start" breaks with an error, but in Docker Desktop I see, and can start, 3 Airflow-related containers. They see my DAGs just fine, but my DAGs don't see the local file system. Is thus unavoidable with the Airflow + Docker combo? (Seems like a big handicap; one can only use a file in the cloud).
...ANSWER
Answered 2021-May-19 at 09:31In general, you can declare a volume at image runtime in Docker using the -v
switch with your docker run
command to mount a local folder on your host to a mount point in your container, and you can access that point from inside the container.
If you go on to use docker-compose up
to orchestrate your containers, you can specify volumes in the docker-compose.yml
file for your containers which configures the volumes for the containers that run.
In your case, the Astronomer docs here suggest it is possible to create a custom directive in the Astronomer docker-compose.override.yml
file to mount the volumes in the Airflow containers created as part of your astro
commands for your stack which should then be visible from your DAGs.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install astronomer
You can use astronomer like any standard Python library. You will need to make sure that you have a development environment consisting of a Python distribution including header files, a compiler, pip, and git installed. Make sure that your pip, setuptools, and wheel are up to date. When using pip it is generally recommended to install packages in a virtual environment to avoid changes to the system.
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page