-
Notifications
You must be signed in to change notification settings - Fork 154
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make XTable as a community managed Airflow provider #495
Comments
@gyli I like this idea and it seems like a natural way for Airflow users to sync their tables after some other step has run in their Airflow pipelines. Onehouse does not own XTable since it is an Apache Incubating project so I don't know how that will work with the mix governance proposed. I also lack the experience with Airflow to know what it would mean to build and maintain an operator. |
To provide more details and examples of Airflow providers:
|
Also, I can take the implementation of the operator, while I would like to put it on hold until there is some progress of the discussion with Airflow team. Can we bring some more attention and discuss it within XTable maintainers as the first step? |
Thanks for bringing this up, @gyli . it's a great idea to have the conversion run at the end of airflow DAGs. Happy to help this make progress. Do you have a dev list thread or a GH issue on Airflow, where you have brought this up with Airflow maintainers? If so, easiest would be to chime in there, understand what needs to be done/overall process. |
This looks like the process? https://github.com/apache/airflow/blob/main/PROVIDERS.rst#accepting-new-community-providers |
The above doc is the correct process to add a new provide. I have started a discussion here, but they need an official proposal and voting in Airflow devlist. |
@gyli I was off. will get on this. next week. Thanks for your patience |
Awesome. I was about to send out the email to their devlist, but it would be much better if you can send out. Thanks. |
Feature Request / Improvement
Hi XTable maintainers,
I am planning to create an Airflow operator for XTable, and also trying to make it as a community managed Airflow provider.
By saying Airflow operator, what I mean is something similar to what AWS presents in this blog, which is a wrapper of XTable's java command, allowing users to trigger it with config in Python codes and as an Airflow task. I believe integrating XTable in Airflow has great benefits for making it popular, and closer to be an industry standard.
I've tried proposing this with Airflow directly, while it requires votes to add it as a provider. More importantly, they are also looking for support from XTable (or maybe even OneHouse?) directly, since they prefer "mixed governance" approach. As an example, here is the discussion in Airflow devlist about adding a new provider. Hence, I am requesting your support to bring this discussion on the table of both sides, provide more background and evidence why XTable is helpful for data engineers (who are highly possibly Airflow users as well), and support such vote in Airflow devlist.
Are you willing to submit PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: