-
Notifications
You must be signed in to change notification settings - Fork 906
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create new spaceflight starters #2838
Comments
As a follow up on this ticket we should look at how do we deal with overlaps in spaceflight projects. Can we somehow combine them to lessen the maintenance burden? |
As a follow up to the conversation we had about #2844 and #2752, @DimedS @deepyaman would also like further discussion on the /conf set up of our starters. i.e. what/how many catalog and parameters files we supply. |
@amandakys Again I think you meant to tag @DimedS, the other Dmitrii left the team a while ago 😉 |
💯 I think most repos I've seen that have this level of duplication have automated processes to generate the different versions. |
I think we may need to consider an empty starter for PySpark. Alloy uses it to put code for the verticals in it. I'll leave @datajoely to add comment here. |
Yes you are! We can manage this otherwise if you're desperate to delete it. But today we require a starter that has the hook + |
Also worth noting is that we have a plan to revise the spaceflights data #2008 and have recently considered reducing the size (@noklam commented "Any reason that we can’t trim the dataset? As a starter that it get used in demo and testing, it takes a considerate of time to run the pipeline. For example in the Kedro bootcamp I see demoing catalog.load("shuttles") takes like 15-20 seconds and is a bit awkward for demo purpose."). So we need one set of data/single location for it. |
Wondering if this could be |
@deepyaman Do you have any examples of repos like this? I'd be interested to see how they manage the different versions. |
The revised plan of action for this is that to ship this feature in 0.19.0, we will create starters for:
In the short-term, our support for Airflow and Databricks will be through documentation. We will have to tell users to edit To that end:
|
Description
Follow up on #2758
Context
Create a new suite of starters all based on spaceflights and add them to a repo.
We'll need at least:
spaceflights-pyspark
starter #2984spaceflights-pandas-viz
starter #2985spaceflights-pyspark-viz
starter #2986Possible Additional examples
(Not to be done now #2838 (comment))
The text was updated successfully, but these errors were encountered: