refactor submission method and add command API as defualt #442

ChenyuLInx · 2022-08-29T23:32:19Z

Description

Use Command API as default python model submission method. User can still create the notebook by adding submission_method:'notebook' in config for the model

Checklist

I have signed the CLA
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have run changie new to create a changelog entry

github-actions · 2022-08-29T23:32:39Z

Thank you for your pull request! We could not find a changelog entry for this change. For details on how to document a change, see the dbt-spark contributing guide.

ChenyuLInx · 2022-08-29T23:48:18Z

@ueshin Can you also review this? I somehow can't add you to reviewer

colin-rogers-dbt · 2022-08-30T16:47:19Z

dbt/adapters/spark/python_submissions.py

+DEFAULT_POLLING_INTERVAL = 3
+
+
+class BasePythonJobHelper:


To avoid getting these massive .py files how do folks feel about putting classes in separate files?

You mean base class in one place then the ones inheritance it in separate ones? I don't really feel like a 300 line file is massive. Is there any advantage of breaking it down to 3 100 line file?

oh definitely not massive as of now but if the logic in these classes grows we this could get large quite quickly

Make sense! But the logic here will likely stay the same, and we probably looking at refactoring it into multi adapter format in the longer term, or starting to adopt dbt-databricks for databricks specific submission. So I am inclined to leave it as is for now to avoid over-optimizing.

colin-rogers-dbt · 2022-08-30T16:51:56Z

dbt/adapters/spark/python_submissions.py

+            json={
+                "path": path,
+                "content": b64_encoded_content,
+                "language": "PYTHON",


not sure it matters but looks like 'language' is uppercase here and lowercase elsewhere, maybe put this in a static variable?

I put the rest into a static variable but this one needs to be upper cased as the API is a different one and actually require uppercase.

colin-rogers-dbt · 2022-08-30T16:53:19Z

dbt/adapters/spark/python_submissions.py

+        self.polling_interval = DEFAULT_POLLING_INTERVAL
+
+    def get_timeout(self):
+        timeout = self.parsed_model["config"].get("timeout", 60 * 60 * 24)


maybe move the timeout into a DEFAULT_TIMEOUT var?

colin-rogers-dbt · 2022-08-30T17:05:17Z

dbt/adapters/spark/python_submissions.py

+            context.destroy(context_id)
+
+
+python_submission_helpers = {


nit: upper case as this is a "global" var?

ChenyuLInx · 2022-08-30T21:03:37Z

Incremental tests failing in integration test is being fixed at #445

ueshin · 2022-08-30T22:38:42Z

dbt/adapters/spark/python_submissions.py

+        self.polling_interval = DEFAULT_POLLING_INTERVAL
+
+    def get_timeout(self):
+        timeout = self.parsed_model["config"].get("timeout", DEFAULT_TIMEOUT)


Will we not use timeout passed to submit_python_job anymore?

I think this is probably cleaner way to set a timeout in the end since it pulls it from config. Really just a placeholder now. Any thoughts, suggestions?

And we don't really want user to call submit_python_job, see dbt-labs/dbt-core#5596

I am gonna just merge this, feel free to open an issue if we feel like other ways to do timeout is better!

ChenyuLInx · 2022-08-30T23:48:26Z

Merging since it only fails on one test that's being fixed in #445

ueshin · 2022-08-31T00:22:26Z

Sorry, now I'm wondering why run_dbt(["run"]) in the test TestChangingSchemaSpark

dbt-spark/tests/functional/adapter/test_python_model.py

Line 52 in b4e177e

run_dbt(["run"])

doesn't fail?

nvm, maybe I'm missing something in dbt-databricks.

refactor submission method and add command API as defualt

79df90a

cla-bot bot added the cla:yes label Aug 29, 2022

update run_name and add changelog

2db018c

ChenyuLInx requested review from McKnight-42, VersusFacit and colin-rogers-dbt August 29, 2022 23:47

fix format

965fdbf

ChenyuLInx requested review from jtcohen6 and stu-k August 30, 2022 14:47

colin-rogers-dbt reviewed Aug 30, 2022

View reviewed changes

pr feedback

b4e177e

ChenyuLInx requested review from colin-rogers-dbt and gshank August 30, 2022 20:00

gshank approved these changes Aug 30, 2022

View reviewed changes

ueshin reviewed Aug 30, 2022

View reviewed changes

ChenyuLInx merged commit cef098f into main Aug 30, 2022

ChenyuLInx deleted the enhancement/update-submission-method branch August 30, 2022 23:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor submission method and add command API as defualt #442

refactor submission method and add command API as defualt #442

ChenyuLInx commented Aug 29, 2022 •

edited

Loading

github-actions bot commented Aug 29, 2022

ChenyuLInx commented Aug 29, 2022

colin-rogers-dbt Aug 30, 2022

ChenyuLInx Aug 30, 2022

colin-rogers-dbt Aug 30, 2022

ChenyuLInx Aug 30, 2022

colin-rogers-dbt Aug 30, 2022

ChenyuLInx Aug 30, 2022

colin-rogers-dbt Aug 30, 2022

colin-rogers-dbt Aug 30, 2022

ChenyuLInx commented Aug 30, 2022

ueshin Aug 30, 2022

ChenyuLInx Aug 30, 2022 •

edited

Loading

ChenyuLInx Aug 30, 2022

ChenyuLInx commented Aug 30, 2022

ueshin commented Aug 31, 2022 •

edited

Loading

		context.destroy(context_id)


		python_submission_helpers = {

refactor submission method and add command API as defualt #442

refactor submission method and add command API as defualt #442

Conversation

ChenyuLInx commented Aug 29, 2022 • edited Loading

Description

Checklist

github-actions bot commented Aug 29, 2022

ChenyuLInx commented Aug 29, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ChenyuLInx commented Aug 30, 2022

Choose a reason for hiding this comment

ChenyuLInx Aug 30, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ChenyuLInx commented Aug 30, 2022

ueshin commented Aug 31, 2022 • edited Loading

ChenyuLInx commented Aug 29, 2022 •

edited

Loading

ChenyuLInx Aug 30, 2022 •

edited

Loading

ueshin commented Aug 31, 2022 •

edited

Loading