DBT should have a Dry-Run mode #1059

mikekaminsky · 2018-10-13T14:52:14Z

DBT should have a dry-run mode

Feature description

DBT should have a dry-run mode so that users can see all of the "commands" DBT will run for a given command without having DBT actually execute those commands against a database.

Who will this benefit?

This can be useful for both debugging as well as making sure that a given command is configured properly. This is especially useful when you're configuring a production setup and want to make sure you're not about to drop schemas or tables you didn't intend to.

It can also be useful for testing macros and other DBT configurations that are post-compilation but don't show up in the compiled SQL

mikekaminsky · 2018-10-14T02:34:11Z

So, I took a look at dbt/adapters/default/impl.py to see how hard this might be and I don't think I have enough context on why some things are implemented the way they are to take a stab at this myself without some guidance.

I can see that the add_query method is where the statements get executed (cursor.execute(sql, bindings) but there's a bunch of other stuff happening in here I don't understand (I suspect a lot of it is related to handling multiple connections).

In case it's helpful to see where my head is going, the way I've implemented this in the past is to have two functions for executing SQL. One for commands (no results returned) and one for queries (returning data) -- this way your read and write operations are split.

Then, for the dry-run mode, you can put some logic in the command-handling function that just logs the command instead of passing it to cur.execute(). Something like:


def run_command(sql):
    print(sql)
    if not DRY_RUN:
        self.cur.execute(sql)

def get_data(sql):
    print(sql)
    self.cur.execute(sql)
    result = cur.fetchall()
    return result

hui-zheng · 2019-04-25T23:58:12Z

yes, I vote to have the dry-run feature, which will be super helpful for debug and performance tuning

thalesmello · 2019-05-28T21:42:27Z

@cmcarthur What's the reasoning for this to be removed from the milestone?

drewbanin · 2019-05-28T22:06:47Z

Hi @thalesmello - this is a pretty involved change to dbt! Connor and I tried to prioritize it for the Wilt Chamberlain release (0.14.0), but we ended up deciding to cull it in favor of Archive-related functionality. This feature is definitely going to be a big improvement to dbt - I'm looking forward to re-prioritizing it!

thalesmello · 2019-05-28T22:08:03Z

Thanks for the explanation @drewbanin 👍

christopher-tin · 2019-10-01T18:08:52Z

+1 as well to adding a dry run feature such as a dbt compile --all command

darrenhaken · 2020-06-12T14:51:04Z

Any movement on this?

giovanni-girelli-sdg · 2020-08-11T08:41:58Z

Hello everyone!
Any progress on this? I think being able to see the actual sql that will be executed is quite the important feature.

alepuccetti · 2020-08-11T08:58:52Z

@giovanni-girelli-sdg To see the actual SQL that dbt will execute you can use dbt compile https://docs.getdbt.com/reference/commands/compile/.

giovanni-girelli-sdg · 2020-08-11T09:15:07Z

Hi @alepuccetti! thanks for the reply, but there you can only see the SELECT statement. What I'd like to see is the actual SQL that will run, such as a MERGE, INSERT or CREATE statement.

Edit at least it's what I can find. Is there a way to see the full materialization code?

Edit2: I had not seen the target/run folder, that's enough for debugging in DEV. I still think it would be great to be able to have a dry run mode where no sql is actually executed, but I can see how that would be much harder to do. Thanks for the great work!

bhtucker · 2020-09-01T22:05:08Z

@mikekaminsky you opened this on my birthday!

davidsr2r · 2020-12-04T00:31:43Z

+1 to this, especially if it allowed the calculation of the cost of the run, and allowed prevention of the run if the cost was too high unless a --force argument was used.

y-tee · 2021-01-14T04:11:35Z

+1

the-davidsn · 2021-05-04T18:29:49Z

+1

fclesio · 2021-05-27T14:40:13Z

+1

giovanni-girelli-sdg · 2021-05-27T19:27:46Z

The more I think about it, the less I think this is possible. There are way too many macros and materializations requiring active database querying to determine the flow - you can't keep it completely dry.

…

On Thu, May 27, 2021 at 4:40 PM Flavio Clesio ***@***.***> wrote: +1 — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#1059 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ANQZGB4JYSUUU5QJQSHCX6DTPZKV7ANCNFSM4F3KU4HQ> .

-- *Giovanni Girelli* Specialist Advanced Business Analytics SDG Consulting Italy Viale del Lavoro 33, 37135 Verona - Italy Mail : ***@***.*** - Mobile: +39 349 1045527 Website <http://www.sdggroup.com/it> | LinkedIn <https://www.linkedin.com/company/sdg-group> | Twitter <https://twitter.com/sdggroup> | YouTube <https://www.youtube.com/user/sdggroup> | Facebook <https://www.facebook.com/SDGGroup>

rabidaudio · 2021-08-27T20:02:54Z

For some reason when searching for this I found #281 but not this issue.

I'm posting about it here with the link so that a reference will show up on the old PR, in case someone else finds the old PR.

drewbanin added this to the Wilt Chamberlain milestone Nov 28, 2018

cmcarthur removed this from the Wilt Chamberlain milestone May 1, 2019

jtcohen6 mentioned this issue Jul 7, 2020

Writing Logs to Separate File in Logs Directory #2617

Closed

jtcohen6 mentioned this issue Apr 8, 2021

Incremental model representation in docs #3222

Closed

jsnb-devoted mentioned this issue Nov 3, 2021

[Feature] "Slim CI" but with LIMIT 0 #4201

Closed

1 task

jtcohen6 mentioned this issue Nov 17, 2021

[Feature] dbt compile to generate as well "run" queries #4227

Closed

1 task

dbt-labs locked and limited conversation to collaborators Dec 8, 2021

jtcohen6 converted this issue into discussion #4456 Dec 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

This issue was moved to a discussion.

DBT should have a Dry-Run mode #1059

DBT should have a Dry-Run mode #1059

mikekaminsky commented Oct 13, 2018

mikekaminsky commented Oct 14, 2018

hui-zheng commented Apr 25, 2019

thalesmello commented May 28, 2019

drewbanin commented May 28, 2019

thalesmello commented May 28, 2019

christopher-tin commented Oct 1, 2019

darrenhaken commented Jun 12, 2020

giovanni-girelli-sdg commented Aug 11, 2020

alepuccetti commented Aug 11, 2020

giovanni-girelli-sdg commented Aug 11, 2020 •

edited

Loading

bhtucker commented Sep 1, 2020

davidsr2r commented Dec 4, 2020

y-tee commented Jan 14, 2021

the-davidsn commented May 4, 2021

fclesio commented May 27, 2021

giovanni-girelli-sdg commented May 27, 2021 via email

rabidaudio commented Aug 27, 2021

This issue was moved to a discussion.

This issue was moved to a discussion.

DBT should have a Dry-Run mode #1059

DBT should have a Dry-Run mode #1059

Comments

mikekaminsky commented Oct 13, 2018