When dbt finds existing approximate matches via `adapter.get_relation()` use the same quoting when printing error message #4187

dataders · 2021-11-02T18:31:51Z

Is there an existing feature request for this?

I have searched the existing issues

Describe the Feature

A user reported this issue to me (https://github.com/firebolt-analytics/dbt-firebolt/issues/62 private repo). But their error message is copied below and it looks as if there are two models: one that is quoted ("oz_account") and one that isn't (oz_account). In reality, I think there is a namespace overlap? What's driving me crazy is that 4/5 of current users are not having this quoting issue, and one user is having this issue. So I don't understand how this can happen when the adapter code is the same...

My suggestion would be to make them both be quoted so users don't think there's a quoting issue going on, but rather should know that it is indeed a problem with the model name already being used.

Completed with 3 errors and 0 warnings:

Compilation Error in model account (models/adtech/staging/account.sql)
  When searching for a relation, dbt found an approximate match. Instead of guessing 
  which relation to use, dbt will move on. Please delete "oz_account", or rename it to be less ambiguous.
  Searched for: oz_account
  Found: "oz_account"
  
  > in macro materialization_table_firebolt (macros/materializations/table.sql)
  > called by model account (models/adtech/staging/account.sql)

Describe alternatives you've considered

Unless this user is experiencing #3835. I'm going to have the user try the fix proposed in #4076 and see if that fixes their problem. If so, we know what's going on.

If #4076 does not fix the user's issue, it seems I have to deep dive more into adapter.get_relation() to see what's going on

Who will this benefit?

users of dbt-firebolt!

Are you interested in contributing this feature?

Anything else?

The text was updated successfully, but these errors were encountered:

jtcohen6 · 2021-11-17T14:28:52Z

@swanderz Thanks for the report! This certainly brings me back to an early day of dbt, when quoting, casing, and caching were all the rage.

I haven't been able to reproduce this on other adapters, as much as I mess with quoting, casing, aliasing, etc. Would you be able to create a simple reproduction case that manages to produce this error on the adapter? I'd be curious to step line by line through the matches logic. Ultimately, these methods do live on an adapter object (BaseRelation), so their logic can be overridden + reimplemented in the adapter plugin if needed.

The approximate_relation_match exception is raised in a very specific case: dbt finds a relation that's an approximate_match (case-insensitive comparison), but not an _is_exactish_match. The logic for _is_exactish_match is a bit peculiar for adapters that turn off quoting (e.g. Snowflake, Firebolt), as dbt will perform a case-insensitive comparison, but only for relations it believes it has created:

dbt-core/core/dbt/adapters/base/relation.py

Lines 32 to 33 in 719b202

 if self.dbt_created and self.quote_policy.get_part(field) is False: 

 return self.path.get_lowered_part(field) == value.lower()

How does it know which relations it has created? Based on the relations returned by each materialization:

dbt-core/core/dbt/task/run.py

Lines 276 to 277 in 719b202

 for relation in self._materialization_relations(result, model): 

 self.adapter.cache_added(relation.incorporate(dbt_created=True))

(I took a quick glance, and it seems that dbt-firebolt's reimplemented materializations are all doing this properly. I did notice that the test task reimplements execute, and is thereby missing the piece that adds returned relations to the cache... so perhaps it's theoretically possible to trip this error with an ambiguously aliased test + --store-failures, but I haven't been able to do it.)

I believe this is a different error from #3835. In that case, we want to raise the approximate_relation_match error, to help out the end user, but we don't because one relation has an extra set of explicit quotes. In this case, we are raising the error, so it seems like it can't be a quoting issue...

In reality, I think there is a namespace overlap?

Given what I said about the dbt_created property above, you could be right about this!

github-actions · 2022-05-18T02:09:40Z

This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please remove the stale label or comment on the issue, or it will be closed in 7 days.

github-actions · 2022-05-25T02:14:06Z

Although we are closing this issue as stale, it's not gone forever. Issues can be reopened if there is renewed community interest; add a comment to notify the maintainers.

joemirizio · 2022-11-15T22:00:01Z

I believe we are experiencing this issue with the Netezza adapter, which has an identifier quoting policy of False. When relations are loaded into the cache and then matched against, the BaseRelation._is_exactish_match is failing - this happens in certain macros and dbt server.

After debugging we’ve found that BaseRelation._is_exactish_match is returning False because it is comparing the table identifiers exactly, rather than lowercasing both sides of the comparison - which is ONLY done if the quoting policy is set to False (it is) AND the relations dbt_created field is True .

When we dove into the cache_added method, it appears that the incoming relation’s dbt_created field is correctly set to True, but when retrieving the value from the cache, it is set to False. Due to a relation's field dbt_created not preserving after a cache read, it causes the match to fail.

Steps to reproduce

Using any adapter, set a breakpoint at https://github.com/dbt-labs/dbt-core/blob/main/core/dbt/adapters/base/impl.py#L447
dbt run a project with a model
Evaluate the following in the debugger

> relation.dbt_created
True
> [cache_relation for cache_relation in self.cache.get_relations(relation.database, relation.schema) if cache_relation.identifier == relation.identifier.lower()][0].dbt_created
False

dataders added enhancement New feature or request triage labels Nov 2, 2021

dataders mentioned this issue Nov 3, 2021

Remove quote before compare firebolt-db/dbt-firebolt#8

Closed

dataders mentioned this issue Nov 16, 2021

adapter.get_relation mistakes exact match for approximate match firebolt-db/dbt-firebolt#11

Closed

jtcohen6 removed the triage label Nov 17, 2021

dataders mentioned this issue Nov 17, 2021

temp workaround for false approximate match firebolt-db/dbt-firebolt#12

Merged

jtcohen6 added the Team:Adapters Issues designated for the adapter area of the code label Nov 18, 2021

github-actions bot added the stale Issues that have gone stale label May 18, 2022

github-actions bot closed this as completed May 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When dbt finds existing approximate matches via `adapter.get_relation()` use the same quoting when printing error message #4187

When dbt finds existing approximate matches via `adapter.get_relation()` use the same quoting when printing error message #4187

dataders commented Nov 2, 2021 •

edited

Loading

jtcohen6 commented Nov 17, 2021

github-actions bot commented May 18, 2022

github-actions bot commented May 25, 2022

joemirizio commented Nov 15, 2022

When dbt finds existing approximate matches via adapter.get_relation() use the same quoting when printing error message #4187

When dbt finds existing approximate matches via adapter.get_relation() use the same quoting when printing error message #4187

Comments

dataders commented Nov 2, 2021 • edited Loading

Is there an existing feature request for this?

Describe the Feature

Describe alternatives you've considered

Who will this benefit?

Are you interested in contributing this feature?

Anything else?

jtcohen6 commented Nov 17, 2021

github-actions bot commented May 18, 2022

github-actions bot commented May 25, 2022

joemirizio commented Nov 15, 2022

Steps to reproduce

When dbt finds existing approximate matches via `adapter.get_relation()` use the same quoting when printing error message #4187

When dbt finds existing approximate matches via `adapter.get_relation()` use the same quoting when printing error message #4187

dataders commented Nov 2, 2021 •

edited

Loading