[Feature] Add unix-style fqn wildcard selector method #6599

z3z1ma · 2023-01-12T23:10:34Z

resolves #6598

Description

Adds a new selector method plus tests. This method is called wildcard. It uses fnmatch from the python stdlib.
Reference here

We look at the fqn as a string:

project.resource.some.dir.my_model
project.resource.some.dir.schema_file.dbt_utils_expression_is_true_my_model_column_a

And simply use fnmatch to determine if it is selected:

Here's a convoluted example just to show how flexible and robust you can imagine this being:

dbt test -s 'wildcard:project.*.*.dir.schema_file.*_column_a

Also you can see the tests as a reference for playing with it if you want to continue imagining what it opens up.

Checklist

I have read the contributing guide and understand what's expected of me
I have signed the CLA
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have opened an issue to add/update docs, or docs changes are not required/relevant for this PR
I have run changie new to create a changelog entry

aranke

LGTM, thank you!

ChenyuLInx

Hey @z3z1ma Thanks for this amazing PR that provide a powerful feature! Great work think through which selector methods should support wildcards!

I want to make sure I understand the exact function you are adding here.
For existing selector, the following now support wildcard( the rest are more status based so they don't need to support wildcard match)

FQN
Tag
Source
Exposure
Metric
File
Package
TestName

And added a new one Wildcard.

After playing with it a bit I have some question/ask

FQN selector support everything Wildcard selector support, plus special logic to the following possible. Is that correct? Is they anything that the wildcard support that the new FQN doesn't supports? If not should we consider just leave the new FQN, or remove wildcard match from FQN and add the new wildcard selector method, whichever feels more consistent with other selector methods.

dbt-core/test/unit/test_graph_selector_methods.py

Lines 659 to 660 in 60fde4d

    
           assert search_manifest_using_method( 
        
               manifest, method, 'pkg.unions') == {'union_model', 'mynamespace.union_model'}

Do you mind add some new test to the tests for those selectors that supports wildcard match? For example, for test_select_tag maybe we can add something like
```
assert search_manifest_using_method(manifest, method, 'uses_eph*') == {
        'view_model', 'table_model'}
```
Only because this change is super powerful, I would like us to think a bit more about is there any concern of the this fnmatch would cause the
1. additional things got selected for user's existing selection command?(@z3z1ma @aranke you two probably know the use case better than I do)
2. performance issue of selector on large projects (@jtcohen6 should we test this on some large size testing project before merging?) (EDIT: Thinking again this shouldn't have performance issue on existing selection. Might be slow for new selection that uses wildcard.)

z3z1ma · 2023-02-28T06:45:10Z

FQN selector support everything Wildcard selector support, plus special logic to the following possible. Is that correct? Is they anything that the wildcard support that the new FQN doesn't supports? If not should we consider just leave the new FQN, or remove wildcard match from FQN and add the new wildcard selector method, whichever feels more consistent with other selector methods.

The Wildcard selector is an artifact from the initial spike in which I had not planned on updating the other selectors.
After some iteration I found I could implement the functionality without any breaking change. So consider that the Wildcard selector will be dropped since the functionality is native to FQN.

We should definitely keep the wildcard matching capabilities on FQN because, as the default selector, it is significantly more accessible to users. Furthermore, existing use of the * character is functionally unchanged by this update. This means its a net new feature that does not change users who opted to use the old FQN "selector glob". Telling users they can run models using wildcard goodies out of the box without explicitly specifying wildcard: is big.

Command	Old Outcome	New Outcome
`dbt run -s something.*`	Select all nodes from package `something`	Select all nodes from package `something`
`dbt run -s something.idk_*`	Select all nodes whose fqn starts with `[something, idk*, ...]` (weird right)	Select all nodes from package `something` whose next fqn segment starts with `idk_`

Unless a user has * or ? or [] in their filenames which should be impossible or just an ill-advised move, we should be shored against any possibility (that I can conjure) of this not being backwards compatible. If anything, this might be more intuitive because I believe we have all used globs at one point thinking they would work in the middle of an fqn segment.

Can elucidate more tomorrow (new comment or will edit this one), but do expect a commit that drops Wildcard and updates to the tests @ChenyuLInx

ChenyuLInx · 2023-02-28T06:52:08Z

@z3z1ma Thanks for the fast response!! This makes sense to me!! I was asking the third question only out of cautious not that I think the answer would be no.
Looking forward to the your new commit!

…thod

z3z1ma · 2023-03-02T04:35:17Z

@ChenyuLInx @aranke

Mentioned updates are complete.

More tests for other methods with extended wildcard enabled
Drop of independent Wildcard selector from initial spike since it is baked into FQN

ChenyuLInx

Looks great! Thanks @z3z1ma !

resolves dbt-labs#6598

Updates wildcard selection documentation (added in #3130), based on the implementation we actually ended up with: - #2702 (comment) - dbt-labs/dbt-core#6599 Crucially, there is no standalone `wildcard:` method, but rather support for unix-style wildcards in a number of existing methods. See Alex's comment (linked above) above for details. --------- Co-authored-by: Doug Beatty <44704949+dbeatty10@users.noreply.github.com>

resolves #3549 ## What are you changing in this pull request and why? This PR adds the (undocumented!) `resource_type` method that has been possible for a long, long time. Reason: we discovered a selection method wasn't documented! So we did 😎 ## Not done #3549 mentions that there are no examples for the wildcard method that include `wildcard:`. This seems okay since the intent in dbt-labs/dbt-core#6599 was to use Unix-style wildcards in conjunction with all the other selection methods. So we're choosing not to add `wildcard:` examples in this PR. If needed, they can always be added later. ## 🎩 [Preview](https://deploy-preview-3548--docs-getdbt-com.netlify.app/reference/node-selection/methods#the-resource_type-method) <img width="550" alt="image" src="https://github.com/dbt-labs/docs.getdbt.com/assets/44704949/c46f18c5-d4ea-4e9a-815d-7f682efa4ee2"> ## Checklist - [x] Review the [Content style guide](https://github.com/dbt-labs/docs.getdbt.com/blob/current/contributing/content-style-guide.md) and [About versioning](https://github.com/dbt-labs/docs.getdbt.com/blob/current/contributing/single-sourcing-content.md#adding-a-new-version) so my content adheres to these guidelines. --------- Co-authored-by: mirnawong1 <89008547+mirnawong1@users.noreply.github.com> Co-authored-by: Doug Beatty <44704949+dbeatty10@users.noreply.github.com>

z3z1ma requested a review from a team January 12, 2023 23:10

z3z1ma requested a review from a team as a code owner January 12, 2023 23:10

z3z1ma requested review from aranke and nathaniel-may January 12, 2023 23:10

cla-bot bot added the cla:yes label Jan 12, 2023

z3z1ma changed the title ~~[Feature] Add unix-style wildcard selector method~~ [Feature] Add unix-style fqn wildcard selector method Jan 12, 2023

z3z1ma mentioned this pull request Jan 12, 2023

Add wildcard selector method to dbt docs dbt-labs/docs.getdbt.com#2702

Closed

1 task

dbeatty10 mentioned this pull request Jan 17, 2023

[CT-1811] [Feature] Unix-style wildcard fqn selector method via fnmatch #6598

Closed

3 tasks

jtcohen6 added Team:Execution ready_for_review Externally contributed PR has functional approval, ready for code review from Core engineering labels Jan 19, 2023

dbeatty10 mentioned this pull request Feb 8, 2023

[CT-2059] [Feature] Syntax to restrict selection to the current package #6891

Closed

3 tasks

aranke approved these changes Feb 27, 2023

View reviewed changes

ChenyuLInx reviewed Feb 28, 2023

View reviewed changes

z3z1ma added 3 commits March 1, 2023 21:14

✨ add unix-style wildcard selector method

2e50628

🔖 add changie entry for contribution

3e9d9df

✨ add fnmatch capapbility to all string-matching based selectors

906821b

z3z1ma force-pushed the feature/fnmatch-selector-method branch from 60fde4d to 906821b Compare March 2, 2023 04:14

✅ add tests for other wildcard enabled selectors and drop wildcard me…

995b688

…thod

ChenyuLInx approved these changes Mar 2, 2023

View reviewed changes

aranke merged commit 24ca76e into dbt-labs:main Mar 5, 2023

acurtis-evi pushed a commit to acurtis-evi/dbt-core that referenced this pull request Mar 7, 2023

[Feature] Add unix-style fqn wildcard selector method (dbt-labs#6599)

50b40dc

resolves dbt-labs#6598

acurtis-evi mentioned this pull request Mar 8, 2023

1.4.next acurtis-evi/dbt-core#5

Draft

6 tasks

jtcohen6 mentioned this pull request Apr 14, 2023

Fix wildcard selection dbt-labs/docs.getdbt.com#3189

Merged

dbeatty10 mentioned this pull request Jun 21, 2023

Update methods.md dbt-labs/docs.getdbt.com#3548

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Add unix-style fqn wildcard selector method #6599

[Feature] Add unix-style fqn wildcard selector method #6599

z3z1ma commented Jan 12, 2023 •

edited

Loading

aranke left a comment

ChenyuLInx left a comment •

edited

Loading

z3z1ma commented Feb 28, 2023

ChenyuLInx commented Feb 28, 2023

z3z1ma commented Mar 2, 2023

ChenyuLInx left a comment

	assert search_manifest_using_method(
	manifest, method, 'pkg.unions') == {'union_model', 'mynamespace.union_model'}

[Feature] Add unix-style fqn wildcard selector method #6599

[Feature] Add unix-style fqn wildcard selector method #6599

Conversation

z3z1ma commented Jan 12, 2023 • edited Loading

Description

Checklist

aranke left a comment

Choose a reason for hiding this comment

ChenyuLInx left a comment • edited Loading

Choose a reason for hiding this comment

z3z1ma commented Feb 28, 2023

ChenyuLInx commented Feb 28, 2023

z3z1ma commented Mar 2, 2023

ChenyuLInx left a comment

Choose a reason for hiding this comment

z3z1ma commented Jan 12, 2023 •

edited

Loading

ChenyuLInx left a comment •

edited

Loading