Pass arbitrary options to sel() #7099

benbovy · 2022-09-28T12:44:52Z

Is your feature request related to a problem?

Currently .sel() accepts two options method and tolerance. These are relevant for default (pandas) indexes but not necessarily for other, custom indexes.

It would be also useful for custom indexes to expose their own selection options, e.g.,

index query optimization like the dualtree flag of sklearn.neighbors.KDTree.query
k-nearest neighbors selection with the creation of a new "k" dimension (+ coordinate / index) with user-defined name and size.

From #3223, it would be nice if we could also pass distinct options values per index.

What would be a good API for that?

Describe the solution you'd like

Some ideas:

A. Allow passing a tuple (labels, options_dict) as indexer value

ds.sel(x=([0, 2], {"method": "nearest"}), y=3)

B. Expose an options kwarg that would accept a nested dict

ds.sel(x=[0, 2], y=3, options={"x": {"method": "nearest"}})

Option A does not look very readable. Option B is slightly better, although the nested dictionary is not great.

Any other ideas? Some sort of context manager? Some Index specific API?

Describe alternatives you've considered

The API proposed in #3223 would look great if method and tolerance were the only accepted options, but less so for arbitrary options.

Additional context

No response

The text was updated successfully, but these errors were encountered:

benbovy · 2022-09-28T12:50:25Z

Another difficulty regarding multi-coordinate indexes: ideally options should be set per index, not per coordinate.

benbovy · 2022-09-28T13:11:01Z

Or we could simply decide that .sel() should not accept arbitrary options and handle special cases, e.g., via accessors.

It would actually make sense to have something like .my_accessor.sel_k_neighbors(). Not so great to have a separate method just for an optimization option, though.

keewis · 2022-09-28T13:20:38Z

another option would be to allow passing a custom object, like

class Indexer:
    def __init__(self, indexer, **options):
        ...

ds.sel(x=Indexer([0, 2], method="nearest"))

I think we wanted to have something like that, anyways, to be able to specify other behaviors of a slice, like right-exclusive?

benbovy · 2022-09-28T14:39:10Z

Or use Indexer objects to group labels + options? This is slightly different than what you suggest:

class Dataset:

    def sel(
        self,
        indexers: Mapping[Any, Any] | Indexer | Iterable[Indexer],
        **indexers_kwargs: Any,
    ):
        ...


class Indexer:
    def __init__(self, labels=None, options=None, **label_kwargs):
        ...

Let's assume a Dataset with lat / lon coordinates both sharing the same geographic index + another time dimension coordinate, then we could write:

indexers = [
    Indexer(lon=[2, 15], lat=[45, 48], options={"foo": "bar"}),
    Indexer(time="2022-01-01"),
]

ds.sel(indexers)

This could also be used to avoid code duplication when using common selection options for different indexes.

benbovy added the enhancement label Sep 28, 2022

benbovy mentioned this issue Sep 28, 2022

Explicit indexes: next steps #6293

Open

49 tasks

benbovy mentioned this issue Nov 21, 2022

Create a to/from xarray method/function sunpy/ndcube#222

Open

benbovy mentioned this issue Jan 24, 2023

Changes to optionally return distances with .sel xarray-contrib/xoak#41

Open

benbovy mentioned this issue Jul 19, 2023

Rotation Functional Index example #8004

Open

benbovy mentioned this issue Aug 30, 2023

Should sel with slice objects care about underlying coordinate order? #1613

Open

benbovy mentioned this issue Nov 3, 2023

xarray-dggs package? pangeo-data/bids2023_codesprint#3

Open

max-sixty changed the title ~~Pass arbitray options to sel()~~ Pass arbitrary options to sel() Apr 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pass arbitrary options to sel() #7099

Pass arbitrary options to sel() #7099

benbovy commented Sep 28, 2022 •

edited

Loading

benbovy commented Sep 28, 2022

benbovy commented Sep 28, 2022

keewis commented Sep 28, 2022

benbovy commented Sep 28, 2022

Pass arbitrary options to sel() #7099

Pass arbitrary options to sel() #7099

Comments

benbovy commented Sep 28, 2022 • edited Loading

Is your feature request related to a problem?

Describe the solution you'd like

Describe alternatives you've considered

Additional context

benbovy commented Sep 28, 2022

benbovy commented Sep 28, 2022

keewis commented Sep 28, 2022

benbovy commented Sep 28, 2022

benbovy commented Sep 28, 2022 •

edited

Loading