[FEATURE] `get_feature_names_out` for `sklego.preprocessing` transformers.

`get_feature_names_out` is an important component for interpreting scikit-learn `Pipeline` objects. A `get_feature_names_out` call on a `Pipeline` only works if it is implemented for all components in the pipeline, except the last step (i.e. the Model).

Scikit-learn recently implemented `get_feature_names_out` for all Transformers in their 1.1 release ([Source](https://scikit-learn.org/stable/auto_examples/release_highlights/plot_release_highlights_1_1_0.html?highlight=scikit+learn+release#get-feature-names-out-available-in-all-transformers)).

 I think it makes sense to also implement `get_feature_names_out` for all `scikit-lego` Transformers that are not models and are not `TrainOnly`. This leaves most objects in `sklego.preprocessing`.

- [x] `sklego.preprocessing.ColumnCapper`
- [x] `sklego.preprocessing.DictMapper`
- [x] `sklego.preprocessing.IdentityTransformer`
- [x] `sklego.preprocessing.IntervalEncoder`
- [x] ~~`sklego.preprocessing.OutlierRemover` (TrainOnly)~~
- [x] `sklego.preprocessing.PandasTypeSelector`
- [x] `sklego.preprocessing.ColumnSelector`
- [x] `sklego.preprocessing.ColumnDropper`
- [x] `sklego.preprocessing.PatsyTransformer`
- [x] `sklego.preprocessing.OrthogonalTransformer`
- [x] `sklego.preprocessing.InformationFilter`
- [x] ~~`sklego.preprocessing.RandomAdder` (TrainOnly)~~
- [x] `sklego.preprocessing.RepeatingBasisFunction`

Additionally, it should be tested if `get_feature_names_out` works correctly with a `Pipeline` that contains transformers inheriting from `TrainOnlyTransformerMixin`, like `RandomAdder`.

@koaning and I recently discussed implementing `get_feature_names_out` for `sklego.meta` and ended up implementing this method for `EstimatorTransformer` (PR #539). It does not look like objects in `sklego.decomposition` and `sklego.mixture` require an implementation of `get_feature_names_out`, because it seems they are mostly used as the last step in a pipeline or wrapped in an `EstimatorTransformer`. 

Since this is such a systematic issue, we can consider adding some additional requirements for people contributing to `sklego.preprocessing`. That is, make sure to implement `get_feature_names_out` for any new preprocessor that is not a train-time only Transformer.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FEATURE] `get_feature_names_out` for `sklego.preprocessing` transformers. #543

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[FEATURE] get_feature_names_out for sklego.preprocessing transformers. #543

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

[FEATURE] `get_feature_names_out` for `sklego.preprocessing` transformers. #543