Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pre-Encoding for Correlations #488

Closed
reza1615 opened this issue May 20, 2021 · 5 comments
Closed

Pre-Encoding for Correlations #488

reza1615 opened this issue May 20, 2021 · 5 comments
Labels
enhancement New feature or request

Comments

@reza1615
Copy link

As you know Correlations just works on the numerical columns and for the categorical doesn't work.
It would be nice to have an option to one-hot (encode) all categorical columns in a batch after that user can get the Correlations.

@aschonfeld aschonfeld added the enhancement New feature or request label May 21, 2021
@aschonfeld
Copy link
Collaborator

Can you refresh my memory on how the "one-hot" (encode) works? We made column builders for this, correct?

@reza1615
Copy link
Author

reza1615 commented May 24, 2021

yes we have column builder.
Image the data has 20 categorical column. to have onehot for all of them to get correlation we should 20 times in column builder create onehot column
this request is to create 20 column's one hot in one request

@aschonfeld
Copy link
Collaborator

  • So what if I added a toggle which checks for any string or categorical column and auto convert using OneHotEncoder
  • Also what if there was a toggle to add in date columns which have been converted to millisecond timestamps

@reza1615
Copy link
Author

reza1615 commented May 25, 2021 via email

@aschonfeld
Copy link
Collaborator

added in v1.48.0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants