Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

string column cleaning in build column #340

Closed
reza1615 opened this issue Dec 11, 2020 · 5 comments
Closed

string column cleaning in build column #340

reza1615 opened this issue Dec 11, 2020 · 5 comments

Comments

@reza1615
Copy link

have an option to replace _ to space in the build column > cleaning, i.e. New_York > New York

Also for the word counts in the column description, it is much better to have a check box to ask do you want to remove _ or convert _ to space. now for the cases which they have _ it is useless. for example, see Fort_Nelsson
image

@aschonfeld
Copy link
Collaborator

Can you give me the list of "Cleaners" you want available? I can make a multi-select dropdown of ones that don't require additional parameters (meaning we can't using "Drop Stop Words" or "Update Word Case"). The screen is already cluttered enough as it is, but I'm happy to add a multi-select for cleaners which will be applied in the order they are selected.

@reza1615
Copy link
Author

reza1615 commented Dec 12, 2020

in my opinion all cleaning that we have + replace _ with space
There are different use case for each function for example to check most mentioned street name we don't need numbers or punctuation
to check most mention zipcode we don't need alphabet chars
and so on
so we can have all of them in the dropdown

@aschonfeld
Copy link
Collaborator

These are the options I can offer:

  • Replace Multi-Space w/ Single-Space
  • Remove Punctuation
  • Remove Numbers
  • Keep Only Alpha
  • Normalize Accent Characters
  • Remove Spaces
  • Drop Repeated Words
  • Add Space Between Word and Numbers
  • Remove Repeated Chars
  • Update Space Values to Empty String

@reza1615
Copy link
Author

multi space doesn't need because it doesn't effect here
please also add replace _ with space

@aschonfeld
Copy link
Collaborator

added in v1.28.0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants