Feature: %string to numerical value conversion #6

vukosim · 2016-03-06T12:28:15Z

You have some datasets that have % values strings e.g. '95%',''82%' etc.

It would be great if this could be automatically dealt with. On Pandas dataframe this can be done with

df = df.replace('%','',regex=True).astype('float')

rhiever · 2016-03-06T12:44:29Z

I like this idea. Before we implement it I want to consider how this might affect other input values that aren't percentages.

For example, what if the user passes a DataFrame with class labels that are, say, ">50%" and "<=50%"? We obviously don't want to parse that into numerical percentages, nor do we want to remove the percentages.

Perhaps one way to accomplish this is:

Check if the column is of type 'object'. If not, then it won't contain a '%' anyway.
Check if any entry in the column contains a '%'. If not, skip the column.
Make a copy of the column and apply the transformation you suggested. If it doesn't crash, then it very likely was a string encoding of a percentage. If it does crash, then it probably was some other string(s) that contained %s.
In the non-crashing case, apply the change to the column.

Are there any cases that such a procedure would miss and incorrectly encode?

MagnetonBora · 2016-10-09T21:35:46Z

Will try to implement this feature request.

rhiever · 2016-10-10T17:15:04Z

Looking forward to it! 👍

vukosim · 2016-10-10T17:33:51Z

Had forgotten about this, would be great 👍

rhiever added the enhancement label Mar 6, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: %string to numerical value conversion #6

Feature: %string to numerical value conversion #6

vukosim commented Mar 6, 2016

rhiever commented Mar 6, 2016

MagnetonBora commented Oct 9, 2016

rhiever commented Oct 10, 2016

vukosim commented Oct 10, 2016

Feature: %string to numerical value conversion #6

Feature: %string to numerical value conversion #6

Comments

vukosim commented Mar 6, 2016

rhiever commented Mar 6, 2016

MagnetonBora commented Oct 9, 2016

rhiever commented Oct 10, 2016

vukosim commented Oct 10, 2016