Skip to content

Latest commit

 

History

History
77 lines (51 loc) · 2.09 KB

CHANGELOG.md

File metadata and controls

77 lines (51 loc) · 2.09 KB

v1.2.5

  • Onehot transform returns same columns always
  • Missing value and unknown handling now configurable in all relevant encoders

v1.2.4

  • Added more sophisticated missing value or unknown category handling to ordinal
  • Passing through missing value config from onehot into ordinal
  • Onehot will return an extra column when unknown categories are passed in if impute is used.
  • Added BaseNEncoder to allow for more flexible alternatives to ordinal, onehot and binary.

v1.2.3

  • Full support for numpy arrays as input, not just dataframes.

v1.2.2

  • All encoders handle missing values and are tested for their handling
  • Created a onehot encoder that follows the same conventions as the rest of the library instead of using sklearns.
  • Did some basic benchmarking for data compression and memory usage, made some performance improvements
  • Changed all docstrings to numpy style and added more documentation
  • Moved all logic methods into staticmethods of the transformer classes themselves.
  • Added more detailed checks for type and shape of input data in fit and transform
  • Support input as list of lists, alongside numpy arrays and pandas dataframes.

v1.2.1

  • Better handling for missing values in hashing encoder

v1.2.0

  • Testing enhancements
  • Hash type in hashing encoder now defaults to md5 using hashlib, but can be set to any valid hashlib hash

v1.1.2

  • Added optional parameter to return a numpy array rather than a dataframe from all transformers.

v1.1.1

  • Immediately return if cols is empty.

v1.1.0

  • Optionally pass drop_invariant to any encoder to consistently drop columns with 0 variance from the output (based on training set data in fit())
  • If None is passed as the cols param, every string column will be encoded (pandas type = object).

v1.0.5

  • Changed setup.py to not explicitly force reinstalls of other packages

v1.0.4

  • Bugfixes

v1.0.0

  • First real usable release, includes sklearn compatible encoders.

v0.0.1

  • Basic library of encoders, no automated testing.