Skip to content

Latest commit

 

History

History
118 lines (70 loc) · 5.03 KB

CHANGELOG.md

File metadata and controls

118 lines (70 loc) · 5.03 KB

Changelog

v0.11.0

-chore from @0323pin, updates deps, fixes clippy lints, updates rust version to 1.81.0.

v0.10.1

-chore Update deps from @0323pin and includes fix for clippy lint

v0.10.0

  • feat Update deps from @0323pin

v0.8.0

  • Bugfix Fixed output column reordering
  • Updated deps and added dependabot

v0.7.5

  • [Bugfix] Invalid thirdparty file fixed

v0.7.4

  • [Improvement] Updated deps to get latest gzp and thereby latest flate2, fix thirdparty.yml

v0.7.3

  • [Improvement] Updated deps to get latest gzp and thereby latest flate2

v0.7.2

  • Improvement Update from structopt to clap v3
  • [Improvement] Updated deps

v0.7.1

  • bugfix Apply unescaping to headers when selecting via -F

v0.7.0

  • Improvement Allow for passing escaped sequences in for the input and output delimiteres. i.e '\t' or '\n' can be passed in directly instead of needing to add $'\n' on the command line.
  • Improvement Add a flag -I / --use-input-delim to reuse the input delimiter if it is a literal as the output delimiter. An error will be raised if -L is not specified or if -D is specified.

v0.6.7

  • Bugfix allow arguments of options to start with a "-".

v0.6.6

  • Performance improvements. On literal byte separators hck is now faster than it previously was and is much faster on inputs that only use the first few fields in a line.

v0.6.5

  • Bugfix for files using fast-path code that have empty first lines

v0.6.4

  • Added thirdparty license in prep for conda-forge

v0.6.3

  • Change the number of compression threads used by default
  • Update gzp version

v0.6.2

  • Output BGZF output by default, which can be indexed and queried with tabix
  • Read input gzipped files a MultiGzDecoder which is more flexible

v0.6.1

  • PGO build fix

v0.6.0

  • Speed up edge case where columns may have already been consumed i.e. -f1-3,5,2-
  • Remove pigz binary from compression search
  • Add native gz decompression
  • Add native gz output compression via gzp

v0.5.4

  • Bugfix Better handling of duplicate selected fields, fixed output ordering when duplicate fields were selected. Added clarification to README regarding mixing by-index and by-header field selction / reordering.

v0.5.3

  • Bugfix, allow headers specified to be excluded to not be found

v0.5.2

  • PR24 Removed the now defunct profile guided optimization shell scripts and all references to them in favor of the justfile that was added in v0.5.0
  • Bugfix fixes incorrect handling of header line for non-stdin inputs, fixes incorrect parsing of last header fields (now strips newline before matching), fixes option parsing so that the -F and -E options wont' try to consume the positional input arguments. Huge thanks to @learnbyexample for their detailed bug report.
  • Change: An error will now be raised when a specified header is not found. This differs from the convention used by the selecion-by-index, which tries to match cut. The reasoning is that it is generally harder to type out each header field and if a header is not found you want to know about it.

v0.5.1

  • Fix the version in the binary to match the actual version

v0.5.0

  • Added --exclude|-e flag to select a set of fields to exclude. These fields may overlap with the -f flag and take precedence over fields selected by -f.
  • Added --exclude-header|-E flag to select a set of headers to exclude. These may mix and match with -e -f and F. The -r flag will cause the headers to be treated as a regex.
  • As part of the -e additional, the default behavior if now headers or fields are specified is to assume -f1-, which allow the user to do hck -e 3,8,290.
  • pigz is now a supported decompression binary, if it's not present hck defaults back to gzip.
  • Decided against adding a greedy heuristic because it actually had worse performance on the most common case of \s (but better on [[:space:]], which was odd).
    • The place where this would make sense would be searching a literal space character greedily (like awk), but that kind of goes against the way the delimiters are documented to work
    • It may be worth adding that special case at some point?
  • Moves CI to using justfile instead of pgo scripts.
  • Fixes several issues in benchmarks
    • choose was not using fastest path and had the wrong input file
    • All tools splitting by a space regex were incorrectly parsing the header line in the multichar data, header line is now fixed
  • Fixes bug with reordering of only two fields

v0.4.2

  • Fixed deb CI