Skip to content

Releases: aloneguid/parquet-dotnet

4.16.3

04 Sep 08:59
Compare
Choose a tag to compare

Delta encoding can be optionally turned off (thanks to @itayfisz for suggestion in #392).

4.16.2

22 Aug 16:20
Compare
Choose a tag to compare

Critical Bug Fix in DELTA_BINARY_PACKED Decoding: Adding first value to destination array before reading the block, by @ee-naveen in #391.

4.16.1

21 Aug 10:40
Compare
Choose a tag to compare

Critical Bug Fixes

  • Ensuring delta encoding footer blocks are complete And Handle Overflow by @ee-naveen in #387.
  • Use PLAIN encoding for columns without defined data by @spanglerco in #388.

4.16.0

17 Aug 11:41
Compare
Choose a tag to compare

New

  • Markdown documentation fully migrated to GitHub Pages. It was becoming slightly unmanageable and also recent GitHub updates made markdown files look awful. Also I kind of wanted to try Writerside by JetBrains, and publish docs with pride ;) @aloneguid
  • Class deserializer will now skip class properties and not throw an exception if they are missing in the source parquet file. Thanks to @greenlynx in #361.
  • Column statistics can be read with zero cost without reading the data. Thanks to @mirosuav in #252, #368.
  • Support for DELTA_BINARY_PACKED encoding on write. This encoding is now default when writing INT32 and INT64 columns. Most of the work done by @ee-naveen in #382.

Improvements

  • IronCompress was updated to v1.5.1 by @aloneguid.

Fixes

  • Fix precision issues writing DateTime as milliseconds by @spanglerco in #312.
  • In DataColumnWriter, RecycableMemoryStream wasn't used in a particular case, and instead MemoryStream was initialized directly. Thanks to @itayfisz in #373.
  • Bitpacked Hybrid decoder was failing on columns containing exactly one value.

4.15.0

30 Jun 10:44
Compare
Choose a tag to compare

Bugs Fixed

  • strings must be null by default (#360). Thanks @waf!

New Stuff

  • You can force optionality of a schema field using the [ParquetRequired] attribute.
  • ParquetSerializer validates class schema against actual file schema on deserialization and throws a helpful exception, like System.IO.InvalidDataException : property 'Id' is declared as Id (System.String?) but source data has it as Id (System.String) (you can spot the difference in nullability here).

4.14.0

28 Jun 13:25
Compare
Choose a tag to compare
  • Added support for reading legacy array primitives collection serialized via legacy ParquetConvert class or some other legacy system, thanks to @PablitoCBR. This work was effectively taken from his PR and integrated more natively into this library. Thank you very much!

  • Fixed deserializing parquet generated by Azure Data Explorer non-native writer by @mcbos in #357.

  • re-worked build pipeline to separate build and release stage.

  • use handcrafted release notes file and cut out last version notes with grep/head/tail on release. This is in order to improve release notes experience as autogenerated ones are often of sub-par quality.

4.13.0

20 Jun 10:07
Compare
Choose a tag to compare

What's Changed

Full Changelog: 4.12.0...4.13.0

4.12.0

22 May 07:48
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: 4.11.3...4.12.0

4.11.3

18 May 10:04
Compare
Choose a tag to compare

What's Changed

  • Added missing flush, which makes my code hang, the flush was present … by @dxdjgl in #333

Full Changelog: 4.11.2...4.11.3

4.11.2

16 May 14:26
Compare
Choose a tag to compare

What's Changed

  • Added deserialization by rowGroupIndex by @mcbos in #330
  • support for serialisation of nullable guids by @aloneguid in #332

New Contributors

Full Changelog: 4.11.1...4.11.2