15.0.0 (2022-05-27)
Breaking changes:
- Change
ArrayDataBuilder::null_bit_buffer
to acceptOption<Buffer>
rather thanBuffer
#1739 [arrow] (HaoYang670) - Remove
null_count
fromArrayData::try_new()
#1721 [arrow] (HaoYang670) - Change parquet writers to use standard
std:io::Write
rather customParquetWriter
trait (#1717) (#1163) #1719 [parquet] (tustvold) - Add explicit column mask for selection in parquet:
ProjectionMask
(#1701) #1716 [parquet] (tustvold) - Add type_ids in Union datatype #1703 [parquet] [arrow] (viirya)
- Fix Parquet Reader's Arrow Schema Inference #1682 [parquet] [arrow] (tustvold)
Implemented enhancements:
- Rename the
string
kernel toconcatenate_elements
#1747 [arrow] ArrayDataBuilder::null_bit_buffer
should acceptOption<Buffer>
as input type #1737 [arrow]- Fix schema comparison for non_canonical_map when running flight test #1730 [arrow]
- Add support in aggregate kernel for
BinaryArray
#1724 [arrow] - Fix incorrect null_count in
generate_unions_case
integration test #1712 [arrow] - Keep type ids in Union datatype to follow Arrow spec and integrate with other implementations #1690 [arrow]
- Support Reading Alternative List Representations to Arrow From Parquet #1680 [parquet]
- Speed up the offsets checking #1675 [arrow]
- Separate Parquet -> Arrow Schema Conversion From ArrayBuilder #1655 [parquet]
- Add
leaf_columns
argument toArrowReader::get_record_reader_by_columns
#1653 [parquet] - Implement
string_concat
kernel #1540 [arrow] - Improve Unit Test Coverage of ArrayReaderBuilder #1484 [parquet]
Fixed bugs:
- Parquet write failure (from record batches) when data is nested two levels deep #1744 [parquet]
- IPC reader may break on projection #1735 [arrow]
- Latest nightly fails to build with feature simd #1734 [arrow]
- Trying to write parquet file in parallel results in corrupt file #1717 [parquet]
- Roundtrip failure when using DELTA_BINARY_PACKED #1708 [parquet]
ArrayData::try_new
cannot always return expected error. #1707 [arrow]- "out of order projection is not supported" after Fix Parquet Arrow Schema Inference #1701 [parquet]
- Rust is not interoperability with C++ for IPC schemas with dictionaries #1694 [arrow]
- Incorrect Repeated Field Schema Inference #1681 [parquet]
- Parquet Treats Embedded Arrow Schema as Authoritative #1663 [parquet]
- parquet_to_arrow_schema_by_columns Incorrectly Handles Nested Types #1654 [parquet]
- Inconsistent Arrow Schema When Projecting Nested Parquet File #1652 [parquet]
- StructArrayReader Cannot Handle Nested Lists #1651 [parquet]
- Bug (
substring
kernel): The null buffer is not aligned whenoffset != 0
#1639 [arrow]
Documentation updates:
- Parquet command line tool does not install "globally" #1710 [parquet]
- Improve integration test document to follow Arrow C++ repo CI #1742 [arrow] (viirya)
Merged pull requests:
- Test for list array equality with different offsets #1756 [arrow] (alamb)
- Rename
string_concat
toconcat_elements_utf8
#1754 [arrow] (alamb) - Rename the
string
kernel toconcat_elements
. #1752 [arrow] (HaoYang670) - Support writing nested lists to parquet #1746 [parquet] (tustvold)
- Pin nightly version to bypass packed_simd build error #1743 (viirya)
- Fix projection in IPC reader #1736 [arrow] (iyupeng)
cargo install
installs not globally #1732 [parquet] (kazuk)- Fix schema comparison for non_canonical_map when running flight test #1731 (viirya)
- Add
min_binary
andmax_binary
aggregate kernels #1725 [arrow] (HaoYang670) - Fix parquet benchmarks #1723 [parquet] (tustvold)
- Fix BitReader::get_batch zero extension (#1708) #1722 [parquet] (tustvold)
- Implementation string concat #1720 [arrow] (Ismail-Maj)
- Check the length of
null_bit_buffer
inArrayData::try_new()
#1714 [arrow] (HaoYang670) - Fix incorrect null_count in
generate_unions_case
integration test #1713 [arrow] (viirya) - Fix: Null buffer accounts for
offset
insubstring
kernel. #1704 [arrow] (HaoYang670) - Minor: Refine
OffsetSizeTrait
to extendnum::Integer
#1702 [arrow] (HaoYang670) - Fix StructArrayReader handling nested lists (#1651) #1700 [parquet] (tustvold)
- Speed up the offsets checking #1684 [arrow] (HaoYang670)
For older versions, see CHANGELOG-old and apache/arrow/CHANGELOG.md
* This Changelog was automatically generated by github_changelog_generator