Skip to content

[C++][Python][Dataset] Add total_byte_size metadata to RowGroupInfo #25430

@asfimport

Description

@asfimport

See #7546 (comment).

The row group information in Parquet's FileMetadata includes a "total_byte_size" field. It would be useful to expose this information on the RowGroupInfo object, similarly as we already expose "num_rows".

cc @rjzamora

Reporter: Joris Van den Bossche / @jorisvandenbossche
Assignee: Joris Van den Bossche / @jorisvandenbossche

PRs and other links:

Note: This issue was originally created as ARROW-9346. Please see the migration documentation for further details.

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions