Add support for DataFrame `sum` operation with tests #1148

zaleslaw · 2025-04-23T13:25:33Z

No description provided.

Introduced the `sum` operation for DataFrames, supporting numerical columns aggregation. Updated relevant tests and added new test cases to verify functionality. Included schema modifications for handling numerical column operations.

Converted various internal classes, interfaces, and functions related to aggregation into public entities. This change expands their visibility, enabling external usage and facilitating integration with other modules or libraries.

…re compatibility and correctness in sum calculations.

Copilot

Pull Request Overview

This PR adds support for the DataFrame sum operation along with comprehensive tests covering various summation scenarios and aggregation handlers. Key changes include:

Adding a new sum operation test in both generated tests and dedicated test data for verifying correct summation.
Extending the aggregation framework with new Sum0 and Sum1 implementations.
Changing visibility modifiers from internal to public in several core aggregator components.

Reviewed Changes

Copilot reviewed 16 out of 16 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
plugins/kotlin-dataframe/tests-gen/org/jetbrains/kotlin/fir/dataframe/DataFrameBlackBoxCodegenTestGenerated.java	Adds a new test method for the sum operation
plugins/kotlin-dataframe/testData/box/sum.kt	Introduces test scenarios for sum over all, selective, and expression-based columns
plugins/kotlin-dataframe/src/org/jetbrains/kotlinx/dataframe/plugin/loadInterpreter.kt	Updates load interpreter to support new Sum aggregators
plugins/kotlin-dataframe/src/org/jetbrains/kotlinx/dataframe/plugin/impl/api/statistics.kt	Adds implementations for summation aggregators
core/src/test/kotlin/org/jetbrains/kotlinx/dataframe/api/statistics.kt	Expands tests for groupBy operations to include new numerical columns
core/src/main/kotlin/org/jetbrains/kotlinx/dataframe/api/sum.kt	Annotates and exposes the new sum APIs
Other core aggregator files	Adjusts visibility and typing to support public sum operation API

Comments suppressed due to low confidence (1)

core/src/test/kotlin/org/jetbrains/kotlinx/dataframe/api/statistics.kt:571

In the groupBy maxBy test, the column names are checked using 'res4' while the aggregation result is later accessed from 'res5'. This inconsistency might lead to erroneous test behavior; ensure the same result object is used throughout the test.

res4.columnNames() shouldBe listOf("city", "name", "age", "weight", "height", "yearsToRetirement", "workExperienceYears", "dependentsCount", "annualIncome")

plugins/kotlin-dataframe/testData/box/sum.kt

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

plugins/kotlin-dataframe/src/org/jetbrains/kotlinx/dataframe/plugin/impl/api/statistics.kt

...rame/tests-gen/org/jetbrains/kotlin/fir/dataframe/DataFrameBlackBoxCodegenTestGenerated.java

plugins/kotlin-dataframe/testData/box/sum.kt

core/src/main/kotlin/org/jetbrains/kotlinx/dataframe/impl/aggregation/aggregators/Aggregator.kt

...aframe/impl/aggregation/aggregators/multipleColumnsHandlers/TwoStepMultipleColumnsHandler.kt

...otlin/org/jetbrains/kotlinx/dataframe/impl/aggregation/aggregators/AggregatorOptionSwitch.kt

plugins/kotlin-dataframe/testData/box/sum.kt

plugins/kotlin-dataframe/src/org/jetbrains/kotlinx/dataframe/plugin/impl/api/statistics.kt

# Conflicts: # core/src/main/kotlin/org/jetbrains/kotlinx/dataframe/impl/aggregation/aggregators/Aggregators.kt

zaleslaw added 4 commits April 23, 2025 15:24

Add support for DataFrame sum operation with tests

7729d03

Introduced the `sum` operation for DataFrames, supporting numerical columns aggregation. Updated relevant tests and added new test cases to verify functionality. Included schema modifications for handling numerical column operations.

Merge branch 'master' into issue-1138

4b4feb8

Make aggregator-related classes and functions public

0376fec

Converted various internal classes, interfaces, and functions related to aggregation into public entities. This change expands their visibility, enabling external usage and facilitating integration with other modules or libraries.

Enhance type conversions between KType and ConeKotlinType to ensu…

644685d

…re compatibility and correctness in sum calculations.

zaleslaw requested review from Copilot, Jolanrensen and koperagen April 23, 2025 16:34

Copilot AI reviewed Apr 23, 2025

View reviewed changes

plugins/kotlin-dataframe/testData/box/sum.kt Outdated Show resolved Hide resolved

zaleslaw and others added 2 commits April 23, 2025 18:39

Update plugins/kotlin-dataframe/testData/box/sum.kt

2db823f

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Refactor type conversion and column handling logic

9844b7d

koperagen reviewed Apr 24, 2025

View reviewed changes

plugins/kotlin-dataframe/src/org/jetbrains/kotlinx/dataframe/plugin/impl/api/statistics.kt Outdated Show resolved Hide resolved

koperagen reviewed Apr 24, 2025

View reviewed changes

...rame/tests-gen/org/jetbrains/kotlin/fir/dataframe/DataFrameBlackBoxCodegenTestGenerated.java Outdated Show resolved Hide resolved

koperagen reviewed Apr 24, 2025

View reviewed changes

plugins/kotlin-dataframe/testData/box/sum.kt Outdated Show resolved Hide resolved