feat: Allow float in interpolate_by by column #18015

agossard · 2024-08-02T13:03:06Z

Ok, splitting this out in to a simpler PR. Fixes #16794

codecov · 2024-08-02T13:33:17Z

Codecov Report

Attention: Patch coverage is 87.50000% with 1 line in your changes missing coverage. Please review.

Project coverage is 80.34%. Comparing base (d5265d3) to head (c9ff1a8).
Report is 84 commits behind head on main.

Files	Patch %	Lines
...ops/src/series/ops/interpolation/interpolate_by.rs	87.50%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #18015      +/-   ##
==========================================
- Coverage   80.49%   80.34%   -0.16%     
==========================================
  Files        1496     1496              
  Lines      196786   197684     +898     
  Branches     2817     2821       +4     
==========================================
+ Hits       158407   158820     +413     
- Misses      37858    38343     +485     
  Partials      521      521

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

MarcoGorelli

thanks @agossard 🙏 !

implementation looks good! you just need to run make pre-commit

thanks for amending the hypothesis test - could we also have one of the unit tests use a float by column?

agossard · 2024-08-03T02:17:25Z

Thanks Marco. I’ll try to fix the remaining style problem. In the unit test, just to confirm… as is, I’ve made it so the tests cover a float by column… by casting the existing example, like the other data types. Are you looking for a different piece of input data that actually has true non integer data in it?

MarcoGorelli

thanks @agossard for updating!

sorry I just meant - on top of the hypothesis test, to modify one of the unit tests (say, test_interpolate_by_trailing_nulls) to have a float by column

MarcoGorelli · 2024-08-03T08:36:15Z

py-polars/tests/unit/operations/test_interpolate_by.py

@@ -143,15 +145,24 @@ def test_interpolate_by_trailing_nulls() -> None:


 @given(data=st.data())
-def test_interpolate_vs_numpy(data: st.DataObject) -> None:
+@pytest.mark.parametrize("x_dtype", [pl.Date, pl.Float64])


this can be in given, you can use st.sampled_from

as in, x_dtype=st.sampled_from([pl.Date, pl.Float64])

MarcoGorelli

thanks @agossard , just one pending comment

MarcoGorelli · 2024-08-05T14:31:34Z

py-polars/tests/unit/operations/test_interpolate_by.py

@@ -143,15 +145,24 @@ def test_interpolate_by_trailing_nulls() -> None:


 @given(data=st.data())
-def test_interpolate_vs_numpy(data: st.DataObject) -> None:
+@pytest.mark.parametrize("x_dtype", [pl.Date, pl.Float64])


as in, x_dtype=st.sampled_from([pl.Date, pl.Float64])

MarcoGorelli

looks like there's a test failure

__________________________ test_interpolate_vs_numpy ___________________________
[gw3] darwin -- Python 3.12.4 /Users/runner/work/polars/polars/.venv/bin/python3
tests/unit/operations/test_interpolate_by.py:168: in test_interpolate_vs_numpy
    def test_interpolate_vs_numpy(data: st.DataObject, x_dtype: pl.DataType) -> None:
tests/unit/operations/test_interpolate_by.py:228: in test_interpolate_vs_numpy
    assert_series_equal(result, expected)
polars/_utils/deprecation.py:91: in wrapper
    return function(*args, **kwargs)
E   AssertionError: Series are different (nan value mismatch)
E   [left]:  [0.0, 0.0, nan, 0.0]
E   [right]: [0.0, 0.0, 0.0, 0.0]
E   Falsifying example: test_interpolate_vs_numpy(
E       data=data(...),
E       x_dtype=Float64,  # or any other generated value
E   )
E   Draw 1: shape: (5, 2)
E   ┌─────────────┬───────┐
E   │ ts          ┆ value │
E   │ ---         ┆ ---   │
E   │ f64         ┆ f64   │
E   ╞═════════════╪═══════╡
E   │ 0.0         ┆ null  │
E   │ 0.0         ┆ 0.0   │
E   │ 9.9792e291  ┆ null  │
E   │ 9.9792e291  ┆ 0.0   │
E   │ -1.7977e308 ┆ 0.0   │
E   └─────────────┴───────┘
E   Explanation:
E       These lines were always and only run by failing examples:
E           /Users/runner/work/polars/polars/py-polars/polars/series/series.py:4039
E           /Users/runner/work/polars/polars/py-polars/polars/series/series.py:696
E           /Users/runner/work/polars/polars/py-polars/polars/series/series.py:739
E           /Users/runner/work/polars/polars/py-polars/polars/series/series.py:752
E           /Users/runner/work/polars/polars/py-polars/polars/series/series.py:782
E           (and 4 more with settings.verbosity >= verbose)
E   
E   You can reproduce this example by temporarily adding @reproduce_failure('6.108.9', b'AXicY2RhwAn2LEBnMgLR/v8QAOIgACMONgMA/UgKew==') as a decorator on your test case

---------- coverage: platform darwin, python 3.12.4-final-0 ----------
Coverage XML written to file main.xml

Required test coverage of 85.0% reached. Total coverage: 90.40%
=========================== short test summary info ============================
FAILED tests/unit/operations/test_interpolate_by.py::test_interpolate_vs_numpy - AssertionError: Series are different (nan value mismatch)
[left]:  [0.0, 0.0, nan, 0.0]
[right]: [0.0, 0.0, 0.0, 0.0]
Falsifying example: test_interpolate_vs_numpy(
    data=data(...),
    x_dtype=Float64,  # or any other generated value
)
Draw 1: shape: (5, 2)
┌─────────────┬───────┐
│ ts          ┆ value │
│ ---         ┆ ---   │
│ f64         ┆ f64   │
╞═════════════╪═══════╡
│ 0.0         ┆ null  │
│ 0.0         ┆ 0.0   │
│ 9.9792e291  ┆ null  │
│ 9.9792e291  ┆ 0.0   │
│ -1.7977e308 ┆ 0.0   │
└─────────────┴───────┘
Explanation:
    These lines were always and only run by failing examples:
        /Users/runner/work/polars/polars/py-polars/polars/series/series.py:4039
        /Users/runner/work/polars/polars/py-polars/polars/series/series.py:696
        /Users/runner/work/polars/polars/py-polars/polars/series/series.py:739
        /Users/runner/work/polars/polars/py-polars/polars/series/series.py:752
        /Users/runner/work/polars/polars/py-polars/polars/series/series.py:782
        (and 4 more with settings.verbosity >= verbose)

You can reproduce this example by temporarily adding @reproduce_failure('6.108.9', b'AXicY2RhwAn2LEBnMgLR/v8QAOIgACMONgMA/UgKew==') as a decorator on your test case

It's probably OK to keep the float case out of the hypothesis test, the unit tests you've added should be enough

agossard · 2024-08-09T02:16:08Z

OK, hopefully working now. Thanks, Marco!

MarcoGorelli

nice one, thanks @agossard !

allow float in interpolate_by by column

19e3386

agossard requested review from ritchie46, c-peters, alexander-beedie, MarcoGorelli, reswqa and orlp as code owners August 2, 2024 13:03

github-actions bot added enhancement New feature or an improvement of an existing feature python Related to Python Polars rust Related to Rust Polars labels Aug 2, 2024

agossard added 2 commits August 2, 2024 15:57

Fix formatting problems

f13f211

type hint

f226b98

MarcoGorelli reviewed Aug 2, 2024

View reviewed changes

agossard added 4 commits August 2, 2024 23:29

more formatting

200d098

less wastefull hypothesis testing parameters

0e3b77d

formatting

04c7c2b

really?

5b25b45

MarcoGorelli reviewed Aug 3, 2024

View reviewed changes

agossard added 2 commits August 3, 2024 08:33

At float version of test_interpolate_by_trailing_nulls

9210a08

more formatting

bf3af7a

MarcoGorelli reviewed Aug 5, 2024

View reviewed changes

use @given instead of parametrize

c7c9ca3

MarcoGorelli reviewed Aug 6, 2024

View reviewed changes

try float bounds on hypothesis test

c9ff1a8

MarcoGorelli approved these changes Aug 9, 2024

View reviewed changes

ritchie46 merged commit 49747c1 into pola-rs:main Aug 18, 2024
27 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Allow float in interpolate_by by column #18015

feat: Allow float in interpolate_by by column #18015

agossard commented Aug 2, 2024 •

edited

Loading

codecov bot commented Aug 2, 2024 •

edited

Loading

MarcoGorelli left a comment

agossard commented Aug 3, 2024

MarcoGorelli left a comment

MarcoGorelli Aug 3, 2024 •

edited

Loading

MarcoGorelli Aug 5, 2024

MarcoGorelli left a comment

MarcoGorelli Aug 5, 2024

MarcoGorelli left a comment

agossard commented Aug 9, 2024

MarcoGorelli left a comment

feat: Allow float in interpolate_by by column #18015

feat: Allow float in interpolate_by by column #18015

Conversation

agossard commented Aug 2, 2024 • edited Loading

codecov bot commented Aug 2, 2024 • edited Loading

Codecov Report

MarcoGorelli left a comment

Choose a reason for hiding this comment

agossard commented Aug 3, 2024

MarcoGorelli left a comment

Choose a reason for hiding this comment

MarcoGorelli Aug 3, 2024 • edited Loading

Choose a reason for hiding this comment

MarcoGorelli Aug 5, 2024

Choose a reason for hiding this comment

MarcoGorelli left a comment

Choose a reason for hiding this comment

MarcoGorelli Aug 5, 2024

Choose a reason for hiding this comment

MarcoGorelli left a comment

Choose a reason for hiding this comment

agossard commented Aug 9, 2024

MarcoGorelli left a comment

Choose a reason for hiding this comment

agossard commented Aug 2, 2024 •

edited

Loading

codecov bot commented Aug 2, 2024 •

edited

Loading

MarcoGorelli Aug 3, 2024 •

edited

Loading