BUG: Ensure df.itertuples() uses plain tuples correctly #30600

simongibbons · 2020-01-01T11:53:53Z

Currently DataFrame.itertuples() has an off by one error
when it inspects whether or not it should return namedtuples
or plain tuples in it's response.

This PR addresses that bug by correcting the condition
that is used when making the check.

Closes: #28282

simongibbons · 2020-01-01T11:56:18Z

pandas/tests/frame/test_api.py

+
+        # Dataframes with >=255 columns will fallback to regular tuples
+        with pytest.raises(AttributeError):
+            result_255_columns.foo_1


Because namedtuple generates a new class when it is used, the only way of determining if something is a namedtuple or a plain one is to try and use it as such.

Hopefully you think this is sufficiently clear with the comment.

jreback · 2020-01-01T16:04:28Z

pandas/core/frame.py

@@ -1018,8 +1018,8 @@ def itertuples(self, index=True, name="Pandas"):
        # use integer indexing because of possible duplicate column names
        arrays.extend(self.iloc[:, k] for k in range(len(self.columns)))

-        # Python 3 supports at most 255 arguments to constructor
-        if name is not None and len(self.columns) + index < 256:
+        # Python versions before 3.7 support at most 255 arguments to constructor


is it possible to skip the arg len check for >= 3.7? does it work?

yes, this was simple to do.

jreback · 2020-01-01T16:04:58Z

pandas/tests/frame/test_api.py

@@ -288,6 +288,22 @@ def test_sequence_like_with_categorical(self):
        for c, col in df.items():
            str(s)

+    def test_itertuples_fallback_to_regular_tuples(self):


can you move this next to the other itertuples tests

Currently DataFrame.itertuples() has an off by one error when it inspects whether or not it should return namedtuples or plain tuples in it's response. This PR addresses that bug by correcting the condition that is used when making the check. Closes: pandas-dev#28282

1. Ensure we return named tuples in more cases (when using python >= 3.7) 2. Move test around to be with the itertuples test 3. Update docstring with the new behaviour.

simongibbons · 2020-01-01T23:17:08Z

pandas/tests/frame/test_api.py

        assert isinstance(tup3, tuple)
+        if PY37:
+            assert hasattr(tup3, "_fields")


This test has changed as we will return named tuples always for python >= 3.7 now.

WillAyd

lgtm

jreback · 2020-01-02T00:58:21Z

thanks @simongibbons very nice!

simongibbons commented Jan 1, 2020

View reviewed changes

jreback requested changes Jan 1, 2020

View reviewed changes

jreback added the Reshaping Concat, Merge/Join, Stack/Unstack, Explode label Jan 1, 2020

simongibbons force-pushed the fix-28282 branch from 730b22f to 187196a Compare January 1, 2020 22:27

simongibbons requested a review from jreback January 1, 2020 22:27

simongibbons force-pushed the fix-28282 branch from f6ddc89 to 6de8beb Compare January 1, 2020 23:15

Address comments.

64b381d

1. Ensure we return named tuples in more cases (when using python >= 3.7) 2. Move test around to be with the itertuples test 3. Update docstring with the new behaviour.

simongibbons force-pushed the fix-28282 branch from 6de8beb to 64b381d Compare January 1, 2020 23:16

simongibbons commented Jan 1, 2020

View reviewed changes

WillAyd approved these changes Jan 2, 2020

View reviewed changes

WillAyd added this to the 1.0 milestone Jan 2, 2020

jreback approved these changes Jan 2, 2020

View reviewed changes

jreback merged commit 13c9601 into pandas-dev:master Jan 2, 2020

simongibbons deleted the fix-28282 branch January 6, 2020 09:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: Ensure df.itertuples() uses plain tuples correctly #30600

BUG: Ensure df.itertuples() uses plain tuples correctly #30600

Uh oh!

simongibbons commented Jan 1, 2020 •

edited

Loading

Uh oh!

simongibbons Jan 1, 2020

Uh oh!

jreback Jan 1, 2020

Uh oh!

simongibbons Jan 1, 2020

Uh oh!

jreback Jan 1, 2020

Uh oh!

simongibbons Jan 1, 2020

Uh oh!

simongibbons Jan 1, 2020

Uh oh!

WillAyd left a comment

Uh oh!

jreback commented Jan 2, 2020

Uh oh!

Uh oh!

Uh oh!

BUG: Ensure df.itertuples() uses plain tuples correctly #30600

BUG: Ensure df.itertuples() uses plain tuples correctly #30600

Uh oh!

Conversation

simongibbons commented Jan 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

simongibbons Jan 1, 2020

Choose a reason for hiding this comment

Uh oh!

jreback Jan 1, 2020

Choose a reason for hiding this comment

Uh oh!

simongibbons Jan 1, 2020

Choose a reason for hiding this comment

Uh oh!

jreback Jan 1, 2020

Choose a reason for hiding this comment

Uh oh!

simongibbons Jan 1, 2020

Choose a reason for hiding this comment

Uh oh!

simongibbons Jan 1, 2020

Choose a reason for hiding this comment

Uh oh!

WillAyd left a comment

Choose a reason for hiding this comment

Uh oh!

jreback commented Jan 2, 2020

Uh oh!

Uh oh!

simongibbons commented Jan 1, 2020 •

edited

Loading