ENH: Add optional argument index to pd.melt to maintain index values #33659

Rik-de-Kort · 2020-04-19T20:49:20Z

Finishing up a stale PR idea: #28859 and #17459

Has some tests and better code.
I think it's fair to duplicate the index values and not bend over backwards to maintain uniqueness like in previous iterations.

Apologies for the mess, it was a quick job and I didn't want to spend an hour fiddling with the commits.

Finally, I deleted some ignore type comments for mypy because the commits weren't going on my system. Is there some other fix for that? Other than that I think it's good to go.

closes Index gets lost when DataFrame melt method is used #17440
tests added / passed
passes black pandas
passes git diff upstream/master -u -- "*.py" | flake8 --diff
whatsnew entry
Reconsider API design
Add usage examples in whatsnew, docstring, and reshaping.rst.

Rik-de-Kort · 2020-05-24T09:02:59Z

Does the CI always have this many issues?

pandas/core/frame.py

simonjayhawkins

Thanks @Rik-de-Kort almost there. generally lgtm. just a couple more comments.

pandas/core/shared_docs.py

doc/source/user_guide/reshaping.rst

pandas/core/frame.py

jreback · 2020-06-25T14:33:21Z

pandas/core/reshape/melt.py

+    result = frame._constructor(mdata, columns=mcolumns)
+
+    if not ignore_index:
+        new_index = np.tile(frame.index, K)


both MI and Index already have a .repeat() method, I think we could add a .tile() method to make this easier. (or just use repeat)

Index(["foo", "bar"]).repeat(2) yields Index(['foo', 'foo', 'bar', 'bar'], dtype='object'), where as np.tile(["foo", "bar"]) yields array(['foo', 'bar', 'foo', 'bar', 'foo', 'bar'], dtype=object). The latter corresponds to the layout used in melt so it's very not trivial to use repeat instead of tile.

I tried having a look at implementing tile on indices but then I would also have to do it for multiindices and document it and tests, and argument validation which I've never even looked at before and I think it's a big hassle that I will not undertake.

ok fair enough

we already have _tile_compat in pandas\core\reshape\util.py. This may allow futher simplification here.

This converts to object dtype. Can you use pandas.core.reshape.util._tile_compat?

You should be able to remove the next section as well, since you won't be converting to an ndarray.

Thanks, that simplifies the code a lot! Build is failing but that's due to a worker crashing.

Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>

jreback

lgtm. a doc comment, pls ping on green.

doc/source/user_guide/reshaping.rst

jreback · 2020-06-25T22:26:46Z

pandas/core/reshape/melt.py

+    result = frame._constructor(mdata, columns=mcolumns)
+
+    if not ignore_index:
+        new_index = np.tile(frame.index, K)


ok fair enough

Rik-de-Kort · 2020-06-28T14:41:12Z

@jreback I though I gave you a ping, but I don't see it, so here it is!

jreback · 2020-07-09T23:34:51Z

thanks @Rik-de-Kort

Rik-de-Kort · 2020-07-10T07:44:49Z

You're welcome

Rik-de-Kort and others added 30 commits November 24, 2019 14:49

initial xlsb support

a4f2d22

Import order fix for CI pass

62564cf

Initial tests

a7a8460

style fixes

d9be281

documentation

8bf8c78

forgot place to document

cd95dce

Fixed test issue with XLRDError

7a7390d

Fix for unnamed column issue

248ac12

style fix

6ea78de

line up with upstream master

44c5439

Merge branch 'master' of https://github.com/pandas-dev/pandas

92c98cd

Fix broken xlrd test

64fa6f3

get docs to build

cb276e8

Remove warning filter

4ebcb48

Merge branch 'master' of https://github.com/Rik-de-Kort/pandas

71436a0

extended description update

00cc66b

Merge branch 'master' of https://github.com/pandas-dev/pandas

4c81853

Xlsb options instead of odf options

e85da03

Add reference in whatsnew to docs

2348c3b

Make pyxlsb show up in install.rst and show_versions

d02a5a5

Add pyxlsb to ci builds

c71e021

environment.yml update

ae3f9ea

Merge upstream master

a410e51

One update to environment.yml too many

7c9dcce

Trying to fix build

4bd8400

Merge upstream

43ab0fe

Added issue number

024492a

Updated to use .rows(sparse=False) for future compat

b424c8e

Merge branch 'master' of https://github.com/pandas-dev/pandas

571489b

xfails in test_readers.py

dad4a53

Rik-de-Kort reopened this May 24, 2020

simonjayhawkins reviewed May 25, 2020

View reviewed changes

pandas/core/frame.py Outdated Show resolved Hide resolved

Rik-de-Kort added 5 commits May 26, 2020 13:59

Fixed documentation

bf7d5e5

Merge branch 'master' of https://github.com/pandas-dev/pandas

9008ccc

Merge branch 'master' of https://github.com/pandas-dev/pandas

a6ec490

Merge branch 'master' of https://github.com/Rik-de-Kort/pandas

0391b7a

Fixed docs (Hopefully)

800c050

simonjayhawkins reviewed Jun 8, 2020

View reviewed changes

pandas/core/frame.py Outdated Show resolved Hide resolved

Rik-de-Kort added 3 commits June 25, 2020 09:26

Merge upstream master

788c28a

Hopefully fix documentation bug

e134ed2

Fix typing error

7a765a3

simonjayhawkins reviewed Jun 25, 2020

View reviewed changes

pandas/core/shared_docs.py Outdated Show resolved Hide resolved

doc/source/user_guide/reshaping.rst Outdated Show resolved Hide resolved

doc/source/user_guide/reshaping.rst Outdated Show resolved Hide resolved

doc/source/user_guide/reshaping.rst Outdated Show resolved Hide resolved

jreback requested changes Jun 25, 2020

View reviewed changes

Rik-de-Kort and others added 3 commits June 25, 2020 17:44

Apply suggestions from code review

b1cca84

Co-authored-by: Simon Hawkins <simonjayhawkins@gmail.com>

Doc review:

c66767d

Type!

7f5018f

jreback requested changes Jun 25, 2020

View reviewed changes

jreback added this to the 1.1 milestone Jun 25, 2020

Rik-de-Kort added 3 commits June 26, 2020 09:08

Added example for difference

16e9bd4

Merge branch 'master' of https://github.com/pandas-dev/pandas

df645e1

Linting failure?

bbf8465

WillAyd mentioned this pull request Jun 30, 2020

ENH: Add option to keep index in pd.melt #35069

Closed

Rik-de-Kort added 2 commits July 7, 2020 17:41

TomAugspurger suggestion

57bffd1

Trailing whitespace...

edcd123

jreback approved these changes Jul 9, 2020

View reviewed changes

jreback merged commit c8d85e2 into pandas-dev:master Jul 9, 2020

Uh oh!

ENH: Add optional argument index to pd.melt to maintain index values #33659

ENH: Add optional argument index to pd.melt to maintain index values #33659

Uh oh!

Conversation

Rik-de-Kort commented Apr 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Rik-de-Kort commented May 24, 2020

Uh oh!

Uh oh!

Uh oh!

simonjayhawkins left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jreback Jun 25, 2020

Choose a reason for hiding this comment

Uh oh!

Rik-de-Kort Jun 25, 2020

Choose a reason for hiding this comment

Uh oh!

jreback Jun 25, 2020

Choose a reason for hiding this comment

Uh oh!

simonjayhawkins Jun 26, 2020

Choose a reason for hiding this comment

Uh oh!

TomAugspurger Jul 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

TomAugspurger Jul 7, 2020

Choose a reason for hiding this comment

Uh oh!

Rik-de-Kort Jul 7, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jreback left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jreback Jun 25, 2020

Choose a reason for hiding this comment

Uh oh!

Rik-de-Kort commented Jun 28, 2020

Uh oh!

jreback commented Jul 9, 2020

Uh oh!

Rik-de-Kort commented Jul 10, 2020

Uh oh!

Uh oh!

Rik-de-Kort commented Apr 19, 2020 •

edited

Loading

TomAugspurger Jul 7, 2020 •

edited

Loading

Rik-de-Kort Jul 7, 2020 •

edited

Loading