API: should values_for_factorize and _from_factorized round-trip missing values?

(fixtures make this so much harder to give a copy/pasteable example)

```
from pandas.tests.extension.json.test_json import *

dtype = JSONDtype()

data = make_data()
while len(data[0]) == len(data[1]):
        data = make_data()

data = JSONArray(data)

values = data._values_for_factorize()[0]
result = type(data)._from_factorized(values, data)
assert len(values) == len(result)  # <-- nope!
```

Am I wrong in thinking this assertion should hold?  If we had a .equals method, i'd strengthen this assertion to `assert result.equals(data)`

cc @TomAugspurger @WillAyd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

API: should values_for_factorize and _from_factorized round-trip missing values? #32673

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

API: should values_for_factorize and _from_factorized round-trip missing values? #32673

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions