gh-81022: Supporting customization of float encoding in JSON #13233

Lee-W · 2019-05-10T01:55:25Z

Add an encode_float argument to JSONEncoder for supporting customization float encoding

Issue: Supporting customization of float encoding in JSON #81022

the-knights-who-say-ni · 2019-05-10T01:55:28Z

Hello, and thanks for your contribution!

I'm a bot set up to make sure that the project can legally accept your contribution by verifying you have signed the PSF contributor agreement (CLA).

Our records indicate we have not received your CLA. For legal reasons we need you to sign this before we can look at your contribution. Please follow the steps outlined in the CPython devguide to rectify this issue.

If you have recently signed the CLA, please wait at least one business day
before our records are updated.

You can check yourself to see if the CLA has been received.

Thanks again for your contribution, we look forward to reviewing it!

mitar

I think also some tests would be useful?

Lib/json/__init__.py

Lee-W · 2019-07-17T06:31:38Z

@mitar I've added a test for the newly added encode_float argument.

cmnord · 2019-07-23T17:58:56Z

Rather than add another argument to these functions, why not un-nest floatstr from iterencode so that it can be overridden?

mitar · 2019-07-23T18:08:02Z

I think the idea is that one can also use C code-path for this? Why it is an argument for decoding and not a method?

cmnord · 2019-07-23T18:43:04Z

@mitar I don't fully understand your comment. Could you clarify what you mean?

What I meant is that it would be nice to be able to do the following:

import json
import numpy as np


class MyJSONEncoder(json.JSONEncoder):
    def floatstr(self, o, _repr=float.__repr__, _inf=json.INFINITY, _neginf=-json.INFINITY):
        if o != o:
            text = "None"
        elif o == _inf:
            text = "Infinity"
        elif o == _neginf:
            text = "-Infinity"
        else:
            return _repr(o)

        if not self.allow_nan:
            raise ValueError(
                "Out of range float values are not JSON compliant: " + repr(o)
            )

        return text


assert json.dumps(np.nan) == "None"

So rather than adding a new argument to many functions, we could achieve the same goal of custom float encoding this way. What do you mean by "C code-path" and "decoding and not a method"?

mitar · 2019-07-23T18:53:10Z

In load we already have parse_float and parse_int. Why do you think this is like that, and not simply a method on cls?
I think the reason why is because allows a fast code path in optimized C version of the parser.
So, we should probably do a similar thing in encoding as well, no? So while your suggestion works, I think the idea is that it is faster if you just have to change those two. But we should probably measure that and see if truly is. But currently, the API here tries to match decoding. It might be confusing that for decoding you have function callbacks, but for encoding you have methods?

cmnord · 2019-07-23T20:37:14Z

@mitar thanks for clarifying, I think I understand now. 🙂

mitar · 2019-07-23T20:40:10Z

If you do have time, you could try to measure how much does this really benefit. You could try moving parsing of floats to a class in decoding. And see if that really slows down things. Personally I also do not like extra functions, especially if we already have a nice class to put methods on.

Lee-W · 2019-11-10T04:44:09Z

@mitar Could you please explain more on what should I measure? It seems you already explain it. Thanks 🙂

mitar · 2019-11-11T07:38:41Z

I would propose that you measure and compare two cases a) having methods on the class b) having functions directly provided.

Lee-W · 2019-11-12T18:00:58Z

I've tested on Python 3.8.0a4+ using the code below.
The result is as follows.

Encode
Default:  0.19853970199999998
Argument:  0.31803161799999996
Class:  0.282146192

import json
import timeit


INFINITY = json.encoder.INFINITY


class MyJSONEncoder(json.JSONEncoder):
    def floatstr(self, obj,  _repr=float.__repr__, _inf=INFINITY, _neginf=-INFINITY):
        if obj != obj:
            text = "None"
        elif obj == json.encoder.INFINITY:
            text = "Infinity"
        elif obj == -json.encoder.INFINITY:
            text = "-Infinity"
        else:
            return float.__repr__(obj)

        if not self.allow_nan:
            raise ValueError(
                "Out of range float values are not JSON compliant: " + repr(obj)
            )

        return text


def floatstr(obj, allow_nan=True, _repr=float.__repr__, _inf=INFINITY, _neginf=-INFINITY):
    if obj != obj:
        text = "None"
    elif obj == json.encoder.INFINITY:
        text = "Infinity"
    elif obj == -json.encoder.INFINITY:
        text = "-Infinity"
    else:
        return float.__repr__(obj)

    if not allow_nan:
        raise ValueError(
            "Out of range float values are not JSON compliant: " + repr(obj)
        )
    return text


obj = {
    'inf': float('inf'),
    '-inf': -float('-inf'),
    'nan': float('nan'),
    '2': 2.23,
}

print('Encode')
print('Default: ', timeit.timeit(lambda: json.dumps(obj), number=10000))
print('Argument: ', timeit.timeit(lambda: json.dumps(obj, encode_float=floatstr), number=10000))
print('Class: ', timeit.timeit(lambda: json.dumps(obj, cls=MyJSONEncoder), number=10000))

Lee-W · 2020-01-11T01:29:42Z

@mitar Do I need to do other experiments on it? Or, would the above one be sufficient? 🙂

mitar · 2020-01-11T09:12:20Z

I will leave to somebody else from the Python team to way in here.

mitar · 2020-12-15T20:46:39Z

@Lee-W Please update the PR, there is now a merge conflict.

I really like this PR, could somebody from Python core team review/merge this?

Lee-W · 2020-12-16T15:16:11Z

@mitar Thanks for reminding. I just fix the conflict.

merwok · 2021-03-20T19:08:49Z

Lib/json/encoder.py

+                            "Out of range float values are not JSON compliant: " +
+                            repr(o))
+
+                    return text


Minor comment: why move the floatstr function definition inline here? it could stay as a regular method in the class, then the line below would be self.encode_float = self.floatstr and still work.

mRemyLynceus · 2022-10-20T13:33:56Z

@python's team : What's going on ?

* To enable customize float encoding

Modules/_json.c

merwok · 2023-06-04T15:45:42Z

As I noted on the ticket, this should be discussed to reach agreement on the need and the shape of the feature first.

Lee-W · 2023-08-06T06:31:35Z

Got it. I just wanted to resolve the previous conflict. As this is not yet agreed, let me close this PR

the-knights-who-say-ni added the CLA not signed label May 10, 2019

bedevere-bot added the awaiting review label May 10, 2019

Lee-W changed the title ~~Bpo 36841: Supporting customization of float encoding in JSON~~ bpo-36841: Supporting customization of float encoding in JSON May 10, 2019

the-knights-who-say-ni added CLA signed and removed CLA not signed labels May 11, 2019

auvipy approved these changes May 31, 2019

View reviewed changes

bedevere-bot added awaiting core review and removed awaiting review labels May 31, 2019

mitar reviewed Jul 1, 2019

View reviewed changes

Lib/json/__init__.py Outdated Show resolved Hide resolved

Lee-W force-pushed the bpo-36841 branch from cd49378 to 9158a6f Compare July 17, 2019 08:56

csabella requested review from ezio-melotti and rhettinger January 12, 2020 00:45

mitar approved these changes Dec 15, 2020

View reviewed changes

Lee-W force-pushed the bpo-36841 branch from 9158a6f to 8a38261 Compare December 16, 2020 14:52

merwok reviewed Mar 20, 2021

View reviewed changes

minrk mentioned this pull request Sep 30, 2021

bpo-36841: JSONEncoder call self.default for unsupported floats #28648

Open

rhettinger removed their request for review May 3, 2022 06:20

mitar mannequin mentioned this pull request Apr 10, 2022

Supporting customization of float encoding in JSON #81022

Open

Gatsik mentioned this pull request May 22, 2022

Add config option to specify json float serialization precision FAForever/server#905

Open

ezio-melotti removed the CLA signed label Jul 13, 2022

Dzeri96 mentioned this pull request Oct 7, 2022

json.dumps() should encode float number NaN to null #84813

Closed

Dzeri96 mentioned this pull request Oct 15, 2022

Proper or custom JSON serialization of non-finite float values #98306

Open

Lee-W and others added 8 commits June 4, 2023 11:44

Add encode_float argument to json encoder

b399d19

* To enable customize float encoding

Add docstring for the newly added encode_float

04e8b7d

Fix json test case break due to additional argument

dfc77d1

Add Misc/NEWS.d for bpo-36841

542c717

Fix json test case break due to encode_float argument

4fd6e25

Add comma after the last parameter

b096c00

Add test to json.dump for newly added encode_float argument

83ff368

style(json.c): unify self usage

4b53214

Lee-W force-pushed the bpo-36841 branch from a0a30ce to e5871b6 Compare June 4, 2023 03:44

arhadthedev reviewed Jun 4, 2023

View reviewed changes

Modules/_json.c Outdated Show resolved Hide resolved

fix(json.c): fix missing }

5e3b2b8

Lee-W force-pushed the bpo-36841 branch from e5871b6 to 5e3b2b8 Compare June 4, 2023 15:13

merwok added type-feature A feature request or enhancement DO-NOT-MERGE and removed awaiting core review labels Jun 4, 2023

AlexWaygood changed the title ~~bpo-36841: Supporting customization of float encoding in JSON~~ gh-81022: Supporting customization of float encoding in JSON Jun 4, 2023

Lee-W closed this Aug 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-81022: Supporting customization of float encoding in JSON #13233

gh-81022: Supporting customization of float encoding in JSON #13233

Lee-W commented May 10, 2019 •

edited by AlexWaygood

Loading

the-knights-who-say-ni commented May 10, 2019

mitar left a comment

Lee-W commented Jul 17, 2019

cmnord commented Jul 23, 2019

mitar commented Jul 23, 2019

cmnord commented Jul 23, 2019 •

edited

Loading

mitar commented Jul 23, 2019

cmnord commented Jul 23, 2019

mitar commented Jul 23, 2019

Lee-W commented Nov 10, 2019

mitar commented Nov 11, 2019

Lee-W commented Nov 12, 2019

Lee-W commented Jan 11, 2020

mitar commented Jan 11, 2020

mitar commented Dec 15, 2020

Lee-W commented Dec 16, 2020

merwok Mar 20, 2021

mRemyLynceus commented Oct 20, 2022

merwok commented Jun 4, 2023

Lee-W commented Aug 6, 2023

gh-81022: Supporting customization of float encoding in JSON #13233

gh-81022: Supporting customization of float encoding in JSON #13233

Conversation

Lee-W commented May 10, 2019 • edited by AlexWaygood Loading

the-knights-who-say-ni commented May 10, 2019

mitar left a comment

Choose a reason for hiding this comment

Lee-W commented Jul 17, 2019

cmnord commented Jul 23, 2019

mitar commented Jul 23, 2019

cmnord commented Jul 23, 2019 • edited Loading

mitar commented Jul 23, 2019

cmnord commented Jul 23, 2019

mitar commented Jul 23, 2019

Lee-W commented Nov 10, 2019

mitar commented Nov 11, 2019

Lee-W commented Nov 12, 2019

Lee-W commented Jan 11, 2020

mitar commented Jan 11, 2020

mitar commented Dec 15, 2020

Lee-W commented Dec 16, 2020

merwok Mar 20, 2021

Choose a reason for hiding this comment

mRemyLynceus commented Oct 20, 2022

merwok commented Jun 4, 2023

Lee-W commented Aug 6, 2023

Lee-W commented May 10, 2019 •

edited by AlexWaygood

Loading

cmnord commented Jul 23, 2019 •

edited

Loading