Implementations of formatting traits for signed integers can be ambiguous #42860

Enet4 · 2017-06-23T14:58:56Z

I was recently formatting a signed integer primitive to hexadecimal using the standard formatter API, hoping that the result would be aware of the value's sign. It turns out that the implementations of UpperHex (and relatives such as LowerHex and Binary) for signed integers simply treat these numbers as unsigned (or just a sequence of bits).

println!("{:X}", -15i32);   // prints "FFFFFFF1",   expected "-F"

I posted this concern first as an SO question. A way around this is to make a newtype with another formatting implementation.

I can make arguments on both sides whether it should (or not) behave like this, but what actually concerns me most is that there seems to be no mention of this behaviour in the documentation. It appears that formatting trait implementations do not have to abide to a value's sign, but then the fact that a negative integer is treated as an unsigned number for formatting purposes can be unexpected for some people, especially when the docs do not clarify this situation.

To sum up: should we improve the documentation regarding what makes a valid formatting trait implementation? Should we also (or just) document further their implementations for integers in particular? I am willing to collaborate with the necessary changes one we're clear about what should be improved.

The text was updated successfully, but these errors were encountered:

dtolnay · 2017-11-18T20:10:32Z

This representation dates back to #1653 which implemented UpperHex using *self as $unsigned in 6feb58e.

For what it's worth, iostream in C++ and printf in C both handle this the same way we do.

#include <iostream>

int main() {
  int32_t x = -15;
  std::cout << std::uppercase << std::hex << x << '\n';
}

#include <stdint.h>
#include <stdio.h>

int main(void) {
  int32_t x = -15;
  printf("%X\n", x);
}

I don't believe we need to change the behavior but I do agree that the expectations around implementation of UpperHex and friends needs to be documented better.

Stargateur · 2017-11-19T23:50:37Z

Your code in C invoke undefined behavior, printf() family expect an unsigned int when you use the specifier X. If you don't send the right type to printf() the behavior is undefined. Plus, you use int32_t where an uint32_t will be undefined too.

dtolnay · 2017-11-19T23:54:14Z

Thanks @Stargateur. What would be the defined way to print a signed 32-bit integer in uppercase hex?

Stargateur · 2017-11-20T00:10:23Z

There are no way,

7.8.1 Macros for format speciﬁers (C11)
...
2 The fprintf macros for signed integers are:
PRIdN PRIdLEASTN PRIdFASTN PRIdMAX PRIdPTR
PRIiN PRIiLEASTN PRIiFASTN PRIiMAX PRIiPTR
3 The fprintf macros for unsigned integers are:
PRIoN PRIoLEASTN PRIoFASTN PRIoMAX PRIoPTR
PRIuN PRIuLEASTN PRIuFASTN PRIuMAX PRIuPTR
PRIxN PRIxLEASTN PRIxFASTN PRIxMAX PRIxPTR
PRIXN PRIXLEASTN PRIXFASTN PRIXMAX PRIXPTR

Like you see, standard don't give any specifier to print an intN_t in hexadecimal format. The correct way to print it would be to cast it.

#include <stdint.h>
#include <stdio.h>
#include <inttypes.h>

int main(void) {
  int32_t x = -15;
  printf("%"PRIX32"\n", (uint32_t)x);
}

Overflow of unsigned integer are defined by C standard so the cast is not undefined. AFAIK

Phlosioneer · 2018-03-11T12:38:22Z

I think this was addressed by #46285

Enet4 · 2018-03-11T13:04:21Z

@Phlosioneer That is correct! :) Closing.

Mark-Simulacrum added the T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. label Jun 23, 2017

Mark-Simulacrum added the C-feature-request Category: A feature request, i.e: not implemented / a PR. label Jul 28, 2017

steveklabnik added the P-medium Medium priority label Nov 21, 2017

XAMPPRocky added the C-enhancement Category: An issue proposing an enhancement or a PR with one. label Jan 22, 2018

Enet4 closed this as completed Mar 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implementations of formatting traits for signed integers can be ambiguous #42860

Implementations of formatting traits for signed integers can be ambiguous #42860

Enet4 commented Jun 23, 2017 •

edited

Loading

dtolnay commented Nov 18, 2017

Uh oh!

Stargateur commented Nov 19, 2017

Uh oh!

dtolnay commented Nov 19, 2017

Uh oh!

Stargateur commented Nov 20, 2017 •

edited

Loading

Uh oh!

Phlosioneer commented Mar 11, 2018

Uh oh!

Enet4 commented Mar 11, 2018

Uh oh!

Implementations of formatting traits for signed integers can be ambiguous #42860

Implementations of formatting traits for signed integers can be ambiguous #42860

Comments

Enet4 commented Jun 23, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

dtolnay commented Nov 18, 2017

Uh oh!

Stargateur commented Nov 19, 2017

Uh oh!

dtolnay commented Nov 19, 2017

Uh oh!

Stargateur commented Nov 20, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Phlosioneer commented Mar 11, 2018

Uh oh!

Enet4 commented Mar 11, 2018

Uh oh!

Enet4 commented Jun 23, 2017 •

edited

Loading

Stargateur commented Nov 20, 2017 •

edited

Loading