feature/format arg_id parsing, align parsing, width parsing, skeleton classes #1232

barcharcraz · 2020-08-26T00:51:17Z

[format.string.general]/2, [format.string.std]/2,3

_Parse_arg_id implements [format.string.general]/2 parsing
_Parse_align parses alignments as specified in [format.string.std]
_Parse_width parses widths as specified in [format.string.std]
- note that it doesn't decide what to actually do with them
most parsing functions that can be are constexpr, at least in some paths, in libfmt
pretty much all parsing functions are constexpr on all paths, but the standard does not
require much of anything in to be constexpr, so it's more of a coding / debugging aid
than anything else. In particular all the number parsing isn't constexpr because it calls out to
from_chars.

Callbacks for parsing functions are specified in concepts _Parse_spec_callbacks (for [format.string.std]/1) and _Parse_arg_id_callbacks (for the arg-id field of the grammar in [format.string.general]/1

This PR has a sprinkling of all over, but future PRs should be more focused (I hope).

Tests included, however they are not complete.

miscco

This already looks quite good, I only came to basic_format_parse_context until my brain exploded.

stl/inc/format

miscco · 2020-08-26T09:50:42Z

stl/inc/format

+    } else if (*_Begin == '{') {
+        ++_Begin;
+        if(_Begin != _End) {
+            _Begin = _Parse_arg_id(_Begin, _End, _Width_adapter(_Callbacks));


I believe we should move this down. If *_Begin == '}' then _Parse_arg_id will early return and we will fail directly after this.

I don't understand the issue, if begin == } then _Parse_arg_id will parse the ID as an auto-id and call the right handler for that, as it should

stl/inc/format

CaseyCarter

Overall: I'm concerned that there isn't test coverage here for all of the library components we're adding to <format>. Ideally we'd add components and tests at the same time, but // TODO: test coverage comments or something would be fine for this interim branch.

stl/inc/format

tests/std/tests/P0645R10_text_formatting_parsing/test.cpp

CaseyCarter · 2020-08-27T17:47:10Z

tests/std/tests/P0645R10_text_formatting_parsing/test.cpp

+    auto s0 = ""sv;
+    auto s2 = "*<"sv;
+    auto s3 = "*>"sv;
+    auto s4 = "*^"sv;


Missing test cases for "}", "{", and "{<".

stl/inc/format

tests/std/tests/P0645R10_text_formatting_parse_contexts/test.cpp

barcharcraz · 2020-09-01T00:46:47Z

This already looks quite good, I only came to basic_format_parse_context until my brain exploded.

well that's good since after basic_format_parse_context the parsing bits end and we go into skeleton code that really just looks like the standard

edit: actually I'd encourage you to speak up about bits you find confusing, since I found a lot of stuff confusing and want to add appropriate comments to make it easier for the next guy.

CaseyCarter

There are a few "outdated but unresolved" comments in my earlier review as well.

stl/inc/format

tests/std/tests/P0645R10_text_formatting_parsing/test.cpp

CaseyCarter · 2020-09-17T07:19:13Z

tests/std/tests/P0645R10_text_formatting_parsing/test.cpp

+        auto s5 = L"*\x343E"sv;
+        test_parse_helper(parse_align_fn, s5, false, view_typ::npos, {_Align::_None, L"*"sv});
+    }
+


Why are we checking the end position of the parse for any of the above cases?

I think you mean "why are we not checking", Casey. You were concerned about the parse potentially running off the end of the input.

Again, I'm happy to merge as-is and clean up later.

tests/std/tests/P0645R10_text_formatting_parsing/test.cpp

Co-authored-by: Casey Carter <cartec69@gmail.com>

CaseyCarter

Couple of small things that can wait til later if you like.

CaseyCarter · 2020-09-22T19:13:33Z

stl/inc/format

+    }
+    for (;;) {
+        switch (*_Align_pt) {
+        case L'<':


comparing to L'{' here solves the problem where we narrow wide characters and mess up on wide characters where the least significant byte happens to be narrow '{'.

Switching on *_Align_pt instead of static_cast<char>(*_Align_pt) solved that problem. It's a different problem that comparing to L'<' assumes that < in every narrow encoding has the same value as L'<' and/or u8'<'.

tests/std/tests/P0645R10_text_formatting_parsing/test.cpp

barcharcraz requested a review from a team August 26, 2020 00:51

StephanTLavavej added the cxx20 C++20 feature label Aug 26, 2020

miscco reviewed Aug 26, 2020

View reviewed changes

mnatsuhara assigned CaseyCarter Aug 26, 2020

statementreply reviewed Aug 27, 2020

View reviewed changes

miscco mentioned this pull request Aug 27, 2020

What about <format> ? #1237

Closed

CaseyCarter suggested changes Aug 27, 2020

View reviewed changes

mnatsuhara unassigned CaseyCarter Sep 2, 2020

barcharcraz added 16 commits September 8, 2020 14:40

add format header

dab5e89

add tests to build

f98520f

add format_arg_value and custom_value.

b34c42a

more format parser

ef225c1

parse align

035bc05

fill/align parsing tests.

8def328

arg_id and width

6209a57

tests for arg_id, quite basic right now

7c61ccc

correct a spelling error

cbaf074

add <format> to other required files

52f0f83

resolve some review comments

30c11a6

respond to review comments

0ced949

start conversion to string_view

e167e86

some tests for width

45a09b3

enable wchar_t

02060dc

constexprify tests.

d6a16ed

StephanTLavavej assigned barcharcraz Sep 9, 2020

tests for non-parsing is todo

14f0b50

barcharcraz force-pushed the fmt_parse branch from 330cbbc to 14f0b50 Compare September 10, 2020 20:42

barcharcraz added 2 commits September 10, 2020 13:48

forgot a bit of named args that needed removal.

85c0d34

remove a requires clause and use brief syntax.

fcd6ea0

barcharcraz added 2 commits September 10, 2020 13:55

newlines

7d95919

use concepts_matrix.

6da6a40

barcharcraz requested a review from CaseyCarter September 11, 2020 02:49

barcharcraz added 2 commits September 14, 2020 13:28

some review comments were hiding from me

6cd007d

fix wchar_t misparse bug

9c323ee

StephanTLavavej assigned CaseyCarter and unassigned barcharcraz Sep 16, 2020

CaseyCarter suggested changes Sep 17, 2020

View reviewed changes

StephanTLavavej assigned barcharcraz and unassigned CaseyCarter Sep 17, 2020

Charlie Barto and others added 6 commits September 21, 2020 16:51

Apply suggestions from code review

cd9c284

Co-authored-by: Casey Carter <cartec69@gmail.com>

Apply suggestions from code review

735d757

Co-authored-by: Casey Carter <cartec69@gmail.com>

move test coverage todo

29367f7

more review comments

af1381e

fix test string numbering

511eb2c

comment on narrowing test case

412e513

CaseyCarter approved these changes Sep 22, 2020

View reviewed changes

fix minor issues

72f6cac

barcharcraz merged commit b10d526 into microsoft:feature/format Sep 22, 2020

StephanTLavavej added the format C++20/23 format label Feb 6, 2021

feature/format arg_id parsing, align parsing, width parsing, skeleton classes #1232

feature/format arg_id parsing, align parsing, width parsing, skeleton classes #1232

Uh oh!

Conversation

barcharcraz commented Aug 26, 2020

Uh oh!

miscco left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

miscco Aug 26, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

barcharcraz Sep 10, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CaseyCarter left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CaseyCarter Aug 27, 2020

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

barcharcraz commented Sep 1, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CaseyCarter left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CaseyCarter Sep 17, 2020

Choose a reason for hiding this comment

Uh oh!

CaseyCarter Sep 22, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

CaseyCarter left a comment

Choose a reason for hiding this comment

Uh oh!

CaseyCarter Sep 22, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

miscco Aug 26, 2020 •

edited

Loading

barcharcraz commented Sep 1, 2020 •

edited

Loading

CaseyCarter Sep 22, 2020 •

edited

Loading

CaseyCarter Sep 22, 2020 •

edited

Loading