Quick fix cleanup #5

ricardo-valero · 2024-02-27T00:42:42Z

Thanks @lukewilliamboswell now I have verified commits
I'm still learning about unicode so I will investigate more to contribute
For now this are some quick fixes

some unicode ranges
typos
updated string interpolation syntax

lukewilliamboswell

Looks good, I've left a couple of comments for you. I don't see any major issues here. Thank you for cleaning this up 😃

lukewilliamboswell · 2024-02-27T08:21:27Z

package/CodePoint.roc

    if u32 <= 0x10FFFF then
        Ok (fromU32Unchecked u32)
    else
        Err InvalidCodePoint

-## Returns false if this is either a [high-surrogate code point](http://www.unicode.org/glossary/#high_surrogate_code_point)


Did you mean to remove these links? I think we should keep these.

I was thinking external links should remain unique for code maintainability. Internal links can be repeated for internal navigation, and they could ultimately lead back to the original external link.
I'm open to putting them back if you find it necessary.

lukewilliamboswell · 2024-02-27T08:25:35Z

package/CodePoint.roc

    u32 >= 0xDC00 && u32 <= 0xDFFF

 ## Zig docs: bytes the UTF-8 representation would require
 ## for the given codepoint.
-utf8Len : CodePoint -> Result U64 [InvalidCodePoint]
+utf8Len : CodePoint -> Result U8 [InvalidCodePoint]


We used U64 here as it was recently changed from Nat. I don't really know myself, but just wondering why you changed it to U8?

This was one of my doubts and should've asked before. But utf8Len and countUtf8Bytes return values in the range of 1 to 4.

Lets give it a try. It's always easy to cast if needed

lukewilliamboswell · 2024-02-27T08:27:43Z

package/Scalar.roc

@@ -18,36 +18,34 @@ interface Scalar
    ]

 ## A [Unicode scalar value](http://www.unicode.org/glossary/#unicode_scalar_value) - that is,
-## any [code point](./CodePoint#CodePoint) except for [high-surrogate](http://www.unicode.org/glossary/#high_surrogate_code_point)
-## and [low-surrogate](http://www.unicode.org/glossary/#low_surrogate_code_point) code points.


Again, I think it is better to link to the source www.unicode.org here so that future contributors can easily find the relevant details in the spec.

ricardo-valero added 4 commits February 26, 2024 18:15

format and spaces

a5d11bb

fix high surrogate range

e1162d4

fix fromU32 ranges

f75f461

string interpolation syntax update and fix typos

65134fb

lukewilliamboswell reviewed Feb 27, 2024

View reviewed changes

lukewilliamboswell approved these changes Feb 27, 2024

View reviewed changes

lukewilliamboswell merged commit 09f22e6 into roc-lang:main Feb 27, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quick fix cleanup #5

Quick fix cleanup #5

ricardo-valero commented Feb 27, 2024 •

edited

Loading

lukewilliamboswell left a comment

lukewilliamboswell Feb 27, 2024

ricardo-valero Feb 27, 2024

lukewilliamboswell Feb 27, 2024

ricardo-valero Feb 27, 2024

lukewilliamboswell Feb 27, 2024

lukewilliamboswell Feb 27, 2024

lukewilliamboswell Feb 27, 2024

Quick fix cleanup #5

Quick fix cleanup #5

Conversation

ricardo-valero commented Feb 27, 2024 • edited Loading

lukewilliamboswell left a comment

Choose a reason for hiding this comment

lukewilliamboswell Feb 27, 2024

Choose a reason for hiding this comment

ricardo-valero Feb 27, 2024

Choose a reason for hiding this comment

lukewilliamboswell Feb 27, 2024

Choose a reason for hiding this comment

ricardo-valero Feb 27, 2024

Choose a reason for hiding this comment

lukewilliamboswell Feb 27, 2024

Choose a reason for hiding this comment

lukewilliamboswell Feb 27, 2024

Choose a reason for hiding this comment

lukewilliamboswell Feb 27, 2024

Choose a reason for hiding this comment

ricardo-valero commented Feb 27, 2024 •

edited

Loading