@@ -29,7 +29,7 @@ You may also be interested in the [grammar].
2929
3030# Notation
3131
32- Rust's grammar is defined over Unicode codepoints , each conventionally denoted
32+ Rust's grammar is defined over Unicode code points , each conventionally denoted
3333` U+XXXX ` , for 4 or more hexadecimal digits ` X ` . _ Most_ of Rust's grammar is
3434confined to the ASCII range of Unicode, and is described in this document by a
3535dialect of Extended Backus-Naur Form (EBNF), specifically a dialect of EBNF
5353- Square brackets are used to group rules.
5454- ` LITERAL ` is a single printable ASCII character, or an escaped hexadecimal
5555 ASCII code of the form ` \xQQ ` , in single quotes, denoting the corresponding
56- Unicode codepoint ` U+00QQ ` .
56+ Unicode code point ` U+00QQ ` .
5757- ` IDENTIFIER ` is a nonempty string of ASCII letters and underscores.
5858- The ` repeat ` forms apply to the adjacent ` element ` , and are as follows:
5959 - ` ? ` means zero or one repetition
@@ -66,9 +66,9 @@ This EBNF dialect should hopefully be familiar to many readers.
6666
6767## Unicode productions
6868
69- A few productions in Rust's grammar permit Unicode codepoints outside the ASCII
69+ A few productions in Rust's grammar permit Unicode code points outside the ASCII
7070range. We define these productions in terms of character properties specified
71- in the Unicode standard, rather than in terms of ASCII-range codepoints . The
71+ in the Unicode standard, rather than in terms of ASCII-range code points . The
7272section [ Special Unicode Productions] ( #special-unicode-productions ) lists these
7373productions.
7474
@@ -91,10 +91,10 @@ production. See [tokens](#tokens) for more information.
9191
9292## Input format
9393
94- Rust input is interpreted as a sequence of Unicode codepoints encoded in UTF-8.
94+ Rust input is interpreted as a sequence of Unicode code points encoded in UTF-8.
9595Most Rust grammar rules are defined in terms of printable ASCII-range
96- codepoints , but a small number are defined in terms of Unicode properties or
97- explicit codepoint lists. [ ^ inputformat ]
96+ code points , but a small number are defined in terms of Unicode properties or
97+ explicit code point lists. [ ^ inputformat ]
9898
9999[ ^ inputformat ] : Substitute definitions for the special Unicode productions are
100100 provided to the grammar verifier, restricted to ASCII range, when verifying the
@@ -147,7 +147,7 @@ comments beginning with exactly one repeated asterisk in the block-open
147147sequence (` /** ` ), are interpreted as a special syntax for ` doc `
148148[ attributes] ( #attributes ) . That is, they are equivalent to writing
149149` #[doc="..."] ` around the body of the comment (this includes the comment
150- characters themselves, ie ` /// Foo ` turns into ` #[doc="/// Foo"] ` ).
150+ characters themselves, i.e. ` /// Foo ` turns into ` #[doc="/// Foo"] ` ).
151151
152152Line comments beginning with ` //! ` and block comments beginning with ` /*! ` are
153153doc comments that apply to the parent of the comment, rather than the item
@@ -333,14 +333,14 @@ Some additional _escapes_ are available in either character or non-raw string
333333literals. An escape starts with a ` U+005C ` (` \ ` ) and continues with one of the
334334following forms:
335335
336- * An _ 8-bit codepoint escape _ escape starts with ` U+0078 ` (` x ` ) and is
337- followed by exactly two _ hex digits_ . It denotes the Unicode codepoint
336+ * An _ 8-bit code point escape _ starts with ` U+0078 ` (` x ` ) and is
337+ followed by exactly two _ hex digits_ . It denotes the Unicode code point
338338 equal to the provided hex value.
339- * A _ 24-bit codepoint escape_ starts with ` U+0075 ` (` u ` ) and is followed
339+ * A _ 24-bit code point escape_ starts with ` U+0075 ` (` u ` ) and is followed
340340 by up to six _ hex digits_ surrounded by braces ` U+007B ` (` { ` ) and ` U+007D `
341- (` } ` ). It denotes the Unicode codepoint equal to the provided hex value.
341+ (` } ` ). It denotes the Unicode code point equal to the provided hex value.
342342* A _ whitespace escape_ is one of the characters ` U+006E ` (` n ` ), ` U+0072 `
343- (` r ` ), or ` U+0074 ` (` t ` ), denoting the unicode values ` U+000A ` (LF),
343+ (` r ` ), or ` U+0074 ` (` t ` ), denoting the Unicode values ` U+000A ` (LF),
344344 ` U+000D ` (CR) or ` U+0009 ` (HT) respectively.
345345* The _ backslash escape_ is the character ` U+005C ` (` \ ` ) which must be
346346 escaped in order to denote * itself* .
@@ -410,7 +410,7 @@ Some additional _escapes_ are available in either byte or non-raw byte string
410410literals. An escape starts with a ` U+005C ` (` \ ` ) and continues with one of the
411411following forms:
412412
413- * An _ byte escape_ escape starts with ` U+0078 ` (` x ` ) and is
413+ * A _ byte escape_ escape starts with ` U+0078 ` (` x ` ) and is
414414 followed by exactly two _ hex digits_ . It denotes the byte
415415 equal to the provided hex value.
416416* A _ whitespace escape_ is one of the characters ` U+006E ` (` n ` ), ` U+0072 `
@@ -700,9 +700,9 @@ in macro rules). In the transcriber, the designator is already known, and so
700700only the name of a matched nonterminal comes after the dollar sign.
701701
702702In both the matcher and transcriber, the Kleene star-like operator indicates
703- repetition. The Kleene star operator consists of ` $ ` and parens , optionally
703+ repetition. The Kleene star operator consists of ` $ ` and parenthesis , optionally
704704followed by a separator token, followed by ` * ` or ` + ` . ` * ` means zero or more
705- repetitions, ` + ` means at least one repetition. The parens are not matched or
705+ repetitions, ` + ` means at least one repetition. The parenthesis are not matched or
706706transcribed. On the matcher side, a name is bound to _ all_ of the names it
707707matches, in a structure that mimics the structure of the repetition encountered
708708on a successful match. The job of the transcriber is to sort that structure
@@ -1203,9 +1203,9 @@ the guarantee that these issues are never caused by safe code.
12031203
12041204[ noalias ] : http://llvm.org/docs/LangRef.html#noalias
12051205
1206- ##### Behaviour not considered unsafe
1206+ ##### Behavior not considered unsafe
12071207
1208- This is a list of behaviour not considered * unsafe* in Rust terms, but that may
1208+ This is a list of behavior not considered * unsafe* in Rust terms, but that may
12091209be undesired.
12101210
12111211* Deadlocks
@@ -1298,7 +1298,7 @@ specific type, but may implement several different traits, or be compatible with
12981298several different type constraints.
12991299
13001300For example, the following defines the type ` Point ` as a synonym for the type
1301- ` (u8, u8) ` , the type of pairs of unsigned 8 bit integers. :
1301+ ` (u8, u8) ` , the type of pairs of unsigned 8 bit integers:
13021302
13031303```
13041304type Point = (u8, u8);
@@ -1952,7 +1952,7 @@ type int8_t = i8;
19521952
19531953### Crate-only attributes
19541954
1955- - ` crate_name ` - specify the this crate's crate name.
1955+ - ` crate_name ` - specify the crate's crate name.
19561956- ` crate_type ` - see [ linkage] ( #linkage ) .
19571957- ` feature ` - see [ compiler features] ( #compiler-features ) .
19581958- ` no_builtins ` - disable optimizing certain code patterns to invocations of
@@ -3464,7 +3464,7 @@ is not a surrogate), represented as a 32-bit unsigned word in the 0x0000 to
34643464UTF-32 string.
34653465
34663466A value of type ` str ` is a Unicode string, represented as an array of 8-bit
3467- unsigned bytes holding a sequence of UTF-8 codepoints . Since ` str ` is of
3467+ unsigned bytes holding a sequence of UTF-8 code points . Since ` str ` is of
34683468unknown size, it is not a _ first-class_ type, but can only be instantiated
34693469through a pointer type, such as ` &str ` or ` String ` .
34703470
0 commit comments