Skip to content

Commit

Permalink
Merge pull request #627 from RalfJung/union
Browse files Browse the repository at this point in the history
unions: call out field offset issues
  • Loading branch information
Centril authored Jun 29, 2019
2 parents 7a5aab5 + f9945c0 commit bccb2c7
Show file tree
Hide file tree
Showing 2 changed files with 20 additions and 9 deletions.
20 changes: 15 additions & 5 deletions src/items/unions.md
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,8 @@ The key property of unions is that all fields of a union share common storage.
As a result writes to one field of a union can overwrite its other fields, and
size of a union is determined by the size of its largest field.

## Initialization of a union

A value of a union type can be created using the same syntax that is used for
struct types, except that it must specify exactly one field:

Expand All @@ -37,13 +39,17 @@ struct fields:
let f = u.f1;
```

## Reading and writing union fields

Unions have no notion of an "active field". Instead, every union access just
interprets the storage at the type of the field used for the access. Reading a
union field reads the bits of the union at the field's type. It is the
programmer's responsibility to make sure that the data is valid at that
type. Failing to do so results in undefined behavior. For example, reading the
value `3` at type `bool` is undefined behavior. Effectively, writing to and then
reading from a union is analogous to a [`transmute`] from the type used for
union field reads the bits of the union at the field's type. Fields might have a
non-zero offset (except when `#[repr(C)]` is used); in that case the bits
starting at the offset of the fields are read. It is the programmer's
responsibility to make sure that the data is valid at the field's type. Failing
to do so results in undefined behavior. For example, reading the value `3` at
type `bool` is undefined behavior. Effectively, writing to and then reading from
a `#[repr(C)]` union is analogous to a [`transmute`] from the type used for
writing to the type used for reading.

Consequently, all reads of union fields have to be placed in `unsafe` blocks:
Expand All @@ -70,6 +76,8 @@ u.f1 = 2;
Commonly, code using unions will provide safe wrappers around unsafe union
field accesses.

## Pattern matching on unions

Another way to access union fields is to use pattern matching. Pattern matching
on union fields uses the same syntax as struct patterns, except that the pattern
must specify exactly one field. Since pattern matching is like reading the union
Expand Down Expand Up @@ -119,6 +127,8 @@ fn is_zero(v: Value) -> bool {
}
```

## References to union fields

Since union fields share common storage, gaining write access to one field of a
union can give write access to all its remaining fields. Borrow checking rules
have to be adjusted to account for this fact. As a result, if one field of a
Expand Down
9 changes: 5 additions & 4 deletions src/types/union.md
Original file line number Diff line number Diff line change
@@ -1,15 +1,16 @@
# Union types

A *union type* is a nominal, heterogeneous C-like union, denoted by the name of
a [`union` item].
a [`union` item][item].

A union access transmutes the content of the union to the type of the accessed
Unions have no notion of an "active field". Instead, every union access
transmutes parts of the content of the union to the type of the accessed
field. Since transmutes can cause unexpected or undefined behaviour, `unsafe` is
required to read from a union field or to write to a field that doesn't
implement [`Copy`].
implement [`Copy`]. See the [item] documentation for further details.

The memory layout of a `union` is undefined by default, but the `#[repr(...)]`
attribute can be used to fix a layout.

[`Copy`]: special-types-and-traits.html#copy
[`union` item]: items/unions.html
[item]: items/unions.html

0 comments on commit bccb2c7

Please sign in to comment.