Skip to content

Commit 97cdc8e

Browse files
authored
Rollup merge of #130229 - RalfJung:ptr-offset-unsigned, r=scottmcm
ptr::add/sub: do not claim equivalence with `offset(c as isize)` In #110837, the `offset` intrinsic got changed to also allow a `usize` offset parameter. The intention is that this will do an unsigned multiplication with the size, and we have UB if that overflows -- and we also have UB if the result is larger than `usize::MAX`, i.e., if a subsequent cast to `isize` would wrap. ~~The LLVM backend sets some attributes accordingly.~~ This updates the docs for `add`/`sub` to match that intent, in preparation for adjusting codegen to exploit this UB. We use this opportunity to clarify what the exact requirements are: we compute the offset using mathematical multiplication (so it's no problem to have an `isize * usize` multiplication, we just multiply integers), and the result must fit in an `isize`. Cc `@rust-lang/opsem` `@nikic` #130239 updates Miri to detect this UB. `sub` still has some cases of UB not reflected in the underlying intrinsic semantics (and Miri does not catch): when we subtract `usize::MAX`, then after casting to `isize` that's just `-1` so we end up adding one unit without noticing any UB, but actually the offset we gave does not fit in an `isize`. Miri will currently still not complain for such cases: ```rust fn main() { let x = &[0i32; 2]; let x = x.as_ptr(); // This should be UB, we are subtracting way too much. unsafe { x.sub(usize::MAX).read() }; } ``` However, the LLVM IR we generate here also is UB-free. This is "just" library UB but not language UB. Cc `@saethlin;` might be worth adding precondition checks against overflow on `offset`/`add`/`sub`? Fixes #130211
2 parents 389a399 + bc3d072 commit 97cdc8e

File tree

3 files changed

+54
-44
lines changed

3 files changed

+54
-44
lines changed

library/core/src/intrinsics.rs

+1-2
Original file line numberDiff line numberDiff line change
@@ -1425,8 +1425,7 @@ extern "rust-intrinsic" {
14251425
///
14261426
/// If the computed offset is non-zero, then both the starting and resulting pointer must be
14271427
/// either in bounds or at the end of an allocated object. If either pointer is out
1428-
/// of bounds or arithmetic overflow occurs then any further use of the returned value will
1429-
/// result in undefined behavior.
1428+
/// of bounds or arithmetic overflow occurs then this operation is undefined behavior.
14301429
///
14311430
/// The stabilized version of this intrinsic is [`pointer::offset`].
14321431
#[must_use = "returns a new pointer rather than modifying its argument"]

library/core/src/ptr/const_ptr.rs

+26-21
Original file line numberDiff line numberDiff line change
@@ -346,7 +346,7 @@ impl<T: ?Sized> *const T {
346346
if self.is_null() { None } else { Some(unsafe { &*(self as *const MaybeUninit<T>) }) }
347347
}
348348

349-
/// Adds an offset to a pointer.
349+
/// Adds a signed offset to a pointer.
350350
///
351351
/// `count` is in units of T; e.g., a `count` of 3 represents a pointer
352352
/// offset of `3 * size_of::<T>()` bytes.
@@ -355,7 +355,8 @@ impl<T: ?Sized> *const T {
355355
///
356356
/// If any of the following conditions are violated, the result is Undefined Behavior:
357357
///
358-
/// * The computed offset, `count * size_of::<T>()` bytes, must not overflow `isize`.
358+
/// * The offset in bytes, `count * size_of::<T>()`, computed on mathematical integers (without
359+
/// "wrapping around"), must fit in an `isize`.
359360
///
360361
/// * If the computed offset is non-zero, then `self` must be derived from a pointer to some
361362
/// [allocated object], and the entire memory range between `self` and the result must be in
@@ -398,7 +399,7 @@ impl<T: ?Sized> *const T {
398399
unsafe { intrinsics::offset(self, count) }
399400
}
400401

401-
/// Calculates the offset from a pointer in bytes.
402+
/// Adds a signed offset in bytes to a pointer.
402403
///
403404
/// `count` is in units of **bytes**.
404405
///
@@ -418,7 +419,7 @@ impl<T: ?Sized> *const T {
418419
unsafe { self.cast::<u8>().offset(count).with_metadata_of(self) }
419420
}
420421

421-
/// Calculates the offset from a pointer using wrapping arithmetic.
422+
/// Adds a signed offset to a pointer using wrapping arithmetic.
422423
///
423424
/// `count` is in units of T; e.g., a `count` of 3 represents a pointer
424425
/// offset of `3 * size_of::<T>()` bytes.
@@ -480,7 +481,7 @@ impl<T: ?Sized> *const T {
480481
unsafe { intrinsics::arith_offset(self, count) }
481482
}
482483

483-
/// Calculates the offset from a pointer in bytes using wrapping arithmetic.
484+
/// Adds a signed offset in bytes to a pointer using wrapping arithmetic.
484485
///
485486
/// `count` is in units of **bytes**.
486487
///
@@ -804,7 +805,11 @@ impl<T: ?Sized> *const T {
804805
}
805806
}
806807

807-
/// Adds an offset to a pointer (convenience for `.offset(count as isize)`).
808+
/// Adds an unsigned offset to a pointer.
809+
///
810+
/// This can only move the pointer forward (or not move it). If you need to move forward or
811+
/// backward depending on the value, then you might want [`offset`](#method.offset) instead
812+
/// which takes a signed offset.
808813
///
809814
/// `count` is in units of T; e.g., a `count` of 3 represents a pointer
810815
/// offset of `3 * size_of::<T>()` bytes.
@@ -813,7 +818,8 @@ impl<T: ?Sized> *const T {
813818
///
814819
/// If any of the following conditions are violated, the result is Undefined Behavior:
815820
///
816-
/// * The computed offset, `count * size_of::<T>()` bytes, must not overflow `isize`.
821+
/// * The offset in bytes, `count * size_of::<T>()`, computed on mathematical integers (without
822+
/// "wrapping around"), must fit in an `isize`.
817823
///
818824
/// * If the computed offset is non-zero, then `self` must be derived from a pointer to some
819825
/// [allocated object], and the entire memory range between `self` and the result must be in
@@ -856,7 +862,7 @@ impl<T: ?Sized> *const T {
856862
unsafe { intrinsics::offset(self, count) }
857863
}
858864

859-
/// Calculates the offset from a pointer in bytes (convenience for `.byte_offset(count as isize)`).
865+
/// Adds an unsigned offset in bytes to a pointer.
860866
///
861867
/// `count` is in units of bytes.
862868
///
@@ -876,8 +882,11 @@ impl<T: ?Sized> *const T {
876882
unsafe { self.cast::<u8>().add(count).with_metadata_of(self) }
877883
}
878884

879-
/// Subtracts an offset from a pointer (convenience for
880-
/// `.offset((count as isize).wrapping_neg())`).
885+
/// Subtracts an unsigned offset from a pointer.
886+
///
887+
/// This can only move the pointer backward (or not move it). If you need to move forward or
888+
/// backward depending on the value, then you might want [`offset`](#method.offset) instead
889+
/// which takes a signed offset.
881890
///
882891
/// `count` is in units of T; e.g., a `count` of 3 represents a pointer
883892
/// offset of `3 * size_of::<T>()` bytes.
@@ -886,7 +895,8 @@ impl<T: ?Sized> *const T {
886895
///
887896
/// If any of the following conditions are violated, the result is Undefined Behavior:
888897
///
889-
/// * The computed offset, `count * size_of::<T>()` bytes, must not overflow `isize`.
898+
/// * The offset in bytes, `count * size_of::<T>()`, computed on mathematical integers (without
899+
/// "wrapping around"), must fit in an `isize`.
890900
///
891901
/// * If the computed offset is non-zero, then `self` must be derived from a pointer to some
892902
/// [allocated object], and the entire memory range between `self` and the result must be in
@@ -937,8 +947,7 @@ impl<T: ?Sized> *const T {
937947
}
938948
}
939949

940-
/// Calculates the offset from a pointer in bytes (convenience for
941-
/// `.byte_offset((count as isize).wrapping_neg())`).
950+
/// Subtracts an unsigned offset in bytes from a pointer.
942951
///
943952
/// `count` is in units of bytes.
944953
///
@@ -958,8 +967,7 @@ impl<T: ?Sized> *const T {
958967
unsafe { self.cast::<u8>().sub(count).with_metadata_of(self) }
959968
}
960969

961-
/// Calculates the offset from a pointer using wrapping arithmetic.
962-
/// (convenience for `.wrapping_offset(count as isize)`)
970+
/// Adds an unsigned offset to a pointer using wrapping arithmetic.
963971
///
964972
/// `count` is in units of T; e.g., a `count` of 3 represents a pointer
965973
/// offset of `3 * size_of::<T>()` bytes.
@@ -1020,8 +1028,7 @@ impl<T: ?Sized> *const T {
10201028
self.wrapping_offset(count as isize)
10211029
}
10221030

1023-
/// Calculates the offset from a pointer in bytes using wrapping arithmetic.
1024-
/// (convenience for `.wrapping_byte_offset(count as isize)`)
1031+
/// Adds an unsigned offset in bytes to a pointer using wrapping arithmetic.
10251032
///
10261033
/// `count` is in units of bytes.
10271034
///
@@ -1038,8 +1045,7 @@ impl<T: ?Sized> *const T {
10381045
self.cast::<u8>().wrapping_add(count).with_metadata_of(self)
10391046
}
10401047

1041-
/// Calculates the offset from a pointer using wrapping arithmetic.
1042-
/// (convenience for `.wrapping_offset((count as isize).wrapping_neg())`)
1048+
/// Subtracts an unsigned offset from a pointer using wrapping arithmetic.
10431049
///
10441050
/// `count` is in units of T; e.g., a `count` of 3 represents a pointer
10451051
/// offset of `3 * size_of::<T>()` bytes.
@@ -1100,8 +1106,7 @@ impl<T: ?Sized> *const T {
11001106
self.wrapping_offset((count as isize).wrapping_neg())
11011107
}
11021108

1103-
/// Calculates the offset from a pointer in bytes using wrapping arithmetic.
1104-
/// (convenience for `.wrapping_offset((count as isize).wrapping_neg())`)
1109+
/// Subtracts an unsigned offset in bytes from a pointer using wrapping arithmetic.
11051110
///
11061111
/// `count` is in units of bytes.
11071112
///

library/core/src/ptr/mut_ptr.rs

+27-21
Original file line numberDiff line numberDiff line change
@@ -344,7 +344,7 @@ impl<T: ?Sized> *mut T {
344344
if self.is_null() { None } else { Some(unsafe { &*(self as *const MaybeUninit<T>) }) }
345345
}
346346

347-
/// Adds an offset to a pointer.
347+
/// Adds a signed offset to a pointer.
348348
///
349349
/// `count` is in units of T; e.g., a `count` of 3 represents a pointer
350350
/// offset of `3 * size_of::<T>()` bytes.
@@ -353,7 +353,8 @@ impl<T: ?Sized> *mut T {
353353
///
354354
/// If any of the following conditions are violated, the result is Undefined Behavior:
355355
///
356-
/// * The computed offset, `count * size_of::<T>()` bytes, must not overflow `isize`.
356+
/// * The offset in bytes, `count * size_of::<T>()`, computed on mathematical integers (without
357+
/// "wrapping around"), must fit in an `isize`.
357358
///
358359
/// * If the computed offset is non-zero, then `self` must be derived from a pointer to some
359360
/// [allocated object], and the entire memory range between `self` and the result must be in
@@ -398,7 +399,7 @@ impl<T: ?Sized> *mut T {
398399
unsafe { intrinsics::offset(self, count) }
399400
}
400401

401-
/// Calculates the offset from a pointer in bytes.
402+
/// Adds a signed offset in bytes to a pointer.
402403
///
403404
/// `count` is in units of **bytes**.
404405
///
@@ -418,7 +419,8 @@ impl<T: ?Sized> *mut T {
418419
unsafe { self.cast::<u8>().offset(count).with_metadata_of(self) }
419420
}
420421

421-
/// Calculates the offset from a pointer using wrapping arithmetic.
422+
/// Adds a signed offset to a pointer using wrapping arithmetic.
423+
///
422424
/// `count` is in units of T; e.g., a `count` of 3 represents a pointer
423425
/// offset of `3 * size_of::<T>()` bytes.
424426
///
@@ -477,7 +479,7 @@ impl<T: ?Sized> *mut T {
477479
unsafe { intrinsics::arith_offset(self, count) as *mut T }
478480
}
479481

480-
/// Calculates the offset from a pointer in bytes using wrapping arithmetic.
482+
/// Adds a signed offset in bytes to a pointer using wrapping arithmetic.
481483
///
482484
/// `count` is in units of **bytes**.
483485
///
@@ -885,7 +887,11 @@ impl<T: ?Sized> *mut T {
885887
unsafe { (self as *const T).sub_ptr(origin) }
886888
}
887889

888-
/// Adds an offset to a pointer (convenience for `.offset(count as isize)`).
890+
/// Adds an unsigned offset to a pointer.
891+
///
892+
/// This can only move the pointer forward (or not move it). If you need to move forward or
893+
/// backward depending on the value, then you might want [`offset`](#method.offset) instead
894+
/// which takes a signed offset.
889895
///
890896
/// `count` is in units of T; e.g., a `count` of 3 represents a pointer
891897
/// offset of `3 * size_of::<T>()` bytes.
@@ -894,7 +900,8 @@ impl<T: ?Sized> *mut T {
894900
///
895901
/// If any of the following conditions are violated, the result is Undefined Behavior:
896902
///
897-
/// * The computed offset, `count * size_of::<T>()` bytes, must not overflow `isize`.
903+
/// * The offset in bytes, `count * size_of::<T>()`, computed on mathematical integers (without
904+
/// "wrapping around"), must fit in an `isize`.
898905
///
899906
/// * If the computed offset is non-zero, then `self` must be derived from a pointer to some
900907
/// [allocated object], and the entire memory range between `self` and the result must be in
@@ -937,7 +944,7 @@ impl<T: ?Sized> *mut T {
937944
unsafe { intrinsics::offset(self, count) }
938945
}
939946

940-
/// Calculates the offset from a pointer in bytes (convenience for `.byte_offset(count as isize)`).
947+
/// Adds an unsigned offset in bytes to a pointer.
941948
///
942949
/// `count` is in units of bytes.
943950
///
@@ -957,8 +964,11 @@ impl<T: ?Sized> *mut T {
957964
unsafe { self.cast::<u8>().add(count).with_metadata_of(self) }
958965
}
959966

960-
/// Subtracts an offset from a pointer (convenience for
961-
/// `.offset((count as isize).wrapping_neg())`).
967+
/// Subtracts an unsigned offset from a pointer.
968+
///
969+
/// This can only move the pointer backward (or not move it). If you need to move forward or
970+
/// backward depending on the value, then you might want [`offset`](#method.offset) instead
971+
/// which takes a signed offset.
962972
///
963973
/// `count` is in units of T; e.g., a `count` of 3 represents a pointer
964974
/// offset of `3 * size_of::<T>()` bytes.
@@ -967,7 +977,8 @@ impl<T: ?Sized> *mut T {
967977
///
968978
/// If any of the following conditions are violated, the result is Undefined Behavior:
969979
///
970-
/// * The computed offset, `count * size_of::<T>()` bytes, must not overflow `isize`.
980+
/// * The offset in bytes, `count * size_of::<T>()`, computed on mathematical integers (without
981+
/// "wrapping around"), must fit in an `isize`.
971982
///
972983
/// * If the computed offset is non-zero, then `self` must be derived from a pointer to some
973984
/// [allocated object], and the entire memory range between `self` and the result must be in
@@ -1018,8 +1029,7 @@ impl<T: ?Sized> *mut T {
10181029
}
10191030
}
10201031

1021-
/// Calculates the offset from a pointer in bytes (convenience for
1022-
/// `.byte_offset((count as isize).wrapping_neg())`).
1032+
/// Subtracts an unsigned offset in bytes from a pointer.
10231033
///
10241034
/// `count` is in units of bytes.
10251035
///
@@ -1039,8 +1049,7 @@ impl<T: ?Sized> *mut T {
10391049
unsafe { self.cast::<u8>().sub(count).with_metadata_of(self) }
10401050
}
10411051

1042-
/// Calculates the offset from a pointer using wrapping arithmetic.
1043-
/// (convenience for `.wrapping_offset(count as isize)`)
1052+
/// Adds an unsigned offset to a pointer using wrapping arithmetic.
10441053
///
10451054
/// `count` is in units of T; e.g., a `count` of 3 represents a pointer
10461055
/// offset of `3 * size_of::<T>()` bytes.
@@ -1099,8 +1108,7 @@ impl<T: ?Sized> *mut T {
10991108
self.wrapping_offset(count as isize)
11001109
}
11011110

1102-
/// Calculates the offset from a pointer in bytes using wrapping arithmetic.
1103-
/// (convenience for `.wrapping_byte_offset(count as isize)`)
1111+
/// Adds an unsigned offset in bytes to a pointer using wrapping arithmetic.
11041112
///
11051113
/// `count` is in units of bytes.
11061114
///
@@ -1117,8 +1125,7 @@ impl<T: ?Sized> *mut T {
11171125
self.cast::<u8>().wrapping_add(count).with_metadata_of(self)
11181126
}
11191127

1120-
/// Calculates the offset from a pointer using wrapping arithmetic.
1121-
/// (convenience for `.wrapping_offset((count as isize).wrapping_neg())`)
1128+
/// Subtracts an unsigned offset from a pointer using wrapping arithmetic.
11221129
///
11231130
/// `count` is in units of T; e.g., a `count` of 3 represents a pointer
11241131
/// offset of `3 * size_of::<T>()` bytes.
@@ -1177,8 +1184,7 @@ impl<T: ?Sized> *mut T {
11771184
self.wrapping_offset((count as isize).wrapping_neg())
11781185
}
11791186

1180-
/// Calculates the offset from a pointer in bytes using wrapping arithmetic.
1181-
/// (convenience for `.wrapping_offset((count as isize).wrapping_neg())`)
1187+
/// Subtracts an unsigned offset in bytes from a pointer using wrapping arithmetic.
11821188
///
11831189
/// `count` is in units of bytes.
11841190
///

0 commit comments

Comments
 (0)