represent slices with length in elements, not bytes #9885

thestinger · 2013-10-16T01:14:48Z

The goal here is to avoid requiring a division or multiplication to compare against the length. The bounds check previously used an incorrect micro-optimization to replace the division by a multiplication, but now neither is necessary for slices. Unique/managed vectors will have to do a division to get the length until they are reworked/replaced.

casting the `uint` to an `int` can result in printing high values as negative intege

Closes #9020

This allows the indexing bounds check or other comparisons against an element length to avoid a multiplication by the size.

pcwalton · 2013-10-16T02:04:13Z

+1. This is especially important on ARM where there is no hardware integer division.

alexcrichton · 2013-10-16T05:06:51Z

src/test/run-fail/vec-underrun.rs

-// except according to those terms.
-
-
-// error-pattern:index out of bounds: the len is 2 but the index is -1


Why was this test removed?

It was testing that the index was printed wrong. I switched the index to being printed as a uint instead of an int, so the test no longer has a purpose. I added a new test for a very high index that was previously handled wrong to replace these.

GitHub thinks I moved small-negative-indexing.rs and edited it, but I really removed that too and wrote a new one.

alexcrichton · 2013-10-16T05:13:16Z

It looks like the assumption about byte vs count is pretty sprawling throughout the codebase. When you were looking at these, did you notice an obvious trend which would make this pattern hard-coded in fewer places? I would expect that at least slice construction would be a pretty refactorable portion, but perhaps indexing may be as well.

I'm also unfamiliar with the reasons for why we went with byte length as opposed to element count in the first place as well, and I'd want to understand why we did it before we change it.

thestinger · 2013-10-16T15:42:40Z

@alexcrichton: I don't know why byte count was ever used for length/capacity in vectors and length in slices. It results in a division or checked multiplication + extra branch being required for correct bounds checks, and a division to convert to the commonly needed element length.

The only time we use length in bytes is the amortized cost of reallocation in push, and similar functions like reserve. It doesn't make sense to micro-optimize the branch that's rarely taken and pays the much more expensive cost of an allocation already.

thestinger · 2013-10-16T16:27:00Z

I figured out the bug. It was caused by an array of zero-size types with destructors, because I replaced our manual byte offset derived from the non-zero length with a normal type-based offset.

brson · 2013-10-16T17:20:38Z

Since this only updates slices it introduces a division to borrowing. Do you plan to update unique vectors as well?

thestinger · 2013-10-16T17:27:25Z

@brson: yes, and I also want to change the representation of vectors beyond that (see #8981)

brson · 2013-10-16T17:42:14Z

Nice work! 🍬

The goal here is to avoid requiring a division or multiplication to compare against the length. The bounds check previously used an incorrect micro-optimization to replace the division by a multiplication, but now neither is necessary *for slices*. Unique/managed vectors will have to do a division to get the length until they are reworked/replaced.

huonw · 2013-10-16T23:31:26Z

src/librustc/middle/trans/tvec.rs

@@ -539,6 +538,40 @@ pub fn get_base_and_len(bcx: @mut Block,
    }
 }

+pub fn get_base_and_len(bcx: @mut Block, llval: ValueRef, vec_ty: ty::t) -> (ValueRef, ValueRef) {
+    //!


FWIW, this will mean the doc-string of get_base_and_len is ""; the rest of the block is ignored (it's treated as a normal comment).

Yeah, I just copy-pasted it from above since I plan on removing the other function soon. I didn't notice it was missing the other ! markers.

Don't lint `string_lit_as_bytes` in match scrutinees fixes rust-lang#9885 changelog: `string_lit_as_bytes`: Don't lint in match scrutinees

thestinger added 4 commits October 15, 2013 16:23

fix bounds checking failure message

420b426

casting the `uint` to an `int` can result in printing high values as negative intege

fix overflow on bounds checks

aa93381

Closes #9020

use element count in slices, not size in bytes

e1a26ad

This allows the indexing bounds check or other comparisons against an element length to avoid a multiplication by the size.

remove executable flag from source file

1e128d7

alexcrichton reviewed Oct 16, 2013
View reviewed changes

rename base_and_len -> base_and_byte_len

ef3ec1f

introduce base_and_len fns for element length

bd7610f

bors closed this Oct 16, 2013

bors merged commit bd7610f into rust-lang:master Oct 16, 2013

huonw reviewed Oct 16, 2013
View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

represent slices with length in elements, not bytes #9885

represent slices with length in elements, not bytes #9885

thestinger commented Oct 16, 2013

pcwalton commented Oct 16, 2013

alexcrichton Oct 16, 2013

thestinger Oct 16, 2013

alexcrichton commented Oct 16, 2013

thestinger commented Oct 16, 2013

thestinger commented Oct 16, 2013

brson commented Oct 16, 2013

thestinger commented Oct 16, 2013

brson commented Oct 16, 2013

huonw Oct 16, 2013

thestinger Oct 16, 2013

		// except according to those terms.


		// error-pattern:index out of bounds: the len is 2 but the index is -1

represent slices with length in elements, not bytes #9885

represent slices with length in elements, not bytes #9885

Conversation

thestinger commented Oct 16, 2013

pcwalton commented Oct 16, 2013

alexcrichton Oct 16, 2013

Choose a reason for hiding this comment

thestinger Oct 16, 2013

Choose a reason for hiding this comment

alexcrichton commented Oct 16, 2013

thestinger commented Oct 16, 2013

thestinger commented Oct 16, 2013

brson commented Oct 16, 2013

thestinger commented Oct 16, 2013

brson commented Oct 16, 2013

huonw Oct 16, 2013

Choose a reason for hiding this comment

thestinger Oct 16, 2013

Choose a reason for hiding this comment