`Invalid` trait for space optimization of enums. #41

vadimcn · 2014-04-10T01:02:58Z

Proposing to add Invalid trait and use it for Option<T> space optimization. This is an alternative to RFC #36.

sfackler · 2014-04-10T01:12:44Z

active/0000-invalid-trait.md

+Add `Invalid` trait and use it for `Option<T>` space optimization, instead of hardcoding "special" types in the compiler.
+
+# Motivation
+


*T is nullable. &T and ~T are not.

Internally, they are (you can do NPO on them too).

sfackler · 2014-04-13T05:16:25Z

It's not clear to me how this would actually work. How would the Invalid trait be usable by the compiler itself as the type implementing it is being compiled? It seems like it'd either have to compile the crate once, dynamically load it, and then call the set_invalid method for all types that implement it or have some kind of interpreter that'd operate directly on the AST.

huonw · 2014-04-13T05:29:19Z

With associated items (and const-expr sizeof) something like

trait Invalid {
    static BITPATTERN: [u8, .. sizeof Self];
}

impl<T> Invalid for ~T {
    static BITPATTERN: [u8, .. 8] = [0, .. 8];
}

// ...

might work, although it seems a little peculiar.

vadimcn · 2014-04-13T05:46:48Z

@sfackler, how is it different from compiler using other built-in traits, such as Deref or Drop?

sfackler · 2014-04-13T05:48:00Z

The compiler doesn't need to be able to execute an implementation of Drop at compile time. It just inserts calls to it at various points in the compiled program.

vadimcn · 2014-04-13T05:55:18Z

Invalid would work the same way. Why would it need to evaluate anything at compile time?

sfackler · 2014-04-13T05:57:27Z

struct MyStruct {
    ...
}

impl Invalid for MyStruct {
...
}

static FOO: Option<MyStruct> = None;

vadimcn · 2014-04-13T06:48:49Z

Grr, that's annoying. I suppose we'd have to prohibit statics with Invalid, just like we prohibit statics with Drop...

ghost · 2014-04-14T15:37:37Z

I think this feature should be implemented using a private, associated compile-time constant of the Self type. This INVALID value of Self type must be a compile-time constant for obvious reasons and it must't become part of the public interface of the type which implements the Invalid trait because... well, for obvious reasons. Therefore we need two new features before this "Invalid trait" feature can be implemented: associated constants and private items for traits (although, I think that items inside traits should be private by default). It would look something like the following (assuming the default is still public and thus the priv keyword is needed):

trait Invalid {
    priv static INVALID: Self;
}

struct S {
    n: int
}

impl Invalid for S {
    priv static INVALID: S = S { n: 123 };
}

And, to the question of what exactly are the semantics of a "private" trait method, an associated constant or an associated type, I think the answer is that it should work like this:

The constant T::INVALID, where T is a type parameter which implements the Invalid trait, should be accessible only from the module where the Invalid trait is defined and its submodules.
The constant S::INVALID, where S is an actual type which implements Invalid, should be accessible only from the module where the trait Invalid is implemented for type S, and its submodules.

Also, I don't think that this Invalid trait feature and the automatic space optimization feature for Option<&T> and Option<~T> are mutually exclusive. We can, and I think we should, have them both.

pczarn · 2014-04-15T20:05:22Z

I think this feature should be implemented with a type such as Difference<T, Value> but it reminded me of Either<A, B> and we don't need either one when we have enums:

enum OptimizedPtr {
    NotNull(uint),
    invalid NotNull(0)
}

An alternative:

enum OptimizedPtr<T: RawPtr> {
    NotNull(T),
    invalid NullPtr = T::null()
}

nikomatsakis · 2014-04-21T19:20:40Z

This is an intriguing idea. I have to stew on it a bit, but I like it in principle.

dobkeratops · 2014-04-21T21:02:39Z

great idea, could this implement a -1 invalid array index type; and in turn operator[] overloaded to take a potentially invalid array index safely returning Some(T) or None

vadimcn · 2014-04-21T21:22:29Z

enum OptimizedPtr {
    NotNull(uint),
    invalid NotNull(0)
}

Somehow it doesn't feel right to actually instantiate the "invalid" bit pattern as an instance of T. Especially, if T implements Drop (would destructor be invoked on the invalid value? hmm...)

enum OptimizedPtr<T: RawPtr> {
   NotNull(T),
   invalid NullPtr = T::null()
}

Doesn't this suffer from the same problem as my proposal, in that it requires compile-time evaluation of functions? (or, alternatively, it requires "life-before-main" in order to initialize statics).

nikomatsakis · 2014-05-01T21:47:03Z

At most recent meeting, we decided that while this idea is promising, the RFC is insufficiently fleshed out and hence we are going to close. Most importantly, it seems like the problems of how to handle static bitpatterns needs to be worked out and specified in detail. If you have a revised proposal that handles statics, feel free to re-open. In general, while the problem this RFC addresses is important, we don't feel it's so urgent that we have to draft a design THIS SECOND, and hence we can take some time to work through the details.

More Option -> Poll

The improvements to `byte_pair_merge` are: - Changing the `parts` vector to avoid repetition of data. This vector used to store ranges for which the invariant `parts[i].end == parts[i + 1].start` holds, which makes the vector twice as big as it needs to be. Keeping this vector small improves CPU-cache efficiency. - Using `usize::MAX` as a sentinel in lieu of `Optional` for the computation of the minimum rank. This change removes branching from the loop to compute the minimum rank, generating assembly that uses conditional moves instead. Ideally, we could keep the `Optional` and inform it of the sentinel much like `Optional<NonZeroUsize>`. As far as I could tell, specifying custom sentinels for `Optional` has an old Rust [RFC](rust-lang/rfcs#41) that has stalled, so we don't get to have nice things. - Minimizing the number of lookups into `ranks` by looking up ranks once and iteratively updating them after each merge. This reduces the number of rank lookups from `n*m` to `n + O(m)`

Invalid trait proposal.

d766f13

sfackler reviewed Apr 10, 2014
View reviewed changes

nikomatsakis closed this May 1, 2014

nikomatsakis added the postponed label May 1, 2014

pnkfelix mentioned this pull request May 21, 2014

Space-optimize Option<T> for integral enum T #84

Closed

huonw mentioned this pull request May 23, 2014

Allow for representing Option<uint> as an integer with an excluded sentinel value rust-lang/rust#14369

Closed

jdm mentioned this pull request May 30, 2014

Consider making JS<T> nullable servo/servo#2516

Closed

rust-highfive mentioned this pull request Sep 24, 2014

Invalid trait for space optimization of enums. #276

Closed

withoutboats pushed a commit to withoutboats/rfcs that referenced this pull request Jan 15, 2017

Merge pull request rust-lang#41 from SimonSapin/patch-3

16afb72

More Option -> Poll

petrochenkov removed the postponed RFCs that have been postponed and may be revisited at a later time. label Feb 24, 2018

nistath mentioned this pull request Feb 13, 2023

Improve performance by 2x openai/tiktoken#31

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`Invalid` trait for space optimization of enums. #41

`Invalid` trait for space optimization of enums. #41

vadimcn commented Apr 10, 2014

sfackler Apr 10, 2014

ticki Jan 10, 2016

sfackler commented Apr 13, 2014

huonw commented Apr 13, 2014

vadimcn commented Apr 13, 2014

sfackler commented Apr 13, 2014

vadimcn commented Apr 13, 2014

sfackler commented Apr 13, 2014

vadimcn commented Apr 13, 2014

ghost commented Apr 14, 2014

pczarn commented Apr 15, 2014

nikomatsakis commented Apr 21, 2014

dobkeratops commented Apr 21, 2014

vadimcn commented Apr 21, 2014

nikomatsakis commented May 1, 2014

		Add `Invalid` trait and use it for `Option<T>` space optimization, instead of hardcoding "special" types in the compiler.

		# Motivation

Invalid trait for space optimization of enums. #41

Invalid trait for space optimization of enums. #41

Conversation

vadimcn commented Apr 10, 2014

sfackler Apr 10, 2014

Choose a reason for hiding this comment

ticki Jan 10, 2016

Choose a reason for hiding this comment

sfackler commented Apr 13, 2014

huonw commented Apr 13, 2014

vadimcn commented Apr 13, 2014

sfackler commented Apr 13, 2014

vadimcn commented Apr 13, 2014

sfackler commented Apr 13, 2014

vadimcn commented Apr 13, 2014

ghost commented Apr 14, 2014

pczarn commented Apr 15, 2014

nikomatsakis commented Apr 21, 2014

dobkeratops commented Apr 21, 2014

vadimcn commented Apr 21, 2014

nikomatsakis commented May 1, 2014

`Invalid` trait for space optimization of enums. #41

`Invalid` trait for space optimization of enums. #41