NegatedIndex type #1032

StefanKarpinski · 2012-07-10T22:47:26Z

Idea spawned by this discussion. The idea is that just as writing v[b], v[!b] where b is a logical indexing vector partitions v into two disjoint sets of elements, writing v[x], x[!x] should also partition v into two disjoint sets of elements when x is a single index or a range of indices. This requires some new NegatedIndex type, together with ! methods for integers and ranges and the appropriate method for indexing arrays with NegatedIndex objects.

The text was updated successfully, but these errors were encountered:

JeffBezanson · 2012-07-11T01:22:42Z

What does it do on integer vectors?

StefanKarpinski · 2012-07-11T01:48:34Z

I'm basically thinking that the structure looks like this:

type NegatedIndex{T}
    idx::T
end

So you just keep the original object until its time to do the actual indexing, at which point you know what you're "subtracting" the index object from, which lets you compute the actual index set and then do regular indexing with that. So ![2,5] would produce NegatedIndex([2,4]). In v[![2,5]], the indexing operations would compute [1:length(v)]-[2,5] as [1,3,5,...,length(v)] (supposing length(v) ≥ 5) and return [v[1],v[3],v[5],...,v[end]]. Make sense?

ViralBShah · 2012-07-12T09:10:31Z

We do not define ! on non-booleans. That makes it possible for us to give it a different meaning, at least for Index types. But, ~ in matlab does have a defined behaviour, and that could lead to confusion.

That said, I do like the idea, and here are some thoughts.

ComplementIndex is an alternative name. We could alternatively allow \ to be used as a unary operator, to mirror the set complement operator.

A complement operator may even be useful outside of indexing, like in loop iterations and such.

ViralBShah · 2012-07-12T09:12:29Z

Looking at the names in http://en.wikipedia.org/wiki/Complement_(set_theory), we could even just have a functional interface, using except, which returns the new Index type.

StefanKarpinski · 2012-07-12T18:10:30Z

I really think that the ! operator makes sense in this case, although I would be ok with writing except(k) as well. Actually, calling the type Except would make a fair amount of sense.

ViralBShah · 2012-07-13T02:25:07Z

ExceptIndex would be clearer for the type name.

-viral

On 12-Jul-2012, at 11:40 PM, Stefan Karpinski wrote:

I really think that the ! operator makes sense in this case, although I would be ok with writing except(k) as well. Actually, calling the type Except would make a fair amount of sense.

Reply to this email directly or view it on GitHub:
#1032 (comment)

toivoh · 2012-07-31T07:07:08Z

I think it seems a bit fishy to define v[!integer_array]:
would v[![1,2]] == v[![2,1]] == v[![1,2,1]] == v[3] == 3 for v=[1,2,3]?
The trouble seems to be to negate a sequence (array) rather than a set.
Perhaps better to allow v[int_set] and v[!int_set] for subsetting, (and keep v[!int_index])?

StefanKarpinski · 2012-07-31T17:32:45Z

It is a little weird, but I was thinking that you resolve, e.g. NegatedIndex([2,1]) when indexed into an indexing slot with range 1:n by iterating 1:n and then "emitting" the index value if and only if !contains([2,1],i). Does that make sense?

toivoh · 2012-08-07T10:44:47Z

I'm a bit worried about surprises that might come out of this. E g you might expect a[!inds] to always give you a subsequence, but what if someone passed an inds::NegatedIndex argument into your function?

The way interpret it, !index is a set complement operation, with sets represented as strictly increasing sequences.
Perhaps one could refuse to create/use a NegatedIndex from a sequence that is not strictly increasing?
Then common cases like a[!2:5] would work, ! would be its own inverse, and

x, y = a[inds], a[!inds]

would always partition the elements of a.

johnmyleswhite · 2012-12-30T01:33:49Z

This seems to have fizzled out, but I still really, really want this functionality to be added. It's one of the most convenient tricks in R for describing data resampling. See the jackknife for the simplest example of a statistical method in which this notation helps.

I just discovered that one of the other DataFrame developers hacked together an implementation of this style of indexing for the columns DataFrames simply because it was so unnatural to articulate one algorithm without it.

punkrockpolly · 2013-11-06T00:26:47Z

If I'm understanding this correctly, the negated index [!x] would strip the member whose position has the same value as the index [x], creating a vector slice with the member removed.

Can someone please explain the following:

"For example, the following creates a vector slice with the third member removed.![2,5] would produce NegatedIndex([2,4]) ...
and return [v[1],v[3],v[5],...,v[end]]."

Why wouldn't ![2,5] produce NegatedIndex([2,5]), and v[5] be removed instead of v[4]?? Is there something I'm misunderstanding?

StefanKarpinski · 2013-11-06T00:40:03Z

Yes, that's absolutely right. I just made a bunch of mistakes jumbled together in that post :-)

punkrockpolly · 2013-11-08T17:09:28Z

I'm looking into this.
Quick question: how do i define the ! operator to reference either the original array or it's size to figure out which indices are not idx in a[!idx]?

timholy · 2013-11-08T17:23:42Z

You might not have to; define ! for the type of indexes you want to support (e.g., integers, ranges, and Vector{Int}) to produce a new type like NegatedIndex (or ComplementIndex). Then define the function getindex(a, idx::NegatedIndex).

timholy · 2013-11-08T17:26:14Z

Of course, you'll also need to look at setindex! and consider how to handle multidimensional arrays. You may find that you need to add more than a couple of function definitions :-).

JeffBezanson · 2013-11-08T19:13:35Z

We might have to change the indexing code to explicitly combine dimension sizes with indexes to allow implementing things like this (and Colon). For example instead of

for i in idx
  ...

we'd write

for i in fullindex(idx, size(A,n))
  ...

StefanKarpinski · 2013-11-08T20:52:23Z

Jeff, can you elaborate on that? I'm failing to see the need for it – not that there isn't one, I'm just not seeing why.

JeffBezanson · 2013-11-08T21:07:28Z

Because there is no way to implement iteration of something like NegatedIndex([2]) without knowing the size of the indexed dimension.

StefanKarpinski · 2013-11-09T00:47:56Z

Yes, obviously you need a context for that. The part I needed clarification of was "change the indexing code" – what indexing code are you referring to?

Mostly works on the following index types: (Int, Range1{Int}, Range{Int}) Does not yet work for index of Array{Int}

StefanKarpinski · 2013-11-18T16:53:22Z

After looking at some code with @punkrockpolly, I'm wondering if the best approach here isn't to have a Complement{T} type that primarily provides an in method:

immutable Complement{T}
  collection::T
end

in(x,c::Complement) = !in(x,c.collection)

Then you can also do things like this:

getindex(v::AbstractVector, c::Complement) = v[filter!(i->in(i,c), 1:end)]

Of course, that doesn't cover everything, but I think the limited scope of the Complement abstraction clarifies things a bit.

@punkrockpolly

Paired on this with @punkrockpolly, see: https://github.com/punkrockpolly/Playing-with-Julia/blob/master/negatedindex.jl The premise of the Complement type is that it abstracts the complement of a collection – primarily that `x in c` <=> `!(x in c.collection)` This commit punts on indexing for more than two dimensions because that turns out to be an incredibly invasive change to huge amounts the multidimensional array indexing code.

ViralBShah · 2015-04-16T17:21:21Z

I am closing this, since this is something that can easily live in a package. Please reopen if you think otherwise.

nalimilan · 2015-04-16T17:36:35Z

Well, if it was to be implemented, it would make much more sense in Base than in a package, since it would affect all indexing.

Would the recent changes in array indexing make this easier to implement efficiently for any number of dimensions? @timholy @mbauman?

timholy · 2015-04-16T17:39:53Z

I'd suggest putting this into SubArray as a valid index type---eventually A[negated(1:15,3), :] would return a SubArray anyway.

mbauman · 2015-04-16T17:47:34Z

Well, #10525 would allow us to define this once and transform the complement to direct indices (or potentially even doing so without any intermediates).

One thing I realized the other day is that we almost have a negated index type already in base. It's the complement IntSet. If IntSets were not in the range 0:(typemax(Int)-1), but instead 1:typemax(Int), it'd actually simplify quite a bit of their implementation. And it'd make IntSet much more useful for indexing… especially if we start using BitArrays as their bit-storage.

nalimilan · 2015-04-16T21:02:41Z

Cool. But AFAIK the order of elements is not defined for IntSet, while for indexing it really matters.

mbauman · 2015-04-16T21:13:09Z

It's documented as being sorted, which seems sensible.

mbauman · 2015-04-16T21:24:23Z

Ah, and yes, my changes in #10331 for indexing with Colon fix Jeff's concerns above.

tpapp · 2016-04-05T06:21:06Z

Would #13157 allow "negated" indexes for array-like constructs with named indexes? (which map keys to indexes, like NamedArray, which already has its Not construct)

mbauman · 2016-04-05T16:31:43Z

Not directly, no. Both SubArray and fallback non-scalar indexing have converged to simply re-index into the stored or passed indices. The difficulty with using IntSets and complement types as indices is that they do not directly support fast indexed access. I don't want more special cases, so that means doing what we do with logical vectors: transform them to normal index vectors after checking their bounds.

Perhaps the only action item for Base is to change the Base.to_index API to accept the array and index dimension so there's enough context to transform a negated index. That would allow this to live in packages and work uniformly. It still requires some funny business, though, since a NegatedIndex must be an AbstractArray to dispatch properly, but it'd really only implement checkbounds and to_index.

StefanKarpinski · 2018-08-19T18:20:18Z

See https://github.com/mbauman/InvertedIndices.jl for an implementation of this idea as a package.

nalimilan · 2018-09-23T16:21:49Z

Is there any chance this would be included in Base? People keep asking about it:
https://stackoverflow.com/questions/52378061/julias-negative-complement-indexing-like-r
https://stackoverflow.com/questions/42382210/array-range-complement

StefanKarpinski mentioned this issue Feb 8, 2013

Negated indexing JuliaData/DataFrames.jl#182

Closed

simonster mentioned this issue Nov 6, 2013

ENH: Allow use of colon operator to slice ranges by column names JuliaData/DataFrames.jl#393

Closed

punkrockpolly added a commit to punkrockpolly/Playing-with-Julia that referenced this issue Nov 7, 2013

WIP: Issue #1032 (JuliaLang/julia#1032)

4c797ac

punkrockpolly added a commit to punkrockpolly/Playing-with-Julia that referenced this issue Nov 10, 2013

WIP: JuliaLang/julia#1032

bcdd56f

Mostly works on the following index types: (Int, Range1{Int}, Range{Int}) Does not yet work for index of Array{Int}

nalimilan mentioned this issue Nov 18, 2013

Add Names type to allow selecting elements by excluding others davidavdav/NamedArrays.jl#1

Merged

JeffBezanson mentioned this issue Nov 22, 2013

Complement type, possible approach to #1032. #4892

Closed

JeffBezanson mentioned this issue Aug 10, 2014

Towards array nirvana #7941

Closed

15 tasks

ViralBShah removed the feature label Feb 14, 2015

ViralBShah closed this as completed Apr 16, 2015

mbauman mentioned this issue Apr 16, 2015

WIP: Refactor IntSets to use BitVectors #10065

Closed

2 tasks

mbauman mentioned this issue Aug 5, 2015

Deprecate IntSet complement and stored zeros #12270

Merged

mbauman mentioned this issue Sep 16, 2015

Arraypocalypse Now and Then #13157

Closed

27 tasks

JaredCrean2 mentioned this issue Apr 5, 2016

Julep: solving tricky iteration problems #15648

Closed

jsams mentioned this issue Sep 29, 2017

Speed up opportunities? JuliaData/InvertedIndices.jl#1

Closed

mbauman mentioned this issue Jul 25, 2018

Negative indexing like R #28276

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NegatedIndex type #1032

NegatedIndex type #1032

StefanKarpinski commented Jul 10, 2012

JeffBezanson commented Jul 11, 2012

StefanKarpinski commented Jul 11, 2012

ViralBShah commented Jul 12, 2012

ViralBShah commented Jul 12, 2012

StefanKarpinski commented Jul 12, 2012

ViralBShah commented Jul 13, 2012

toivoh commented Jul 31, 2012

StefanKarpinski commented Jul 31, 2012

toivoh commented Aug 7, 2012

johnmyleswhite commented Dec 30, 2012

punkrockpolly commented Nov 6, 2013

StefanKarpinski commented Nov 6, 2013

punkrockpolly commented Nov 8, 2013

timholy commented Nov 8, 2013

timholy commented Nov 8, 2013

JeffBezanson commented Nov 8, 2013

StefanKarpinski commented Nov 8, 2013

JeffBezanson commented Nov 8, 2013

StefanKarpinski commented Nov 9, 2013

StefanKarpinski commented Nov 18, 2013

ViralBShah commented Apr 16, 2015

nalimilan commented Apr 16, 2015

timholy commented Apr 16, 2015

mbauman commented Apr 16, 2015

nalimilan commented Apr 16, 2015

mbauman commented Apr 16, 2015

mbauman commented Apr 16, 2015

tpapp commented Apr 5, 2016

mbauman commented Apr 5, 2016

StefanKarpinski commented Aug 19, 2018

nalimilan commented Sep 23, 2018

NegatedIndex type #1032

NegatedIndex type #1032

Comments

StefanKarpinski commented Jul 10, 2012

JeffBezanson commented Jul 11, 2012

StefanKarpinski commented Jul 11, 2012

ViralBShah commented Jul 12, 2012

ViralBShah commented Jul 12, 2012

StefanKarpinski commented Jul 12, 2012

ViralBShah commented Jul 13, 2012

toivoh commented Jul 31, 2012

StefanKarpinski commented Jul 31, 2012

toivoh commented Aug 7, 2012

johnmyleswhite commented Dec 30, 2012

punkrockpolly commented Nov 6, 2013

StefanKarpinski commented Nov 6, 2013

punkrockpolly commented Nov 8, 2013

timholy commented Nov 8, 2013

timholy commented Nov 8, 2013

JeffBezanson commented Nov 8, 2013

StefanKarpinski commented Nov 8, 2013

JeffBezanson commented Nov 8, 2013

StefanKarpinski commented Nov 9, 2013

StefanKarpinski commented Nov 18, 2013

ViralBShah commented Apr 16, 2015

nalimilan commented Apr 16, 2015

timholy commented Apr 16, 2015

mbauman commented Apr 16, 2015

nalimilan commented Apr 16, 2015

mbauman commented Apr 16, 2015

mbauman commented Apr 16, 2015

tpapp commented Apr 5, 2016

mbauman commented Apr 5, 2016

StefanKarpinski commented Aug 19, 2018

nalimilan commented Sep 23, 2018