traverse Array with Maybe more quickly #142

andrewthad · 2018-04-20T15:46:04Z

WIP. I still need to handle SmallArray. I also need to make this work with Either, but the backwards-compatibility story there is more annoying because of the historical absence of ExceptT. I also want to do this for State. Probably not Writer though since it has abysmal performance anyway.

treeowl · 2018-04-22T05:11:43Z

I like this. For Either, you can either copy the basics of ExceptT from transformers or (perhaps better) just manually work it into the code for traverseArrayP.

Side note: it's most unfortunate for RULES that we don't have, e.g., Applicative (ExceptT e m) :- Applicative m, and that we have no way to match on the use of a particular instance dictionary either. So all these rules have to be pretty much monomorphic.

andrewthad · 2018-04-22T14:23:15Z

I've just added Either, State, and Reader and some documentation of what's going on here. To keep things simple, I think I'll just disable the Either optimization if the user builds with transformers older than 0.4. I suspect that creating copies of traverseArrayP for all three of these types that manually inline the monad instance dictionaries (eschewing the use of the monad transformers entirely) doesn't actually cause GHC to generate better code at the use site. I'll add a benchmark soon to confirm this.

Aside from that, are there other issues you see with this? Other types worth supporting?

treeowl · 2018-04-22T14:37:52Z

I don't understand why you think the manual copies (with rewrite rules to match) wouldn't help.

andrewthad · 2018-04-22T14:58:32Z

I think GHC will produce the same core.

treeowl · 2018-04-22T15:00:38Z

Quite possibly, but you won't be depending on having ExceptT in transformers.

…

On Apr 22, 2018 10:58 AM, "Andrew Martin" ***@***.***> wrote: I think GHC will produce the same core. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#142 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABzi_VIxxCgLXNeO_AbANLNZKCV4NYIzks5trJqYgaJpZM4Tdr0z> .

…rsal specialized to Either

andrewthad · 2018-04-22T19:45:35Z

Ah, it was consistent behavior with all versions of containers that was your concern. I've changed it as you suggested. Also, I added a benchmark just to see if there was a performance difference between the two implementations. They have identical performance (Wooh GHC!).

treeowl · 2018-04-22T22:21:34Z

All these rewrite rules (including the ones I wrote) feel somewhat unsatisfactory, because they completely fall down for transformers. I'm wondering if there might be a way to catch known transformations of known base monads, using rewrite rules with fall-backs. Roughly speaking, rewrite applications of known transformers recursively until we reach a known base monad (in which case we can rewrite to something particularly efficient) or fail to do so, in which case we ultimately inline to the usual case.

By the way: what sorts of benchmark speed-ups were you able to demonstrate from the Maybe and Either e rules?

…of Array to stock applicative traversal

andrewthad · 2018-04-23T12:20:01Z

I've just added a benchmark to compare the two:

benchmarked Array/implementations/traverse/Either/inlined
time                 36.32 μs   (31.90 μs .. 41.36 μs)
                     0.928 R²   (0.884 R² .. 0.998 R²)
mean                 32.20 μs   (31.49 μs .. 33.72 μs)
std dev              3.534 μs   (1.644 μs .. 5.927 μs)
variance introduced by outliers: 65% (severely inflated)

benchmarked Array/implementations/traverse/Either/closure
time                 172.3 μs   (166.3 μs .. 179.7 μs)
                     0.981 R²   (0.965 R² .. 0.994 R²)
mean                 168.8 μs   (166.1 μs .. 172.5 μs)
std dev              10.94 μs   (7.326 μs .. 15.23 μs)
variance introduced by outliers: 42% (moderately inflated)

This is on a pretty noisy box, but there's about a 5x speedup. On PrimArray, it's much more pronounced:

benchmarked PrimArray/traverse/Maybe/Applicative
time                 1.215 ms   (1.191 ms .. 1.237 ms)
                     0.996 R²   (0.991 R² .. 0.999 R²)
mean                 1.181 ms   (1.171 ms .. 1.195 ms)
std dev              41.17 μs   (30.35 μs .. 68.63 μs)
variance introduced by outliers: 17% (moderately inflated)

benchmarked PrimArray/traverse/Maybe/PrimMonad
time                 5.985 μs   (5.817 μs .. 6.189 μs)
                     0.996 R²   (0.993 R² .. 0.999 R²)
mean                 5.858 μs   (5.827 μs .. 5.921 μs)
std dev              135.6 ns   (82.21 ns .. 233.3 ns)

About a 200x speedup on my noisy box. This is because, for PrimArray, the specialized traversals do zero allocations (other than the original allocation of the new array).

I like your idea about recursively rewriting. I'm going to give this a try.

andrewthad · 2018-04-23T19:36:23Z

All these rewrite rules (including the ones I wrote) feel somewhat unsatisfactory, because they completely fall down for transformers. I'm wondering if there might be a way to catch known transformations of known base monads, using rewrite rules with fall-backs.

I can't find a way to make this work. Consider what a rewrite rule for MaybeT might look like:

"traverse/MaybeT" forall (f :: a -> MaybeT m b). traverseArray f = ...

Here's the problem: in the rewrite rule, we have no way to tell GHC that m needs to have a PrimMonad constraint. GHC appears to have a syntax that tricks the user into thinking they can do this, but it doesn't actually work.

andrewthad · 2018-04-23T19:43:30Z

In fact, in the rewrite rule, GHC doesn't even know that m has an Applicative instance.

treeowl · 2018-04-23T19:43:41Z

You certainly can't do it that way. I'm not sure if you can do it at all or not, but if so it will require rewrite rules "all the way down" to a base type a rule matches on.

…

On Mon, Apr 23, 2018, 3:36 PM Andrew Martin ***@***.***> wrote: All these rewrite rules (including the ones I wrote) feel somewhat unsatisfactory, because they completely fall down for transformers. I'm wondering if there might be a way to catch known transformations of known base monads, using rewrite rules with fall-backs. I can't find a way to make this work. Consider what a rewrite rule for MaybeT might look like: "traverse/MaybeT" forall (f :: a -> MaybeT m b). traverseArray f = ... Here's the problem: in the rewrite rule, we have no way to tell GHC that m needs to have a PrimMonad constraint. GHC appears to have a syntax that tricks the user into thinking they can do this, but it doesn't actually work. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#142 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABzi_dZCqTaNqZmbf1VIp02zVy3_B4Aaks5tri04gaJpZM4Tdr0z> .

treeowl · 2018-04-23T19:56:12Z

The basic idea would be to step through the layers of monad transformers you recognize, recording how each one transforms a monad. Then at the bottom, find a base monad you recognize and build up the ultimate bind and return. I'm not sure this can be done, but I strongly suspect it can.

…

On Mon, Apr 23, 2018, 3:43 PM Andrew Martin ***@***.***> wrote: In fact, in the rewrite rule, GHC doesn't even know that m has an Applicative instance. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#142 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABzi_bT_4FDHXBWsf7sCOMq0KZFsl94lks5tri7igaJpZM4Tdr0z> .

andrewthad · 2018-04-23T20:06:57Z

I'm still not able to follow how this is supposed to look. Going back to MaybeT:

"traverse/MaybeT" forall (f :: a -> MaybeT m b). traverseArray f = ...

I cannot see anything that could go on the RHS (other than traverseArray f) that is both correct and satisfies the type checker.

andrewthad · 2018-04-23T20:07:43Z

Wait, I think I might be beginning to see a way.

treeowl · 2018-04-23T20:12:45Z

Wait, I think I might be beginning to see a way.

Me too 😉

…

— You are receiving this because you commented. Reply to this email directly, view it on GitHub <#142 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABzi_UxLd-9G6tLrM5RAYiQkfb2cd6_xks5trjSQgaJpZM4Tdr0z> .

treeowl · 2018-04-23T20:14:40Z

The race is on.

treeowl · 2018-04-23T20:50:31Z

I think we can get MaybeT, ExceptT, and StateT, but not WriterT, RWST, or ErrorT (since those have side constraints). I haven't actually tested my rules, though, so I don't know if they fire or, if they do, whether they produce decent code.

…

On Mon, Apr 23, 2018, 4:12 PM David Feuer ***@***.***> wrote: Wait, I think I might be beginning to see a way. > Me too 😉 > — > You are receiving this because you commented. > Reply to this email directly, view it on GitHub > <#142 (comment)>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/ABzi_UxLd-9G6tLrM5RAYiQkfb2cd6_xks5trjSQgaJpZM4Tdr0z> > . >

andrewthad · 2018-04-23T20:53:38Z

Have you pushed these somewhere?

treeowl · 2018-04-23T21:06:02Z

No. Thus far they're pretty much faked up. When I'm home I'll (try to) make them a bit less fake and push to show you.

…

On Mon, Apr 23, 2018, 4:53 PM Andrew Martin ***@***.***> wrote: Have you pushed these somewhere? — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#142 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABzi_aDZfEmPXOF-IzFWAsJpLuOxWDjDks5trj9TgaJpZM4Tdr0z> .

treeowl · 2018-04-23T22:08:08Z

Until then, I strongly urge you to give it a good try. You may well come up with a better idea than I did, and in any case you'll certainly learn some things.

…

On Mon, Apr 23, 2018, 5:05 PM David Feuer ***@***.***> wrote: No. Thus far they're pretty much faked up. When I'm home I'll (try to) make them a bit less fake and push to show you. On Mon, Apr 23, 2018, 4:53 PM Andrew Martin ***@***.***> wrote: > Have you pushed these somewhere? > > — > You are receiving this because you commented. > Reply to this email directly, view it on GitHub > <#142 (comment)>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/ABzi_aDZfEmPXOF-IzFWAsJpLuOxWDjDks5trj9TgaJpZM4Tdr0z> > . >

treeowl · 2018-04-24T01:21:39Z

Hrmm... I'm able to write code that at least type checks to handle (some) transformers on top of ST and IO, but I'm actually not sure how to handle transformers on top of Maybe, Either, Identity, etc. Do you think it's doable? If not for the annoying limits on instance discovery, do you think you could write rules? Maybe those could inspire. Anyway, I'll upload what I have within the hour.

cartazio · 2018-04-24T03:43:01Z

It feels like we’re bordering on feature / ideas for improving rules based optimization. What is it we wish we could say / can only express via hand scripted hermit tool optimization’s? David: have you tried out hermit?

…

On Mon, Apr 23, 2018 at 9:21 PM David Feuer ***@***.***> wrote: Hrmm... I'm able to write code that at least type checks to handle (some) transformers on top of ST and IO, but I'm actually not sure how to handle transformers on top of Maybe, Either, Identity, etc. Do you think it's doable? If not for the annoying limits on instance discovery, do you think you could write rules? Maybe those could inspire. Anyway, I'll upload what I have within the hour. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#142 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAAQwoT8qU8bQdwXqJnxjl4erPb-ylDkks5trn4jgaJpZM4Tdr0z> .

treeowl · 2018-04-24T03:50:20Z

@cartazio, I've only looked at hermit briefly. It's rather complicated. The basic limitation is basically that the instance resolution derivations drop away after type checking. So for example the simplifier doesn't know how GHC resolved PrimMonad (WriterT w m) and therefore doesn't know Monad m or Monoid w.

cartazio · 2018-04-24T14:45:01Z

Is there ways this could be changed in ghc? Seems like it would be at least a useful discussion as feature request motivated by what optimization’s you want to be able to say simply. I’d like it if we can keep primitive simple if we can. At some point too much rules optImitation can hinder easy of changing internals. Though we don’t have much internals going on :)

…

On Mon, Apr 23, 2018 at 11:50 PM David Feuer ***@***.***> wrote: @cartazio <https://github.com/cartazio>, I've only looked at hermit briefly. It's rather complicated. The basic limitation is basically that the instance resolution derivations drop away after type checking. So for example the simplifier doesn't know how GHC resolved PrimMonad (WriterT w m) and therefore doesn't know Monad m or Monoid w. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#142 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAAQwp1B8vcjC9gp9iAxWzcxqVfS4i63ks5trqD8gaJpZM4Tdr0z> .

treeowl · 2018-04-26T22:50:58Z

For stacks based on ST or IO, my version seems much simpler, and involves far fewer enormous piles of code that need to go away. I suggest you just ignore those stacks altogether and try to come up with a solution for stacks based on Maybe, Either, Identity, etc. It should be easy to use both solutions at the same time.

treeowl · 2018-04-26T22:53:40Z

Separately, I'm considering a simpler general version that might work somewhat better than the one I put in before. In particular, we can go back to something somewhat list-like, but using a list that holds multiple elements per cons. I need to do some benchmarking.

andrewthad · 2018-04-27T00:04:23Z

Undoubtedly, yours is much simpler. In my most recent commit, I was mostly playing around just to see what it takes to make things like Maybe and Either accepted as the base monad. It’s helped me understand the problem better. I have another idea for how to extend your solution to accept different base monads that I am going to try soon.

cartazio · 2018-04-27T16:22:06Z

Question: why do we need the rewrite rules?

a) is it because we can better specialize the code ?

b) is it because you want to be "Afine/ linear resource safe"?

treeowl · 2018-04-27T16:35:55Z

We want rewrite rules for speed. They're not supposed to change semantics any.

…

On Fri, Apr 27, 2018, 12:33 PM Carter Tazio Schonwald < ***@***.***> wrote: Question: why do we need the rewrite rules? a) is it because we can better specialize the code ? b) is it because you want to be "Afine/ linear resource safe"? — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#142 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABzi_RDKD9u2DR7vKBuuEX15qtO00cceks5ts0hUgaJpZM4Tdr0z> .

treeowl · 2018-04-27T19:11:32Z

@cartazio, the fast way to traverse an array is to

Create a mutable array.
For each element of the given array,
a. Perform an action to get an element and
b. Write that element to the mutable array.
Freeze the mutable array and return the resulting immutable array.

We'd like traverse to do that whenever it can, but it's not always possible. Even ignoring safety, note that Compose m n is not generally a PrimMonad, or even a Monad, even if m and n both are.

cartazio · 2018-04-27T21:41:22Z

I'm not terribly familiar with Compose, could you walk me through an example?

cartazio · 2018-04-27T21:41:29Z

also where does Compose live?

cartazio · 2018-04-27T21:43:34Z

hrmm

if this is the correct defintion of compose

-- | Right-to-left composition of functors.
-- The composition of applicative functors is always applicative,
-- but the composition of monads is not always a monad.
newtype Compose f g a = Compose { getCompose :: f (g a) }

wouldn't just require PrimMonad just one of these?

cartazio · 2018-04-27T21:44:48Z

hrmmm, that makes me think that we can't have an instance for compose, because there should only be one primonad in the stack ... i think?

cartazio · 2018-04-27T21:45:35Z

(something something global approximate linearity of state token)

cartazio · 2018-04-27T21:53:40Z

i guess i'm perhaps not understnding why Compose the data type/monad enters into this, but i'll have to look at more details

treeowl · 2018-04-27T22:01:51Z

Not even compositions of pure monads are necessarily monads. As my PR shows, we can do good things for a lot of transformer stacks built on ST and IO. Andrew has been working on an idea for stacks built on Identity, Either, and Maybe (and perhaps even (a,)), but it's not ready yet. Compose is a generally useful way to construct Applicatives, which people certainly use with traverse, but I don't know if we'll be able to optimize for it or not.

…

On Fri, Apr 27, 2018, 5:45 PM Carter Tazio Schonwald < ***@***.***> wrote: (something something global approximate linearity of state token) — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#142 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABzi_RalzTkE83YEyH1yLdW5ZDNst80mks5ts5GAgaJpZM4Tdr0z> .

treeowl · 2018-04-27T23:34:17Z

The composition law for Traversable does seem to suggest we can probably do something smart about Compose, actually. We know that traverse <http://hackage.haskell.org/package/base-4.11.1.0/docs/Data-Traversable.html#v:traverse> (Compose . fmap <http://hackage.haskell.org/package/base-4.11.1.0/docs/Data-Functor.html#v:fmap> g . f) = Compose . fmap <http://hackage.haskell.org/package/base-4.11.1.0/docs/Data-Functor.html#v:fmap> (traverse <http://hackage.haskell.org/package/base-4.11.1.0/docs/Data-Traversable.html#v:traverse> g) . traverse <http://hackage.haskell.org/package/base-4.11.1.0/docs/Data-Traversable.html#v:traverse> f In particular, traverse (Compose . f) = Compose . fmap sequenceA . traverse f So we can turn one traversal into two, which sounds bad but is probably actually good. For *pure* Applicatives, I conjecture that we can also turn two traversals into one efficient one. Anyway, that's the start of a thought in that direction. Still need to think it through a bit more.

…

On Fri, Apr 27, 2018, 6:01 PM David Feuer ***@***.***> wrote: Not even compositions of pure monads are necessarily monads. As my PR shows, we can do good things for a lot of transformer stacks built on ST and IO. Andrew has been working on an idea for stacks built on Identity, Either, and Maybe (and perhaps even (a,)), but it's not ready yet. Compose is a generally useful way to construct Applicatives, which people certainly use with traverse, but I don't know if we'll be able to optimize for it or not. On Fri, Apr 27, 2018, 5:45 PM Carter Tazio Schonwald < ***@***.***> wrote: > (something something global approximate linearity of state token) > > — > You are receiving this because you commented. > Reply to this email directly, view it on GitHub > <#142 (comment)>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/ABzi_RalzTkE83YEyH1yLdW5ZDNst80mks5ts5GAgaJpZM4Tdr0z> > . >

traverse Array with Maybe more quickly

91f599b

add rewrite rules for traversing with Either, State, and Reader

7528b97

andrewthad added 2 commits April 22, 2018 14:39

benchmark difference between different implementations of array trave…

ec7609b

…rsal specialized to Either

make rewrite rule for Array traversal with Either not require ExceptT

e64e576

andrewthad added 2 commits April 23, 2018 08:11

document reason for specialized either traversal in benchmark suite

302d5b9

add benchmark to compare performance of specialized either traversal …

68a0be8

…of Array to stock applicative traversal

composable traverse rewrites almost working

165856c

traverse Array with Maybe more quickly #142

Are you sure you want to change the base?

traverse Array with Maybe more quickly #142

Conversation

andrewthad commented Apr 20, 2018

treeowl commented Apr 22, 2018

andrewthad commented Apr 22, 2018

treeowl commented Apr 22, 2018

andrewthad commented Apr 22, 2018

treeowl commented Apr 22, 2018 via email

andrewthad commented Apr 22, 2018

treeowl commented Apr 22, 2018

andrewthad commented Apr 23, 2018

andrewthad commented Apr 23, 2018 • edited Loading

andrewthad commented Apr 23, 2018

treeowl commented Apr 23, 2018 via email

treeowl commented Apr 23, 2018 via email

andrewthad commented Apr 23, 2018

andrewthad commented Apr 23, 2018

treeowl commented Apr 23, 2018 via email

treeowl commented Apr 23, 2018

treeowl commented Apr 23, 2018 via email

andrewthad commented Apr 23, 2018

treeowl commented Apr 23, 2018 via email

treeowl commented Apr 23, 2018 via email

treeowl commented Apr 24, 2018

cartazio commented Apr 24, 2018 via email

treeowl commented Apr 24, 2018

cartazio commented Apr 24, 2018 via email

treeowl commented Apr 26, 2018

treeowl commented Apr 26, 2018

andrewthad commented Apr 27, 2018

cartazio commented Apr 27, 2018

treeowl commented Apr 27, 2018 via email

treeowl commented Apr 27, 2018

cartazio commented Apr 27, 2018

cartazio commented Apr 27, 2018

cartazio commented Apr 27, 2018

cartazio commented Apr 27, 2018

cartazio commented Apr 27, 2018

cartazio commented Apr 27, 2018

treeowl commented Apr 27, 2018 via email

treeowl commented Apr 27, 2018 via email

andrewthad commented Apr 23, 2018 •

edited

Loading