weighted logsumexp #101

mileslucas · 2020-08-09T13:52:39Z

add weighted logsumexp function
bump version 0.9.5 => 0.9.6

I ran into a use case where I needed a weighted sum in the logsumexp function when copying some python code.

There are no performance regressions in the new version, I also removed the TODO regarding some special-casing on the value u because without it the performance is much worse (30 ns -> 1 us) for cases where dims=:.

Here are the benchmarks I did to make sure there's still a super-fast no allocation for the simplest case

julia> @benchmark logsumexp($([1.0, 2.0]))
BenchmarkTools.Trial:
  memory estimate:  0 bytes
  allocs estimate:  0
  --------------
  minimum time:     32.608 ns (0.00% GC)
  median time:      33.277 ns (0.00% GC)
  mean time:        34.576 ns (0.00% GC)
  maximum time:     101.738 ns (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     993

julia> @benchmark logsumexp($([1.0, 2.0]), $([-0.5, 0.5]))
BenchmarkTools.Trial:
  memory estimate:  0 bytes
  allocs estimate:  0
  --------------
  minimum time:     36.436 ns (0.00% GC)
  median time:      37.050 ns (0.00% GC)
  mean time:        38.228 ns (0.00% GC)
  maximum time:     150.541 ns (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     992

bkamins · 2020-08-09T20:04:57Z

src/basicfuns.jl

 end
 end

+function logsumexp(X::AbstractArray{T}, W; dims=:) where {T<:Real}


in the docstring you say W is an array, but there is no type restriction in the signature. I think it would be cleaner to have it especially that in your code W and X do not have to have matching dimensions, and when do not then zip(X, W) will not iterate every element of X in corner cases.

added words and examples

Can you just restrict this to ::AbstractArray? I don't see the point of allowing any other type (and we don't in StatsBase). That also allows simplifying the docs. I also don't like allowing broadcasting an array of any dimensions, as currently the weighted sum over dimensions in StatsBase does something different (it applies the weights over the dimension which is being summed over). And undefined behavior is not something we generally accept in Julia packages. So better require the two arrays to have the same size for now.

@nalimilan I didn't want to over-specialize because then I couldn't use a call like logsumexp([1, 2], (0.5, 0.3)). This is useful for me because my weights are static in the calculations I need so I prefer to use a Tuple.

As for the making the sizes the same, that would disallow broadcasting like
logsumexp(randn(3, 4), rand(1, 4), dims=1). By not adding a size check it's more flexible.

Perhaps your concerns could be alleviated by thinking about combining some of David's work in #97 here. The only reason I latch onto this array method when my use case will be tuples is that the non-array method currently is pretty slow for my use.

@nalimilan I didn't want to over-specialize because then I couldn't use a call like logsumexp([1, 2], (0.5, 0.3)). This is useful for me because my weights are static in the calculations I need so I prefer to use a Tuple.

Could you use StaticArrays instead of tuples? I tend to prefer restrictive APIs unless we have a strong reason to be more flexible, as we can easily end up allowing things that weren't intended, and which can even return incorrect results.

As for the making the sizes the same, that would disallow broadcasting like
logsumexp(randn(3, 4), rand(1, 4), dims=1). By not adding a size check it's more flexible.

As I said the problem is that the definition would be inconsistent with StatsBase, which we should really avoid. With StatsBase conventions, logsumexp(randn(3, 4), rand(1, 4), dims=1) would be written as just logsumexp(randn(3, 4), rand(4), dims=1), but it's a bit more complicated to implement. So again unless there's a strong reason

Perhaps your concerns could be alleviated by thinking about combining some of David's work in #97 here. The only reason I latch onto this array method when my use case will be tuples is that the non-array method currently is pretty slow for my use.

That wouldn't really change anything AFAICT. With non-AbstractArray weights you can't check the size of the input to make sure it makes sense and that there's a one-to-one correspondence between values and weights.

Okay, well I don't really agree that this is better, but I've fully restricted the function to only work with abstract arrays which have exactly the same size. I've updated the tests and docstring, too.

andreasnoack · 2022-02-18T12:19:25Z

Should now be against https://github.com/JuliaStats/LogExpFunctions.jl since logsumexp has moved there.

mileslucas added 3 commits August 9, 2020 08:45

add weighted logsumexp function

339de72

bump version 0.9.5 => 0.9.6

7fa1da8

remove W specialization

95d50cf

mschauer approved these changes Aug 9, 2020

View reviewed changes

bkamins reviewed Aug 9, 2020

View reviewed changes

mileslucas added 3 commits August 9, 2020 15:17

expand logsumexp docstring

b43cc22

restrict type and size of weights in logsumexp

962378e

fix docstring

5f26681

devmotion mentioned this pull request Aug 31, 2021

Weighted logsumexp JuliaStats/LogExpFunctions.jl#24

Open

andreasnoack closed this Feb 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

weighted logsumexp #101

weighted logsumexp #101

mileslucas commented Aug 9, 2020 •

edited

Loading

bkamins Aug 9, 2020

mileslucas Aug 9, 2020

nalimilan Sep 1, 2020

mileslucas Sep 1, 2020 •

edited

Loading

nalimilan Sep 2, 2020

mileslucas Sep 10, 2020

andreasnoack commented Feb 18, 2022

weighted logsumexp #101

weighted logsumexp #101

Conversation

mileslucas commented Aug 9, 2020 • edited Loading

bkamins Aug 9, 2020

Choose a reason for hiding this comment

mileslucas Aug 9, 2020

Choose a reason for hiding this comment

nalimilan Sep 1, 2020

Choose a reason for hiding this comment

mileslucas Sep 1, 2020 • edited Loading

Choose a reason for hiding this comment

nalimilan Sep 2, 2020

Choose a reason for hiding this comment

mileslucas Sep 10, 2020

Choose a reason for hiding this comment

andreasnoack commented Feb 18, 2022

mileslucas commented Aug 9, 2020 •

edited

Loading

mileslucas Sep 1, 2020 •

edited

Loading