Added 3- and 5- argument LinearAlgebra.mul! and copy functions #37

matteoacrossi · 2020-03-05T10:46:17Z

No description provided.

codecov · 2020-03-05T10:48:39Z

Codecov Report

Merging #37 into master will increase coverage by 0.67%.
The diff coverage is 100%.

@@            Coverage Diff             @@
##           master      #37      +/-   ##
==========================================
+ Coverage    94.9%   95.58%   +0.67%     
==========================================
  Files           4        4              
  Lines         157      181      +24     
==========================================
+ Hits          149      173      +24     
  Misses          8        8

Impacted Files	Coverage Δ
src/blockdiagonal.jl	`83.72% <100%> (+2.63%)`	⬆️
src/linalg.jl	`98.03% <100%> (+0.6%)`	⬆️
src/base_maths.jl	`100% <0%> (ø)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 234a99c...ab7f8fe. Read the comment docs.

nickrobinson251

Thanks a lot for this!

Please can you post what the current behaviour of mul! with 3 BlockDiagonals is? Does it error, get the wrong results, or is it just not as performant as possible?
If the last one, please can you post some benchmarks indicative of the speed up from the change?

src/blockdiagonal.jl

src/linalg.jl

matteoacrossi · 2020-03-05T15:24:07Z

Currently mul! falls back to the standard LinearAlgebra one and I think it converts the BlockDiagonal matrices to regular Matrix. If the matrices are both BlockDiagonal and have the same block structure, it is much faster to perform a block-by-block multiplication:

using BlockDiagonals
using LinearAlgebra
using BenchmarkTools

# 3-Argument mul!
mul2!(C::BlockDiagonal, A::BlockDiagonal, B::BlockDiagonal) = mul2!(C, A, B, true, false)

# 5-Argument mul!
function mul2!(C::BlockDiagonal, A::BlockDiagonal, B::BlockDiagonal, α::Number, β::Number)
    BlockDiagonals.isequal_blocksizes(A, B) || throw(DimensionMismatch("A and B have different block sizes"))
    BlockDiagonals.isequal_blocksizes(C, A) || throw(DimensionMismatch("C has incompatible block sizes"))
    for i in 1:length(blocks(C))
        LinearAlgebra.mul!(C.blocks[i], A.blocks[i], B.blocks[i], α, β)
    end
    return C
end

N1, N2, N3 = 30, 40, 50
b1 = BlockDiagonal([rand(N1, N1), rand(N2, N2), rand(N3, N3)])
b2 = BlockDiagonal([rand(N1, N1), rand(N2, N2), rand(N3, N3)])
b3 = similar(b1);

This is with the current master of BlockDiagonals

@benchmark mul!(b3, b1, b2)

BenchmarkTools.Trial: 
  memory estimate:  39.06 MiB
  allocs estimate:  388823
  --------------
  minimum time:     19.184 ms (0.00% GC)
  median time:      28.789 ms (11.61% GC)
  mean time:        29.916 ms (10.99% GC)
  maximum time:     52.644 ms (11.49% GC)
  --------------
  samples:          168
  evals/sample:     1

This is with the specialized function

@benchmark mul2!(b3, b1, b2)

BenchmarkTools.Trial: 
  memory estimate:  768 bytes
  allocs estimate:  17
  --------------
  minimum time:     20.210 μs (0.00% GC)
  median time:      21.957 μs (0.00% GC)
  mean time:        27.255 μs (0.00% GC)
  maximum time:     834.732 μs (0.00% GC)
  --------------
  samples:          10000
  evals/sample:     1

matteoacrossi · 2020-03-05T15:25:11Z

But I do realize now that maybe the specialized code should fall back to the standard matrix multiplication if the block sizes don't match, but the overall matrix sizes do, instead of throwing an error.

matteoacrossi · 2020-03-06T10:52:12Z

I implemented the proposed changes. After all, the mul!(C::BlockDiagonal, A::BlockDiagonal, B::BlockDiagonal) should error if the block structures do not match. If C is not BlockDiagonal, then the method automatically falls back to regular matrix multiplication (although there well be a lot of allocations).

nickrobinson251

This looks great! Thanks a lot.

I bunch of very minor style suggestions.

Please can you bump the version in the Project.toml? Then I will release the new version as soon as this is merged :) Since this is just a performance improvement, and not breaking, we can just bump the last version number. Thanks again!

src/linalg.jl

test/linalg.jl

src/linalg.jl

src/blockdiagonal.jl

Co-Authored-By: Nick Robinson <npr251@gmail.com>

matteoacrossi added 4 commits February 3, 2020 16:33

Added 3 and 5 arg mul!

32913d8

Added copy and fill functions

1f6274b

Slightly better loop

58ae423

Removed fill!

5810f2e

nickrobinson251 self-requested a review March 5, 2020 10:59

nickrobinson251 mentioned this pull request Mar 5, 2020

speed up multiplication with BlockArrays #34

Open

nickrobinson251 reviewed Mar 5, 2020

View reviewed changes

src/blockdiagonal.jl Outdated Show resolved Hide resolved

src/blockdiagonal.jl Outdated Show resolved Hide resolved

src/linalg.jl Outdated Show resolved Hide resolved

nickrobinson251 mentioned this pull request Mar 5, 2020

Add benchmarks #19

Open

matteoacrossi added 4 commits March 6, 2020 11:12

Use module qualification rather than import

9b2e275

Removed copyto!

fae44fe

5-arg mul only for VERSION >= 1.3

e779339

Fixed mul! for VERSION < 1.3

2a9b50b

nickrobinson251 approved these changes Mar 7, 2020

View reviewed changes

matteoacrossi and others added 2 commits March 7, 2020 19:25

Apply suggestions from code review

9dcf2da

Co-Authored-By: Nick Robinson <npr251@gmail.com>

Increase version

ab7f8fe

nickrobinson251 merged commit 9fdcb28 into JuliaArrays:master Mar 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added 3- and 5- argument LinearAlgebra.mul! and copy functions #37

Added 3- and 5- argument LinearAlgebra.mul! and copy functions #37

matteoacrossi commented Mar 5, 2020

codecov bot commented Mar 5, 2020 •

edited

Loading

nickrobinson251 left a comment

matteoacrossi commented Mar 5, 2020

matteoacrossi commented Mar 5, 2020

matteoacrossi commented Mar 6, 2020

nickrobinson251 left a comment

Added 3- and 5- argument LinearAlgebra.mul! and copy functions #37

Added 3- and 5- argument LinearAlgebra.mul! and copy functions #37

Conversation

matteoacrossi commented Mar 5, 2020

codecov bot commented Mar 5, 2020 • edited Loading

Codecov Report

nickrobinson251 left a comment

Choose a reason for hiding this comment

matteoacrossi commented Mar 5, 2020

matteoacrossi commented Mar 5, 2020

matteoacrossi commented Mar 6, 2020

nickrobinson251 left a comment

Choose a reason for hiding this comment

codecov bot commented Mar 5, 2020 •

edited

Loading