fix onehot gpu #1441

CarloLucibello · 2020-12-27T14:11:28Z

Fix #556, fix #582, using a GPU friendly reduction.
Benchmarked with the following script (onecold2 is the new version)

using CUDA
using Flux
using Flux: onehotbatch, onecold, OneHotMatrix
using BenchmarkTools
CUDA.allowscalar(false)

onecold2(y::AbstractMatrix, labels=1:size(y,1)) =
  vec(map(x -> labels[x[1]], argmax(y; dims=1)))

onecold2(y::OneHotMatrix, labels...) = 
    map(x -> Flux.onecold(x, labels...), y.data)

function accuracy_v1a(oh, ŷ)
    mean(onecold(oh) .== onecold(ŷ))
end

function accuracy_v1b(oh, ŷ)
    mean(onecold(cpu(oh)) .== onecold(cpu(ŷ)))
end

function accuracy_v1c(y, ŷ)
    mean(cpu(y) .== onecold(cpu(ŷ)))
end

function accuracy_v2a(oh, ŷ)
    mean(onecold2(oh) .== onecold2(ŷ))
end

function accuracy_v2b(oh, ŷ)
    mean(onecold2(cpu(oh)) .== onecold2(cpu(ŷ)))
end

function accuracy_v2c(y, ŷ)
    mean(y .== onecold2(ŷ))
end

function accuracy_v3(y, ŷ)
    mean(y .== mapslices(argmax, ŷ, dims=1))
end

function accuracy_v4(oh, ŷ)
    mean(maximum(oh .* ŷ, dims=1) .== maximum(ŷ, dims=1))
end


ŷ = rand(Float32, 100, 1000)
y = rand(1:100, 1000)
oh = onehotbatch(y, 1:100)
ŷg, yg, ohg = gpu.([ŷ, y, oh]) 

println("V1A")
@btime accuracy_v1a(oh, ŷ) # 755.393 μs (9516 allocations: 248.97 KiB)
# @btime CUDA.@sync accuracy_v1a(ohg, ŷg) # Error scalar indexing

println("\nV1B")
@btime accuracy_v1b(oh, ŷ)  #   728.873 μs (9524 allocations: 249.81 KiB)
@btime CUDA.@sync accuracy_v1b(ohg, ŷg) #   1.027 ms (9542 allocations: 648.70 KiB)

println("\nV1C")
@btime accuracy_v1c(y, ŷ) # 771.765 μs (9519 allocations: 241.75 KiB)
@btime CUDA.@sync accuracy_v1c(yg, ŷg) # 1.022 ms (9537 allocations: 640.64 KiB)

println("\nV2A")
@btime accuracy_v2a(oh, ŷ) #   511.169 μs (10 allocations: 40.20 KiB)
@btime CUDA.@sync accuracy_v2a(ohg, ŷg) #   72.631 μs (261 allocations: 8.03 KiB)

println("\nV2B")
@btime accuracy_v2b(oh, ŷ) #  513.488 μs (20 allocations: 41.11 KiB)
@btime CUDA.@sync accuracy_v2b(ohg, ŷg) #   792.764 μs (38 allocations: 440.00 KiB)


println("\nV2C")
@btime accuracy_v2c(y, ŷ) # 524.095 μs (9 allocations: 32.27 KiB)
@btime CUDA.@sync accuracy_v2c(yg, ŷg)  #  64.536 μs (236 allocations: 7.38 KiB)

println("\nV3")
@btime accuracy_v3(y, ŷ)    # 1.802 ms (9510 allocations: 362.91 KiB)
# @btime CUDA.@sync accuracy_v3(yg, ŷg) # Error scalar indexing

println("\nV4")
@btime accuracy_v4(oh, ŷ) #   612.915 μs (14 allocations: 403.50 KiB)
@btime CUDA.@sync accuracy_v4(ohg, ŷg) # 85.776 μs (154 allocations: 4.23 KiB)

src/onehot.jl

DhairyaLGandhi · 2020-12-31T14:56:40Z

src/onehot.jl


-onecold(y::AbstractMatrix, labels...) =
-  dropdims(mapslices(y -> onecold(y, labels...), y, dims=1), dims=1)
+onecold(y::AbstractMatrix, labels = 1:size(y,1)) =


Why remove the splatted version?

because previously the default arg was handled by the specialization on AbstractVector, now it has to be handled here

DhairyaLGandhi · 2020-12-31T15:03:52Z

#764 fixed the performance IIRC, that issue should be closed

CarloLucibello · 2020-12-31T15:06:25Z

it didn't, as you can see from the benchmark I posted

CarloLucibello · 2021-01-01T11:16:08Z

closing in favor of #1447

CarloLucibello closed this Dec 27, 2020

CarloLucibello reopened this Dec 27, 2020

CarloLucibello requested a review from DhairyaLGandhi December 27, 2020 16:30

DhairyaLGandhi reviewed Dec 31, 2020

View reviewed changes

CarloLucibello added 4 commits December 31, 2020 17:47

fix onehot gpu

48428ba

cleanup

ba9e33b

doc example

e23c146

cleanup

d8c85bf

CarloLucibello force-pushed the cl/onecold branch from 025a577 to d8c85bf Compare December 31, 2020 16:48

CarloLucibello requested a review from DhairyaLGandhi December 31, 2020 16:48

CarloLucibello mentioned this pull request Jan 1, 2021

new onehot implementation #1447

Closed

4 tasks

CarloLucibello closed this Jan 1, 2021

CarloLucibello deleted the cl/onecold branch January 7, 2021 08:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix onehot gpu #1441

fix onehot gpu #1441

CarloLucibello commented Dec 27, 2020 •

edited

Loading

DhairyaLGandhi Dec 31, 2020

CarloLucibello Dec 31, 2020

DhairyaLGandhi commented Dec 31, 2020

CarloLucibello commented Dec 31, 2020

CarloLucibello commented Jan 1, 2021

fix onehot gpu #1441

fix onehot gpu #1441

Conversation

CarloLucibello commented Dec 27, 2020 • edited Loading

DhairyaLGandhi Dec 31, 2020

Choose a reason for hiding this comment

CarloLucibello Dec 31, 2020

Choose a reason for hiding this comment

DhairyaLGandhi commented Dec 31, 2020

CarloLucibello commented Dec 31, 2020

CarloLucibello commented Jan 1, 2021

CarloLucibello commented Dec 27, 2020 •

edited

Loading