Skip to content

Commit 471cc7f

Browse files
authored
add bf16 exp fallback (#776)
1 parent 68af215 commit 471cc7f

File tree

1 file changed

+5
-0
lines changed

1 file changed

+5
-0
lines changed

src/tl_templates/cuda/common.h

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -330,3 +330,8 @@ TL_DEVICE void __sync_thread_partial() {
330330
asm volatile("bar.sync %0, %1;" : : "r"(barrier_id), "r"(thread_count));
331331
}
332332
} // namespace tl
333+
334+
namespace cutlass {
335+
TL_DEVICE
336+
bfloat16_t fast_exp(bfloat16_t x) { return ::hexp(x); }
337+
} // namespace cutlass

0 commit comments

Comments
 (0)