PSA: --xla_gpu_simplify_all_fp_conversions is about to be on by default #9

cheshire · 2022-08-24T09:00:55Z

cheshire
Aug 24, 2022

The change allows simplifying away lowering conversions (e.g. f32->bf16->f32), and was seen to lead to large increases in performance on some models (~10%).

This can cause casts being ignored, and overall precision higher than expected, but it's already the case for XLA compiler (e.g. elementwise ops and matmuls can be already performed in higher precision than specified in HLO)

cheshire · 2022-09-28T09:12:10Z

cheshire
Sep 28, 2022
Author

This was committed.

1 reply

nouiz Sep 28, 2022
Collaborator

URL: tensorflow/tensorflow@6a98a2c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PSA: --xla_gpu_simplify_all_fp_conversions is about to be on by default #9

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

PSA: --xla_gpu_simplify_all_fp_conversions is about to be on by default #9

cheshire Aug 24, 2022

Replies: 1 comment · 1 reply

cheshire Sep 28, 2022 Author

nouiz Sep 28, 2022 Collaborator

cheshire
Aug 24, 2022

Replies: 1 comment 1 reply

cheshire
Sep 28, 2022
Author

nouiz Sep 28, 2022
Collaborator