Add hip support #330

dacorvo · 2024-10-04T16:06:28Z

What does this PR do?

This is a rebase of #280.

This has been tested on an AMD Instinct MI250X/MI250.

Some tests with qfloat8 Linear are failing because there is a wider mismatch between the quantized and non-quantized version, but this is a good first step.

FAILED test/nn/test_qlinear.py::test_quantize_linear_float16_activations_float8[cuda-a-qfloat8-e4m3-w-qint8-fp16-bias-10-256-1] - ValueError: Alignment 0.95751953 deviates too much from 1.0 with atol=0.005, rtol=0.001
FAILED test/nn/test_qlinear.py::test_quantize_linear_float16_activations_float8[cuda-a-qfloat8-e4m3-w-qint8-fp16-bias-10-256-10] - ValueError: Alignment 0.95556641 deviates too much from 1.0 with atol=0.005, rtol=0.001
FAILED test/nn/test_qlinear.py::test_quantize_linear_float16_activations_float8[cuda-a-qfloat8-e4m3-w-qint8-fp16-no-bias-10-256-1] - ValueError: Alignment 0.95458984 deviates too much from 1.0 with atol=0.005, rtol=0.001
FAILED test/nn/test_qlinear.py::test_quantize_linear_float16_activations_float8[cuda-a-qfloat8-e4m3-w-qint8-fp16-no-bias-10-256-10] - ValueError: Alignment 0.95507812 deviates too much from 1.0 with atol=0.005, rtol=0.001
FAILED test/nn/test_qlinear.py::test_quantize_linear_float16_activations_float8[cuda-a-float8-e4m3-uz-w-qint8-fp16-bias-10-256-1] - ValueError: Alignment 0.99267578 deviates too much from 1.0 with atol=0.005, rtol=0.001
FAILED test/nn/test_qlinear.py::test_quantize_linear_float16_activations_float8[cuda-a-float8-e4m3-uz-w-qint8-fp16-bias-10-256-10] - ValueError: Alignment 0.99316406 deviates too much from 1.0 with atol=0.005, rtol=0.001
FAILED test/nn/test_qlinear.py::test_quantize_linear_float16_activations_float8[cuda-a-float8-e4m3-uz-w-qint8-fp16-no-bias-10-256-1] - ValueError: Alignment 0.99267578 deviates too much from 1.0 with atol=0.005, rtol=0.001
FAILED test/nn/test_qlinear.py::test_quantize_linear_float16_activations_float8[cuda-a-float8-e4m3-uz-w-qint8-fp16-no-bias-10-256-10] - ValueError: Alignment 0.99267578 deviates too much from 1.0 with atol=0.005, rtol=0.001

dacorvo mentioned this pull request Oct 4, 2024

feat: add HIP support #280

Closed

3 tasks

dacorvo and others added 2 commits October 4, 2024 18:10

test(fp8marlin): only test is extension is available

e02aa6d

feat: add HIP support

49cc052

dacorvo force-pushed the add_hip_support branch from dd3e1cf to 49cc052 Compare October 4, 2024 16:11

dacorvo merged commit 843b793 into main Oct 4, 2024
16 checks passed

dacorvo deleted the add_hip_support branch October 4, 2024 16:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add hip support #330

Add hip support #330

dacorvo commented Oct 4, 2024

Add hip support #330

Add hip support #330

Conversation

dacorvo commented Oct 4, 2024

What does this PR do?