Add test for EmitReducePrecisionIR #16775

apivovarov · 2024-09-03T23:07:54Z

I noticed that the EmitReducePrecisionIR function from xla/service/elemental_ir_emitter.h is not covered by unit tests.

Given its non-trivial logic, I believe it should be thoroughly tested, particularly for corner cases.

Changes in this PR:

Declare EmitReducePrecisionIR function in xla/service/elemental_ir_emitter.h
Add EmitReducePrecisionIR_F16ToF8e5m2 test
Add EmitReducePrecisionIR_F16ToF8e4m3fn test

Related PR:

PR-16585 Add support for float8_e4m3

reedwm · 2024-09-05T01:01:05Z

Maybe such tests should be in xla/tests/reduce_precision_test.cc. That way it will be tested on backends which don't use element_ir_emitter.

apivovarov · 2024-09-05T02:06:47Z

Maybe such tests should be in xla/tests/reduce_precision_test.cc. That way it will be tested on backends which don't use element_ir_emitter.

The function EmitReducePrecisionIR is declared in xla/service/elemental_ir_emitter.h and implemented in xla/service/elemental_ir_emitter.cc.

To test the functions declared in elemental_ir_emitter.h, XLA provides a corresponding file, xla/service/elemental_ir_emitter_test.cc.

EmitReducePrecisionIR utilizes LLVM and returns an llvm::Value (LLVM IR), which is then converted to a string representation and verified in the related tests.

In contrast, xla/tests/reduce_precision_test.cc does not rely on LLVM.

Given this, I believe that xla/service/elemental_ir_emitter_test.cc is the correct location for the EmitReducePrecisionIR tests.

akuegel · 2024-09-05T07:03:55Z

Maybe such tests should be in xla/tests/reduce_precision_test.cc. That way it will be tested on backends which don't use element_ir_emitter.

The function EmitReducePrecisionIR is declared in xla/service/elemental_ir_emitter.h and implemented in xla/service/elemental_ir_emitter.cc.

To test the functions declared in elemental_ir_emitter.h, XLA provides a corresponding file, xla/service/elemental_ir_emitter_test.cc.

EmitReducePrecisionIR utilizes LLVM and returns an llvm::Value (LLVM IR), which is then converted to a string representation and verified in the related tests.

In contrast, xla/tests/reduce_precision_test.cc does not rely on LLVM.

Given this, I believe that xla/service/elemental_ir_emitter_test.cc is the correct location for the EmitReducePrecisionIR tests.

So we have different kind of tests:

End2End tests for all backends, those are in xla/tests
GPU specific End2End tests, those are in xla/service/gpu/tests
Filecheck tests, those are also in xla/service/gpu/tests

For the tests you are adding here, I think filecheck based tests that use the hlo-opt tool would be the best fit. Can you please rewrite the tests based on that? You can take a look at .hlo files in xla/service/gpu/tests to see how to use it.

apivovarov · 2024-09-05T23:56:08Z

Thank you, Adrian and Reed, for your feedback. I’d like to provide some additional context on why I’m using the GTest framework for this test and the associated code.

The EmitReducePrecisionIR function consists of 138 lines of code and handles several specific cases:

When the destination mantissa is smaller than the source mantissa (adds 6 LLVM ops).
When the destination exponent is smaller than the source exponent (adds 7 LLVM ops).
NaN handling (adds 5 LLVM ops).
Prolog/epilog setup (adds 5 LLVM ops).

The function generates a sequence of LLVM IR operations that convert an input number in f16 format to an output in a reduced-precision f16-like format, with fewer bits for the exponent and mantissa. The result is returned as an llvm::Value*.

When working on f8E4M3 type support, it wasn’t immediately clear what LLVM IR or output I would get when reducing f16 to a 4-bit exponent and 3-bit mantissa. To explore this, I created a test that generates the LLVM IR and uses specific f16 constants as inputs.

Since the input is a constant, the entire LLVM IR can be evaluated and simplified to a final result. By calling llvm::Value*->print(), the result is folded into a final f16-like LLVM IR constant, represented as specific bits in hexadecimal format.

Note that the GPU compiler has its own separate EmitReducePrecision and EmitF16ToF8e5m2 functions, which use mlir::Value instead of llvm::Value.

The EmitReducePrecisionIR function’s test:

Is not an end-to-end test.
Is not an HLO test.
Is not a GPU test.

It’s a unit test for a specific C++ function (EmitReducePrecisionIR), which uses the LLVM API internally to emit LLVM IR for the CPU compiler.

Given this, I believe the test should use GTest along with the LLVM API to properly validate the result of EmitReducePrecisionIR.

@akuegel @reedwm

akuegel · 2024-09-06T05:00:02Z

Thanks for the explanation. If it is for the CPU backend, then I think @ezhulenev might be a better reviewer.

apivovarov · 2024-09-09T18:48:59Z

Hi Eugene,

Could you please review this PR? I've included details above explaining the necessity of the test and the reasoning behind using GTest with the LLVM API.

@ezhulenev

apivovarov requested a review from reedwm September 3, 2024 23:11

apivovarov force-pushed the elemental_ir_emitter_test branch from 5cd9df3 to a63e028 Compare September 3, 2024 23:17

NaiyerRizz self-assigned this Sep 4, 2024

apivovarov force-pushed the elemental_ir_emitter_test branch from a63e028 to f94a79d Compare September 4, 2024 20:11

apivovarov requested review from hawkinsp and akuegel September 4, 2024 21:03

apivovarov force-pushed the elemental_ir_emitter_test branch from f94a79d to 7407fbb Compare September 6, 2024 00:08

akuegel requested a review from ezhulenev September 6, 2024 04:59

Add test for EmitReducePrecisionIR

57dd070

apivovarov force-pushed the elemental_ir_emitter_test branch from 7407fbb to 57dd070 Compare September 11, 2024 20:16

apivovarov requested review from ddunl and removed request for akuegel September 11, 2024 20:50

akuegel requested a review from penpornk September 12, 2024 05:31

hawkinsp removed their request for review September 19, 2024 19:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test for EmitReducePrecisionIR #16775

Add test for EmitReducePrecisionIR #16775

apivovarov commented Sep 3, 2024 •

edited

Loading

reedwm commented Sep 5, 2024

apivovarov commented Sep 5, 2024 •

edited

Loading

akuegel commented Sep 5, 2024

apivovarov commented Sep 5, 2024

akuegel commented Sep 6, 2024

apivovarov commented Sep 9, 2024

Add test for EmitReducePrecisionIR #16775

Are you sure you want to change the base?

Add test for EmitReducePrecisionIR #16775

Conversation

apivovarov commented Sep 3, 2024 • edited Loading

reedwm commented Sep 5, 2024

apivovarov commented Sep 5, 2024 • edited Loading

akuegel commented Sep 5, 2024

apivovarov commented Sep 5, 2024

akuegel commented Sep 6, 2024

apivovarov commented Sep 9, 2024

apivovarov commented Sep 3, 2024 •

edited

Loading

apivovarov commented Sep 5, 2024 •

edited

Loading