[RyuJIT] Correct names of AVX that stand for AVX2 #13879

fiigii · 2017-09-08T21:52:02Z

Now, the name of AVX actually stands for AVX2 instruction set somewhere in RyuJIT. Because we introduced System.Runtime.Intrinsics.X86 in #13576 , RyuJIT should distinguish AVX and AVX2 ISA.

Currently, RyuJIT SIMD and floating point codegen emit AVX instructions (VEX encoding) when AVX2 is available, so RyuJIT uses "AVX" and "AVX2" interchangeably which can cause confusion with Avx intrinsics be introduced. But intrinsics in class System.Runtime.Intrinsics.X86.Avx may change this strategy, so we should distinguish AVX and AVX2 in RyuJIT (names in vm side are correct).

During implementing System.Runtime.Intrinsics.X86, I found InstructionSet_AVX has to be changed to InstructionSet_AVX2 (a new InstructionSet_AVX will be added). I am not sure if any other name of "AVX" should be changed to "AVX2" (e.g., canUseAVX), so we can discuss in this PR.

fiigii · 2017-09-08T21:53:02Z

/cc @russellhadley @jkotas @CarolEidt

BruceForstall

LGTM. I don't think the JIT ever generates AVX (but not AVX2) instructions currently, and certainly not for SIMD.

@CarolEidt comments?

fiigii · 2017-09-15T18:31:53Z

I don't think the JIT ever generates AVX (but not AVX2) instructions currently, and certainly not for SIMD.

@BruceForstall I think this codegen strategy should be changed when we enable Intel hardware intrinsics, which allows developers to use AVX intrinsics on non-AVX2 machine (Sandy Bridge and Ivy Bridge). Under the current codegen strategy, mixing scalar floating-point calculations with AVX intrinsics may trigger AVX-SSE transition penalties.

CarolEidt · 2017-09-15T22:34:05Z

As I mentioned in the intrinsics discussion, we will have to change our strategy. We chose to use AVX2 rather than AVX because the AVX2 support for 256-bit vectors was more complete. However, if we are going to broadly support AVX intrinsics, then I believe that we need to:

Base the encoding selection on whether we are on an AVX-capable system, and
Base the size of Vector<T> on whether we are on an AVX2-capable system.

I'm not comfortable with this change as it is, because 1) I don't think it improves anything, and 2) it just leads to further confusion.

What we need is to clarify (and separately address) the question of the encoding and available instruction set (i.e. the actual target hardware) vs. the size of Vector<T>

fiigii · 2017-09-15T23:17:26Z

However, if we are going to broadly support AVX intrinsics, then I believe that we need to:

Base the encoding selection on whether we are on an AVX-capable system, and

Base the size of Vector<T> on whether we are on an AVX2-capable system.

@CarolEidt Agree with this new codegen strategy. We should generate VEX-encoding instructions that operate over xmm registers once AVX is supported by underlying hardware. But only enable 256-bit Vector<T> when AVX2 is available. I will open a new issue and make the change of encoding selection.

I'm not comfortable with this change as it is, because 1) I don't think it improves anything, and 2) it just leads to further confusion.

We have to change something because the current RyuJIT uses "AVX" and "AVX2" interchangeably and #14020 requires InstructionSet_AVX and InstructionSet_AVX2 both.

fiigii · 2017-09-15T23:21:50Z

src/jit/simdcodegenxarch.cpp

@@ -1158,7 +1158,7 @@ void CodeGen::genSIMDIntrinsic32BitConvert(GenTreeSIMD* simdNode)
            getEmitter()->emitIns_R_R_I(INS_pinsrw, emitTypeSize(TYP_INT), tmpReg, tmpIntReg, 3);
        }
 #endif
-        if (compiler->getSIMDInstructionSet() == InstructionSet_AVX)
+        if (compiler->getSIMDInstructionSet() == InstructionSet_AVX2)
        {
            inst_RV_RV(INS_vpbroadcastd, tmpReg, tmpReg, targetType, emitActualTypeSize(targetType));
        }


@CarolEidt For example, once we distinguish InstructionSet_AVX and InstructionSet_AVX2, we will have to change here because vpbroadcastd is an AVX2 instruction.

@CarolEidt ping?

CarolEidt · 2017-09-18T23:13:16Z

We have to change something because the current RyuJIT uses "AVX" and "AVX2" interchangeably and #14020 requires InstructionSet_AVX and InstructionSet_AVX2 both.

I agree that changes are needed, but I don't think this is an appropriate change. Instead, we should:

Add InstructionSet_AVX2 (instead of just replacing InstructionSet_AVX
Leave the code as-is that determines whether/when to emit vzeroupper instructions
Modify the code that determines the size of Vector<T> to check for InstructionSet_AVX2 before setting to 32 bytes.
Ensure that we are using AVX encodings when compiler->getSIMDInstructionSet() >= InstructionSet_AVX

There may be other changes required as well.

fiigii · 2017-09-18T23:27:10Z

@CarolEidt Got it, I will close this PR and provide a better solution.

fiigii · 2017-09-19T19:00:41Z

Move to #14065

Rename InstructionSet_AVX to InstructionSet_AVX2

eb2af99

dnfclas added the cla-already-signed label Sep 8, 2017

fiigii changed the title ~~[RyuJIT] Correct names of AVX that stands for AVX2~~ [RyuJIT] Correct names of AVX that stand for AVX2 Sep 8, 2017

BruceForstall approved these changes Sep 15, 2017

View reviewed changes

fiigii mentioned this pull request Sep 15, 2017

Implement "IsSupported" for all ISA classes of Intel hardware intrinsics #14020

Merged

fiigii commented Sep 15, 2017

View reviewed changes

fiigii closed this Sep 19, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RyuJIT] Correct names of AVX that stand for AVX2 #13879

[RyuJIT] Correct names of AVX that stand for AVX2 #13879

fiigii commented Sep 8, 2017 •

edited

Loading

fiigii commented Sep 8, 2017

BruceForstall left a comment

fiigii commented Sep 15, 2017 •

edited

Loading

CarolEidt commented Sep 15, 2017

fiigii commented Sep 15, 2017 •

edited

Loading

fiigii Sep 15, 2017

fiigii Sep 18, 2017

CarolEidt commented Sep 18, 2017

fiigii commented Sep 18, 2017

fiigii commented Sep 19, 2017

[RyuJIT] Correct names of AVX that stand for AVX2 #13879

[RyuJIT] Correct names of AVX that stand for AVX2 #13879

Conversation

fiigii commented Sep 8, 2017 • edited Loading

fiigii commented Sep 8, 2017

BruceForstall left a comment

Choose a reason for hiding this comment

fiigii commented Sep 15, 2017 • edited Loading

CarolEidt commented Sep 15, 2017

fiigii commented Sep 15, 2017 • edited Loading

fiigii Sep 15, 2017

Choose a reason for hiding this comment

fiigii Sep 18, 2017

Choose a reason for hiding this comment

CarolEidt commented Sep 18, 2017

fiigii commented Sep 18, 2017

fiigii commented Sep 19, 2017

fiigii commented Sep 8, 2017 •

edited

Loading

fiigii commented Sep 15, 2017 •

edited

Loading

fiigii commented Sep 15, 2017 •

edited

Loading