i#2626 AArch64 encoder: Add isz operand and vector ADD to encoder. #3016

fhahn · 2018-05-21T09:36:15Z

This patch adds an isz operand to encode the vector element width for
non-FP vector instructions. It also adds support for vector ADD to the
encoder/decoder. Additional tests and macros should be added once the
script in the project-aarch64-generate-patterns branch gets updated.

Issue #2626

This patch adds an isz operand to encode the vector element width for non-FP vector instructions. It also adds support for vector ADD to the encoder/decoder. Additional tests and macros should be added once the script in the project-aarch64-generate-patterns branch gets updated. Issue #2626 Change-Id: I2bca21610205c3b2ba7bb67f990fe108d210001c

…ctor-add Change-Id: Ie8adf4f164d83f7bdcf85cb9e9f44b487c1de5d7

Change-Id: I7414163022cb784fdd4d7af29cb6c184e7394c45

egrimley · 2018-05-24T08:44:15Z

Does it make sense to refer to "the project-aarch64-generate-patterns branch" in the commit message? Shall I assume that the last sentence of the commit message will be something like "Additional tests and macros will be added by a later commit"?

egrimley · 2018-05-24T08:52:10Z

core/arch/aarch64/codec.c

+static inline bool
+encode_opnd_isz(uint enc, int opcode, byte *pc, opnd_t opnd, OUT uint *enc_out)
+{
+    if (opnd_get_immed_int(opnd) < ISZ_BYTE || opnd_get_immed_int(opnd) > ISZ_DOUBLE)


Since opnd_get_immed_int might be a function, I would avoid using it three times here and instead put uint bits = opnd_get_immed_int(opnd).

Also, since line 1984 wants bits to be 0, 1, 2 or 3, I'm not sure the use of ISZ_BYTE and ISZ_DOUBLE is helpful here, so I would have put if (bits > 3), but that's just my opinion and I wouldn't be surprised if some people would disagree.

Since opnd_get_immed_int might be a function, I would avoid using it three times here and instead put uint bits = opnd_get_immed_int(opnd).

Sometimes I tend to have too much faith in compilers. I've changed it.

Also, since line 1984 wants bits to be 0, 1, 2 or 3, I'm not sure the use of ISZ_BYTE and ISZ_DOUBLE is helpful here, so I would have put if (bits > 3), but that's just my opinion and I wouldn't be surprised if some people would disagree.

I wasn't sure, but yes, they don't add too much value I think. Ideally, I think there would be an enum, shared with the FP ops. I could do that as a follow up.

egrimley · 2018-05-24T08:55:17Z

core/arch/aarch64/codec.txt

@@ -131,6 +131,8 @@
 ---------?x---------x-----------  vindex_SD  # Index for vector with single or double
                                             # elements, depending on bit 22 (sz)
 ?--------xx---------------------  imm16sh    # shift for MOVK/... (immediate); checks 31
+--------xx----------------------  isz        # element size of a vector register (


If you wanted to get the comment into one line you could put something like: # element size of vector reg (8<<x bits)

egrimley · 2018-05-24T08:58:54Z

core/arch/aarch64/codec.txt

@@ -957,6 +959,10 @@ x101101011000000000101xxxxxxxxxx  cls     wx0 : wx5
 1101101011000000000011xxxxxxxxxx  rev     x0 : x5

 # Data Processing - Scalar Floating-Point and Advanced SIMD
+
+# ADD


Is there a rule for how the instructions are ordered in this file? (I think I was following the "Index by Encoding" in our internal web pages at some point...) If it's feasible, it might be good to follow some canonical ordering and mark omissions with a comment. (But perhaps it isn't feasible.)

The neon patterns should currently follow alphabetic order (as on the A64 -- SIMD and Floating-point Instructions (alphabetic order) index page of the public XML ISA spec). That's how the generator script happens to process them, but IMO that makes it easier to read. On second thought, it might be easier to extend to generator script to work by the index page in the future.

egrimley · 2018-05-24T09:26:08Z

suite/tests/api/dis-a64.txt

@@ -1561,6 +1561,16 @@ fd3fffff : str    d31, [sp,#32760]        : str    %d31 -> +0x7ff8(%sp)[8byte]
 fd481041 : ldr    d1, [x2,#4128]          : ldr    +0x1020(%x2)[8byte] -> %d1
 fd7fffff : ldr    d31, [sp,#32760]        : ldr    +0x7ff8(%sp)[8byte] -> %d31

+
+# ADD (vector)
+4e2c856a : add v10.16b, v11.16b, v12.16b : add    %q11 %q12 $0x00 -> %q10


The script dis-a64.pl contains a format specification that would attempt to make these colons line up. In fact, I think at some point dis-a64.txt could survive reformatting by dis-a64.pl. Perhaps worth thinking about getting that to work again, but not as part of this commit.

Right, I've aligned the ADD lines. It should be easy to update the generator script to align on a per-opcode basis.

…ctor-add

Change-Id: I4c017e16910f18a3b56bb1fc59df743aea908e40

fhahn · 2018-05-24T13:28:04Z

Does it make sense to refer to "the project-aarch64-generate-patterns branch" in the commit message? Shall I assume that the last sentence of the commit message will be something like "Additional tests and macros will be added by a later commit"?

Probably not. I'll make sure to update that in the final message.

egrimley

LGTM

fhahn · 2018-05-24T14:37:14Z

Thanks Edmund!

This patch adds an isz operand to encode the vector element width for non-FP vector instructions. It also adds support for vector ADD to the encoder/decoder. Additional tests and macros will be added by a later commit. Issue #2626

fhahn added 3 commits May 21, 2018 10:34

Merge branch 'master' of github.com:DynamoRIO/dynamorio into i2626-ve…

3067d3c

…ctor-add Change-Id: Ie8adf4f164d83f7bdcf85cb9e9f44b487c1de5d7

Some style changes

eb63f9c

Change-Id: I7414163022cb784fdd4d7af29cb6c184e7394c45

fhahn requested a review from egrimley May 23, 2018 14:19

egrimley reviewed May 24, 2018

View reviewed changes

fhahn added 2 commits May 24, 2018 13:57

Merge branch 'master' of github.com:DynamoRIO/dynamorio into i2626-ve…

3c57b20

…ctor-add

Address Edmund's comments.

6360288

Change-Id: I4c017e16910f18a3b56bb1fc59df743aea908e40

egrimley approved these changes May 24, 2018

View reviewed changes

fhahn merged commit 3e1182e into master May 24, 2018

fhahn deleted the i2626-vector-add branch May 24, 2018 14:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

i#2626 AArch64 encoder: Add isz operand and vector ADD to encoder. #3016

i#2626 AArch64 encoder: Add isz operand and vector ADD to encoder. #3016

fhahn commented May 21, 2018

egrimley commented May 24, 2018

egrimley May 24, 2018

fhahn May 24, 2018

egrimley May 24, 2018

fhahn May 24, 2018

egrimley May 24, 2018

fhahn May 24, 2018

egrimley May 24, 2018

fhahn May 24, 2018

fhahn commented May 24, 2018

egrimley left a comment

fhahn commented May 24, 2018

i#2626 AArch64 encoder: Add isz operand and vector ADD to encoder. #3016

i#2626 AArch64 encoder: Add isz operand and vector ADD to encoder. #3016

Conversation

fhahn commented May 21, 2018

egrimley commented May 24, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fhahn commented May 24, 2018

egrimley left a comment

Choose a reason for hiding this comment

fhahn commented May 24, 2018