adds semantics for the x86 SSE floating-point instructions #1466

ivg · 2022-04-14T19:33:52Z

Introduction

This PR utilizes the recently added intrinsic Primus Lisp primitive and represents and represents floating-point operations as calls to intrinsic functions. The set of operations coincides with the set of floating-point operations in Core Theory and SMT-LIB. The rounding modes and floating-point format are encoded in the name of the intrinsic, e.g., here's how the ADDSD XMM0, qword ptr [RBP + -0x18] is lifted,

00000881: intrinsic:x0 := extract:63:0[extract:127:0[YMM0]]
00000884: intrinsic:x1 := mem[RBP - 0x18, el]:u64
00000887: call @intrinsic:fadd_rne_ieee754_binary with return %00000889

00000889:
0000088a: YMM0 := YMM0 &
          0xFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF00000000000000000000000000000000
          | pad:256[intrinsic:y0]

The rounding mode is represented as a parameter, which could be either a constant (the default and is equal to +rne+) or could be read from the MXCSR register).

Deprecated Features

This PR sunsets two existing features.

First, is the legacy floating-point lifter, that previously could be enabled with the --x86-with-floating-points option. While we do not plan to remove it in the foreseeable future, it is not really necessary anymore (other than for comparison and debugging). To make the option name less confusing, it is renamed to --x86-with-legacy-floating-points (as floating points are now enabled by default and there's no need to enable them explicitly). The old option has the same behavior, except that it will print a big fat warning message.

Another feature is the old capability of the BIL lifter to automatically translate unknown instructions (or instructions belonging to some class) into intrinsic calls. The new intrinsic primitive supersedes this option at the cost of some extra manual work. However, unlike the automatic translation, it enables explicit data flow and well-defined intrinsic calling conventions. If you will enable this mode, you will get knowledge base conflicts with the x86 plugin since both will try to turn x86 floating-point instructions into intrinsic calls in different ways. If you still need to use this feature of the BIL lifter, then you can pass the --x86-disable-floating-point-intrinsics option to suppress the x86 lifter. We currently don't know if we will remove the automatic translation altogether or rewrite it in a such way that it will work with the modern BAP. But in any case, it is better to explicitly denotate the missing instructions as intrinsic calls instead of relying on the automatic translation.

Bug fixes, Tweaks, and some New Small Features

This PR also contains a few bug fixes and features that were necessary to make everything work together (we now have 4 different floating-point lifters!).

The Primus.Lisp.Semantics.context allows us to constrain the context of the applicability of Primus Lisp definitions, so that we can disable the translation of the floating-point instructions into the intrinsic calls.

Several bugs were also discovered during the work on this PR.

The edge contraction procedure in the IR lifter was sometimes producing blocks in an incorrect order. The fixed version is also contracting more edges. Moreover, it is now enabled by default for all lifters (before that it was only contracting edges for IR generated by the Ghidra lifter).

The stack correction and automatic ABI were preventing the intrinsic functions from working (since they are not C). To fix this, we introduced a new intrinsic attribute to the subroutine. If a subroutine is marked as an intrinsic then it won't have C type automatically inserted and the stack correction in x86 will be disabled. In addition, the already existing synthetic attribute is used to tag synthetic subroutines.

Finally, this PR fixes some bugs in the semantics of pcode wrt the floating-point operations, so now it is fully functional and tested (using the testsuite/sse/float program).

Summary of the Available Floating-point Lifters

Basically, we now have 4 options to lift and run the floating-point programs (all passing the test for our simple toy program),

the default lifter that translates instructions into intrinsic calls:

bap sse/floats --run

the Ghidra lifter that translates pcode floating-point operations into intrinsic calls:

bap sse/floats --run --x86-backend=ghidra

the BIL ability to translate the unknown instructions into intrinsics

bap sse/floats --run \
    --bil-enable-intrinsic=:unknown \
    --primus-lisp-load=posix,llvm-x86-64-floats \
    --x86-disable-floating-point-intrinsics

the legacy floating-point lifter together with BIL floating-point emulation mode.

bap sse/floats --run \
    --x86-with-legacy-floating-points \
    --bil-enable-fp-emulation

Future Work

In the future we plan to utilize the BIL floating-point emulation mode (translation of the basic floating-point operations, such as +,-,*,/,sqrt into bitvector operations) to give concrete semantics to the corresponding intrinsic functions so that they will not have the empty denotation. Right now, the first two examples above are executable, because we defined the dynamic Primus Lisp semantics for these intrinsics. They work, however, only for ieee756 64-bit floats and use OCaml floats underneath the hood.

All synthetic subroutines are marked with the synthetic attribute, and special together with intrinsic functions are marked as intrinsic.

a little bit hacky, but will make it better in the future

the repetitive parameters were ignored

we now are passing the tests!

The Ghidra lifter generates code that is either invalid or our interpretation of it is not valid, e.g., `int3` instructioin on x86 is lifted as ``` CALL x; RETURN y; ``` where according to the documentation both CALL and RETURN should be treated initially as indirect jumps

to prevent clashing with the Primus Lisp systems

Otherwise, they both try to provide semantics that is conflicting.

This option makes it possible to still use the old BIL plugin feature that translates unknown (to the BIL lifters) instructions into intrinsic calls.

2.5.0 ===== - BinaryAnalysisPlatform/bap#1390 adds the flattening pass to the library interface - BinaryAnalysisPlatform/bap#1389 adds `insn-code` to the `Theory.Semantics` class - BinaryAnalysisPlatform/bap#1394 adds the `Bitvec.modular` function - BinaryAnalysisPlatform/bap#1395 adds LLVM 13/14 compatibility - BinaryAnalysisPlatform/bap#1408 adds support for mips64el target - BinaryAnalysisPlatform/bap#1409 adds the `--print-missing` option to print unlifed instructions - BinaryAnalysisPlatform/bap#1410 adds several new Primus Lisp primitives and new instructions - BinaryAnalysisPlatform/bap#1428 adds the monad choice interface to the knowledge base - BinaryAnalysisPlatform/bap#1429 refines the `Theory.Target.matches` and adds the `matching` function - BinaryAnalysisPlatform/bap#1434 adds arm unpredicated BL instruction - BinaryAnalysisPlatform/bap#1444 adds the x86/amd64 plt corrector component to the Primus base system - BinaryAnalysisPlatform/bap#1445 updates the `Sub.compute_liveness` function to handle SSA form - BinaryAnalysisPlatform/bap#1446 provides the new liveness analysis - BinaryAnalysisPlatform/bap#1452 implements pcode floating-point and special operators - BinaryAnalysisPlatform/bap#1457 adds optional `join` for `Knowledge.Domain.mapping` - BinaryAnalysisPlatform/bap#1461 enables v8.{1,2,3,4,5,6}a revisions for the aarch64 target - BinaryAnalysisPlatform/bap#1464 adds arbitrary-precision loopless clz and popcount to Primus Lisp - BinaryAnalysisPlatform/bap#1460 adds compatibility with Core_kernel >= 0.15 - BinaryAnalysisPlatform/bap#1466 adds semantics for the x86 SSE floating-point instructions - BinaryAnalysisPlatform/bap#1469 adds the jump destination addresses/names to the assembly output - BinaryAnalysisPlatform/bap#1458 adds more aarch64 instructions - BinaryAnalysisPlatform/bap#1473 adds an `--arm-features` command-line option - BinaryAnalysisPlatform/bap#1476 implements the naming scheme for interrupts - BinaryAnalysisPlatform/bap#1479 reifies external subroutines and intrinsics into I - BinaryAnalysisPlatform/bap#1482 enables BIR subroutines without an explicit return - BinaryAnalysisPlatform/bap#1481 enables disabling the patterns plugin - BinaryAnalysisPlatform/bap#1483 implements floating-point intrinsic subroutines - BinaryAnalysisPlatform/bap#1488 adds compatibility with OCaml 4.14 and Core v0.15 - BinaryAnalysisPlatform/bap#1489 adds some missing functionality to Primus Lisp POSIX - BinaryAnalysisPlatform/bap#1490 adds some missing C POSIX APIs - BinaryAnalysisPlatform/bap#1492 makes bit-twiddling operations easier to read and analyze - BinaryAnalysisPlatform/bap#1493 adds smart constructors and destructors to the C types library - BinaryAnalysisPlatform/bap#1491 adds semantics for the x86-64 `popq` instruction - BinaryAnalysisPlatform/bap#1497 extends the C.Abi library - BinaryAnalysisPlatform/bap#1498 adds the extended lvalue assignment to Primus Interpreter - BinaryAnalysisPlatform/bap#1499 makes BIL smart constructors smart - BinaryAnalysisPlatform/bap#1500 makes argument passing well-typed - BinaryAnalysisPlatform/bap#1503 reimplements C types printing functions - BinaryAnalysisPlatform/bap#1504 extends the demanglers library to the new targets infrastructure - BinaryAnalysisPlatform/bap#1505 rewrites x86 abi using the new infrastructure - BinaryAnalysisPlatform/bap#1511 implements some missing Thumb instructions - BinaryAnalysisPlatform/bap#1513 implements the x86_64 padd instructions - BinaryAnalysisPlatform/bap#1515 allows target overriding - BinaryAnalysisPlatform/bap#1516 adds armv8 BFM instructions - BinaryAnalysisPlatform/bap#1517 publishes Theory.Target.nicknames and extends Primus Contexts - BinaryAnalysisPlatform/bap#1519 extends Core Theory with target registration and lookup - BinaryAnalysisPlatform/bap#1520 adds the high-level calling convention specification language - BinaryAnalysisPlatform/bap#1521 reimplements x86 targets using the new infrastructure - BinaryAnalysisPlatform/bap#1522 reimplements ARM ABI and target specification - BinaryAnalysisPlatform/bap#1523 rewrites mips targets and abi - BinaryAnalysisPlatform/bap#1524 adds C data type layout - BinaryAnalysisPlatform/bap#1525 adds the pass by reference argument passing method - BinaryAnalysisPlatform/bap#1526 restructures powerpc targets and reimplements ppc32 eabi - BinaryAnalysisPlatform/bap#1529 makes the ABI processors usable programmatically - BinaryAnalysisPlatform/bap#1391 fixes ARM/Thumb `movt` semantics - BinaryAnalysisPlatform/bap#1396 fixes the path plugin loader path handling - BinaryAnalysisPlatform/bap#1414 fixes the pc value in pc-relative thumb ldr - BinaryAnalysisPlatform/bap#1420 fixes the low-level Disasm_expert.Basic.create function - BinaryAnalysisPlatform/bap#1421 fixes the core-theory plugin semantics tags - BinaryAnalysisPlatform/bap#1426 fixes arm predication - BinaryAnalysisPlatform/bap#1438 reads correctly unqualified system names - BinaryAnalysisPlatform/bap#1439 fixes a bug in the KB update function, adds new functions - BinaryAnalysisPlatform/bap#1448 fixes an accidental dependency on the bap-traces internal module - BinaryAnalysisPlatform/bap#1449 fixes unconditional pop with return in thumb - BinaryAnalysisPlatform/bap#1455 fixes register assignments in p-code semantics - BinaryAnalysisPlatform/bap#1462 fixes the `cast-signed` Primus Lisp primitive - BinaryAnalysisPlatform/bap#1463 fixes the arithmetic modulus in Primus Lisp primitives - BinaryAnalysisPlatform/bap#1465 fixes handling of `jmp term`s in the flatten pass - BinaryAnalysisPlatform/bap#1467 fixes a sporadic internal error in the cache garbage collector - BinaryAnalysisPlatform/bap#1468 fixes the relocation symbolizer incorrect handling of intrinsics - BinaryAnalysisPlatform/bap#1458 fixes aarch64 bitmask immediate encoding - BinaryAnalysisPlatform/bap#1486 fixes type unification on binary operation application - BinaryAnalysisPlatform/bap#1485 fixes little-endian MIPS disassembling - BinaryAnalysisPlatform/bap#1494 fixes the encoding of the comparison operators - BinaryAnalysisPlatform/bap#1496 fixes registers allocation in the abi specification DSL - BinaryAnalysisPlatform/bap#1502 fixes the bitvector order function - BinaryAnalysisPlatform/bap#1528 fixes armv4t name that was missing the arm prefix - BinaryAnalysisPlatform/bap#1393 improves the Primus Lisp documentation generator - BinaryAnalysisPlatform/bap#1397 fixes the macOS CI build - BinaryAnalysisPlatform/bap#1399 updates the url of the testing repo to use the encrypted version - BinaryAnalysisPlatform/bap#1432 updates the docker image - BinaryAnalysisPlatform/bap#1435 selects specific llvm components for linking - BinaryAnalysisPlatform/bap#1447 updates to the git+https in the dockerfiles - BinaryAnalysisPlatform/bap#1470 corrects linking of Unix library in configure - BinaryAnalysisPlatform/bap#1478 fixes the opam/opam dev-repo protocol which broke the release action - BinaryAnalysisPlatform/bap#1480 adds an automation to build a docker image for the latest release - BinaryAnalysisPlatform/bap#1514 adds the mmap dependency - BinaryAnalysisPlatform/bap#1386 adds missing ARM target ABI information - BinaryAnalysisPlatform/bap#1388 adds aliasing information for x86 - BinaryAnalysisPlatform/bap#1392 adds an option to directly use ogre files as a loader - BinaryAnalysisPlatform/bap#1398 provides the assembly string as a promise (removes #undefined) - BinaryAnalysisPlatform/bap#1400 improves the computation of the instruction properties - BinaryAnalysisPlatform/bap#1401 improves the KB.Value merge operation - BinaryAnalysisPlatform/bap#1402 moves promises and theories into the core-theory plugin - BinaryAnalysisPlatform/bap#1403 moves knowledge base rules from the library to the plugin - BinaryAnalysisPlatform/bap#1404 improves the peformance of the byte patterns matcher (1/3) - BinaryAnalysisPlatform/bap#1405 improves the performance of bitvectors (2/3) - BinaryAnalysisPlatform/bap#1411 [optimization] do not store empty objects in the knowledge base - BinaryAnalysisPlatform/bap#1412 updates the KB version number and adds a few more microoptimizations - BinaryAnalysisPlatform/bap#1413 updates bap to latest OCaml, switches to newer bitstrings - BinaryAnalysisPlatform/bap#1415 switches to patricia trees in the KB implementation - BinaryAnalysisPlatform/bap#1416 Reimplements x86 bitscan and popcnt - BinaryAnalysisPlatform/bap#1418 uses the builtin clz function from base, instead of the custom one - BinaryAnalysisPlatform/bap#1417 relaxes the speculative disassembler constraints - BinaryAnalysisPlatform/bap#1419 allows bapbuild to work when bap and other defaults are not present - BinaryAnalysisPlatform/bap#1422 relaxes interpreters to allow ill-typed operations - BinaryAnalysisPlatform/bap#1425 applies ARM modified immediate (MIC) decoding in more places - BinaryAnalysisPlatform/bap#1423 reimplements clz using the branchless/loopless algorithm - BinaryAnalysisPlatform/bap#1427 removes unnecessary units from the knowledge base - BinaryAnalysisPlatform/bap#1430 refines and extends target definitions - BinaryAnalysisPlatform/bap#1431 partially upgrades byteweight to work with the modern bap - BinaryAnalysisPlatform/bap#1441 uses Allen's Interval Algebra in the KB.Value merge implementation - BinaryAnalysisPlatform/bap#1442 wraps proposals into with_empty and adds more guards - BinaryAnalysisPlatform/bap#1443 adds subinstruction contraction to improve the ghidra lifter output - BinaryAnalysisPlatform/bap#1433 adds mode events to traces - BinaryAnalysisPlatform/bap#1450 hushes bil lifters - BinaryAnalysisPlatform/bap#1451 removes falls-through from unconditional branches in IR reification - BinaryAnalysisPlatform/bap#1454 improves the setw function used - BinaryAnalysisPlatform/bap#1456 removes Thumb2 branches from the legacy ARM lifter - BinaryAnalysisPlatform/bap#1471 uses function starts as the entires when building the symtab - BinaryAnalysisPlatform/bap#1472 improves disassembler performance - BinaryAnalysisPlatform/bap#1475 unifies name generation for IR subroutines - BinaryAnalysisPlatform/bap#1477 removes the special Primus Lisp primitive - BinaryAnalysisPlatform/bap#1484 disables byteweight - BinaryAnalysisPlatform/bap#1487 reduces memory footprint - BinaryAnalysisPlatform/bap#1501 makes all C data type sizes a multitude of their alignment - BinaryAnalysisPlatform/bap#1506 optimizes encoding computation for x86 - BinaryAnalysisPlatform/bap#1510 adds an example on how to create a monad transformer stack (#1354) - BinaryAnalysisPlatform/bap#1518 uses signed casts for promoting arguments - BinaryAnalysisPlatform/bap#1530 turns x86 endbr instructions into nops - BinaryAnalysisPlatform/bap#1531 adds patterns to recognize certain x86 endbr as function starts - BinaryAnalysisPlatform/bap#1532 improves the main subroutine discovery within glibc runtime - BinaryAnalysisPlatform/bap#1535 prevents knowledge conflicts on mangled names

ivg added 19 commits April 13, 2022 14:20

marks synthetic subroutines with proper attributes

d922cc2

All synthetic subroutines are marked with the synthetic attribute, and special together with intrinsic functions are marked as intrinsic.

prevents C ABI pass from clobbering registers on intrinsic calls

820d22a

a little bit hacky, but will make it better in the future

do not correct the stack pointer on calls to intrinsic functions

c84b5d3

adds the MXCSR register to the x86-64 environment

c12dcc2

fixes a bug in the intrinisic primtive

07a0c25

the repetitive parameters were ignored

fixes the IR contraction procedure again and enables unconditionally

3813229

adds the semantics of the basic SSE floating-point operations

2b19171

adds the implementation of some ieee754 intrinsics

f5637d9

relaxes the contraction predicate

b48f8e5

fixes the pcode floating-point lifter

381afd6

we now are passing the tests!

makes the word-width primitive equal in both versions of Primus

9d082bf

adds the ieee754-is-nan primitive

a3608e5

adds more SSE instruction, implements branchless comparison

6f88230

renames the system feature in Lisp context to the target-system

b78ae26

to prevent clashing with the Primus Lisp systems

adds the Lisp.Semantics.context to control the Lisp definitions

04f8df0

disables x86 FP intrinsic semantics when the legacy lifter is used

47cec64

Otherwise, they both try to provide semantics that is conflicting.

adds the --x86-disable-floating-point-intrinsics as an escape hatch

dddcf04

This option makes it possible to still use the old BIL plugin feature that translates unknown (to the BIL lifters) instructions into intrinsic calls.

updates the testsuite revision

1b95ae4

ivg merged commit a360700 into BinaryAnalysisPlatform:master Apr 15, 2022

ivg mentioned this pull request Jul 12, 2022

releases BAP 2.5.0 ocaml/opam-repository#21781

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

adds semantics for the x86 SSE floating-point instructions #1466

adds semantics for the x86 SSE floating-point instructions #1466

ivg commented Apr 14, 2022 •

edited

Loading

adds semantics for the x86 SSE floating-point instructions #1466

adds semantics for the x86 SSE floating-point instructions #1466

Conversation

ivg commented Apr 14, 2022 • edited Loading

Introduction

Deprecated Features

Bug fixes, Tweaks, and some New Small Features

Summary of the Available Floating-point Lifters

Future Work

ivg commented Apr 14, 2022 •

edited

Loading