Skip to content

vikmik/uiCA

 
 

Repository files navigation

uiCA (uops.info Code Analyzer)

uiCA is a simulator that can predict the throughput of basic blocks on recent Intel microarchitectures. In addition to that, it also provides insights into how the code is executed.

uiCA is based on data from uops.info, combined with a detailed pipeline model. Like related tools, it assumes that all memory accesses result in cache hits.

Details on uiCA's pipeline model, as well as a comparison with similar tools, can be found in our paper Accurate Throughput Prediction of Basic Blocks on Recent Intel Microarchitectures.

Web Interface

An online version of uiCA is available at uiCA.uops.info.

Installation

Ubuntu

  • Prerequisites:

    sudo apt-get install gcc python3 python3-pip
    pip3 install plotly
    
  • Installation:

    git clone https://github.com/andreas-abel/uiCA.git
    cd uiCA
    ./setup.sh
    
  • Update:

    git pull
    ./setup.sh
    

Windows

  • Prerequisites:

  • Installation:

    git clone https://github.com/andreas-abel/uiCA.git
    cd uiCA
    .\setup.cmd
    
  • Update:

    git pull
    .\setup.cmd
    

Example Usage

echo ".intel_syntax noprefix; l: add rax, rbx; add rbx, rax; dec r15; jnz l" > test.asm
as test.asm -o test.o
./uiCA.py test.o -arch SKL

Command-Line Options

The following parameters are optional. Parameter names may be abbreviated if the abbreviation is unique (e.g., -ar may be used instead of -arch).

Option Description
-arch The microarchitecture of the simulated CPU. Available microarchitectures: SNB, IVB, HSW, BDW, SKL, SKX, KBL, CFL, CLX, ICL, TGL, RKL. Alternatively, you can use all to get an overview of the throughputs for all supported microarchitectures. [Default: all]
-iacaMarkers Analyze only the code that is between the IACA_START and IACA_END markers of Intel's IACA tool.
-raw Analyze a file that directly contains the machine code of the benchmark, but no headers or other data.
-trace <filename.html> Generate an HTML file that contains a table with a cycle-by-cycle view of how the instructions are executed.
-graph <filename.html> Generate an HTML file that contains a graph with various performance-related events.
-alignmentOffset Alignment offset (relative to a 64-Byte cache line). The option all provides an overview of the throughputs for all possible alignment offsets. [Default: 0]
-TPonly Output only the throughput prediction.
-simpleFrontEnd Simulate a simple front end that is only limited by the issue width.
-noMicroFusion Simulate a CPU variant that does not support micro-fusion.
-noMacroFusion Simulate a CPU variant that does not support macro-fusion.

About

uops.info Code Analyzer

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 95.0%
  • HTML 4.5%
  • Other 0.5%