Rapid Data Analysis (RDA)

Rapid Data Analysis (RDA) is a versatile framework designed to facilitate the analysis of experimental data. It provides built-in functions, intuitive plotting capabilities, and advanced tools for statistical and power analysis. RDA is ideal for quickly exploring your data after experiments, and it also supports custom power models and Correlation Power Analysis (CPA) attacks.

RDA uses the Pandas, Numpy, Seaborn, Scipy and many more packages for easy data visualization and includes optimized GPU implementations for large-scale CPA attacks, ensuring high performance even with extensive datasets.

Key Features

Simple and efficient plotting functions for rapid data visualization.
Support for statistical outlier removal through a pipeline-based approach.
Both generic and GPU-accelerated CPA implementations for power analysis and CPA attacks.
Customizable power models for detailed, flexible analysis.

CPA Support

RDA offers two variants of Correlation Power Analysis (CPA):

CPU-based Generic CPA
This variant allows for arbitrary, user-defined models and runs on the CPU. While highly flexible, it is constrained by the number of samples and model complexity, making it ideal for smaller datasets or simpler analyses.
GPU-accelerated CPA
This variant is a highly optimized CPA implementation using the CuPy library, requiring an NVIDIA GPU with CUDA support. It supports predefined models such as Hamming Distance, Hamming Weight, and Hamming Weight of S-box outputs, which can be combined to create complex models. With GPU acceleration, RDA efficiently processes large datasets:
- 48-bit data with 50 million samples can be analyzed in about 3 hours.
- 40-bit data with 50 million samples takes around 1 minute.

Installation

To avoid package conflicts, it is recommended to use a virtual environment, such as venv.

Installation Steps:

Install the required dependencies:
```
pip install -r requirements.txt
```
Install RDA in editable mode:
```
pip install -e .
```

Requirements

CuPy could require the cuda toolkit on some systems. Please follow the installation requirements on cupy.dev.

Usage Examples

Example Dataset

Assume we are analyzing a cache hit/miss histogram saved in a NumPy .npy file, structured as follows:

Timing	Label
121	Hit
312	Miss
111	Hit

Here, Timing represents the measured time, and Label indicates the outcome (Hit or Miss).

Plot a Histogram

To generate a simple histogram:

rda - i file.npy - g Label - plot_hist Timing

This will create a histogram of the Timing data, grouped by the Label values.

Remove Outliers

RDA follows a pipeline-based approach, where commands are executed sequentially.

Remove 1% of Outliers (Overall)

To remove 1% of all samples as outliers:

rda - i file.npy - per Timing 0.5 99.5 - g Label - plot_hist Timing

Remove Outliers by Class

To remove 1% of outliers for each class separately:

rda - i file.npy - g Label - per Timing 0.5 99.5 - plot_hist Timing

This approach removes 2% of the total samples but applies outlier handling differently for each class, which is essential when class distributions differ.

Power Analysis

RDA also supports power analysis. In the following example we use the build in simulation facility to simulate a textbook CPA attack on an aes sbox. If there is a leakage between Value and Guess, the following command analyzes the relationship between the model and the provided data:

rda - power_sim_aes_sbox 1000 - power_init --no_diff - g Exp - power_models "hwsbox(v000, g000)" - power_fit Power - print

This will output results like the following:

> print_data n=None index=None
            rho     rho_l     rho_u    pv_rho  r2_score    N    type x y  hw(sbox(v000^g000))                model
Exp
test   0.104796  0.030934  0.177521  0.005514  0.010840  700  unknown                 1.131573  hw(sbox(v000^g000))
train  0.152331  0.039777  0.261069  0.008220  0.022043  300  unknown                 1.501876  hw(sbox(v000^g000))

Explanation of Fields

Field	Description
`rho`	Pearson correlation coefficient of the model and data
`rho_l`	Lower bound of the Pearson correlation coefficient
`rho_u`	Upper bound of the Pearson correlation coefficient
`pv_rho`	p-value of the Pearson correlation coefficient estimate
`hw(sbox(v000^g000))`	Linear regression coefficient for the model component
`r2_score`	R² score of the linear regression

Performing a CPA Attack

To run a CPA attack where the secret Value is unknown (used only for ground truth validation):

rda - power_sim_aes_sbox 1000 - power_init --no_diff - g Exp - power_models "hwsbox(v000, g000)" "hd(v000,g000)" - pw_cpa Power --step 10

This command will run a CPA attack iteratively and plot the ranks of the correct key candidate. We see that the wrong Hamming distance model will not converge to the correct key candidate.

Visually Validating the Power Model

You can also plot the power model against the power consumption data to visually validate whether the model is influencing the power consumption:

rda - power_sim_aes_sbox 100000 - power_init --no_diff - power_eval_model "hwsbox(v000, g000)" - g Exp - idx "hwsbox(v000, g000)" - plot_line Power

Disclaimer

This code is provided "as-is," without any warranties or guarantees regarding its correctness, performance, or suitability for any particular purpose. Use this code at your own risk. The authors and contributors are not responsible for any direct or indirect damages or issues that arise from the use of this code. No support or maintenance is implied.

Acknowledgments

Special thanks to Gäetan Cassiers for the countless discussions on optimizing the performance of the CPA implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src/rda		src/rda
LICENSE		LICENSE
NOTE		NOTE
README.md		README.md
cpa.png		cpa.png
hwsbox.png		hwsbox.png
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rapid Data Analysis (RDA)

Key Features

CPA Support

Installation

Installation Steps:

Requirements

Usage Examples

Example Dataset

Plot a Histogram

Remove Outliers

Remove 1% of Outliers (Overall)

Remove Outliers by Class

Power Analysis

Explanation of Fields

Performing a CPA Attack

Visually Validating the Power Model

Disclaimer

Acknowledgments

About

Releases

Packages

Languages

License

0xhilbert/rda

Folders and files

Latest commit

History

Repository files navigation

Rapid Data Analysis (RDA)

Key Features

CPA Support

Installation

Installation Steps:

Requirements

Usage Examples

Example Dataset

Plot a Histogram

Remove Outliers

Remove 1% of Outliers (Overall)

Remove Outliers by Class

Power Analysis

Explanation of Fields

Performing a CPA Attack

Visually Validating the Power Model

Disclaimer

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages