Skip to content

Latest commit

 

History

History
21 lines (10 loc) · 2.26 KB

readme.md

File metadata and controls

21 lines (10 loc) · 2.26 KB

SteadySketch

Introduction

In this paper, we study steady flows in data streams, which refers to those flows whose arrival rate is always non-zero and around a fixed value for several consecutive windows. To find steady flows in real time, we propose a novel sketch, SteadySketch, aiming to accurately report steady flows in limited space. To the best of our knowledge, this is the first work to find steady flows in data streams. In SteadySketch, we propose the reborn technique to reduce the space by 75%. Our theoretical proofs show that the impact caused by the reborn technique is minimal. Experimental results show that we improve the Precision Rate (PR) by 81.1% and reduce the Average Relative Error (ARE) to 660.8 times compared with the strawman solution. Finally, we provide a concrete case: Cache prefetch, which proves that SteadySketch can significantly improve the Cache hit ratio.

About the source code

There are three directories here, one is for finding the temporary steady items, the other is for finding the persistent steady items, the last one is implement in cache.

The source code in each directory is implemented by C++, including the SteadySketch algorithm , strawman solution and the groundtruth solution.

You should use your own dataset and move it to the root directory.

How to run

If you have already cloned the repository, you need to get the datasets and move it to the root directory. Then you need to change the path of the dataset in main.cpp. Finally, you just need to compile and run main.cpp. In addition, you can change the parameters in either main.cpp or parm.h.

Memory allocation: 1) For the experiments of finding temporary steady flows, the memory size starts from 50KB, then: SteadyFilter occupies 10KB, and it has 10239 buckets, with 8 bits per bucket; RollingSketch occupies 40KB, and it has 40950 buckets, with 8 bits per bucket. 2) For the experiments of finding persistent steady flows, the memory size starts from 300KB, then: SteadyFilter occupies 30KB, and it has 30690 buckets, with 8 bits per bucket; RollingSketch occupies 120KB, it has 122878 buckets, with 8 bits per bucket; and Stage 2 has 9600 cells, with 128 bits per cell, i.e., flow ID (64 bits), start time (32 bits), and recent time (32 bits).