[L0 v2] Add latency tracker #1892

igchor · 2024-07-24T18:47:54Z

The latency tracker is useful for measuring and optimizing specific parts of code that are hard to profile using external tracers.

Sample output:

<LEVEL_ZERO>[INFO]: [command_list_cache_t::getRegularCommandList] average latency: 0ns
<LEVEL_ZERO>[INFO]: [command_list_cache_t::getRegularCommandList] number of samples: 0
<LEVEL_ZERO>[INFO]: [command_list_cache_t::getImmediateCommandList] average latency: 15895ns
<LEVEL_ZERO>[INFO]: [command_list_cache_t::getImmediateCommandList] number of samples: 200

pbalcer

I wrote something like this before when I was investigating latency spikes in enqueue, but I took a different approach. I made a structure that then formed a hierarchy of trackers, and on destruction of the top-most object (per-thread global) printed a (a hacky one-off) histogram of collected data.

Something like:

static per_thread FooTracker foo; // this also had some options to e.g., only collect data if the top-most operation took more than N ns.

void foo() {
   TRACKER(&foo); //this took function name and line #
   { 
       TRACKER_SCOPE(&foo, "op1");
       // some expensive operation
   }
   {
      TRACKER_SCOPE(&foo); // or anonymous scope
   }
}

with the tracker global object usable across module boundaries to track latency of operations in a deep stack of things.

I never cleaned up that code (I can try to dig it somewhere out of my hundreds of UR branches :P), and the histogram implementation was awful (I had to tune buckets by hand), but it gave me a better overall picture of the latency of an operation, rather than just an average.

source/adapters/level_zero/v2/latency_tracker.hpp

pbalcer · 2024-07-25T08:21:43Z

source/adapters/level_zero/v2/latency_tracker.hpp

+
+private:
+  const char *name;
+  double avg_{0};


why is this a double and not just an integer value?

I would still need to cast it to double for calculations in trackValue I think. I'm not sure if it matters really. But I have changed the estimate() return value to be uint64_t

I would still need to cast it to double for calculations in trackValue I think. I'm not sure if it matters really.

Well, I just prefer not using floating point math if we don't have to, especially for something so simple as sum += value; ++cnt; avg = sum / cnt;. But yeah, I don't think matters all that much.

source/adapters/level_zero/v2/latency_tracker.hpp

so that v2::context will be appropriately destroyed

igchor · 2024-07-25T16:40:50Z

I wrote something like this before when I was investigating latency spikes in enqueue, but I took a different approach. I made a structure that then formed a hierarchy of trackers, and on destruction of the top-most object (per-thread global) printed a (a hacky one-off) histogram of collected data.

Something like:
static per_thread FooTracker foo; // this also had some options to e.g., only collect data if the top-most operation took more than N ns.

void foo() {
   TRACKER(&foo); //this took function name and line #
   { 
       TRACKER_SCOPE(&foo, "op1");
       // some expensive operation
   }
   {
      TRACKER_SCOPE(&foo); // or anonymous scope
   }
}
with the tracker global object usable across module boundaries to track latency of operations in a deep stack of things.

I never cleaned up that code (I can try to dig it somewhere out of my hundreds of UR branches :P), and the histogram implementation was awful (I had to tune buckets by hand), but it gave me a better overall picture of the latency of an operation, rather than just an average.

That makes sense. I think it would be good to have some histogram as well in the future. In CacheLib we used one implemented on top of folly but that's a huge dependency + I'm not sure what's the overhead for that. For this rolling latency the overhead is really not noticeable for things we are measuring.

pbalcer · 2024-07-25T16:54:14Z

That makes sense. I think it would be good to have some histogram as well in the future. In CacheLib we used one implemented on top of folly but that's a huge dependency + I'm not sure what's the overhead for that.

Yea, I'd rather we find some good small compact histogram library. Otherwise, we can always put this behind a ifdef in cmake.

For this rolling latency the overhead is really not noticeable for things we are measuring.

Yeah, let's go with this simple implementation for now.

igchor · 2024-07-31T18:57:53Z

That makes sense. I think it would be good to have some histogram as well in the future. In CacheLib we used one implemented on top of folly but that's a huge dependency + I'm not sure what's the overhead for that.

Yea, I'd rather we find some good small compact histogram library. Otherwise, we can always put this behind a ifdef in cmake.

For this rolling latency the overhead is really not noticeable for things we are measuring.

Yeah, let's go with this simple implementation for now.

Please take a look at: #1912

igchor requested a review from a team as a code owner July 24, 2024 18:47

github-actions bot added the level-zero L0 adapter specific issues label Jul 24, 2024

pbalcer reviewed Jul 25, 2024

View reviewed changes

igchor added 3 commits July 25, 2024 16:31

[L0 v2] add simple latency tracker (rolling average)

cfc388b

[L0 v2] track command list cache get() latency

d3436c3

[L0 v2] make destructor of the legacy context virtual

d2f8523

so that v2::context will be appropriately destroyed

igchor force-pushed the latency_tracker_v2 branch from 0a537da to d2f8523 Compare July 25, 2024 16:32

igchor closed this Aug 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[L0 v2] Add latency tracker #1892

[L0 v2] Add latency tracker #1892

Uh oh!

igchor commented Jul 24, 2024

Uh oh!

pbalcer left a comment •

edited

Loading

Uh oh!

Uh oh!

pbalcer Jul 25, 2024

Uh oh!

igchor Jul 25, 2024

Uh oh!

pbalcer Jul 31, 2024

Uh oh!

Uh oh!

igchor commented Jul 25, 2024

Uh oh!

pbalcer commented Jul 25, 2024

Uh oh!

igchor commented Jul 31, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[L0 v2] Add latency tracker #1892

[L0 v2] Add latency tracker #1892

Uh oh!

Conversation

igchor commented Jul 24, 2024

Uh oh!

pbalcer left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pbalcer Jul 25, 2024

Choose a reason for hiding this comment

Uh oh!

igchor Jul 25, 2024

Choose a reason for hiding this comment

Uh oh!

pbalcer Jul 31, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

igchor commented Jul 25, 2024

Uh oh!

pbalcer commented Jul 25, 2024

Uh oh!

igchor commented Jul 31, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pbalcer left a comment •

edited

Loading