You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently the dashboard Host Utilization graph assumes that there is at most one task per core, and does not display disk usage. But in fact, in many simulations there can be more tasks than cores on a host, and more than one transfer at a time per disk. Here are some thoughts on how to display these situations, starting with the disk, and then showing how a host could be done almost similarly.
Disk: We can easily identify the time intervals during which the number of ongoing disk operations is constant. Say that between times t1 and t2 there are n ongoing operations. Then we simply show n rectangles splitting the disk bandwidth (i.e., the height of the pink rectangle in the display) into n identical rectangle, each of them with 1/n-th of the full height of the disk bandwidth. The one drawback is the situation in which, say, from t1 to t2 we have 2 operations, and one of them finishes at time t2. From t2 to t3 the other transfer continues. As described above, the display will look exactly the same if both these operations finish at time t2 and a third transfer starts at time t2. A much fancier display would remove some vertical lines at time t2 to show that one transfer first proceeds at 1/2 the bandwidth and then proceeds at full bandwidth (in other words, the transfer is not shown as two rectangles, but instead as some L-shaped polygon). Creating this fancier display in arbitrary cases is likely NP-hard though...
Network Link: No issue because we do not differentiate between individual network flows (we don't have that information)
Host: The idea here is to do the same as for the disk if the number of tasks is greater than the number cores, and what we currently do if it's smaller. This makes sense in terms of what the simulation actually does. Say we have a 4-core host with 5 tasks. Each task really gets 4/5 of a core. So we should show them as 5 identical rectangles of height 4/5. Here again there could be a fancier display to show tasks as polygons, but it's likely NP-hard...
The text was updated successfully, but these errors were encountered:
@henricasanova, do we currently have an example (i.e., JSON file) in which multiple I/O operations happens simultaneously in the same disk? I was thinking on how to display such data and it can really be tricky. It would be great if you could generate an example JSON file so I could put more thinking into that. Thanks!
This is the same issue as plotting multiple compute operations on a single core basically. I can't think of an example. I'll make up a simulator to generate that. Should be pretty easy.
Currently the dashboard Host Utilization graph assumes that there is at most one task per core, and does not display disk usage. But in fact, in many simulations there can be more tasks than cores on a host, and more than one transfer at a time per disk. Here are some thoughts on how to display these situations, starting with the disk, and then showing how a host could be done almost similarly.
Disk: We can easily identify the time intervals during which the number of ongoing disk operations is constant. Say that between times t1 and t2 there are n ongoing operations. Then we simply show n rectangles splitting the disk bandwidth (i.e., the height of the pink rectangle in the display) into n identical rectangle, each of them with 1/n-th of the full height of the disk bandwidth. The one drawback is the situation in which, say, from t1 to t2 we have 2 operations, and one of them finishes at time t2. From t2 to t3 the other transfer continues. As described above, the display will look exactly the same if both these operations finish at time t2 and a third transfer starts at time t2. A much fancier display would remove some vertical lines at time t2 to show that one transfer first proceeds at 1/2 the bandwidth and then proceeds at full bandwidth (in other words, the transfer is not shown as two rectangles, but instead as some L-shaped polygon). Creating this fancier display in arbitrary cases is likely NP-hard though...
Network Link: No issue because we do not differentiate between individual network flows (we don't have that information)
Host: The idea here is to do the same as for the disk if the number of tasks is greater than the number cores, and what we currently do if it's smaller. This makes sense in terms of what the simulation actually does. Say we have a 4-core host with 5 tasks. Each task really gets 4/5 of a core. So we should show them as 5 identical rectangles of height 4/5. Here again there could be a fancier display to show tasks as polygons, but it's likely NP-hard...
The text was updated successfully, but these errors were encountered: