`SpillBuffer.spilled_total` appears to return incorrect results #6783

hendrikmakait · 2022-07-25T17:25:04Z

When spilling data to disk, the SpillBuffer appears to return the incorrect size of the data on disk. For example, when spilling an ~8 MB random matrix onto disk, the data file created by the SpillBuffer is also ~8 MB in size, yet the SpillBuffer only returns ~1 MB.

Reproducer:

from __future__ import annotations

import os
import tempfile

import numpy as np

from distributed.spill import SpillBuffer


def test_spill_size():
    tmpdir = tempfile.mkdtemp()
    buf = SpillBuffer(spill_directory=tmpdir, target=0, max_spill=False)
    data = np.random.random((1024, 1024))
    buf["data"] = data
    spill_size = os.stat(os.path.join(tmpdir,  "data")).st_size
    assert buf.spilled_total.disk == spill_size

fails with

>       assert buf.spilled_total.disk == spill_size
E       assert 1048808 == 8388840
E        +  where 1048808 = SpilledSize(memory=8388608, disk=1048808).disk
E        +    where SpilledSize(memory=8388608, disk=1048808) = Buffer<<LRU: 0/0 on dict>, <zict.cache.Cache object at 0x11f2413d0>>.spilled_total

The text was updated successfully, but these errors were encountered:

ncclementi · 2022-07-25T17:55:50Z

A couple of comments here, it looks a bit weird to me that the disk size calculated as spill_size is very close to the value we are calculating as the buf.spilled_total.memory. I would expect the spill_size to be bigger. However, I wonder if this is related to how we calculate spilled_total. See:

distributed/distributed/spill.py

Lines 30 to 31 in 55cc1a5

    
           def __add__(self, other: SpilledSize) -> SpilledSize:  # type: ignore 
        
               return SpilledSize(self.memory + other.memory, self.disk + other.disk)

hendrikmakait · 2022-07-25T18:21:17Z

The size of spill_size feels fine to me. Just looking at the way we serialize things, the overhead for this array appears to be 232 bytes (sys.getsizeof(serialize(data)[0])), which matches perfectly. There might be a clue in the fact that spill_size is always almost 8x the size of buf.spilled_total.disk, regardless of the size of data.

hendrikmakait · 2022-07-25T18:29:18Z

It looks like the calculation of the memory size is wrong:

distributed/distributed/spill.py

Line 290 in 55cc1a5

pickled_size = sum(len(frame) for frame in pickled)

uses len but we should probably use nbytes on memoryview objects (https://docs.python.org/3/library/stdtypes.html#memoryview.nbytes). I'll file a PR for this.

ncclementi · 2022-07-25T18:47:14Z

That line was suggested by Guido when we were working on this. I'm sure he'll have an idea of what could be happening here. I know he is on PTO, but he might be a good person to review this. He will be back next week.

crusaderky · 2022-07-26T21:28:08Z

I was not aware that len(memoryview)! = memoryview.nbytes. Good catch! No need to wait for my return.

hendrikmakait mentioned this issue Jul 26, 2022

Fix spilled size calculation in Slow #6789

Merged

2 tasks

hendrikmakait self-assigned this Jul 26, 2022

hendrikmakait added the bug Something is broken label Jul 26, 2022

hendrikmakait mentioned this issue Jul 26, 2022

Integration tests: spill/unspill coiled/benchmarks#136

Closed

crusaderky closed this as completed in #6789 Jul 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`SpillBuffer.spilled_total` appears to return incorrect results #6783

`SpillBuffer.spilled_total` appears to return incorrect results #6783

hendrikmakait commented Jul 25, 2022

ncclementi commented Jul 25, 2022

hendrikmakait commented Jul 25, 2022

hendrikmakait commented Jul 25, 2022 •

edited

Loading

ncclementi commented Jul 25, 2022

crusaderky commented Jul 26, 2022

SpillBuffer.spilled_total appears to return incorrect results #6783

SpillBuffer.spilled_total appears to return incorrect results #6783

Comments

hendrikmakait commented Jul 25, 2022

ncclementi commented Jul 25, 2022

hendrikmakait commented Jul 25, 2022

hendrikmakait commented Jul 25, 2022 • edited Loading

ncclementi commented Jul 25, 2022

crusaderky commented Jul 26, 2022

`SpillBuffer.spilled_total` appears to return incorrect results #6783

`SpillBuffer.spilled_total` appears to return incorrect results #6783

hendrikmakait commented Jul 25, 2022 •

edited

Loading