Max memory limit to protect from out of memory #32

rohitjoshi · 2019-03-29T03:22:49Z

Hello,
Is there any way to restrict the max memory usage for LRU? E.g. I am expecting 100M key/value pair but value length varies and ends up killing the process when it exceeds available memory.

Is there any way to enforce configurable max memory usage even though LRU capacity is not filled?

May be allocator counter: https://doc.rust-lang.org/beta/std/alloc/struct.System.html

jeromefroe · 2019-04-01T00:19:20Z

I agree, that would be super helpful! Since the cache is generic over keys and values, my first thought is that the cache could accept a function in it's constructor that accepts a reference to a key and value and returns the size in bytes of the two:

let limit = 2^16
let mut cache: LruCache<isize, &str> = LruCache::with_max_memory(limit, |&k, &v| {
    // Calculate size of key and value.
    size
});

What do you think?

rohitjoshi · 2019-04-01T08:19:17Z

That’s good idea. Either it takes the max memory size or number of entries/capacity

rohitjoshi · 2020-06-08T03:33:34Z

Any plans to prioritize this work?

jeromefroe · 2020-06-08T12:04:21Z

Hey @rohitjoshi! Sorry, I don't have a clear idea on how we can implement this, so I'm not sure how long it will take and I don't have the time at the moment to really devote the amount of attention it will likely require.

webmaster128 · 2020-10-30T13:12:01Z

We are probably needing this as well and if this functionality is desired here, I am willing to implement it.

I propose the following API: the fn new(cap: usize) constructor remains unchanged, but the capacity is re-interpreted from element count to "points". When adding a new element in the existing API, it has a default "cost" of 1 point. A new setter ::put_with_cost allows for specifying cost of this item in points. The calling application can then assign cost points based on whatever metric you like. This might sound overly general and I don't have a specific need for it but it would at least allow using element count and memory consumtion in the same API.

By the way, I don't think the crate can reasonably caclulate the size of elements automatically because you can have all kind of indiretion to heap allocated memory. This becomes even harder when two elements have shared ownership of some data (e.g. via Rc).

jeromefroe · 2020-11-02T14:16:23Z

@webmaster128 I agree that it would be extremely difficult to determine, in a general way, the cost of elements inserted into the cache and so it would be preferable to expose an API based on "points" that allows the caller to set the cost of an element explicitly. I don't think I'll have any time to implement this in the near-future though, but would be more than happy to review any pull requests that do! :)

rohitjoshi · 2020-11-02T14:26:21Z

Alternative way I am considering is to enforce dynamic cache size based on memory available from external trigger. Eg when available memory is less than 10%, set that as a cache size and force eviction for new elements. This would be easier to achieve.

Alternative way is as elements are added, increment the total size by length of key and Val.

rohitjoshi · 2020-11-21T13:56:35Z

Is it possible to have dynamic cap size which can be set by application to force eviction prior to initial capacity set.

e.g. Let's say initial capacity is 100 but. When len() is 75, I want to freeze() the additional growth but still allow inserting new elements and evicting old. Alternatively we can resize() but this would be expensive operation for latency sensitive applications.

jeromefroe · 2020-11-23T13:53:15Z

Hi @rohitjoshi! Currently, the resize method is:

pub fn resize(&mut self, cap: usize) {
    // return early if capacity doesn't change
    if cap == self.cap {
        return;
    }

    while self.map.len() > cap {
        self.remove_last();
    }
    self.map.shrink_to_fit();

    self.cap = cap;
}

If I understand your question correctly, it sounds like you want an option to skip calling self.map.shrink_to_fit()? Is that accurate? If so, I think it would be straightforward to add that as an option.

rohitjoshi · 2020-11-23T18:52:15Z

@jeromefroe that's correct. shrink_to_fit() will expensive operation and would lock for a longer duration in a multithreaded environment.
Idea is to make self.map.len() as self.cap so no more memory growth (assuming new key/val are same size as existing) and still allows insert with eviction of old records.

rohitjoshi · 2020-12-27T17:17:47Z

I have worked around this by getting the current len() and resizing with that capacity which solved my problem. I am closing for now.

rohitjoshi changed the title ~~Max memory use to protect from out of memory~~ Max memory limit to protect from out of memory Mar 29, 2019

webmaster128 mentioned this issue Nov 1, 2020

Allow limiting by memory limit instead of item count marmeladema/clru-rs#11

Closed

rohitjoshi closed this as completed Dec 27, 2020

felipou mentioned this issue Dec 17, 2022

Limit by memory instead of number of items #163

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Max memory limit to protect from out of memory #32

Max memory limit to protect from out of memory #32

rohitjoshi commented Mar 29, 2019 •

edited

Loading

jeromefroe commented Apr 1, 2019

rohitjoshi commented Apr 1, 2019

rohitjoshi commented Jun 8, 2020

jeromefroe commented Jun 8, 2020

webmaster128 commented Oct 30, 2020

jeromefroe commented Nov 2, 2020

rohitjoshi commented Nov 2, 2020

rohitjoshi commented Nov 21, 2020

jeromefroe commented Nov 23, 2020

rohitjoshi commented Nov 23, 2020

rohitjoshi commented Dec 27, 2020

Max memory limit to protect from out of memory #32

Max memory limit to protect from out of memory #32

Comments

rohitjoshi commented Mar 29, 2019 • edited Loading

jeromefroe commented Apr 1, 2019

rohitjoshi commented Apr 1, 2019

rohitjoshi commented Jun 8, 2020

jeromefroe commented Jun 8, 2020

webmaster128 commented Oct 30, 2020

jeromefroe commented Nov 2, 2020

rohitjoshi commented Nov 2, 2020

rohitjoshi commented Nov 21, 2020

jeromefroe commented Nov 23, 2020

rohitjoshi commented Nov 23, 2020

rohitjoshi commented Dec 27, 2020

rohitjoshi commented Mar 29, 2019 •

edited

Loading