Fast, unopinionated, minimalist cache decorator for python.
import httpx
from aquiche import alru_cache
@alru_cache(wrap_async_exit_stack=True)
async def get_client() -> httpx.AsyncClient:
return httpx.AsyncClient(base_url="https://api.data.io")
@alru_cache(expiration="10minutes")
async def get_data(id: str) -> dict:
client = await get_client()
res = await client.get(f"/data/{id}")
return res.json()
async def main() -> None:
data = await get_data("id1")
print("Received the data!!!")
This is a python module available through the pypi registry.
Before installing, download and install Python. Python 3.8 or higher is required.
Installation is done using the
pip install
command:
$ pip install aquiche
- Robust caching
- Safe to use with both async and multithreaded applications
- Works with both "sync" functions and async functions/coroutines
- No cache stampede/cascading failure - no multiple redundant calls when your cache is being hit at the same time
- Wide range of options for setting the expiration
- Negative cache support
- Focus on high performance
- Contains typings
- High test coverage
The decorator is very similar to functools.lru_cache
.
Decorator to wrap a function with a memoizing callable that saves up to the maxsize most recent calls. It can save time when an expensive or IO bound function is periodically called with the same arguments. The decorator works both with sync and async functions. It is safe to use in multithreaded and async applications.
Since a dictionary is used to cache results, the positional and keyword arguments to the function must be hashable or the key parameter has to be provided.
Distinct argument patterns may be considered to be distinct calls with separate cache entries. For example, f(a=1, b=2) and f(b=2, a=1) differ in their keyword argument order and may have two separate cache entries. If you provide the key parameter the str.format()
is used to generate the key. Args
and kwargs
are passed to the format function. Named args
are passed to the function both as args
and they are included in the kwargs
as well. The special option "single key" can be used for the functions with no arguments or with only self/cls argument. In that case the key will have the same value no matter the args
or kwargs
.
The wrapped function is instrumented with a cache_parameters()
function that returns a dataclass showing the values for all the set parameters. This is for information purposes only. Mutating the values has no effect.
To help measure the effectiveness of the cache and tune the maxsize parameter, the wrapped function is instrumented with a cache_info()
function that returns a dataclass showing hits, misses, maxsize, current_size, and last_expiration_check. This function has to be awaited when the decorator is wrapping an async function.
The decorator also provides a cache_clear()
function for clearing or invalidating the cache. The function has to be awaited when the decorator is wrapping an async function.
Expired items can be removed from the cache with the use of remove_expired()
function. The function has to be awaited when the decorator is wrapping an async function.
The original underlying function is accessible through the wrapped attribute. This is useful for introspection, bypassing the cache, or for rewrapping the function with a different cache.
The cache keeps references to the arguments and returns values until they age out of the cache or until the cache is cleared. The cache can be periodically checked for expired items.
If a method is cached, the self instance argument is included in the cache. If you are caching a method it is strongly recommended to add __eq__
and __hash__
methods to the class or set the key parameter.
The decorator can simply be used without any parameters, both @alru_cache
and @alru_cache()
are supported. If you want to modify the caching behavior below you can find the list of parameters that can be set.
The cache can be enabled or disabled. It is not checked actively during the runtime! You cannot update the value once the function is wrapped. If you want to check actively if the cache is enabled use the expiration param.
@alru_cache(enabled=True|False)
def cache_function(value: str) -> int:
return len(value)
It is possible to override the default cache key generation either with the custom template or with one of the special options. If the custom template is provided the str.format()
will be used to generate the key. Both the *args
and **kwargs
are passed to the str.format()
. Kwargs
are extended with the named args
and the defaults are applied.
If you want the cache key to always result in the same value the special option Key.SINGLE_KEY
can be used. This option is only recommended for functions where it is desired to ignore args
and kwargs
. The size of this cache will never exceed 1.
from aquiche import alru_cache, Key
@alru_cache(key="env:{environment.id}:id:{user[id]}")
async def get_username(environment: object, user: dict) -> str:
return user["username"]
@alru_cache(key=Key.SINGLE_KEY)
async def get_user_count() -> int:
return 10
The maxsize param behaves the same as in functools.lru_cache
. If set to None
the cache can grow without bound. If set, the memoizing decorator saves up to the maxsize most recent calls. It is set to None
by default.
from aquiche import alru_cache
@alru_cache(maxsize=5)
async def cache_function_limited(value: str) -> int:
return len(value)
@alru_cache(maxsize=None)
async def cache_function_unlimited(value: str) -> int:
return len(value)
It is possible to set when the function call expires. Expired functions call will automatically be called again the next time the function is being called. To save memory it is possible to set expired_items_auto_removal_period
to automatically remove items after a certain period of time. If set to None
expired items are not removed but stay in the cache. It is set to None
by default. The decorated function still needs to be called for the removal to happen - the expiration removal task is not automatically scheduled. It is recommended to rather use maxsize
to limit the memory consumption and keep the param set to None
.
Possible expiration types/values:
None
, the function call never expiresTrue|False
, if set toTrue
the value is expired and the function will be called againint|float
, based on the value it will either be treated as thetimedelta
or unix timestampdatetime
, TTL (Time To Live)/the expiration date, if it contains no timezone the UTC timezone is automatically addedtime
, the function call will expire today at this timetimedelta
, TTR (Time To Refresh/refresh interval), the function call will refresh the value eachtimedelta
perioddatetime|time|timedelta
string, the string that can be parsed todatetime|time|timedelta
, supported formats: ISO 8601, human-readable formats, uses the same (or nearly the same) resolution as pydantic- Data pointer string e.g.
$.response.data.expiry
, the pointer can point to any of the other expiration values - Function, the
CachedItem
object (for more information see the example below) will be passed as an argument to the function. It is possible to return any of the other expiration types from the function - Async function/coroutine, the
CachedItem
object (for more information see the example below) will be passed as an argument to the function. It can only be used when decorating an async function/coroutine. It is possible to return any of the other expiration types from the function
The expiration types like True, False or datetime are best used in combination with a data pointer or a function/coroutine. It is strongly advised not to set these directly in the decorator.
The expiration is set to None
by default.
from aquiche import alru_cache
from datetime import datetime, timedelta
# Datetime expiration
# The record should be removed after 1 day (the function needs to be called)
@alru_cache(expiration="2012-04-23T09:15:00Z", expired_items_auto_removal_period="1 day")
async def get_data(value: str) -> int:
return len(value)
# Datetime expiration
@alru_cache(
expiration=datetime(2017, 5, 5, 19, 27, 24),
expired_items_auto_removal_period="1 hour"
)
async def get_data(value: str) -> int:
return len(value)
# Date expiration, expires at midnight (UTC)
@alru_cache(
expiration="2012-04-56",
expired_items_auto_removal_period=timedelta(seconds=300),
)
async def get_data(value: str) -> int:
return len(value)
# Time expiration, expires the same day
# If the time does not contain timezone it defaults to UTC
@alru_cache(expiration="11:05:00Z", expired_items_auto_removal_period="10 minutes")
async def get_data(value: str) -> int:
return len(value)
"""
Another possible datetime/date/time values:
# values in seconds
'1494012444.883309' -> datetime(2017, 5, 5, 19, 27, 24, 883_309, tzinfo=timezone.utc)
1_494_012_444.883_309 -> datetime(2017, 5, 5, 19, 27, 24, 883_309, tzinfo=timezone.utc)
'1494012444' -> datetime(2017, 5, 5, 19, 27, 24, tzinfo=timezone.utc)
b'1494012444' -> datetime(2017, 5, 5, 19, 27, 24, tzinfo=timezone.utc)
1_494_012_444 -> datetime(2017, 5, 5, 19, 27, 24, tzinfo=timezone.utc)
# values in ms
'1494012444000.883309' -> datetime(2017, 5, 5, 19, 27, 24, 883, tzinfo=timezone.utc)
'-1494012444000.883309' -> datetime(1922, 8, 29, 4, 32, 35, 999117, tzinfo=timezone.utc)
1_494_012_444_000 -> datetime(2017, 5, 5, 19, 27, 24, tzinfo=timezone.utc)
'2012-04-23T09:15:00' -> datetime(2012, 4, 23, 9, 15)
'2012-4-9 4:8:16' -> datetime(2012, 4, 9, 4, 8, 16)
'2012-04-23T09:15:00Z' -> datetime(2012, 4, 23, 9, 15, 0, 0, timezone.utc)
'2012-4-9 4:8:16-0320' -> datetime(2012, 4, 9, 4, 8, 16, 0, create_tz(-200))
'2012-04-23T10:20:30.400+02:30'
-> datetime(2012, 4, 23, 10, 20, 30, 400_000, create_tz(150))
'2012-04-23T10:20:30.400+02'
-> datetime(2012, 4, 23, 10, 20, 30, 400_000, create_tz(120))
'2012-04-23T10:20:30.400-02'
-> datetime(2012, 4, 23, 10, 20, 30, 400_000, create_tz(-120))
b'2012-04-23T10:20:30.400-02'
-> datetime(2012, 4, 23, 10, 20, 30, 400_000, create_tz(-120))
datetime(2017, 5, 5) -> datetime(2017, 5, 5)
0 -> datetime(1970, 1, 1, 0, 0, 0, tzinfo=timezone.utc)
d1494012444.883309' -> date(2017, 5, 5)
d'1494012444.883309' -> date(2017, 5, 5)
d_494_012_444.883_309 -> date(2017, 5, 5)
d1494012444' -> date(2017, 5, 5)
d_494_012_444 -> date(2017, 5, 5)
d -> date(1970, 1, 1)
d2012-04-23' -> date(2012, 4, 23)
d'2012-04-23' -> date(2012, 4, 23)
d2012-4-9' -> date(2012, 4, 9)
date(2012, 4, 9) -> date(2012, 4, 9)
'09:15:00' -> time(9, 15)
'10:10' -> time(10, 10)
'10:20:30.400' -> time(10, 20, 30, 400_000)
b'10:20:30.400' -> time(10, 20, 30, 400_000)
'4:8:16' -> time(4, 8, 16)
time(4, 8, 16) -> time(4, 8, 16)
3610 -> time(1, 0, 10)
3600.5 -> time(1, 0, 0, 500000)
86400 - 1 -> time(23, 59, 59)
'11:05:00-05:30' -> time(11, 5, 0, tzinfo=create_tz(-330))
'11:05:00-0530' -> time(11, 5, 0, tzinfo=create_tz(-330))
'11:05:00Z' -> time(11, 5, 0, tzinfo=timezone.utc)
'11:05:00+00' -> time(11, 5, 0, tzinfo=timezone.utc)
'11:05-06' -> time(11, 5, 0, tzinfo=create_tz(-360))
'11:05+06' -> time(11, 5, 0, tzinfo=create_tz(360))
"""
from aquiche import alru_cache
from datetime import timedelta
# Refreshes a value every 10 minutes
@alru_cache(expiration=timedelta(minutes=10))
async def cache_function(value: str) -> int:
return len(value)
# Refreshes a value every 1 day, 42 minutes and 6 seconds
# This can be shortened to `1d42m6s`
@alru_cache(expiration="1 day 42 minutes 6 seconds")
async def cache_function(value: str) -> int:
return len(value)
# Refreshes a value every minute
@alru_cache(expiration="1m")
async def cache_function(value: str) -> int:
return len(value)
"""
Other possible refresh/timedelta values
timedelta(seconds=30), timedelta(seconds=30)
'30' -> timedelta(seconds=30)
30 -> timedelta(seconds=30)
30.1 -> timedelta(seconds=30, milliseconds=100)
9.9e-05 -> timedelta(microseconds=99)
minutes seconds
'15:30' -> timedelta(minutes=15, seconds=30)
'5:30' -> timedelta(minutes=5, seconds=30)
hours minutes seconds
'10:15:30' -> timedelta(hours=10, minutes=15, seconds=30)
'1:15:30' -> timedelta(hours=1, minutes=15, seconds=30)
'100:200:300' -> timedelta(hours=100, minutes=200, seconds=300)
days
'4 15:30' -> timedelta(days=4, minutes=15, seconds=30)
'4 10:15:30' -> timedelta(days=4, hours=10, minutes=15, seconds=30)
fractions of seconds
'15:30.1' -> timedelta(minutes=15, seconds=30, milliseconds=100)
'15:30.01' -> timedelta(minutes=15, seconds=30, milliseconds=10)
'15:30.001' -> timedelta(minutes=15, seconds=30, milliseconds=1)
'15:30.0001' -> timedelta(minutes=15, seconds=30, microseconds=100)
'15:30.00001' -> timedelta(minutes=15, seconds=30, microseconds=10)
'15:30.000001' -> timedelta(minutes=15, seconds=30, microseconds=1)
b'15:30.000001' -> timedelta(minutes=15, seconds=30, microseconds=1)
iso_8601
'P4Y', errors.DurationError
'P4M', errors.DurationError
'P4W', errors.DurationError
'P4D' -> timedelta(days=4)
'P0.5D' -> timedelta(hours=12)
'PT5H' -> timedelta(hours=5)
'PT5M' -> timedelta(minutes=5)
'PT5S' -> timedelta(seconds=5)
'PT0.000005S' -> timedelta(microseconds=5)
b'PT0.000005S' -> timedelta(microseconds=5)
'1m 10h 10s' -> timedelta(seconds=36070)
'1minute 10hour 10second' -> timedelta(seconds=36070)
'1minutes 10hours 10seconds' -> timedelta(seconds=36070)
'1 minutes 2 days 10 hours 10 seconds' -> timedelta(seconds=208870)
'1 m 2 d 10 h 10 s' -> '1 minute 2 days 10 hours 10 seconds'
-> timedelta(seconds=208870)
'1 minute 2 day 10 hours 10 seconds' -> timedelta(seconds=208870)
"""
The pointer can point to any expiration value. The data pointer can resolve either dictionaries or objects. If you need more flexibility or you are facing issues with the pointer it is recommended to use function/coroutine expiration. That way you ensure you are returning the right value.
from aquiche import alru_cache
from typing import Any
@alru_cache(expiration="$.token.expiration")
async def cache_function(value: str) -> Any:
expiry = True
if value == "user":
expiry = "30 minutes"
if value == "service":
expiry = "12 hours"
if value == "role":
expiry = "2022-01-30T00:00:00+0000"
return {"token": {"expiration": expiry}}
It is possible to use both functions and coroutines to set the expiration. If the decorated function is a sync one then it is possible to only use the sync function to set the expiration. CachedItem
object is passed to the expiration function.
from aquiche import alru_cache, CachedItem
"""
@dataclass
class CachedItem:
value: Any -> the returned value or the raised exception*
last_fetched: datetime -> when was the value last fetched
is_error: bool -> set to true if the value contains a raised exception*
*Only possible when the negative cache is enabled
"""
async def is_item_expired_async(item: CachedItem) -> bool:
return item.value > 1
def is_item_expired(item: CachedItem) -> str | datetime:
if item.value > 10:
return "10s"
if item.value > 5:
return datetime(2017, 5, 5, 19, 27, 24, tzinfo=timezone.utc)
return "1 day"
# Async function/Coroutine
# It can be used with both sync and async function expirations
@alru_cache(expiration=is_item_expired_async)
async def cache_function_async_1(value: str) -> int:
return len(value)
@alru_cache(expiration=is_item_expired)
async def cache_function_async_2(value: str) -> int:
return len(value)
# Sync function can only be used with sync function expiration
@alru_cache(expiration=is_item_expired)
def cache_function_sync(value: str) -> int:
return len(value)
# Using this function would throw an error
# @alru_cache(expiration=is_item_expired_async)
# def cache_function_sync(value: str) -> int:
# return len(value)
The param wrap_async_exit_stack
simplifies caching of the async clients. When the function is called the returned client(s) (or any other value for that matter) enters the async context. When the client expires or the cache is cleared, the context is automatically cleaned up and renewed if needed. It is recommended to manually clear the cache on the shutdown so the context is closed.
import httpx
from database_client import DatabaseClient
from aquiche import alru_cache, await_exit_stack_close_operations, clear_all
@alru_cache(wrap_async_exit_stack=True)
async def get_client() -> httpx.AsyncClient:
return httpx.AsyncClient(base_url="https://api.data.io")
client = await get_client()
# Perform the needed actions
await client.post("/raspberry_pi")
# Once done clear the cache
await get_client.clear_cache()
"""
In case of returning multiple clients a list of data pointers can be used to wrap them
If the data pointer does not point to any value from the result then the error is thrown
To prevent the error from happening you can append suffix :ignore_missing to the pointer
"""
@alru_cache(
wrap_async_exit_stack=["$.clients.database:ignore_missing", "$.clients.http"]
)
def get_clients() -> dict:
return {
"token": "p@ss123",
"clients": {
# Both clients will use the same AsyncExitStack context
"http": httpx.AsyncClient(),
"database": DatabaseClient()
}
}
"""
When the cache is cleaned the async context managers will be closed. If you want
to perform a graceful shutdown you can await the close operations. The command has an
optional timeout.
"""
async def shutdown() -> None:
await clear_all()
await await_exit_stack_close_operations()
async def shutdown_timeout() -> None:
await clear_all()
await await_exit_stack_close_operations("1minute")
It is possible to enable the negative caching. If the negative caching is enabled the errors are not raised but rather cached as if they were the actual results of the function call. The negative cache has a separate expiration which is by default set to 10 seconds. It is recommended to always set the negative cache expiration, preferably to a short duration. The negative caching is disabled by default.
from aquiche import alru_cache
# the function can now return exceptions as well
@alru_cache(negative_cache=True, negative_expiration="30seconds")
async def cache_function(value: str) -> int | Exception:
if value == "invalid":
raise Exception("Invalid data")
return len(value)
await cache_function("invalid")
result = await cache_function("invalid")
assert isinstance(error, Exception)
assert str(error) == "Invalid data"
The function calls can be retried if they fail. To retry the function call retry_count
can be set to desired number of retries. The function call is retried with exponential backoff. To set the exponential backoff use the backoff_in_seconds
param. Both retry_count
and backoff_in_seconds
are set to 0 by default.
from aquiche import alru_cache
import random
@alru_cache(negative_cache=True, retry_count=3, backoff_in_seconds=2)
async def cache_function(value: str) -> int | Exception:
if random.randint(1, 10) > 2:
raise Exception("Doom has fallen upon us")
return len(value)
The cache can be cleared either individually or all cached functions can be cleared with a clear all function. The contexts are automatically cleaned up on the cache clear if they were created with the use of wrap_async_exit_stack
param. If you are only using the decorator with the sync functions there is a sync version of the clear all function which only clears the "sync" function caches.
To remove the expired items remove_expired()
function can be called.
from aquiche import alru_cache, clear_all, clear_all_sync
@alru_cache(wrap_async_exit_stack=True)
async def cache_function_async(value: str) -> Client:
return Client()
@alru_cache(retry_count=3, backoff_in_seconds=2)
def cache_function_sync(value: str) -> int:
return len(value)
# Clears individual function caches
await cache_function_async.clear_cache()
cache_function_sync.clear_cache()
# Clears both cache_function_sync and cache_function_async's cache
await clear_all()
# Clears only cache_function_sync's cache
clear_all_sync()
# Removes expired items
await cache_function_async.remove_expired()
cache_function_sync.remove_expired()