Scrat

Persistent Caching of Expensive Function Results

Get Started

Install with pip install scrat
Initialize stash scrat init
Start saving time:

import scrat
import time

@scrat.stash()
def expensive_function(param_1):
    time.sleep(3)
    return param_1

expensive_function(1)  # <- function called
expensive_function(1)  # <- function not called the result is recovered from stash
expensive_function(2)  # <- function called again beacuse the parameters changed

Features

Seamlessly stores the results of expensive functions to disk for future reuse.
Automatically re-evaluates the function if the parameters or function code have changed, ensuring up-to-date results.
Saves any result using the pickle.
Improved storage of pandas DataFrames, Series, and Numpy arrays.
Customizable support for alternative serializers.
Flexible parameter hashing mechanism to efficiently handle any parameter type.
Command-line interface (CLI) for convenient control and management of the caching functionality.

Similar Projects

lru_cache

Great and fast memoize provided by the standard library functools, unfurtunately results are stored in memory so they can't be reused in different runs.

cachetools

Provides alternatives to lru_cache but it also works in-memory.

Joblib

Joblib is a stablished library that provides great functionality for parallelization and caching. The Memory module provides an excelent alternative to Scrat, but it does have some limitations:

Hard to avoid using pickle
Lack of options to control the cache size and policies
Lack of tools to inspect and cleanup the cache

These are the problems that scrat aims to improve, however, I'd recommend using Joblib in production since it's much more mature than Scrat at the moment.

Concepts

Scrat is a famous pre-historic squirrel with some bad luck
Stash is composed of a folder where results are saved and a database to index them
A Nut is one of the entries in the database
The Squirrel is in charge of fetching and stashing the Nuts
Serializer dumps results to files and load them back to memory
Hasher creates unique hashes for a parameter value
HashManager coordinates hashes of all arguments and functon code

Development Setup

Clone this repo
Install pyenv.
Install the python version used for development running pyenv install in the root of this repository.
Install poetry. Version 1.5.1 is recommended.
Run this command to make sure poetry uses the right python version poetry env use $(which python)
Install project and dependencies with poetry install
Run tests with poetry run pytest or activate the virtualenv with poetry shell and then run pytest

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.github/workflows		.github/workflows
.vscode		.vscode
docs		docs
examples		examples
scrat		scrat
tests		tests
.flake8		.flake8
.gitignore		.gitignore
.python-version		.python-version
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
README.md		README.md
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scrat

Get Started

Features

Similar Projects

lru_cache

cachetools

Joblib

Concepts

Development Setup

About

Releases

Languages

License

javiber/scrat

Folders and files

Latest commit

History

Repository files navigation

Scrat

Get Started

Features

Similar Projects

lru_cache

cachetools

Joblib

Concepts

Development Setup

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Languages