HashHop Long Context Evaluation

This repository contains the code for HashHop, our long context architecture benchmark.

Installation Guide

Prerequisites

Git
Python 3.9+
Poetry

Steps

Clone the repository:

git clone git@github.com:magicproduct/hash-hop.git
cd hash-hop

Install dependencies:
```
poetry install
```

Generating Evaluation Data

The MultiHopEval.make_one function generates a MultiHopSample object which can be used for either evaluation (via the targets field) or for training models on the multihop task (via the completion field).

Usage Example

from hashhop import MultiHopEval

CHARS_PER_TOKEN = 3
datapoint = MultiHopEval.make_one(
    n_chars_problem=int(1_000_000 * CHARS_PER_TOKEN),
    num_queries=5,
    hops=2,
    hash_pair_str_length=16,
    chain_of_thought=False,
)
print(datapoint.prompt)
print(datapoint.completion)
print(datapoint.targets)

Parameters

n_chars_problem: int
- The size of the problem in characters.
num_queries: int
- The number of queries in the completion.
hops: int
- The number of hops in the reasoning chain.
hash_pair_str_length: int
- The number of characters per hash.
chain_of_thought: bool
- If True, the model is asked to produce H1 -> H2 -> H3.
- If False, the model is asked to produce H1 -> H3.

Output

prompt: str
- Contains the shuffled hash pairs.
(Used for training) completion: str
- The queries and targets in string format
(Used for evaluation) targets: Dict[str, str]
- Contains query-ground truth pairs in structured format
- If chain of thought is false, will contain {H1: H3} (e.g. 'HETyxiWTFSVUYega': 'pChfybAJRUBmdAGC')
- If chain of thought is true, will contain full chain {H1: H2 = H3} (e.g. 'KeiVcwXpnYIWLPmk': 'GmmNmICdvEErHgei = JhgvBFdYCnLVZBoy')

Citation

@misc{magic2024hashhop,
  author = {Magic},
  title = {HashHop: Long Context Evaluation},
  year = {2024},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/magicproduct/hash-hop}},
}

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.github		.github
hashhop		hashhop
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CODEOWNERS		CODEOWNERS
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HashHop Long Context Evaluation

Installation Guide

Prerequisites

Steps

Generating Evaluation Data

Usage Example

Parameters

Output

Citation

License

About

Releases 2

Contributors 5

Languages

License

magicproduct/hash-hop

Folders and files

Latest commit

History

Repository files navigation

HashHop Long Context Evaluation

Installation Guide

Prerequisites

Steps

Generating Evaluation Data

Usage Example

Parameters

Output

Citation

License

About

Resources

License

Stars

Watchers

Forks

Releases 2

Contributors 5

Languages