AugARC - Augmented Abstraction and Reasoning Corpus

Introduction

This repository contains the source code for producing the augmented ARC datasets, training LLMs and testing them on the AugARC Benchmark.

The AugARC provides is easy and unified benchmark to evaluate LLMs on 3-shot accuracy on reasoning tasks. In AugARC, each ARC task starts with a textual description explaining the format of the problem. Each ARC grid is represented as a 2D matrix of numbers.

In AugARC, the first prediction is based on a normal ARC task, whereas the second and the third ones are 90° and 270° clockwise rotated versions of the same task. The AugARC benchmark is tailored towards LLMs’ architecture, as those models process inputs in an auto-regressive, sequential manner. By rotating the ARC tasks, LLMs are presented with a different sequence of numbers (2D matrices) which contain the same abstract logic.

Transformations on an ARC task to obtain its Augmented ARC variants are visualised below.

Base	90° Rotated	270° Rotated

Data

All the augmented ARC data is also available from:

https://osf.io/r58ks/

Citation

If you use our data, please cite our paper

https://openreview.net/pdf?id=cgUTWzgvCj

AugARC: Augmented Abstraction and Reasoning Benchmark for Large Language Models, Kiril Bikov, Mikel Bober-Irizar, Soumya Banerjee, AAAI Workshop on Preparing Good Data for Generative AI: Challenges and Approaches

Contact

Kiril Bikov and Soumya Banerjee

kmb85@cam.ac.uk

sb2333@cam.ac.uk

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
arc_data		arc_data
augmented_datasets		augmented_datasets
llm_benchmark		llm_benchmark
training_pipeline		training_pipeline
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AugARC - Augmented Abstraction and Reasoning Corpus

Introduction

Data

Citation

Contact

About

Releases

Packages

Contributors 2

Languages

License

kiril-bikov/AugARC

Folders and files

Latest commit

History

Repository files navigation

AugARC - Augmented Abstraction and Reasoning Corpus

Introduction

Data

Citation

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages