APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding [ICLR 2025]

[Paper] | [Project]

TL;DR

We introduce APE for context-augmented generation with better efficiency and performance.

Usage

Environment Setup

conda create -yn ape python=3.10
conda activate ape

pip install -r requirements.txt
python setup.py install

Run Context-augmented Question Answering with APE

By default, the temperature and scaling factor are set to 0.9, preserving over 90% performance on few-shot tasks.

CUDA_VISIBLE_DEVICES=0 python demo_APE.py --model llama3-8b-instruct

Experiments

To reproduce the APE results for retrieval-augmented generation (RAG) and in-context learning (ICL) tasks in Section 5, please follow the instructions and use the code provided in the experiments directory.

TODOs

We will release the code and data in the following order, please stay tuned!

Release core code of APE, including Llama-3, Llama-3.1, Mistral-v0.3, and Gemma-2.
Release RAG and ICL evaluation code.
Release APE context-augmented QA demo
Incorporate APE into efficient inference engine

Citation

If you find APE useful or relevant to your project and research, please kindly cite our paper:

@inproceedings{yang2025ape,
  title={APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding},
  author={Yang, Xinyu and Chen, Tianqi and Chen, Beidi},
  booktitle={ICLR 2025},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 67 Commits
ape		ape
assets		assets
experiments		experiments
README.md		README.md
demo_ape.py		demo_ape.py
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding [ICLR 2025]

[Paper] | [Project]

TL;DR

Usage

Environment Setup

Run Context-augmented Question Answering with APE

Experiments

TODOs

Citation

About

Releases

Packages

Languages

Infini-AI-Lab/APE

Folders and files

Latest commit

History

Repository files navigation

APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding [ICLR 2025]

[Paper] | [Project]

TL;DR

Usage

Environment Setup

Run Context-augmented Question Answering with APE

Experiments

TODOs

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages