TACO

Official implementation of TACO paper. Our code is built on Openmatch.

Setup

pip install -r requirements.txt
pip install -e .

Install Faiss-gpu

For A100 CUDA-11, we need to install faiss-gpu via conda or specific pip wheels

via conda

# check your cudatoolkit
conda search cudatoolkit
# install faiss-gpu 
conda install -c pytorch faiss-gpu cudatoolkit=11.3.1

via pip

# download faiss-gpu pip cuda-11 wheel
wget https://github.com/kyamagu/faiss-wheels/releases/download/v1.7.3/faiss_gpu-1.7.3-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl

# pip install
python -m pip install faiss_gpu-1.7.3-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl

Supported Datasets and Setup DATA

supported datasets
- KILT Benchmark
- MSMARCO
- ZESHEL
- BEIR
setup data scripts (kilt for example)


./scripts/kilt/shells/setup_data.sh

For ZESHEL, you need to download the raw data and bm25 candidates from here , then run the setup_data.sh script

Supported features:

standard dense retrieval
- hard negative mining: unify ANCE with Hard-nce style, the only difference is optimizer state
multi-task retrieval
- PCG
- CGD
- GradNorm
- TACO
- Naive
- split query encoder
- query adapter

example Scripts

T5-ANCE dense retrieval

# warmup with bm25 candidates first
./scripts/kilt/shells/warmup_dr.sh

# ANCE iterations
./scripts/kilt/shells/ance_dr.sh

Multi_task dense retrieval

# warmup with bm25 candidates first
./scripts/kilt/shells/warmup_mt.sh 

# ANCE iterations
./scripts/kilt/shells/ance_mt.sh [your multi_task method (naive, pcg, gn, cgd
, taco]

Name		Name	Last commit message	Last commit date
Latest commit History 314 Commits
scripts		scripts
src/taco		src/taco
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TACO

Setup

Install Faiss-gpu

Supported Datasets and Setup DATA

Supported features:

example Scripts

About

Releases

Packages

Languages

License

WenzhengZhang/TACO

Folders and files

Latest commit

History

Repository files navigation

TACO

Setup

Install Faiss-gpu

Supported Datasets and Setup DATA

Supported features:

example Scripts

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages