Latent Aggregation

Aggregating seemingly different latent spaces.

Quickstart

Development installation

Setup the development environment:

git clone git@github.com:crisostomi/latent-aggregation.git
cd latent-aggregation
conda env create -f env.yaml
conda activate la
pre-commit install

Run the tests:

pre-commit run --all-files

We use HuggingFace Datasets throughout the project; assuming you already have a HF account (create one if you don't), you will have to login via

huggingface-cli login

which will prompt you to either create a new token or paste an existing one.

Update the dependencies

Re-install the project in edit mode:

pip install -e '.[dev]'

Experiment flow

Each experiment exp_name in part_shared_part_novel, same_classes_disj_samples, totally_disjoint has three scripts:

prepare_data_${exp_name}.py divides the data in tasks according to what the experiment expects;
run_${exp_name}.py trains the task-specific models and uses them to embed the data for each task;
analyze_${exp_name}.py obtains the results for the experiment.

Each script has a corresponding conf file in conf/ with the same name. So, to run the part_shared_part_novel, you have to first configure the experiment in conf/prepare_data_part_shared_part_novel.yaml. In this case, you have to choose a value for num_shared_classes and num_novel_classes_per_task. Now you will prepare the data via

python src/la/scripts/prepare_data_part_shared_part_novel.py

this will populate the data/${dataset_name}/part_shared_part_novel/ folder. Then you'll embed the data by running

python src/la/scripts/run_part_shared_part_novel.py

so that now you will have the encoded data in data/${dataset_name}/part_shared_part_novel/S${num_shared_classes}_N${num_novel_classes_per_task}. Having all the latent spaces, you can now run the actual experiment and collect the results by running

python src/la/scripts/analyze_part_shared_part_novel.py

The results can now be found in results/part_shared_part_novel.

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
.dvc		.dvc
.github/workflows		.github/workflows
conf		conf
docs		docs
notebooks		notebooks
paper_results		paper_results
shell_scripts		shell_scripts
src/la		src/la
tests		tests
.dvcignore		.dvcignore
.editorconfig		.editorconfig
.env.template		.env.template
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
data.dvc		data.dvc
env.yaml		env.yaml
matrioska_learning.dvc		matrioska_learning.dvc
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg
setup.py		setup.py
tmp.txt		tmp.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Latent Aggregation

Quickstart

Development installation

Update the dependencies

Experiment flow

About

Releases

Packages

Contributors 3

Languages

License

crisostomi/latent-aggregation

Folders and files

Latest commit

History

Repository files navigation

Latent Aggregation

Quickstart

Development installation

Update the dependencies

Experiment flow

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages