Accessing pre-trained sub-modules #14

bpiwowar · 2023-05-15T09:09:08Z

When a model is trained, one might want to access one of its submodules in another task, e.g.

model = output["validation"]["best"]
faiss_index = FaissIndexBuilder(encoder=model.encoder).submit()

In that case, the encoder is random... and this is quite unexpected.

Potential solutions:

Add a mechanism in experimaestro that allows linking the sub-models to the main one - when initialized, the encoder will require the model to be loaded from the checkpoint. Caveats: as with any implicit process, this makes xpmir more complex and might have other side effects.
Add a mechanism to have several tasks to be run (and in that case, the output will be just the path), instead of just one, i.e.
```
load_model = LoadModel(model=model, checkpoint=output["validation"]["best"])

index = FaissIndexBuilder(encoder=model.encoder).submit(pre_tasks=[load_model])
```
Important caveat: it breaks the current HuggingFace uploading/downloading mechanism and we need to include load_model each time we want to use the pre-trained model (which can begin to be cumbersome when various models are combined since we need to track down this). For HF, the solution would be to have this when loading
```
model, load_model = AutoModel.load_from_hf_hub("xpmir/SPLADE_DistilMSE")
```
Third solution, do something a bit in between
```
 best_model = model_loader(model=model, checkpoint=output["validation"]["best"])
 index = FaissIndexBuilder(encoder=best_model.encoder).submit()
```
where model_loader is a special experimaestro object (ConfigFactory) that does whatever is needed and returns the "encoder" submodel.

The text was updated successfully, but these errors were encountered:

bpiwowar · 2023-06-08T15:16:11Z

Re-opening: the current solution is not optimal, so it is better to switch to solution 2 (Add a mechanism to have several tasks to be run)

bpiwowar · 2023-06-18T10:13:23Z

Fixed in bcaa1ef with experimaestro 0.28 that includes initialization tasks.

bpiwowar added bug Something isn't working enhancement New feature or request labels May 15, 2023

bpiwowar added this to the Version 1.0 milestone May 15, 2023

bpiwowar self-assigned this May 15, 2023

bpiwowar mentioned this issue May 15, 2023

Roadmap #9

Open

16 tasks

bpiwowar closed this as completed in 6d2b849 May 23, 2023

bpiwowar reopened this Jun 8, 2023

bpiwowar closed this as completed Jun 18, 2023

Provide feedback