allow partial loading for pre trained ms2 models #226

mo-sameh · 2024-12-30T23:56:21Z

This PR include two main changes:

Allow to dynamically change the charged_frag_types used for ms2 hence changing the dimensions of the output layer for the ms2 model.
Allow for partially loading the ms2 model so that a user can benefit from a pretrained model backbone when using a different prediction head such as changing the fragment types. With this feature users can finetune/train a model with more fragment types than the ones used from for pretraining. From my experiments using a pretrained backbone had a significant performance gap compared to training fully from scratch.

From an alphaDia finetuning experiment where I want to predict the following fragment types ['b','y','c','a','x','z']

Performance when training for 50 epochs from scratch:

Model tested on test dataset with the following metrics:

l1_loss                       : 0.0228
PCC-mean                      : 0.8015
COS-mean                      : 0.8140
SA-mean                       : 0.6054
SPC-mean                      : -0.2333

Performance when training for 50 epochs when loading a pretrained backbone:

Model tested on test dataset with the following metrics:

l1_loss                       : 0.0122
PCC-mean                      : 0.9462
COS-mean                      : 0.9491
SA-mean                       : 0.7961
SPC-mean                      : -0.3072

jalew188 · 2025-01-06T01:03:36Z

peptdeep/model/ms2.py

@@ -425,6 +425,53 @@ def _prepare_train_data_df(
        # if np.all(precursor_df['nce'].values > 1):
        #     precursor_df['nce'] = precursor_df['nce']*self.NCE_factor

+    def _load_model_from_stream(self, stream: IO):


There may be two cases:

we save partial model params into the file and this method allows us to load this partial model.

we save the full model but we only need to load partial params by specifying param names or first Kth layers.

Hi, I’m not sure I fully understand your concerns. The current interface saves the complete model weights, so I’m unclear about why users would want to save only partial models. Are you suggesting we add this functionality?

As for loading partial weights, the current implementation(in this PR) should already handle this automatically. It matches parameter keys and sizes, loading the matching weights while initializing the remaining parameters from scratch. Are you suggesting we modify the interface to let users explicitly specify which layers to load?

No, I mean the use cases.

mschwoer · 2025-01-10T15:27:01Z

peptdeep/model/ms2.py

+        self.model.load_state_dict(filtered_params, strict=False)
+        if size_mismatches or unexpected_keys or missing_keys:
+            warning_msg = "Some layers might be randomly initialized due to a mismatch between the loaded weights and the model architecture. Make sure to train the model or load different weights before prediction."
+            warning_msg += (


(nit) This more concise format might be easier on the eye:

warning_msg += "".join( [ f"\nKeys with size mismatches: {size_mismatches}" if size_mismatches else "", f"\nUnexpected keys: {unexpected_keys}" if unexpected_keys else "", f"\nMissing keys: {missing_keys}" if missing_keys else "", ] )

mschwoer · 2025-01-10T15:29:13Z

peptdeep/pretrained_models.py

@@ -270,6 +270,7 @@ def __init__(
        self,
        mask_modloss: bool = False,
        device: str = "gpu",
+        charged_frag_types: list[str] = None,


we still have requires-python = ">=3.8.0" .. so Optional[List[str]] it is ..

@jalew188 should we drop support for 3.8?

We can, in the next release.

feat: allow partial loading for pre trained ms2 models

8f8c9c7

mo-sameh changed the title ~~feat: allow partial loading for pre trained ms2 models~~ allow partial loading for pre trained ms2 models Dec 30, 2024

frormat

acd8e34

mo-sameh mentioned this pull request Dec 31, 2024

Transfer-learning allow additional fragtypes MannLabs/alphadia#421

Draft

mo-sameh marked this pull request as ready for review January 5, 2025 17:19

mo-sameh requested review from mschwoer, jalew188 and GeorgWa January 5, 2025 17:19

jalew188 reviewed Jan 6, 2025

View reviewed changes

mschwoer reviewed Jan 10, 2025

View reviewed changes

mo-sameh mentioned this pull request Feb 12, 2025

Adding Charged Fragment Types to MS2 Model Weights #228

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

allow partial loading for pre trained ms2 models #226

allow partial loading for pre trained ms2 models #226

mo-sameh commented Dec 30, 2024

jalew188 Jan 6, 2025

mo-sameh Jan 11, 2025 •

edited

Loading

jalew188 Jan 12, 2025

mschwoer Jan 10, 2025

mschwoer Jan 10, 2025

jalew188 Jan 11, 2025

allow partial loading for pre trained ms2 models #226

Are you sure you want to change the base?

allow partial loading for pre trained ms2 models #226

Conversation

mo-sameh commented Dec 30, 2024

jalew188 Jan 6, 2025

Choose a reason for hiding this comment

mo-sameh Jan 11, 2025 • edited Loading

Choose a reason for hiding this comment

jalew188 Jan 12, 2025

Choose a reason for hiding this comment

mschwoer Jan 10, 2025

Choose a reason for hiding this comment

mschwoer Jan 10, 2025

Choose a reason for hiding this comment

jalew188 Jan 11, 2025

Choose a reason for hiding this comment

mo-sameh Jan 11, 2025 •

edited

Loading