Modifed masking before pooling - Fixes issue in ONNX conversion #92

ashokrajab · 2023-10-26T06:39:40Z

Issue:
In class INSTRUCTOR_Transformer, inside def forward(), the attention mask corresponding to the instruction tokens are
set to 0 in the following manner:

if context_masks is not None:
            import torch
            assert len(context_masks) == len(attention_mask)
            n = len(attention_mask)
            # print('n ',n)
            for local_idx in range(n):
                assert torch.sum(attention_mask[local_idx]).item() >= context_masks[local_idx].item(),\
                    f'{attention_mask[local_idx]}, {context_masks[local_idx]}, ' \
                    f'{torch.sum(attention_mask[local_idx]).item()}, {context_masks[local_idx].item()}'
                attention_mask[local_idx][:context_masks[local_idx]] = 0

I want to draw attention to the line n = len(attention_mask). This int variable will be treated as a constant during onnx conversion, which will lead to incorrect inference when the instruction token length changes.

Solution:
Instead of geting the instruction token length and manually iterating the attention_mask to set the value as 0,
I have introduced def prepare_input_features() function under class Instructor that carries out the same task using tensor manipulations.
By this way performing inference using the onnx model works as expected for any instruction.

Other change set:
There are many other diff in the pull request, those are a result of adhering to formatting/linting standards.

ashokrajab · 2023-10-29T14:58:25Z

@Harry-hash @hongjin-su
Looking forward to your inputs...

ashokrajab · 2023-11-27T11:12:59Z

@hongjin-su @Harry-hash
Just a gentle reminder..

ashokrajab · 2023-12-18T08:34:45Z

Following up on this.

ashokrajab · 2024-03-20T08:56:43Z

@hongjin-su @Harry-hash
just a reminder

ashokrajab force-pushed the onnx_conversion_fix branch from a667a4f to 494a0ac Compare November 6, 2023 09:43

ashokrajab force-pushed the onnx_conversion_fix branch 2 times, most recently from fcf4147 to 43e7c83 Compare November 15, 2023 06:40

ashokrajab mentioned this pull request Nov 15, 2023

Production Level Updates #93

Open

ashokrajab force-pushed the onnx_conversion_fix branch from 43e7c83 to 9d93ea7 Compare November 17, 2023 09:15

modifed masking before pooling

856211d

ashokrajab force-pushed the onnx_conversion_fix branch from 9d93ea7 to 856211d Compare January 21, 2024 06:31

hongjin-su merged commit d92edb7 into xlang-ai:main Apr 12, 2024

racinmat mentioned this pull request Apr 16, 2024

please accept this pull request ASAP PLEASE! #113

Closed

racinmat mentioned this pull request Apr 24, 2024

Update to newer sentence-transformers with offline usage and renaming of classes to match the released package #115

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Modifed masking before pooling - Fixes issue in ONNX conversion #92

Modifed masking before pooling - Fixes issue in ONNX conversion #92

ashokrajab commented Oct 26, 2023 •

edited

Loading

ashokrajab commented Oct 29, 2023

ashokrajab commented Nov 27, 2023

ashokrajab commented Dec 18, 2023

ashokrajab commented Mar 20, 2024

Modifed masking before pooling - Fixes issue in ONNX conversion #92

Modifed masking before pooling - Fixes issue in ONNX conversion #92

Conversation

ashokrajab commented Oct 26, 2023 • edited Loading

ashokrajab commented Oct 29, 2023

ashokrajab commented Nov 27, 2023

ashokrajab commented Dec 18, 2023

ashokrajab commented Mar 20, 2024

ashokrajab commented Oct 26, 2023 •

edited

Loading