Fix max token slicing in compression data transform function #672

nikita-savelyevv · 2024-07-24T12:15:32Z

Changes
Fix how max token slicing is applied inside data-aware compression transform_fn.

Reason for change
input_ids and attention_mask have the shape of (B, N), but max token slicing is applied at first instead of the second dimension.

nikita-savelyevv · 2024-07-24T12:17:37Z

In draft for now not to interfere with the current compression experiments

cc @KodiaqQ

nikita-savelyevv · 2024-08-27T14:19:03Z

Not needed after #689 was merged

Fix max token slicing

6f82e2b

github-actions bot added the category: llm_bench Label for tool/llm_bench folder label Jul 24, 2024

nikita-savelyevv requested a review from andreyanufr July 24, 2024 12:18

andreyanufr approved these changes Jul 24, 2024

View reviewed changes

nikita-savelyevv added a commit to nikita-savelyevv/openvino.genai that referenced this pull request Jul 25, 2024

Also adopt changes from openvinotoolkit#672

0b17c04

nikita-savelyevv mentioned this pull request Jul 25, 2024

Transition to default int4 compression configs from optimum-intel #689

Merged

nikita-savelyevv added a commit to nikita-savelyevv/openvino.genai that referenced this pull request Aug 19, 2024

Also adopt changes from openvinotoolkit#672

4cea1e5

nikita-savelyevv added a commit to nikita-savelyevv/openvino.genai that referenced this pull request Aug 20, 2024

Also adopt changes from openvinotoolkit#672

fa051e4

nikita-savelyevv closed this Aug 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix max token slicing in compression data transform function #672

Fix max token slicing in compression data transform function #672

nikita-savelyevv commented Jul 24, 2024 •

edited

Loading

nikita-savelyevv commented Jul 24, 2024

nikita-savelyevv commented Aug 27, 2024

Fix max token slicing in compression data transform function #672

Fix max token slicing in compression data transform function #672

Conversation

nikita-savelyevv commented Jul 24, 2024 • edited Loading

nikita-savelyevv commented Jul 24, 2024

nikita-savelyevv commented Aug 27, 2024

nikita-savelyevv commented Jul 24, 2024 •

edited

Loading