Causal models only supported for text-generation task, not summarization task #972

njbrake · 2025-02-21T15:08:40Z

What's changing

Move SamplingParameters to more closely match the GenerationConfig that HF uses
Handle the summarization task differently from text-generation
Unify how we handle new tokens, and make sure that we don't allow for more tokens than the model pos emb support
Warn if the input exceeds the pos embeddings, but don't throw an error, truncate.

If this PR is related to an issue or closes one, please link it here.

Refs #970

How to test it

CI Tests, also load the Lumigator UI and make sure you can still test the BART CNN model.

Additional notes for reviewers

I already...

Tested the changes in a working environment to ensure they work as expected
Added some tests for any new functionality
Updated the documentation (both comments in code and product documentation under /docs)
Checked if a (backend) DB migration step was required and included it if required

njbrake · 2025-02-21T16:14:05Z

lumigator/jobs/inference/model_clients.py

+        max_length = self._pipeline.model.config.max_position_embeddings
+        # If the model is of the HF Hub the odds of this being wrong are low, but it's still good to check that the
+        # tokenizer model and the model have the same max_position_embeddings
+        if self._pipeline.tokenizer.model_max_length != max_length:


Because some models (esp older ones) Don't have a tokenizer_config file! Looking at you, facebook/bart-large-cnn lol https://huggingface.co/facebook/bart-large-cnn/discussions/71

njbrake · 2025-02-21T17:03:38Z

It was pointed out that I accidentally remove text-generation support when merging LiteLLM. I'll put back that code #978 and then refactor this PR based on that code. Thanks!

…-summarization

…model-not-supported-for-summarization

…-summarization

Fixing up the handling of generation config and max length checking.

53d60fb

njbrake linked an issue Feb 21, 2025 that may be closed by this pull request

Unit tests using model not supported for summarization #970

Open

github-actions bot added sdk backend frontend api Changes which impact API/presentation layer schemas Changes to schemas (which may be public facing) labels Feb 21, 2025

njbrake marked this pull request as draft February 21, 2025 15:14

njbrake added 2 commits February 21, 2025 10:20

Keep in the accelerator/device

97c545d

fix unit test

5f31e3c

njbrake marked this pull request as ready for review February 21, 2025 15:33

njbrake requested review from dpoulopoulos and HareeshBahuleyan February 21, 2025 15:33

njbrake commented Feb 21, 2025

View reviewed changes

njbrake added 2 commits February 21, 2025 11:33

Touch up the naming and fix a bug we didn't know we had

62901b5

wrong setting for max_new_tok

0146b43

njbrake and others added 2 commits February 21, 2025 12:04

fix

c0ba9b8

Merge branch 'main' into 970-unit-tests-using-model-not-supported-for…

9cc4387

…-summarization

njbrake changed the base branch from main to 122-support-hf-clms February 21, 2025 18:09

integrate the PR-978 changes

5c5173e

Base automatically changed from 122-support-hf-clms to main February 21, 2025 18:41

Merge remote-tracking branch 'origin/main' into 970-unit-tests-using-…

87ec5ae

…model-not-supported-for-summarization

njbrake changed the title ~~HF Summarization Pipeline does not support Causal Models.~~ Causal models only supported for text-generation task, not summarization task Feb 21, 2025

njbrake and others added 2 commits February 21, 2025 14:16

Make max_new_tokens look like the other gen_configs in behavior

53c0ea2

Merge branch 'main' into 970-unit-tests-using-model-not-supported-for…

062a5e5

…-summarization

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Causal models only supported for text-generation task, not summarization task #972

Causal models only supported for text-generation task, not summarization task #972

njbrake commented Feb 21, 2025 •

edited

Loading

njbrake Feb 21, 2025

njbrake commented Feb 21, 2025

Causal models only supported for text-generation task, not summarization task #972

Are you sure you want to change the base?

Causal models only supported for text-generation task, not summarization task #972

Conversation

njbrake commented Feb 21, 2025 • edited Loading

What's changing

How to test it

Additional notes for reviewers

I already...

njbrake Feb 21, 2025

Choose a reason for hiding this comment

njbrake commented Feb 21, 2025

njbrake commented Feb 21, 2025 •

edited

Loading