The inference API truncates the response #487

NormXU · 2024-02-17T16:09:50Z

Issue Description

I host an Image-to-Text pipeline with this model for a while. The Inference API widget worked quite well until recently when some developers reported that the inference widget always cut the response short. However, he ran the model locally with the same example, and the response was perfect. I tried previously successful examples but found all examples returned the same truncated results.

I didn't update model weights and configs before trying to fix this issue. Please check the issue for more details.

These are all commits I made after reading the issue:

Following the advice mentioned in the issue, I first tried to add:

inference:
  parameters:
    max_length: 800

in the model card. But it doesn't work.

Then, I guessed maybe the encoder config confused the API, so I tried to edit encoder.max_length=800. But it still failed to fix the problem, thenI edited it back.

I speculated that it is an inference API bug that causes the truncated responses.

The text was updated successfully, but these errors were encountered:

coyotte508 · 2024-02-17T18:03:02Z

cc @Narsil @oOraph

NormXU mentioned this issue Feb 17, 2024

HuggingFace Inference API cutting responses short NormXU/nougat-latex-ocr#2

Closed

olmobaldoni mentioned this issue Apr 11, 2024

Not giving the correct/ complete LaTeX olmobaldoni/logseq-formula-ocr-plugin#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The inference API truncates the response #487

The inference API truncates the response #487

NormXU commented Feb 17, 2024 •

edited

Loading

coyotte508 commented Feb 17, 2024

The inference API truncates the response #487

The inference API truncates the response #487

Comments

NormXU commented Feb 17, 2024 • edited Loading

Issue Description

coyotte508 commented Feb 17, 2024

NormXU commented Feb 17, 2024 •

edited

Loading