Get an "incorrect audio shape" assert when I process the whisper models #1

jasontitus · 2023-04-08T15:36:16Z

When trying to convert the whisper models I get an assert -

assert x.shape[1:] == self.positional_embedding.shape, "incorrect audio shape"

It doesn't seem to kill the process, but the produced models don't seem to recognize much of anything (one word of the JFK sample, for example), and are different from the models others uploaded to HF. I'm running this on a Macbook Pro M2 64GB - Mac OS 13.3.

wangchou · 2023-04-14T07:02:01Z

That means ... there is a warning on that assert line.
That assert didn't get fired.

About how to use the encoder model, please check ggml-org/whisper.cpp#566 .

/projects/w/whisper/whisper/model.py:166: TracerWarning: Converting a tensor to a Python boolean might cause the trace to be incorrect. We can't record the data
flow of Python values, so this value will be treated as a constant in the future. This means that the trace might not generalize to other inputs!
  assert x.shape[1:] == self.positional_embedding.shape, "incorrect audio shape"

wangchou closed this as completed Apr 14, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get an "incorrect audio shape" assert when I process the whisper models #1

Get an "incorrect audio shape" assert when I process the whisper models #1

jasontitus commented Apr 8, 2023

wangchou commented Apr 14, 2023 •

edited

Loading

Get an "incorrect audio shape" assert when I process the whisper models #1

Get an "incorrect audio shape" assert when I process the whisper models #1

Comments

jasontitus commented Apr 8, 2023

wangchou commented Apr 14, 2023 • edited Loading

wangchou commented Apr 14, 2023 •

edited

Loading