diff --git a/docs/source/models/supported_models.md b/docs/source/models/supported_models.md index 20a706a0b8ec..62e545492dd9 100644 --- a/docs/source/models/supported_models.md +++ b/docs/source/models/supported_models.md @@ -1080,11 +1080,6 @@ See [this page](#generative-models) for more information on how to use generativ E Pre-computed embeddings can be inputted for this modality. + Multiple items can be inputted per text prompt for this modality. -:::{important} -Pan-and-scan image pre-processing is currently supported on V0 (but not V1). -You can enable it by passing `--mm-processor-kwargs '{"do_pan_and_scan": true}'`. -::: - :::{warning} Both V0 and V1 support `Gemma3ForConditionalGeneration` for text-only inputs. However, there are differences in how they handle text + image inputs: @@ -1104,7 +1099,7 @@ This limitation exists because the model's mixed attention pattern (bidirectiona ::: :::{note} -`h2oai/h2ovl-mississippi-2b` will be available in V1 once we support backends other than FlashAttention. +`h2oai/h2ovl-mississippi-2b` will be available in V1 once we support head size 80. ::: :::{note}