/v1/inference/embeddings input and output shape mismatch

/v1/inference/embeddings: model x [List[InterleavedContent]](https://github.com/meta-llama/llama-stack/blob/main/docs/resources/llama-stack-spec.yaml#L2771) -> [List[List[float]]](https://github.com/meta-llama/llama-stack/blob/main/docs/resources/llama-stack-spec.yaml#L2791)

the shape mismatch comes from InterleavedContent allowing for [List[InterleavedContentItem]](https://github.com/meta-llama/llama-stack/blob/main/docs/resources/llama-stack-spec.yaml#L1498).

example: [string, [text0, text1], image] -?-> [embedding of string, embedding of text0, embedding of text1, embedding of image]

i suggest aligning the shapes.

my preference is to change the input shape, and use an input of `array of string | array of InterleavedContentItem`, which keeps string (untyped) and text / image (typed) inputs separate.

a further enhancement: embedding is often done in two modes, batch and query. in batch mode many items are embedded for storage. in query mode a single item is embedded for lookup. allowing input of `string | array of string | array of InterleavedContentItem` facilitates this use case.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

/v1/inference/embeddings input and output shape mismatch #922

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

/v1/inference/embeddings input and output shape mismatch #922

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions