Support prompt caching in Google Vertex AI generators #1003

julian-risch · 2024-08-19T06:32:20Z

Is your feature request related to a problem? Please describe.
Google Vertex AI, in particular the models Gemini 1.5 Flash and Gemini 1.5 Pro support prompt caching or context caching. We should enable users to use that feature through Haystack to reduce costs and latency. https://cloud.google.com/vertex-ai/generative-ai/docs/context-cache/context-cache-overview

Describe the solution you'd like
We need to implement a way to first create a context cache and then to reference the contents of the context cache in a prompt request.

julian-risch added feature request Ideas to improve an integration integration:google-vertex P3 labels Aug 19, 2024

julian-risch mentioned this issue Aug 19, 2024

Support prompt caching in Anthropic generators #1004

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support prompt caching in Google Vertex AI generators #1003

Support prompt caching in Google Vertex AI generators #1003

julian-risch commented Aug 19, 2024

Support prompt caching in Google Vertex AI generators #1003

Support prompt caching in Google Vertex AI generators #1003

Comments

julian-risch commented Aug 19, 2024