Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create Operators for Google Cloud Vertex AI Context Caching #43008

Merged
merged 3 commits into from
Oct 15, 2024

Conversation

CYarros10
Copy link
Contributor

Use context caching to reduce the cost of requests that contain repeat content with high input token counts. Cached context items, such as a large amount of text, an audio file, or a video file, can be used in prompt requests to the Gemini API to generate output. Requests that use the same cache in the prompt also include text unique to each prompt. For example, each prompt request that composes a chat conversation might include the same context cache that references a video along with unique text that comprises each turn in the chat.

Adding Operators, hooks, docs. Updating provider.yaml.

Can be used as part of a broader Generative AI Operations pipeline.


^ Add meaningful description above
Read the Pull Request Guidelines for more information.
In case of fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in a newsfragment file, named {pr_number}.significant.rst or {issue_number}.significant.rst, in newsfragments.

@boring-cyborg boring-cyborg bot added area:providers kind:documentation provider:google Google (including GCP) related issues labels Oct 14, 2024
@potiuk
Copy link
Member

potiuk commented Oct 15, 2024

Needs rebase.

Copy link
Contributor

@MaksYermak MaksYermak left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@potiuk LGTM

@CYarros10 CYarros10 force-pushed the context-cache-operator branch from 8b28517 to 69ba217 Compare October 15, 2024 15:41
@CYarros10 CYarros10 force-pushed the context-cache-operator branch from 69ba217 to f361d02 Compare October 15, 2024 16:01
Copy link
Contributor

@shahar1 shahar1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great!

@shahar1 shahar1 merged commit f1f9201 into apache:main Oct 15, 2024
100 of 101 checks passed
R7L208 pushed a commit to R7L208/airflow that referenced this pull request Oct 17, 2024
…3008)

* Fix merge conflicts

* Fix documentation.

* Update return variables.
harjeevanmaan pushed a commit to harjeevanmaan/airflow that referenced this pull request Oct 23, 2024
…3008)

* Fix merge conflicts

* Fix documentation.

* Update return variables.
PaulKobow7536 pushed a commit to PaulKobow7536/airflow that referenced this pull request Oct 24, 2024
…3008)

* Fix merge conflicts

* Fix documentation.

* Update return variables.
ellisms pushed a commit to ellisms/airflow that referenced this pull request Nov 13, 2024
…3008)

* Fix merge conflicts

* Fix documentation.

* Update return variables.
@CYarros10 CYarros10 deleted the context-cache-operator branch December 9, 2024 14:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area:providers kind:documentation provider:google Google (including GCP) related issues
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants