Add "gpt-4-vision-preview" as default parser #1417

rholinshead · 2024-03-07T22:06:26Z

Add "gpt-4-vision-preview" as default parser

Summary: Just adding this model/parser as a default alongside the other openai defaults

Test Plan: Can use the parser in the editor

Stack created with Sapling. Best reviewed with ReviewStack.

# Update openai to 1.13.3 Summary: Updating the openai version to latest (1.13.3) so that we can use the image messages for gpt-4-v in the next PR Test Plan: - pytest works without errors - WIP: Testing all relevant cookbooks

rossdanlm · 2024-03-08T17:56:33Z

python/src/aiconfig/Config.py

+ModelParserRegistry.register_model_parser(
+ OpenAIVisionParser("gpt-4-vision-preview")
+)


cc @tanya-rai this would be the 3rd position from drop-down. If you want to change it, we can easily change in a future PR

# Implement OpenAIVisionParser Summary: Using the default openai parser as the basis (and extending it), implement a parser for text-and-image openai models (e.g. gpt-4-v). The key differences are: - vision models don't support function calling or tools, so exclude from the completion params - serialize/deserialize need to be a bit different to construct the user messages' content with text and image urls One thing to note is the default max tokens for GPT-4V is quite small. Bumping it to 100 in #1419 Another thing I found while testing is that the model gets confused sometimes when you refer to the attached image as an image. For example, I had a prompt as "What is this image of" and it responded with "I can't provide information on any images until you upload one". Then I reran a handful of times and it finally gave expected result. Changing the prompt to "What is this?" seems to provide much better results. Test Plan: https://github.com/lastmile-ai/aiconfig/assets/5060851/47e0ae90-fab7-4d4f-98e9-163cd992aeed

# Add "gpt-4-vision-preview" as default parser Summary: Just adding this model/parser as a default alongside the other openai defaults Test Plan: Can use the parser in the editor

Default GPT-4V max_tokens to 100 # Default GPT-4V max_tokens to 100 Summary: The default max_tokens is really low (like < 20) so bump to 100 to make it more useable Test Plan: ![Screenshot 2024-03-07 at 5 38 27 PM](https://github.com/lastmile-ai/aiconfig/assets/5060851/085c9e68-9521-450e-b2cb-699a09eead9b) --- [//]: # (BEGIN SAPLING FOOTER) Stack created with [Sapling](https://sapling-scm.com). Best reviewed with [ReviewStack](https://reviewstack.dev/lastmile-ai/aiconfig/pull/1419). * __->__ #1419 * #1418 * #1417 * #1416 * #1415

This was referenced Mar 7, 2024

Implement OpenAIVisionParser #1416

Merged

Update OpenAIVisionParserPromptSchema #1418

Merged

Update openai to 1.13.3 #1415

Merged

rholinshead marked this pull request as ready for review March 7, 2024 22:22

rholinshead requested review from saqadri, suyoglastmileai, Ankush-lastmile, jonathanlastmileai and rossdanlm as code owners March 7, 2024 22:22

rholinshead force-pushed the pr1417 branch from 44a4c03 to 2d8d2aa Compare March 7, 2024 22:38

rholinshead mentioned this pull request Mar 7, 2024

Default GPT-4V max_tokens to 100 #1419

Merged

Update openai to 1.13.3

10483bb

# Update openai to 1.13.3 Summary: Updating the openai version to latest (1.13.3) so that we can use the image messages for gpt-4-v in the next PR Test Plan: - pytest works without errors - WIP: Testing all relevant cookbooks

rossdanlm reviewed Mar 8, 2024

View reviewed changes

rossdanlm approved these changes Mar 8, 2024

View reviewed changes

Ryan Holinshead added 2 commits March 8, 2024 13:34

Add "gpt-4-vision-preview" as default parser

c22e607

# Add "gpt-4-vision-preview" as default parser Summary: Just adding this model/parser as a default alongside the other openai defaults Test Plan: Can use the parser in the editor

rholinshead force-pushed the pr1417 branch from 2d8d2aa to c22e607 Compare March 8, 2024 18:42

rholinshead merged commit c22e607 into main Mar 8, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add "gpt-4-vision-preview" as default parser #1417

Add "gpt-4-vision-preview" as default parser #1417

rholinshead commented Mar 7, 2024 •

edited

Loading

rossdanlm Mar 8, 2024

Add "gpt-4-vision-preview" as default parser #1417

Add "gpt-4-vision-preview" as default parser #1417

Conversation

rholinshead commented Mar 7, 2024 • edited Loading

Add "gpt-4-vision-preview" as default parser

rossdanlm Mar 8, 2024

Choose a reason for hiding this comment

rholinshead commented Mar 7, 2024 •

edited

Loading