Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[playground][dogfooding] initial playground dogfooding check list #5373

Closed
28 of 66 tasks
Tracked by #3435
Parker-Stafford opened this issue Nov 15, 2024 · 1 comment · Fixed by #5518
Closed
28 of 66 tasks
Tracked by #3435

[playground][dogfooding] initial playground dogfooding check list #5373

Parker-Stafford opened this issue Nov 15, 2024 · 1 comment · Fixed by #5518
Assignees

Comments

@Parker-Stafford
Copy link
Contributor

Parker-Stafford commented Nov 15, 2024

Datasets

(Tony + Xander)

  • test on large datasets
  • streaming
  • non streaming (ui side)
  • [stretch] test behind proxy with some timeout

Tools

(Roger + Parker)

  • tool switching for calls and schemas between providers

Tool calling

Does the playground allow you to simulate an actual tool call (user message with tool, assistant message with tool calls, tool message with results, proper response from llm)

  • tool calling for anthropic
  • tool calling for openai

Tool use

Does the playground allow you to add tools that can be appropriately picked out by an llm (add a tool with a relevant message)

  • tool use for anthropic
  • tool use for openai

Template application

  • mustache (datasets / normal)
  • fstring (datasets / normal)

Span replay

(Xander + Parker)

  • test going from auto instrumentor to playground
  • test going from playground to span to playground
  • input
  • output
  • model
  • invocation params
  • tools
  • tool calls

Structured output

Parker + Roger

Known issues / WIP

Playground spans

UI

Server

New Issues

Lower prio

Dogfooding issues round 2

Post-Launch

Dogfooding round 3

@Parker-Stafford
Copy link
Contributor Author

Parker-Stafford commented Nov 20, 2024

-- moved issues up

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging a pull request may close this issue.

5 participants