Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[playground][dogfooding] initial playground dogfooding check list #5373

Open
5 of 38 tasks
Tracked by #3435
Parker-Stafford opened this issue Nov 15, 2024 · 0 comments
Open
5 of 38 tasks
Tracked by #3435
Assignees

Comments

@Parker-Stafford
Copy link
Contributor

Parker-Stafford commented Nov 15, 2024

Datasets

(Tony + Xander)

  • test on large datasets
  • streaming
  • non streaming (ui side)
  • [stretch] test behind proxy with some timeout

Tools

(Roger + Parker)

  • tool switching for calls and schemas between providers

Tool calling

Does the playground allow you to simulate an actual tool call (user message with tool, assistant message with tool calls, tool message with results, proper response from llm)

  • tool calling for anthropic
  • tool calling for openai

Tool use

Does the playground allow you to add tools that can be appropriately picked out by an llm (add a tool with a relevant message)

  • tool use for anthropic
  • tool use for openai

Template application

  • mustache (datasets / normal)
  • fstring (datasets / normal)

Span replay

(Xander + Parker)

  • test going from auto instrumentor to playground
  • test going from playground to span to playground
  • input
  • output
  • model
  • invocation params
  • tools
  • tool calls

Structured output

Parker + Roger

Known issues / WIP

Playground spans

UI

Server

New Issues

Lower prio

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: 👨‍💻 In progress
Development

No branches or pull requests

5 participants