[FR]: UI: See Traces and LLM Calls of Evaluations when using LLM as Judges metrics #632

SrBliss · 2024-11-14T10:40:40Z

Proposal summary

See in the UI the traces that are produced when using LLM as Judges, both General and LLM spans.

Motivation

When using LLM as Judged metrics in your Evaluations, it is useful to track the Evaluation because:

Sometimes you want to iterate on the creation of the actual metric, and you need to analyze / compare the prompts that were used to evaluate the LLM-App.
It allows tracking of the full usage/cost of the Evaluation

alexkuzmik · 2024-11-20T14:14:33Z

@jverre I like this idea. However, we need to decide how enable/disable it. We'll likely need to have some flags to disable/enable opik tracking for score functions

SrBliss added the enhancement New feature or request label Nov 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FR]: UI: See Traces and LLM Calls of Evaluations when using LLM as Judges metrics #632

[FR]: UI: See Traces and LLM Calls of Evaluations when using LLM as Judges metrics #632

SrBliss commented Nov 14, 2024

alexkuzmik commented Nov 20, 2024

[FR]: UI: See Traces and LLM Calls of Evaluations when using LLM as Judges metrics #632

[FR]: UI: See Traces and LLM Calls of Evaluations when using LLM as Judges metrics #632

Comments

SrBliss commented Nov 14, 2024

Proposal summary

Motivation

alexkuzmik commented Nov 20, 2024