Control concurrency of evals #98

filip-michalsky · 2024-10-07T00:14:42Z

running 10+ evals on different browsers in parallel locally can become computationally challenging.

Braintrust offers a concurrency semaphor:
https://www.braintrust.dev/docs/guides/evals/run

import { Factuality, Levenshtein } from "autoevals";
import { Eval } from "braintrust";
 
Eval("Say Hi Bot", {
  data: () =>
    Array.from({ length: 100 }, (_, i) => ({
      input: `${i}`,
      expected: `${i + 1}`,
    })),
  task: (input) => {
    return input + 1;
  },
  scores: [Factuality, Levenshtein],
  maxConcurrency: 5, // Run 5 tests concurrently
});

The text was updated successfully, but these errors were encountered:

filip-michalsky added enhancement New feature or request good first issue Good for newcomers labels Nov 4, 2024

kamath added this to Stagehand Nov 29, 2024

kamath added this to the Evaluation milestone Nov 29, 2024

kamath moved this to Todo in Stagehand Nov 29, 2024

kamath removed the status in Stagehand Nov 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Control concurrency of evals #98

Control concurrency of evals #98

filip-michalsky commented Oct 7, 2024

Control concurrency of evals #98

Control concurrency of evals #98

Comments

filip-michalsky commented Oct 7, 2024