The Datasets tab is where you can create and manage reusable data.

  • Test case datasets can be used in the Playground.
  • Benchmarking allows you to select a dataset, a few prompts, and a set of evaluation metrics and run evals on all data points in a batch job. When completed, you will have summary statistics for the results.

Getting Started