Evaluation Quickstart
Evaluate your LLM app.
Installation
First, you’ll need a Parea API key. See Authentication to get started.
After you’ve followed those steps, you are ready to install the Parea SDK client.
Create an evaluation script
Start with creating a simple evaluation script.
Run experiment
After you’ve followed the above steps, you are ready run your experiment.
View results
The executed script will create a link to the experiment overview & its traces. You will see a high-level overview of your experiment, including average values for metrics such as latencies and cost, and any evaluation functions you’ve defined. You will see a table of your logs, and any chains will be expandable. The log table supports search, filtering, and sorting.
You can create additional statistics by clicking the “Pin stat” button. If you click a log, it will open the detailed trace view. Here, you can step through each span and view inputs, outputs, messages, metadata, and other key metrics associated with a given trace.
Can you improve to 100%?
For our first experiment, we only achieved a 77.8% score. Can you improve the score to 100%? If you run another experiment, you can compare the results like the screenshot below.
What’s Next?
Dive deeper into Experiments or get started with monitoring your application.
Was this page helpful?