Results
Analyze experiment results, find insights, and export data.
Lab Run:
Template:
Model:
Status:
Top Recommendations
Run experiments in the Lab to see recommendations.
Performance by Template
| Template | Avg Score | Avg Cost | Avg Latency | Pass Rate | Samples |
|---|---|---|---|---|---|
| No data yet. Run experiments to see results. | |||||
Performance by Configuration
| Model Config | Avg Score | Avg Cost | Avg Latency | Pass Rate | Samples |
|---|---|---|---|---|---|
| No data yet. Run experiments to see results. | |||||
Score Matrix (Template × Config)
| Config / Template | v3 | v2 | v1 |
|---|---|---|---|
| Claude 4.1 Opus | - | - | - |
| Claude 4.5 Haiku | - | - | - |
| Claude 4.5 Sonnet | - | - | - |
| Gemini 2.5 Flash | - | - | - |
| Gemini 2.5 Flash | - | - | - |
Green = High Score (4+) · Yellow = Medium (3-4) · Red = Low (<3)
Individual Run Analysis
| Template | Config | Score | Status | Cost | Latency | Response Preview | |
|---|---|---|---|---|---|---|---|
| No runs yet. Run experiments in the Lab. | |||||||