- Attach once, evaluate everywhere: Add LLM or built-in code evaluators to a dataset and reuse them across Playground experiments.
- Flexible input mapping: Map evaluator inputs to dataset fields so each example is evaluated consistently.
- Built-in visibility: Each evaluator captures traces for debugging and refinement, with details available from the evaluator view.

