Build better prompts, faster

Evalit is a lightweight Python toolkit to:

Quickstart

Install, run an experiment, and find the winning prompt.

Explore the PromptManager, Experiment, and Evaluator APIs.

Create, version, and activate prompts.

Compare variants with a controlled budget.

IRT-inspired scoring using logistic regression.