Evaluation

Evaluating an AgentKit app can be done on multiple levels:

Routing layer: Evaluate the meta agent's accuracy of choosing the right action plan based on the user query
Tool layer: Evaluate individual tools
Output layer: Evaluate the final output quality

AgentKit natively integrates with LangSmith, which is a useful tool for tracing and tracking the performance of your app. https://docs.smith.langchain.com/

See Optional Features for instructions to set up LangSmith.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

evaluation.md

evaluation.md

Evaluation

Files

evaluation.md

Latest commit

History

evaluation.md

File metadata and controls

Evaluation