Skip to content

Tags: allenai/agent-eval

Tags

0.1.43

Toggle 0.1.43's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Adjust openness and tool usage values (#70)

0.1.42

Toggle 0.1.42's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
LLM names take into account reasoning effort in model args (#69)

0.1.41

Toggle 0.1.41's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
bump version for cost freezing take 3 (#68)

0.1.40

Toggle 0.1.40's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Cost freezing take two (#66)

0.1.39

Toggle 0.1.39's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Freeze model costs (#63)

0.1.38

Toggle 0.1.38's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Reproducibility url for multiple revisions (#65)

0.1.37

Toggle 0.1.37's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Add model name mappings for GPT-5 (#64)

0.1.36

Toggle 0.1.36's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Fix submissions url (#62)

0.1.35

Toggle 0.1.35's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Update how we figure out the task name when processing logs (#60)

0.1.34

Toggle 0.1.34's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
Update readme fixes (#58)