E2E LLM evals, with less focus on metrics and more focus on binary assertions

1 gharbat 0 5/13/2025, 8:40:12 AM github.com ↗

Comments (0)

No comments yet