Back to trendsalexweil/sherlock-agent-eval: A cheat-resistant detective board-game harness for evaluating LLM agents.
Source-linked topic cluster with 1 signals across related articles, projects, models, papers, and source updates.
RDR54Developer ToolsMomentum 74Last seen Jun 22, 2026
Source mixGITHUB:github-ai-on-radar (1)
Signals