LIVE-Last scan updating-52 sources active-65 signals today-AI CODINGMinecraft AI Agent Skills Bundle for Codex and Claude Code
Research radar

arXiv preprints, filtered for developer impact.

Concise summaries of new AI papers, ranked for how likely they are to change how production systems are built. arXiv collection remains rate-limited.

Papers tracked
180
1 arXiv sources
Top radar
86
http://arxiv.org/abs/2605.30352v1
Published 24h
0
by paper date
Rate limit
1/3s
arXiv policy
PaperAuthorsarXiv IDCategoriesPublishedCodeRadarSummary
Reasoning with Sampling: Cutting at Decision PointsFelix Zhou, Anay Mehrotra, Quanquan C. Liuhttp://arxiv.org/abs/2605.30327v1cs.LG, cs.AIMay 28, 2026NoneRDR85Frontier reasoning models are produced by posttraining base language models with reinforcemen...
SpatialBench: Is Your Spatial Foundation Model an All-Round Player?Haosong Peng, Hao Li, Jiaqi Chenhttp://arxiv.org/abs/2605.27367v1cs.CVMay 26, 2026NoneRDR85While spatial foundation models have demonstrated impressive performance on standard datasets...
MobileMoE: Scaling On-Device Mixture of ExpertsYanbei Chen, Hanxian Huang, Ernie Changhttp://arxiv.org/abs/2605.27358v1cs.LG, cs.AIMay 26, 2026NoneRDR85Mixture-of-Experts (MoE) has become the de facto architecture for hundred-billion-parameter l...
LLMSurgeon: Diagnosing Data Mixture of Large Language ModelsYaxin Luo, Jiacheng Cui, Xiaohan Zhaohttp://arxiv.org/abs/2605.30348v1cs.CL, cs.AIMay 28, 2026NoneRDR84The pretraining data mixture of Large Language Models (LLMs) constitutes their "digital DNA",...
Bias Leaves a Gradient Trail: Label-Free Bias Identification via Gradient Probes on Concept DecompositionsThomas Vitry, Kieran Edgeworth, Stefan Wermterhttp://arxiv.org/abs/2605.28780v1cs.CV, cs.LGMay 27, 2026DetectedRDR84Vision classifiers can exploit spurious correlations, achieving high in-distribution accuracy...
Automated Benchmark Auditing for AI Agents and Large Language ModelsJunlin Wang, Federico Bianchi, Shang Zhuhttp://arxiv.org/abs/2605.26079v1cs.CLMay 25, 2026NoneRDR84Modern AI benchmarks operate at a complexity that outpaces traditional verification methods....
SkillOpt: Executive Strategy for Self-Evolving Agent SkillsYifan Yang, Ziyang Gong, Weiquan Huanghttp://arxiv.org/abs/2605.23904v1cs.AI, cs.CLMay 22, 2026NoneRDR84Agent skills today are hand-crafted, generated one-shot, or evolved through loosely controlle...
Decomposing Queries into Tool Calls for Long-Video Keyframe RetrievalMichal Shlapentokh-Rothman, Prachi Garg, Yu-Xiong Wanghttp://arxiv.org/abs/2605.23826v1cs.CV, cs.CLMay 22, 2026DetectedRDR84Keyframe selection is a direct way to provide verifiable visual evidence for long-video quest...
Gated DeltaNet-2: Decoupling Erase and Write in Linear AttentionAli Hatamizadeh, Yejin Choi, Jan Kautzhttp://arxiv.org/abs/2605.22791v1cs.AIMay 21, 2026DetectedRDR84Linear attention replaces the unbounded cache of softmax attention with a fixed-size recurren...
LCGuard: Latent Communication Guard for Safe KV Sharing in Multi-Agent SystemsSadia Asif, Mohammad Mohammadi Amiri, Momin Abbashttp://arxiv.org/abs/2605.22786v1cs.AI, cs.ETMay 21, 2026NoneRDR84Large language model (LLM)-based multi-agent systems increasingly rely on intermediate commun...