LIVE-Last scan updating-52 sources active-134 signals today-AGENTSPeakflo's 20x: A Self-Improving AI Agent Orchestrator for Knowledge Work
Research radar

arXiv preprints, filtered for developer impact.

Concise summaries of new AI papers, ranked for how likely they are to change how production systems are built. arXiv collection remains rate-limited.

Papers tracked
180
1 arXiv sources
Top radar
86
http://arxiv.org/abs/2605.30352v1
Published 24h
0
by paper date
Rate limit
1/3s
arXiv policy
PaperAuthorsarXiv IDCategoriesPublishedCodeRadarSummary
In-Context Reward Adaptation for Robust Preference ModelingZhenyu Sun, Zheng Xu, Ermin Weihttp://arxiv.org/abs/2605.30323v1cs.LG, cs.AIMay 28, 2026NoneRDR74Reinforcement Learning from Human Feedback (RLHF) typically relies on static reward models to...
Personal Visual Memory from Explicit and Implicit EvidenceViet Nguyen, Thao Nguyen, Vishal M. Patelhttp://arxiv.org/abs/2605.28806v1cs.CV, cs.CLMay 27, 2026NoneRDR74Long-term memory is increasingly important for personalized AI agents, yet existing benchmark...
Human Label Variation as Stable Signal: Learning Annotator-Specific Explanation Behavior via Cross-Annotator Preference OptimizationBeiduo Chen, Pingjun Hong, Ziyun Zhanghttp://arxiv.org/abs/2605.28802v1cs.CLMay 27, 2026NoneRDR74Free-text explanations extend human label variation (HLV) beyond label disagreement by reveal...
Agent Explorative Policy Optimization for Multimodal Agentic ReasoningMinki Kang, Shizhe Diao, Ryo Hachiumahttp://arxiv.org/abs/2605.28774v1cs.CLMay 27, 2026NoneRDR74Vision-language models with extended reasoning succeed on complex problems, but many real-wor...
SwarmHarness: Skill-Based Task Routing via Decentralized Incentive-Aligned AI Agent NetworksEdwin Josehttp://arxiv.org/abs/2605.28764v1cs.AI, cs.DCMay 27, 2026NoneRDR74Vast quantities of compute (GPU cycles on personal workstations, idle inference servers, and...
Natural Language Query to Configuration for Retrieval AgentsMelissa Z. Pan, Negar Arabzadeh, Mathew Jacobhttp://arxiv.org/abs/2605.27361v1cs.AI, eess.SYMay 26, 2026NoneRDR74Modern retrieval agents expose many configuration choices -- LLM, retriever, number of docume...
Towards Controllable Image Generation through Representation-Conditioned Diffusion ModelsNithesh Chandher Karthikeyan, Jonas Unger, Gabriel Eilertsenhttp://arxiv.org/abs/2605.27343v1cs.CV, cs.LGMay 26, 2026NoneRDR74Diffusion models have emerged as powerful tools for high-quality image generation and editing...
FinHarness: An Inline Lifecycle Safety Harness for Finance LLM AgentsHaoxuan Jia, Yang Liu, Bin Chonghttp://arxiv.org/abs/2605.27333v1cs.CLMay 26, 2026NoneRDR74Finance LLM agents must simultaneously block prompt-induced unauthorized actions and approve...
LLMs as Noisy Channels: A Shannon Perspective on Model Capacity and Scaling LawsXu Ouyang, Deyi Liu, Yuhang Caihttp://arxiv.org/abs/2605.23901v1cs.LG, cs.AIMay 22, 2026NoneRDR74Existing scaling laws for Large Language Models (LLMs), predominantly monotonic power laws, f...
SPACENUM: Revisiting Spatial Numerical Understanding in VLMsJianshu Zhang, Yijiang Li, Huifeixin Chenhttp://arxiv.org/abs/2605.23898v1cs.AIMay 22, 2026NoneRDR74Vision-Language Models (VLMs) are increasingly deployed in embodied environments, where they...