LIVE-Last scan updating-52 sources active-36 signals today-AI CODINGMinecraft AI Agent Skills Bundle for Codex and Claude Code
Research radar

arXiv preprints, filtered for developer impact.

Concise summaries of new AI papers, ranked for how likely they are to change how production systems are built. arXiv collection remains rate-limited.

Papers tracked
180
1 arXiv sources
Top radar
86
http://arxiv.org/abs/2605.30352v1
Published 24h
0
by paper date
Rate limit
1/3s
arXiv policy
PaperAuthorsarXiv IDCategoriesPublishedCodeRadarSummary
EdgeFlow: Edge-Map Augmented VLM-Based Flowchart Processing for Industrial Requirements EngineeringZhifei Dou, Shabnam Hassani, Ou Weihttp://arxiv.org/abs/2605.27332v1cs.SE, cs.AIMay 26, 2026NoneRDR86Flowcharts are widely used in industrial requirements, but usually remain embedded as static...
Chartographer: Counterfactual Chart Generation for Evaluating Vision-Language ModelsYifan Jiang, Dae Yon Hwang, Jesse C. Cresswellhttp://arxiv.org/abs/2605.27311v1cs.CL, cs.CVMay 26, 2026NoneRDR86Chart question-answering (QA) benchmarks aim to pose questions that require visual reasoning...
From Model Scaling to System Scaling: Scaling the Harness in Agentic AIShangding Guhttp://arxiv.org/abs/2605.26112v1cs.AI, cs.LGMay 25, 2026DetectedRDR86This paper studies the next major bottleneck in agentic AI as system scaling, not only model...
Prism: A Plug-in Reproducible Infrastructure for Scalable Multimodal Continual Instruction TuningJun-Tao Tang, Yu-Cheng Shi, Zhen-Hao Xiehttp://arxiv.org/abs/2605.26110v1cs.LG, cs.CLMay 25, 2026DetectedRDR86Multimodal Large Language Models (MLLMs) achieve versatility by reformulating diverse tasks i...
InstructSAM: Segment Any Instance with Any InstructionsYuqian Yuan, Wentong Li, Zhaocheng Lihttp://arxiv.org/abs/2605.26102v1cs.CVMay 25, 2026NoneRDR86In this paper, we introduce InstructSAM, a unified and streamlined framework designed for mul...
DiscoverPhysics: Benchmarking LLMs for Out-of-the-Box Scientific ThinkingMatt L. Wiemann, Lindsay M. Smith, Peter Melchiorhttp://arxiv.org/abs/2605.26087v1stat.ML, cs.LGMay 25, 2026NoneRDR86Frontier LLMs now perform strongly across a wide range of physics evaluations, but it is hard...
StakeBench: Evaluating Language Understanding Grounded in Market CommitmentYunhua Pei, Jingyu Hu, Yiwei Shihttp://arxiv.org/abs/2605.26074v1cs.CL, cs.AIMay 25, 2026NoneRDR86Existing financial NLP benchmarks often rely on labels supplied by outside observers, measuri...
Rethinking Weak Supervision in Anomaly Detection: A Comprehensive BenchmarkXu Yao, Siyuan Zhou, Wu Zhenbohttp://arxiv.org/abs/2605.26068v1cs.LG, cs.AIMay 25, 2026DetectedRDR86Weakly supervised anomaly detection (WSAD) has developed in three primary directions: incompl...
YoCausal: How Far is Video Generation from World Model? A Causality PerspectiveYou-Zhe Xie, Yu-Hsuan Li, Jie-Ying Leehttp://arxiv.org/abs/2605.30346v1cs.CVMay 28, 2026NoneRDR85As video diffusion models (VDMs) advance toward world models, a key question arises: do they...
SchGen: PCB Schematic Generation with Semantic-Grounded Code RepresentationsQinpei Luo, Ruichun Ma, Xinyu Zhanghttp://arxiv.org/abs/2605.30345v1cs.AI, cs.CLMay 28, 2026NoneRDR85Printed circuit board (PCB) schematic design defines nearly all electronic hardware, but it r...