2026.02.27 | 诊断补课反超72B;三一致性考趴世界模型
HuggingFace 每日AI论文速递
【赞助商】通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd【目录】本期的 15 篇论文如下:[00:31] 🔍 From Blind Spots to Gains: Diagnostic-Driven Iterative Training for Large Multimodal Models(从盲点到增益:诊断驱动的迭代训练用于大型多模态模型)[01:16] 🌍 The Trinity of Consistency as a Defining Principle for General World Models(一致性三位一体:作为通用世界模型定义原则)[01:49] 🧭 MobilityBench: A Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios(MobilityBench:一个用于评估现实世界移动场景中路径规划智能体的基准)[02:52] 🧠 OmniGAIA: Towards Native Omni-Modal AI Agents(OmniGAIA:迈向原生全模态人工智能体)[03:44] 🔍 Imagination Helps Visual Reasoning, But Not Yet in Latent Space(想象力助力视觉推理,但尚未在潜在空间中实现)[04:26] 🧠 Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization(基于混合在线与离线策略优化的探索性记忆增强大语言模型智能体)[05:26] 🛡 AgentDropoutV2: Optimizing Information Flow in Multi-Agent Systems via Test-Time Rectify-or-Reject Pruning(AgentDropoutV2:通过测试时修正或拒绝剪枝优化多智能体系统中的信息流)[06:18] 🔍 Search More, Think Less: Rethinking Long-Horizon Agentic Search for Efficiency and Generalization(多搜索,少思考:重新思考长视野智能搜索的效率与泛化性)[06:54] 🩺 MediX-R1: Open Ended Medical Reinforcement Learning(MediX-R1:开放式医学强化学习框架)[07:42] ⚡ Accelerating Diffusion via Hybrid Data-Pipeline Parallelism Based on Conditional Guidance Scheduling(基于条件引导调度的混合数据-流水线并行加速扩散模型)[08:43] 🤖 EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents(EmbodMocap:面向具身智能体的野外4D人-场景重建)[09:41] 🎮 AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games(AI游戏商店:通过人类游戏对机器通用智能进行可扩展、开放式评估)[10:26] 🚶 Causal Motion Diffusion Models for Autoregressive Motion Generation(因果运动扩散模型用于自回归运动生成)[11:09] ⚡ veScale-FSDP: Flexible and High-Performance FSDP at Scale(veScale-FSDP:大规模灵活且高性能的FSDP)[11:51] 🚗 Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving(面向可泛化端到端自动驾驶的风险感知世界模型预测控制)【关注我们】您还可以在以下平台找到我们,获得播客内容以外更多信息小红书: AI速递在小宇宙查看该单集文稿