2026.04.03 | DataFlex让数据像乐高;潜在空间成AI新地图
HuggingFace 每日AI论文速递
【赞助商】通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd【目录】本期的 15 篇论文如下:[00:41] 🔄 DataFlex: A Unified Framework for Data-Centric Dynamic Training of Large Language Models(DataFlex:面向大语言模型数据中心化动态训练的统一框架)[01:48] 🧠 The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook(潜在空间:基础、演进、机制、能力与展望)[02:45] 🧠 SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization(SKILL0:用于技能内化的上下文智能体强化学习)[03:22] 🎮 Generative World Renderer(生成式世界渲染器)[04:09] 👁 EgoSim: Egocentric World Simulator for Embodied Interaction Generation(EgoSim:面向具身交互生成的第一人称世界模拟器)[05:24] 🧠 LatentUM: Unleashing the Potential of Interleaved Cross-Modal Reasoning via a Latent-Space Unified Model(LatentUM:通过潜在空间统一模型释放交错跨模态推理的潜力)[06:06] 🧠 Omni-SimpleMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory(Omni-SimpleMem:基于自主研究引导的终身多模态智能体记忆发现)[06:47] 🚗 UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving(UniDriveVLA:统一自动驾驶中的理解、感知与动作规划)[07:35] 🎯 Steerable Visual Representations(可操控的视觉表示)[08:12] 🎬 VOID: Video Object and Interaction Deletion(VOID:视频对象与交互删除)[09:06] 🤖 Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time(探究自主编码代理在真实项目中的贡献:活动模式与代码随时间的变化)[09:47] 🚀 ASI-Evolve: AI Accelerates AI(ASI-Evolve:人工智能加速人工智能发展)[10:50] 🎭 Tex3D: Objects as Attack Surfaces via Adversarial 3D Textures for Vision-Language-Action Models(Tex3D:通过对抗性3D纹理将物体作为视觉-语言-动作模型的攻击面)[11:36] 🤖 GPA: Learning GUI Process Automation from Demonstrations(GPA:通过演示学习图形用户界面流程自动化)[12:24] 🔍 VideoZeroBench: Probing the Limits of Video MLLMs with Spatio-Temporal Evidence Verification(VideoZeroBench:通过时空证据验证探究视频多模态大语言模型的极限)【关注我们】您还可以在以下平台找到我们,获得播客内容以外更多信息小红书: AI速递在小宇宙查看该单集文稿