主播
节目简介
来源:小宇宙
【赞助商】
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd
【目录】
本期的 15 篇论文如下:
[00:33] 🧠 OpenWorldLib: A Unified Codebase and Definition of Advanced World Models(OpenWorldLib:一个统一代码库与高级世界模型定义)
[01:26] 📊 MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale(MinerU2.5-Pro:在规模上突破数据为中心文档解析的极限)
[02:07] 🧠 TriAttention: Efficient Long Reasoning with Trigonometric KV Compression(TriAttention:基于三角函数的KV压缩实现高效长序列推理)
[02:58] 🎥 AURA: Always-On Understanding and Real-Time Assistance via Video Streams(AURA:基于视频流的持续理解与实时辅助系统)
[03:42] 🔍 LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models(LIBERO-Para:面向VLA模型的释义鲁棒性诊断基准与度量)
[04:24] 🎯 SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing(SpatialEdit:细粒度图像空间编辑基准测试)
[05:07] 📈 Adam's Law: Textual Frequency Law on Large Language Models(亚当定律:大语言模型上的文本频率定律)
[05:56] 🗂 FileGram: Grounding Agent Personalization in File-System Behavioral Traces(FileGram:基于文件系统行为轨迹的智能体个性化研究)
[06:45] 🧪 ClawArena: Benchmarking AI Agents in Evolving Information Environments(ClawArena:在演化信息环境中对AI智能体进行基准测试)
[07:38] 🧠 LightThinker++: From Reasoning Compression to Memory Management(LightThinker++:从推理压缩到内存管理)
[08:12] 🔄 Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing(通过样本路由统一组相对与自蒸馏策略优化)
[08:50] 🧠 SkillX: Automatically Constructing Skill Knowledge Bases for Agents(SkillX:面向智能体的技能知识库自动构建框架)
[09:39] 🤖 Self-Execution Simulation Improves Coding Models(自执行模拟提升代码模型性能)
[10:22] 🧠 Vero: An Open RL Recipe for General Visual Reasoning(Vero:一种用于通用视觉推理的开放强化学习方案)
[11:12] 🛡 Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw(你的智能体,他们的资产:OpenClaw 的现实世界安全性分析)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd
【目录】
本期的 15 篇论文如下:
[00:33] 🧠 OpenWorldLib: A Unified Codebase and Definition of Advanced World Models(OpenWorldLib:一个统一代码库与高级世界模型定义)
[01:26] 📊 MinerU2.5-Pro: Pushing the Limits of Data-Centric Document Parsing at Scale(MinerU2.5-Pro:在规模上突破数据为中心文档解析的极限)
[02:07] 🧠 TriAttention: Efficient Long Reasoning with Trigonometric KV Compression(TriAttention:基于三角函数的KV压缩实现高效长序列推理)
[02:58] 🎥 AURA: Always-On Understanding and Real-Time Assistance via Video Streams(AURA:基于视频流的持续理解与实时辅助系统)
[03:42] 🔍 LIBERO-Para: A Diagnostic Benchmark and Metrics for Paraphrase Robustness in VLA Models(LIBERO-Para:面向VLA模型的释义鲁棒性诊断基准与度量)
[04:24] 🎯 SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing(SpatialEdit:细粒度图像空间编辑基准测试)
[05:07] 📈 Adam's Law: Textual Frequency Law on Large Language Models(亚当定律:大语言模型上的文本频率定律)
[05:56] 🗂 FileGram: Grounding Agent Personalization in File-System Behavioral Traces(FileGram:基于文件系统行为轨迹的智能体个性化研究)
[06:45] 🧪 ClawArena: Benchmarking AI Agents in Evolving Information Environments(ClawArena:在演化信息环境中对AI智能体进行基准测试)
[07:38] 🧠 LightThinker++: From Reasoning Compression to Memory Management(LightThinker++:从推理压缩到内存管理)
[08:12] 🔄 Unifying Group-Relative and Self-Distillation Policy Optimization via Sample Routing(通过样本路由统一组相对与自蒸馏策略优化)
[08:50] 🧠 SkillX: Automatically Constructing Skill Knowledge Bases for Agents(SkillX:面向智能体的技能知识库自动构建框架)
[09:39] 🤖 Self-Execution Simulation Improves Coding Models(自执行模拟提升代码模型性能)
[10:22] 🧠 Vero: An Open RL Recipe for General Visual Reasoning(Vero:一种用于通用视觉推理的开放强化学习方案)
[11:12] 🛡 Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw(你的智能体,他们的资产:OpenClaw 的现实世界安全性分析)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
小宇宙热评
HiYen
3周前
上海
0
论文是否link