主播
节目简介
来源:小宇宙
【赞助商】
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd
【目录】
本期的 15 篇论文如下:
[00:33] 🎬 Helios: Real Real-Time Long Video Generation Model(Helios:实时长视频生成模型)
[01:12] 🤝 Heterogeneous Agent Collaborative Reinforcement Learning(异构智能体协作强化学习)
[01:56] 🧠 T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning(T2S-Bench与思维结构:全面文本到结构推理的基准测试与提示技术)
[02:50] 🤖 Proact-VL: A Proactive VideoLLM for Real-Time AI Companions(Proact-VL:面向实时AI伴侣的主动视频语言模型)
[03:28] 🧠 MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning(MemSifter:通过结果驱动的代理推理卸载LLM记忆检索)
[04:20] 🤖 ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors(ArtHOI:基于视频先验4D重建的关节化人-物交互合成)
[05:12] 🎥 CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video(CubeComposer:基于透视视频的时空自回归4K 360°视频生成)
[05:51] 🧠 Phi-4-reasoning-vision-15B Technical Report(Phi-4推理视觉-15B技术报告)
[06:41] 🧠 Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory(Memex(RL):通过索引化经验记忆扩展长程LLM智能体)
[07:20] 🔍 AgilePruner: An Empirical Study of Attention and Diversity for Adaptive Visual Token Pruning in Large Vision-Language Models(AgilePruner:针对大型视觉语言模型中自适应视觉令牌剪枝的注意力与多样性实证研究)
[08:12] 🎬 RIVER: A Real-Time Interaction Benchmark for Video LLMs(RIVER:面向视频大语言模型的实时交互基准)
[08:51] 🎬 InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions(InfinityStory:具有世界一致性和角色感知镜头转换的无限制视频生成)
[09:43] 🧠 EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding(EmbodiedSplat:面向开放词汇3D场景理解的在线前馈语义3D高斯泼溅)
[10:32] 🧠 BeamPERL: Parameter-Efficient RL with Verifiable Rewards Specializes Compact LLMs for Structured Beam Mechanics Reasoning(BeamPERL:基于可验证奖励的参数高效强化学习使紧凑型大语言模型专精于结构化梁力学推理)
[11:34] 🔄 SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration(SWE-CI:通过持续集成评估智能体在代码库维护中的能力)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd
【目录】
本期的 15 篇论文如下:
[00:33] 🎬 Helios: Real Real-Time Long Video Generation Model(Helios:实时长视频生成模型)
[01:12] 🤝 Heterogeneous Agent Collaborative Reinforcement Learning(异构智能体协作强化学习)
[01:56] 🧠 T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning(T2S-Bench与思维结构:全面文本到结构推理的基准测试与提示技术)
[02:50] 🤖 Proact-VL: A Proactive VideoLLM for Real-Time AI Companions(Proact-VL:面向实时AI伴侣的主动视频语言模型)
[03:28] 🧠 MemSifter: Offloading LLM Memory Retrieval via Outcome-Driven Proxy Reasoning(MemSifter:通过结果驱动的代理推理卸载LLM记忆检索)
[04:20] 🤖 ArtHOI: Articulated Human-Object Interaction Synthesis by 4D Reconstruction from Video Priors(ArtHOI:基于视频先验4D重建的关节化人-物交互合成)
[05:12] 🎥 CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video(CubeComposer:基于透视视频的时空自回归4K 360°视频生成)
[05:51] 🧠 Phi-4-reasoning-vision-15B Technical Report(Phi-4推理视觉-15B技术报告)
[06:41] 🧠 Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory(Memex(RL):通过索引化经验记忆扩展长程LLM智能体)
[07:20] 🔍 AgilePruner: An Empirical Study of Attention and Diversity for Adaptive Visual Token Pruning in Large Vision-Language Models(AgilePruner:针对大型视觉语言模型中自适应视觉令牌剪枝的注意力与多样性实证研究)
[08:12] 🎬 RIVER: A Real-Time Interaction Benchmark for Video LLMs(RIVER:面向视频大语言模型的实时交互基准)
[08:51] 🎬 InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions(InfinityStory:具有世界一致性和角色感知镜头转换的无限制视频生成)
[09:43] 🧠 EmbodiedSplat: Online Feed-Forward Semantic 3DGS for Open-Vocabulary 3D Scene Understanding(EmbodiedSplat:面向开放词汇3D场景理解的在线前馈语义3D高斯泼溅)
[10:32] 🧠 BeamPERL: Parameter-Efficient RL with Verifiable Rewards Specializes Compact LLMs for Structured Beam Mechanics Reasoning(BeamPERL:基于可验证奖励的参数高效强化学习使紧凑型大语言模型专精于结构化梁力学推理)
[11:34] 🔄 SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via Continuous Integration(SWE-CI:通过持续集成评估智能体在代码库维护中的能力)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
评价
空空如也
小宇宙热评
暂无小宇宙热门评论