HuggingFace 每日AI论文速递 - 2026.02.06 | RLVR去长度偏见；长镜头不换记忆 - EarsOnMe

主播

拨号上网 1 档播客

节目简介

来源：小宇宙

【赞助商】

通勤路上就听AI每周谈。AI每周谈，每周带你回顾上周AI大事

传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd

【目录】

本期的 15 篇论文如下：

[00:29] 📊 Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR（长度无偏序列策略优化：揭示与控制RLVR中的响应长度变化）

[01:20] 🎬 Context Forcing: Consistent Autoregressive Video Generation with Long Context（上下文强制：具有长上下文的一致自回归视频生成）

[02:11] 🧠 RISE-Video: Can Video Generators Decode Implicit World Rules?（RISE-Video：视频生成器能否解码隐含的世界规则？）

[02:57] 🔮 ProAct: Agentic Lookahead in Interactive Environments（ProAct：交互式环境中的前瞻性智能体规划）

[03:47] ⚡ Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations（Dr. Kernel：用于Triton内核生成的强化学习正确实现）

[04:39] 🧭 Steering LLMs via Scalable Interactive Oversight（通过可扩展的交互式监督引导大型语言模型）

[05:27] 🧠 Grounding and Enhancing Informativeness and Utility in Dataset Distillation（数据集约简中信息性与实用性的基础与增强）

[06:13] 🧪 Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities（检索增强推理沙盒：一个解耦检索与推理能力的基准）

[07:07] 🔍 Semantic Search over 9 Million Mathematical Theorems（对超过900万个数学定理的语义搜索）

[07:57] 🕷 Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening（Spider-Sense：基于内在风险感知的高效智能体防御与分层自适应筛查）

[08:39] 🧪 CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty（CAR-bench：评估现实世界不确定性下LLM智能体的一致性与极限感知能力）

[09:30] 🤖 InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions（InterPrior：基于物理的人-物交互生成控制扩展框架）

[10:22] 🎬 Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning（帧中思考：视觉上下文与测试时缩放如何赋能视频推理）

[11:14] 🔄 SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs（SwimBird：在混合自回归多模态大语言模型中引发可切换推理模式）

[12:20] 🔍 SAGE: Benchmarking and Improving Retrieval for Deep Research Agents（SAGE：深度研究智能体的检索基准评测与性能提升）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

在小宇宙查看该单集文稿

2026.02.06 | RLVR去长度偏见；长镜头不换记忆

加入我们的 Discord

扫描微信二维码

播放列表