时长:
13分钟
播放:
101
发布:
1周前
主播...
简介...
【赞助商】
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd
【目录】
本期的 15 篇论文如下:
[00:29] 📊 Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR(长度无偏序列策略优化:揭示与控制RLVR中的响应长度变化)
[01:20] 🎬 Context Forcing: Consistent Autoregressive Video Generation with Long Context(上下文强制:具有长上下文的一致自回归视频生成)
[02:11] 🧠 RISE-Video: Can Video Generators Decode Implicit World Rules?(RISE-Video:视频生成器能否解码隐含的世界规则?)
[02:57] 🔮 ProAct: Agentic Lookahead in Interactive Environments(ProAct:交互式环境中的前瞻性智能体规划)
[03:47] ⚡ Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations(Dr. Kernel:用于Triton内核生成的强化学习正确实现)
[04:39] 🧭 Steering LLMs via Scalable Interactive Oversight(通过可扩展的交互式监督引导大型语言模型)
[05:27] 🧠 Grounding and Enhancing Informativeness and Utility in Dataset Distillation(数据集约简中信息性与实用性的基础与增强)
[06:13] 🧪 Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities(检索增强推理沙盒:一个解耦检索与推理能力的基准)
[07:07] 🔍 Semantic Search over 9 Million Mathematical Theorems(对超过900万个数学定理的语义搜索)
[07:57] 🕷 Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening(Spider-Sense:基于内在风险感知的高效智能体防御与分层自适应筛查)
[08:39] 🧪 CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty(CAR-bench:评估现实世界不确定性下LLM智能体的一致性与极限感知能力)
[09:30] 🤖 InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions(InterPrior:基于物理的人-物交互生成控制扩展框架)
[10:22] 🎬 Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning(帧中思考:视觉上下文与测试时缩放如何赋能视频推理)
[11:14] 🔄 SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs(SwimBird:在混合自回归多模态大语言模型中引发可切换推理模式)
[12:20] 🔍 SAGE: Benchmarking and Improving Retrieval for Deep Research Agents(SAGE:深度研究智能体的检索基准评测与性能提升)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd
【目录】
本期的 15 篇论文如下:
[00:29] 📊 Length-Unbiased Sequence Policy Optimization: Revealing and Controlling Response Length Variation in RLVR(长度无偏序列策略优化:揭示与控制RLVR中的响应长度变化)
[01:20] 🎬 Context Forcing: Consistent Autoregressive Video Generation with Long Context(上下文强制:具有长上下文的一致自回归视频生成)
[02:11] 🧠 RISE-Video: Can Video Generators Decode Implicit World Rules?(RISE-Video:视频生成器能否解码隐含的世界规则?)
[02:57] 🔮 ProAct: Agentic Lookahead in Interactive Environments(ProAct:交互式环境中的前瞻性智能体规划)
[03:47] ⚡ Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations(Dr. Kernel:用于Triton内核生成的强化学习正确实现)
[04:39] 🧭 Steering LLMs via Scalable Interactive Oversight(通过可扩展的交互式监督引导大型语言模型)
[05:27] 🧠 Grounding and Enhancing Informativeness and Utility in Dataset Distillation(数据集约简中信息性与实用性的基础与增强)
[06:13] 🧪 Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities(检索增强推理沙盒:一个解耦检索与推理能力的基准)
[07:07] 🔍 Semantic Search over 9 Million Mathematical Theorems(对超过900万个数学定理的语义搜索)
[07:57] 🕷 Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening(Spider-Sense:基于内在风险感知的高效智能体防御与分层自适应筛查)
[08:39] 🧪 CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty(CAR-bench:评估现实世界不确定性下LLM智能体的一致性与极限感知能力)
[09:30] 🤖 InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions(InterPrior:基于物理的人-物交互生成控制扩展框架)
[10:22] 🎬 Thinking in Frames: How Visual Context and Test-Time Scaling Empower Video Reasoning(帧中思考:视觉上下文与测试时缩放如何赋能视频推理)
[11:14] 🔄 SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs(SwimBird:在混合自回归多模态大语言模型中引发可切换推理模式)
[12:20] 🔍 SAGE: Benchmarking and Improving Retrieval for Deep Research Agents(SAGE:深度研究智能体的检索基准评测与性能提升)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
评价...
空空如也
小宇宙热门评论...
暂无小宇宙热门评论