主播
节目简介
来源:小宇宙
【赞助商】
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd
【目录】
本期的 15 篇论文如下:
[00:31] 🧠 The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping(过去并未过去:基于记忆增强的动态奖励塑形)
[01:20] 🔍 Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation(Transformer中的注意力沉没现象:利用、解释与缓解策略综述)
[02:08] ⚛ QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation(QuanBench+:面向基于大语言模型的量子代码生成的统一多框架基准测试)
[02:59] 🎬 OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation(OmniShow:统一多模态条件的人-物交互视频生成)
[03:35] 🎨 Strips as Tokens: Artist Mesh Generation with Native UV Segmentation(条带即令牌:基于原生UV分割的艺术家网格生成)
[04:11] 🎬 Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator(Uni-ViGU:通过基于扩散的视频生成器实现统一的视频生成与理解)
[05:13] 🔍 Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models(伪统一:熵探测揭示统一多模态模型中的信息模式分歧)
[05:57] 🔍 CodeTracer: Towards Traceable Agent States(CodeTracer:迈向可追溯的智能体状态)
[06:45] 🧪 CocoaBench: Evaluating Unified Digital Agents in the Wild(CocoaBench:在真实场景中评估统一数字智能体)
[07:32] 🕸 Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs(溯源寻根:用于揭示后训练大语言模型中数据谱系的多智能体框架)
[08:17] 🤔 Introspective Diffusion Language Models(内省扩散语言模型)
[09:12] 🧠 Solving Physics Olympiad via Reinforcement Learning on Physics Simulators(基于物理模拟器的强化学习解决物理奥林匹克竞赛问题)
[09:50] 🎬 Prompt Relay: Inference-Time Temporal Control for Multi-Event Video Generation(提示接力:面向多事件视频生成的推理时态控制)
[10:38] 🎵 Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music(Audio Flamingo Next:面向语音、声音与音乐的下一代开放音频-语言模型)
[11:33] ⚡ SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding(SPEED-Bench:一个用于推测解码的统一且多样化的基准测试)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事
传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd
【目录】
本期的 15 篇论文如下:
[00:31] 🧠 The Past Is Not Past: Memory-Enhanced Dynamic Reward Shaping(过去并未过去:基于记忆增强的动态奖励塑形)
[01:20] 🔍 Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation(Transformer中的注意力沉没现象:利用、解释与缓解策略综述)
[02:08] ⚛ QuanBench+: A Unified Multi-Framework Benchmark for LLM-Based Quantum Code Generation(QuanBench+:面向基于大语言模型的量子代码生成的统一多框架基准测试)
[02:59] 🎬 OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation(OmniShow:统一多模态条件的人-物交互视频生成)
[03:35] 🎨 Strips as Tokens: Artist Mesh Generation with Native UV Segmentation(条带即令牌:基于原生UV分割的艺术家网格生成)
[04:11] 🎬 Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator(Uni-ViGU:通过基于扩散的视频生成器实现统一的视频生成与理解)
[05:13] 🔍 Pseudo-Unification: Entropy Probing Reveals Divergent Information Patterns in Unified Multimodal Models(伪统一:熵探测揭示统一多模态模型中的信息模式分歧)
[05:57] 🔍 CodeTracer: Towards Traceable Agent States(CodeTracer:迈向可追溯的智能体状态)
[06:45] 🧪 CocoaBench: Evaluating Unified Digital Agents in the Wild(CocoaBench:在真实场景中评估统一数字智能体)
[07:32] 🕸 Tracing the Roots: A Multi-Agent Framework for Uncovering Data Lineage in Post-Training LLMs(溯源寻根:用于揭示后训练大语言模型中数据谱系的多智能体框架)
[08:17] 🤔 Introspective Diffusion Language Models(内省扩散语言模型)
[09:12] 🧠 Solving Physics Olympiad via Reinforcement Learning on Physics Simulators(基于物理模拟器的强化学习解决物理奥林匹克竞赛问题)
[09:50] 🎬 Prompt Relay: Inference-Time Temporal Control for Multi-Event Video Generation(提示接力:面向多事件视频生成的推理时态控制)
[10:38] 🎵 Audio Flamingo Next: Next-Generation Open Audio-Language Models for Speech, Sound, and Music(Audio Flamingo Next:面向语音、声音与音乐的下一代开放音频-语言模型)
[11:33] ⚡ SPEED-Bench: A Unified and Diverse Benchmark for Speculative Decoding(SPEED-Bench:一个用于推测解码的统一且多样化的基准测试)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递