主播
节目简介
来源:小宇宙
【目录】
本期的 15 篇论文如下:
[00:31] 🎭 ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time?(ArcANE:角色扮演语言代理在正确时刻保持角色一致性吗?)
[01:26] 🔍 TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration(TIDE:通过模板引导的迭代实现主动多问题发现)
[02:27] 🤖 AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints(AdaPlanBench:在世界与用户约束下评估大语言模型智能体的自适应规划能力)
[03:14] 🎥 VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding(VideoKR:迈向知识和推理密集型视频理解)
[04:09] 🤖 RobotValues: Evaluating Household Robots When Human Values Conflict(机器人价值观:当人类价值观冲突时评估家用机器人)
[05:01] 🌐 Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation(强化学习引发对未见语言的上下文翻译学习)
[05:58] 🎬 LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing(LoomVideo:统一多模态输入的视频生成与编辑)
[06:49] 📸 Personal AI Agent for Camera Roll VQA(个人相机胶卷视觉问答的AI助手)
[07:36] 🧠 Rethinking Continual Experience Internalization for Self-Evolving LLM Agents(重新思考持续经验内化以实现自演化的大语言模型智能体)
[08:27] ⚖ Complexity-Balanced Diffusion Splitting(复杂度平衡扩散分割)
[09:28] 🤖 Dream.exe: Can Video Generation Models Dream Executable Robot Manipulation?(Dream.exe:视频生成模型能否构想出可执行的机器人操作?)
[10:33] 🔬 Unsupervised Skill Discovery for Agentic Data Analysis(面向智能体数据分析的无监督技能发现)
[11:25] 🔍 LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs(大型语言模型可能泄露训练数据,但它们愿意吗?一种基于倾向性的记忆评估方法)
[12:17] 🎯 Towards One-to-Many Temporal Grounding(迈向一对多时序定位)
[13:16] 💰 The Shadow Price of Reasoning: Economic Perspective on Optimal Budget Allocation for LLMs(推理的影子价格:大型语言模型最优预算分配的经济学视角)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
【赞助商】
OpenClaw快报
每天五分钟,听听 OpenClaw 快报,带你了解最新动态和业内讨论
传送门 https://www.xiaoyuzhoufm.com/podcast/6a1732a2dffa135d0ab5ef43
本期的 15 篇论文如下:
[00:31] 🎭 ArcANE: Do Role-Playing Language Agents Stay in Character at the Right Time?(ArcANE:角色扮演语言代理在正确时刻保持角色一致性吗?)
[01:26] 🔍 TIDE: Proactive Multi-Problem Discovery via Template-Guided Iteration(TIDE:通过模板引导的迭代实现主动多问题发现)
[02:27] 🤖 AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints(AdaPlanBench:在世界与用户约束下评估大语言模型智能体的自适应规划能力)
[03:14] 🎥 VideoKR: Towards Knowledge- and Reasoning-Intensive Video Understanding(VideoKR:迈向知识和推理密集型视频理解)
[04:09] 🤖 RobotValues: Evaluating Household Robots When Human Values Conflict(机器人价值观:当人类价值观冲突时评估家用机器人)
[05:01] 🌐 Reinforcement Learning Elicits Contextual Learning of Unseen Language Translation(强化学习引发对未见语言的上下文翻译学习)
[05:58] 🎬 LoomVideo: Unifying Multimodal Inputs into Video Generation and Editing(LoomVideo:统一多模态输入的视频生成与编辑)
[06:49] 📸 Personal AI Agent for Camera Roll VQA(个人相机胶卷视觉问答的AI助手)
[07:36] 🧠 Rethinking Continual Experience Internalization for Self-Evolving LLM Agents(重新思考持续经验内化以实现自演化的大语言模型智能体)
[08:27] ⚖ Complexity-Balanced Diffusion Splitting(复杂度平衡扩散分割)
[09:28] 🤖 Dream.exe: Can Video Generation Models Dream Executable Robot Manipulation?(Dream.exe:视频生成模型能否构想出可执行的机器人操作?)
[10:33] 🔬 Unsupervised Skill Discovery for Agentic Data Analysis(面向智能体数据分析的无监督技能发现)
[11:25] 🔍 LLMs Can Leak Training Data But Do They Want To? A Propensity-Aware Evaluation of Memorization in LLMs(大型语言模型可能泄露训练数据,但它们愿意吗?一种基于倾向性的记忆评估方法)
[12:17] 🎯 Towards One-to-Many Temporal Grounding(迈向一对多时序定位)
[13:16] 💰 The Shadow Price of Reasoning: Economic Perspective on Optimal Budget Allocation for LLMs(推理的影子价格:大型语言模型最优预算分配的经济学视角)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
【赞助商】
OpenClaw快报
每天五分钟,听听 OpenClaw 快报,带你了解最新动态和业内讨论
传送门 https://www.xiaoyuzhoufm.com/podcast/6a1732a2dffa135d0ab5ef43