时长:
28分钟
播放:
155
发布:
6天前
主播...
简介...
你有没有想过,AI的“内心”也会上演一出出精彩的戏码?这一期,我们将一起潜入AI的大脑,看看它如何像我们一样,在解题前就有了“这题我能行”的直觉;然后我们会给它一张“地图”,看它如何从迷茫游客变身城市规划师,看懂整个复杂的软件世界;接着,我们将见证一位机器人“偷师学艺”,只通过观看视频就学会了打篮球;最后,我们还会聊聊顶尖数学家们如何给AI办一场杜绝作弊的“闭卷考”,以及AI训练场上一条好心办坏事的“交通规则”是如何被修正的。
00:00:40 AI的“第六感”,它如何知道自己快答对了?
00:05:17 给AI一张地图,让它看懂整个软件世界
00:10:47 机器人偷师记,它怎么光看视频就学会了打篮球?
00:18:33 给AI一场“闭卷考”,顶尖数学家们想干啥?
00:23:05 AI训练场上的“交规”,为什么好心会办坏事?
本期介绍的几篇论文:
[CL] Sparse Reward Subsystem in Large Language Models
[Tsinghua University & Stanford University]
https://arxiv.org/abs/2602.00986
---
[CL] Closing the Loop: Universal Repository Representation with RPG-Encoder
[Microsoft Research Asia]
https://arxiv.org/abs/2602.02084
---
[RO] HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos
[The Hong Kong University of Science and Technology]
https://arxiv.org/abs/2602.02473
---
[AI] First Proof
[Stanford University & Columbia University & EPFL ]
https://arxiv.org/abs/2602.05192
---
[LG] Rethinking the Trust Region in LLM Reinforcement Learning
[Sea AI Lab & National University of Singapore]
https://arxiv.org/abs/2602.04879
00:00:40 AI的“第六感”,它如何知道自己快答对了?
00:05:17 给AI一张地图,让它看懂整个软件世界
00:10:47 机器人偷师记,它怎么光看视频就学会了打篮球?
00:18:33 给AI一场“闭卷考”,顶尖数学家们想干啥?
00:23:05 AI训练场上的“交规”,为什么好心会办坏事?
本期介绍的几篇论文:
[CL] Sparse Reward Subsystem in Large Language Models
[Tsinghua University & Stanford University]
https://arxiv.org/abs/2602.00986
---
[CL] Closing the Loop: Universal Repository Representation with RPG-Encoder
[Microsoft Research Asia]
https://arxiv.org/abs/2602.02084
---
[RO] HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos
[The Hong Kong University of Science and Technology]
https://arxiv.org/abs/2602.02473
---
[AI] First Proof
[Stanford University & Columbia University & EPFL ]
https://arxiv.org/abs/2602.05192
---
[LG] Rethinking the Trust Region in LLM Reinforcement Learning
[Sea AI Lab & National University of Singapore]
https://arxiv.org/abs/2602.04879
评价...
空空如也
小宇宙热门评论...
暂无小宇宙热门评论