时长:
31分钟
播放:
176
发布:
1周前
主播...
简介...
今天,我们来聊聊AI那些你不知道的“另一面”。为什么有时聪明的AI会突然“出戏”,变得神神叨叨?为什么它能解开复杂的难题,却连最简单的掷骰子都做不好?我们又该如何设计一套聪明的系统,给AI装上“人格护栏”,甚至让它成为我们时薪不到一块钱的“超级实习生”?这一期,我们将从五篇最新论文出发,为你揭开AI不为人知的内在机制。
00:00:31 AI的“人格”开关,藏在哪里?
00:07:06 AI的“逻辑脆断”,为什么聪明的大模型会突然变傻?
00:13:20 AI的“贴身保安”,怎样做到又便宜又好用?
00:20:04 你以为AI是高手,其实它连骰子都掷不好
00:25:35 你的“数学家教”,时薪不到一块钱
本文介绍的几篇论文:
[CL] The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
[MATS & Anthropic]
https://arxiv.org/abs/2601.10387
---
[CL] Logical Phase Transitions: Understanding Collapse in LLM Logical Reasoning
[Huazhong University of Science and Technology]
https://arxiv.org/abs/2601.02902
---
[LG] Building Production-Ready Probes For Gemini
[Google DeepMind]
https://arxiv.org/abs/2601.11516
---
[CL] Large Language Models Are Bad Dice Players: LLMs Struggle to Generate Random Numbers from Statistical Distributions
[Harvard University]
https://arxiv.org/abs/2601.05414
---
[LG] 130k Lines of Formal Topology in Two Weeks: Simple and Cheap Autoformalization for Everyone?
[AI4REASON]
https://arxiv.org/abs/2601.03298
00:00:31 AI的“人格”开关,藏在哪里?
00:07:06 AI的“逻辑脆断”,为什么聪明的大模型会突然变傻?
00:13:20 AI的“贴身保安”,怎样做到又便宜又好用?
00:20:04 你以为AI是高手,其实它连骰子都掷不好
00:25:35 你的“数学家教”,时薪不到一块钱
本文介绍的几篇论文:
[CL] The Assistant Axis: Situating and Stabilizing the Default Persona of Language Models
[MATS & Anthropic]
https://arxiv.org/abs/2601.10387
---
[CL] Logical Phase Transitions: Understanding Collapse in LLM Logical Reasoning
[Huazhong University of Science and Technology]
https://arxiv.org/abs/2601.02902
---
[LG] Building Production-Ready Probes For Gemini
[Google DeepMind]
https://arxiv.org/abs/2601.11516
---
[CL] Large Language Models Are Bad Dice Players: LLMs Struggle to Generate Random Numbers from Statistical Distributions
[Harvard University]
https://arxiv.org/abs/2601.05414
---
[LG] 130k Lines of Formal Topology in Two Weeks: Simple and Cheap Autoformalization for Everyone?
[AI4REASON]
https://arxiv.org/abs/2601.03298
评价...
空空如也
小宇宙热门评论...
暂无小宇宙热门评论