AI可可AI生活 - 节目列表

[人人能懂] 造AI的AI，犯错的青春期，和通用好“板书”

这一期，我们脑洞大开。你会听到，顶尖AI的大脑里，原来天天都在开激烈的辩论会；而训练AI，竟然就像呵护一个需要犯错、需要折腾的“青春期”。我们还会聊聊，如何用优雅的数学工具给AI一套更聪明的“橡皮泥”，如何让大模型退居幕后帮你“造”一个更高效的AI，以及，怎么判断AI老师的“板书”是不是真的靠谱。准备好了吗？让我们一起出发。 00:00:33 AI建模，我们得到了一套更聪明的“橡皮泥”工具 00:07:19 AI的大脑里，原来天天在开会 00:12:55 聪明人的“笨功夫”，如何让AI帮你造一个AI？ 00:18:52 成大事者，为何要珍惜“犯错”的青春期？ 00:24:39 AI当老师，它的“板书”靠谱吗？本期介绍的几篇论文： [LG] Analytic Bijections for Smooth and Interpretable Normalizing Flows [University of Amsterdam] https://arxiv.org/abs/2601.10774 --- [CL] Reasoning Models Generate Societies of Thought [Google & University of Chicago] https://arxiv.org/abs/2601.10825 --- [LG] FORESTLLM: Large Language Models Make Random Forest Great on Few-shot Tabular Learning [National University of Singapore & Zhejiang University & University of British Columbia] https://arxiv.org/abs/2601.11311 --- [LG] Transient learning dynamics drive escape from sharp valleys in Stochastic Gradient Descent [Peking University & Zhejiang University] https://arxiv.org/abs/2601.10962 --- [CL] Do explanations generalize across large reasoning models? [Northeastern University & Microsoft Research] https://arxiv.org/abs/2601.11517

[人人能懂] 斜杠、专家、选择器与分寸感

今天，我们一同窥见了AI世界精巧的另一面：从注意力机制中类似“机械”的斜杠模式，到并行专家协作的优雅高效；从学会“如何选择”的元认知智慧，到预判趋势实现加速的数学之美，再到机器人通过巧妙设计获得的“分寸感”。这些最新论文告诉我们，通往更强人工智能的道路，不仅需要强大的算力，更充满了令人惊叹的巧思与智慧。 00:00:29 大模型里的‘斜杠’，一个被忽视的注意力模式 00:08:32 AI变聪明的秘密，不是读得更多，而是问得更巧 00:14:06 炼成全能AI的关键一步，选对方法，比埋头苦干更重要 00:20:15 AI绘画加速的秘密，如何让机器“预见”未来？ 00:25:36 机器人干活儿，差的那点“分寸感”怎么补？本期介绍的几篇论文： [LG] Demystifying the Slash Pattern in Attention: The Role of RoPE [National University of Singapore] https://arxiv.org/abs/2601.08297 --- [CL] Parallel Context-of-Experts Decoding for Retrieval Augmented Generation [EURECOM] https://arxiv.org/abs/2601.08670 --- [LG] SimMerge: Learning to Select Merge Operators from Similarity Signals [Cohere & Google] https://arxiv.org/abs/2601.09473 --- [LG] High-accuracy and dimension-free sampling with diffusions [UC Berkeley & Harvard University] https://arxiv.org/abs/2601.10708 --- [RO] In-the-Wild Compliant Manipulation with UMI-FT [Stanford University] https://arxiv.org/abs/2601.09988

30分钟

[人人能懂] AI如何“一箭双雕”、为何“叛逆”、怎样拥有“私密思想”

你有没有想过，AI也能像侦探一样，给蛋白质“看相”，给药丸“配对”吗？你有没有遇到过，你越不让AI说什么，它就越要说的“叛逆”时刻？本期节目，我们将一起钻进AI的“大脑”，看看最新论文是如何揭示AI的“语义引力井”，如何通过一个“私密小本本”让它告别失忆症，甚至让机器人学会“看着办”的灵巧跑酷，以及如何给AI装上一个聪明的“记忆管理员”，解决它的“内存焦虑”。准备好了吗？让我们一起出发！ 00:00:35 给蛋白质“看相”，给药丸“配对”，AI如何一箭双雕？ 00:07:43 为什么你越不让AI说什么，它就越要说？ 00:13:18 AI的“失忆症”，为什么你没法和它玩好一个猜谜游戏 00:19:06 让机器人“灵巧”起来，到底有多难？ 00:24:03 AI的“记忆”正在爆炸，我们能给它装个“忘得快”吗？本期介绍的几篇论文： [LG] Contrastive Geometric Learning Unlocks Unified Structure- and Ligand-Based Drug Design [Johannes Kepler University Linz & Merck Healthcare KGaA] https://arxiv.org/abs/2601.09693 --- [CL] Semantic Gravity Wells: Why Negative Constraints Backfire [Independent Researcher] https://arxiv.org/abs/2601.08070 --- [CL] LLMs Can't Play Hangman: On the Necessity of a Private Working Memory for Language Agents [Chandar Research Lab & LAMA-WeST Lab & Mila – Quebec AI Institute] https://arxiv.org/abs/2601.06973 --- [RO] Deep Whole-body Parkour [Tsinghua University] https://arxiv.org/abs/2601.07701 --- [LG] KVzap: Fast, Adaptive, and Faithful KV Cache Pruning [NVIDIA] https://arxiv.org/abs/2601.07891

30分钟

[人人能懂] 大力出奇迹的秘密与“抄作业”的智慧

这一期，我们将一口气潜入五篇最新论文的智慧深海，看看AI的世界又发生了哪些奇妙的变化。我们会一起探索，“大力出奇迹”这个口号背后，那张描绘AI生长规律的神秘“DNA”图谱；学习一种最高效的“偷懒”智慧，看看AI如何通过“抄自己的作业”来惊人地提速；我们还会给AI的大脑装上一部“专属字典”，让它的知识不仅能被检索，还能被精准地“手术”修改；更会戴上CT眼镜，看看聪明的AI解难题时，究竟是在严密推理，还是在玩一场高维度的“猜谜游戏”；最后，我们将学习一种资源管理的艺术，看AI如何像一位聪明的项目经理，把“好钢”用在最关键的“刀刃”上。准备好了吗？让我们一起出发！ 00:00:50 AI升级指南，大力出奇迹背后有地图？ 00:06:50 为什么说，最高效的偷懒是“抄作业”？ 00:11:56 给AI的大脑装一个“专属字典” 00:16:39 你的AI在思考，还是在蒙答案？ 00:22:20 AI世界的“好钢”，怎么用在刀刃上？本期介绍的几篇论文： [LG] On the origin of neural scaling laws: from random graphs to natural language [Meta Superintelligence Lab & Axiom Math] https://arxiv.org/abs/2601.10684 --- [LG] Single-Stage Huffman Encoder for ML Compression [Google LLC] https://arxiv.org/abs/2601.10673 --- [LG] STEM: Scaling Transformers with Embedding Modules [Meta AI & CMU] https://arxiv.org/abs/2601.10639 --- [LG] Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models [Shanghai Qi Zhi Institute] https://arxiv.org/abs/2601.10679 --- [CL] TRIM: Hybrid Inference via Targeted Stepwise Routing in Multi-Step Reasoning Tasks [Amazon & CMU] https://arxiv.org/abs/2601.10245

28分钟

[人人能懂] 从学徒、团队到工具大师

这期我们来聊聊，怎样把一个AI从“通才”培养成“专才”，甚至让它学会不只是解决问题，而是先为自己打造一套“专属工具”？一个AI团队如何开“聪明会”，以及它怎样才能记住过去的经验，不再傻乎乎地重复劳动？最后，我们会看到一个大胆的设想：要是我们干脆给AI换一个来自“流体力学”的引擎，又会发生什么？让我们一起进入今天的前沿探索。 00:00:31 给你一个好苗子，怎么把它培养成翻译大师？ 00:05:23 别让聪明人开“笨蛋会” 00:12:07 你的时间，只够用在刀刃上 00:17:22 高手做事，都是先打造工具 00:22:37 给AI换个“流体力学”引擎，会发生什么？本期介绍的几篇论文： [CL] TranslateGemma Technical Report [Google Translate Research Team] https://arxiv.org/abs/2601.09012 --- [LG] Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning [MIT & NYU & Microsoft] https://arxiv.org/abs/2601.09667 --- [LG] SRT: Accelerating Reinforcement Learning via Speculative Rollout with Tree-Structured Cache [Cornell University & University of Illinois Urbana-Champaign & University of Washington] https://arxiv.org/abs/2601.09083 --- [LG] Programming over Thinking: Efficient and Robust Multi-Constraint Planning [Nanyang Technological University & Agency for Science, Technology and Research (A*STAR)] https://arxiv.org/abs/2601.09097 --- [LG] Spectral Generative Flow Models: A Physics-Inspired Replacement for Vectorized Large Language Models [UC Berkeley] https://arxiv.org/abs/2601.08893

[人人能懂] 从“分身术”思考，到“反向”学习，再到“说人话”的KPI

你有没有想过，AI要如何像高手一样，同时“试驾”多种思路？我们又该如何给狂飙的AI装上“定速巡航”，让它在学习时永不“翻车”？今天，我们就从几篇最新的AI论文出发，聊一聊AI要如何学会“分身术”思考，如何跳出“思维定式”的陷阱，甚至，我们以后可能再也不用费劲地给AI设定KPI，直接“说人话”就能让它们完美协作。准备好了吗？让我们一起探索AI思考方式的深层变革。 00:00:35 如何像高手一样思考？答案可能在“分身术”里 00:05:07 给狂飙的AI装上定速巡航 00:09:57 思维定式是怎么炼成的？AI给了我们一个新答案 00:15:23 怎么让AI大模型学会“左右互搏”？ 00:21:37 AI界的“KPI”革命，未来我们不用再跟机器打哑谜本期介绍的几篇论文： [CL] Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge [Microsoft Research & University of Pennsylvania] https://arxiv.org/abs/2601.08808 --- [LG] Controlled LLM Training on Spectral Sphere [Microsoft Research Asia & Renmin University] https://arxiv.org/abs/2601.08393 --- [LG] Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs [MIT & NUS] https://arxiv.org/abs/2601.08763 --- [LG] Reverse Flow Matching: A Unified Framework for Online Reinforcement Learning with Diffusion and Flow Policies [MIT] https://arxiv.org/abs/2601.08136 --- [LG] The End of Reward Engineering: How LLMs Are Redefining Multi-Agent Coordination [New York University & Lerna AI] https://arxiv.org/abs/2601.08237

28分钟

[人人能懂] 记忆、谎言和顿悟的秘密

你有没有想过，一个更聪明的AI，是应该更会“思考”，还是更会“偷懒”？最新论文告诉我们，让AI学会用“记忆”分担计算，反而能让它更专注于难题。当AI面对一本几十万字的小说时，它又是如何像我们一样“做笔记”，避免“七秒记忆”的？更有趣的是，如果把AI关进小黑屋，不给任何学习资料，它竟能通过“左右互搏”实现自我进化。最后，我们会深入AI的内心世界，看看它“一本正经胡说八道”时，脑子里究竟走了哪两条路，以及它那令人惊叹的“举一反三”，可能根本不是在学习，而是在“对答案”。 00:00:48 为什么“偷懒”的AI，反而更会思考？ 00:06:15 AI的“七秒记忆”，有救了？ 00:11:46 AI的“闭关修炼”，不喂数据，如何变强？ 00:16:53 AI为什么会“一本正经地胡说八道”？它的脑子里有两条路 00:22:20 别再说AI在“学习”了，它可能只是在“对答案” 本期介绍的几篇论文： [LG] Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models [DeepSeek-AI] https://arxiv.org/abs/2505.11080 --- [LG] Gecko: An Efficient Neural Architecture Inherently Processing Sequences with Arbitrary Lengths [University of Southern California & Meta AI Research] https://arxiv.org/abs/2601.06463 --- [LG] Dr. Zero: Self-Evolving Search Agents without Training Data [Meta Superintelligence Labs] https://arxiv.org/abs/2601.07055 --- [CL] Two Pathways to Truthfulness: On the Intrinsic Encoding of LLM Hallucinations [Peking University & Microsoft Research Asia] https://arxiv.org/abs/2601.07422 --- [LG] Filtering Beats Fine Tuning: A Bayesian Kalman View of In Context Learning in LLMs [UC Berkeley] https://arxiv.org/abs/2601.06100

[人人能懂] 省内存、教笨蛋、做决策的三个锦囊

本期我们来聊聊AI世界里那些“反直觉”的智慧：当AI不再给商品打分而是直接“写”出排名，当语音助手不再被粗暴地对答案而是被“手把手”教会思考，当“不完美”的数据反而能帮我们做出更好的决策，一场关于效率和认知的革命正在悄然发生。最新论文告诉我们，解决难题最好的方法，有时是换一个全新的玩法。 00:00:28 你看到的结果，是谁为你排的序？ 00:07:42 AI大模型背后，一场关于“搬家”的效率革命 00:13:08 你的语音助手，为什么一开口就变笨了？ 00:18:01 AI的“思考开关”，是个美丽的误会？ 00:23:25 别再等了！“不完美”的数据也能做出好决策本期介绍的几篇论文： [IR] Autoregressive Ranking: Bridging the Gap Between Dual and Cross Encoders [Google DeepMind & University of Massachusetts Amherst] https://arxiv.org/abs/2601.05588 --- [LG] MoEBlaze: Breaking the Memory Wall for Efficient MoE Training on Modern GPUs [Meta Platforms Inc] https://arxiv.org/abs/2601.05296 --- [CL] Closing the Modality Reasoning Gap for Speech Large Language Models [Microsoft Corporation & The Chinese University of Hong Kong] https://arxiv.org/abs/2601.05543 --- [LG] Do Sparse Autoencoders Identify Reasoning Features in Language Models? [UC Berkeley] https://arxiv.org/abs/2601.05679 --- [LG] Good Allocations from Bad Estimates [Stanford University & Max Planck Institute for Intelligent Systems, Tübingen] https://arxiv.org/abs/2601.05597

[人人能懂] 从“左右脑”分工到“创新食谱”

你有没有想过，我们能否打造一个既有“文科生”的灵活，又有“理科生”严谨的AI？当一群“偏科”的AI专家聚在一起，如何才能组建一支高效的“梦之队”？本期节目，我们将一口气为你解读几篇最新论文，看看科学家们是如何通过巧妙的流程设计，让AI学会“左右脑”分工、进行词级别的精细协作，甚至拥有主动管理记忆的“断舍离”能力。最后，我们还会揭秘一份顶尖科学家的“创新食谱”。准备好了吗？让我们一起探索AI进化背后的智慧。 00:00:38 AI的“左右脑”，如何让它既灵活又靠谱 00:06:27 AI也“偏科”？我们如何组建一个“梦之队” 00:11:25 AI当科学家，为什么还是个“学徒”？ 00:20:09 让AI学会“断舍离”，它才能真正进化 00:25:12 科学家的创新，原来是有“食谱”的本期介绍的几篇论文： [LG] Structured Decomposition for LLM Reasoning: Cross-Domain Validation and Semantic Web Integration [Warsaw University of Technology] https://arxiv.org/abs/2601.01609 --- [CL] Token-Level LLM Collaboration via FusionRoute [Meta AI] https://arxiv.org/abs/2601.05106 --- [LG] Why LLMs Aren't Scientists Yet: Lessons from Four Autonomous Research Attempts [Lossfunk] https://arxiv.org/abs/2601.03315 --- [CL] Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents [Alibaba Group] https://arxiv.org/abs/2601.01885 --- [LG] Sci-Reasoning: A Dataset Decoding AI Innovation Patterns [Orchestra Research] https://arxiv.org/abs/2601.04577

31分钟

[人人能懂] 从复印机、免疫系统到作弊考生

你有没有想过，你每天使用的AI，可能正悄悄地把一整本《哈利波特》藏在“脑子”里？为了让AI变得更强，我们竟然要逼它和它所有的前辈“打群架”？本期节目，我们将一起揭开AI那些不为人知的“秘密”：从一个能让AI拥有完美记忆的“文件柜”，到一个既聪明又省钱的“免疫系统”，再到一场揪出AI“作弊考生”的全新考试。准备好了吗？让我们一起窥探AI大脑的奇妙内部。 00:00:32 你的AI，可能藏着一个图书馆 00:05:07 为什么说“盯住第一”是最大的陷阱？ 00:11:19 给AI安一个靠谱的“文件柜” 00:16:50 给AI装上一个“既聪明又省钱”的免疫系统 00:23:26 你的AI考了高分，但它真的看懂图了吗？本期介绍的几篇论文： [CL] Extracting books from production language models [Stanford University] https://arxiv.org/abs/2601.02671 --- [LG] Digital Red Queen: Adversarial Program Evolution in Core War with LLMs [MIT] https://arxiv.org/abs/2601.03335 --- [LG] Everything is Context: Agentic File System Abstraction for Context Engineering [University of New South Wales & ArcBlock, Inc & University of Tasmania] https://arxiv.org/abs/2512.05470 --- [LG] Constitutional Classifiers++: Efficient Production-Grade Defenses against Universal Jailbreaks [Anthropic] https://arxiv.org/abs/2601.04603 --- [LG] DatBench: Discriminative, Faithful, and Efficient VLM Evaluations [DatologyAI] https://arxiv.org/abs/2601.02316

30分钟

[人人能懂] 从潜在行动、结构化生成到奖励解耦

我们总希望AI更像一个聪明的伙伴，而不是一个笨拙的机器。但怎样才算“聪明”？本期节目，我们将透过几篇最新的研究，一起窥探AI学习智慧的深层秘密。我们会聊到，AI如何像婴儿一样，在无声的世界里自己“悟”出万物的规律；又如何像个特工，在“聊天模式”和“任务模式”间无缝切换；我们还会探讨，如何用一把精妙的尺子，量出AI学到的究竟是“真本事”还是“假把式”，以及如何避免它在多重目标下“偏科”，甚至沦为一个只会讨好规则的“马屁精”。 00:00:39 AI学会了“无师自通”，世界将有什么不同？ 00:06:21 给AI装上一个“万能遥控器” 00:12:57 AI上课也分“顿悟”和“补课”？一把尺子量出它学到了多少真本事 00:19:54 AI“偏科”怎么办？谈谈多目标奖励的艺术 00:25:33 “好学生”与“马屁精”，AI如何学会做个人本期介绍的几篇论文： [LG] Learning Latent Action World Models In The Wild [FAIR at Meta] https://arxiv.org/abs/2601.05230 --- [LG] XGrammar 2: Dynamic and Efficient Structured Generation Engine for Agentic LLMs [Shanghai Jiao Tong University & CMU] https://arxiv.org/abs/2601.04426 --- [LG] Excess Description Length of Learning Generalizable Predictors [UC Berkeley & Anthropic] https://arxiv.org/abs/2601.04728 --- [CL] GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization [NVIDIA] https://arxiv.org/abs/2601.05242 --- [CL] Learning to Simulate Human Dialogue [Stanford University] https://arxiv.org/abs/2601.04436

31分钟