[人人能懂] 如何打造一个我们能真正信赖的AI?
AI可可AI生活

[人人能懂] 如何打造一个我们能真正信赖的AI?

25分钟 140 6个月前
节目简介
来源:小宇宙

00:00:27  AI的“心术”:我们能给它装上一个诚实开关吗?


00:05:08 比正确更重要的,是正确地思考


00:10:18  省钱的艺术:如何让“实习生”干好“专家”的活?


00:15:44 AI的“一键删除”,真的能删除吗?


00:21:08 AI训练的“终局思维”:把答案直接告诉你


本期介绍的几篇论文:


[LG] Can LLMs Lie? Investigation beyond Hallucination  


[Carnegie Mellon University (CMU)]  


https://arxiv.org/abs/2509.03518  


---


[LG] Beyond Correctness: Harmonizing Process and Outcome Rewards through RL Training  


[Amazon]  


https://arxiv.org/abs/2509.03403  


---


[LG] Cut Costs, Not Accuracy: LLM-Powered Data Processing with Guarantees  


[University of California, Berkeley]  


https://arxiv.org/abs/2509.02896  


---


[LG] Unlearning That Lasts: Utility-Preserving, Robust, and Almost Irreversible Forgetting in LLMs  


[University of Tübingen & EPFL]  


https://arxiv.org/abs/2509.02820  


---


[LG] Imitate Optimal Policy: Prevail and Induce Action Collapse in Policy Gradient  


[University of Sydney & King Abdullah University of Science and Technology]  


https://arxiv.org/abs/2509.02737  

评价

空空如也

小宇宙热评

暂无小宇宙热门评论

加入我们的 Discord

与播客爱好者一起交流

立即加入

扫描微信二维码

添加微信好友,获取更多播客资讯

微信二维码

播放列表

自动播放下一个

播放列表还是空的

去找些喜欢的节目添加进来吧