时长:
29分钟
播放:
151
发布:
5天前
主播...
简介...
你有没有想过,最顶尖的AI,它的智慧可能不是体现在无所不知,而是敢于坦诚地说出“我不知道”?本期节目,我们将一起探索AI如何学会这项宝贵的品质。我们还会揭秘,如何给AI装上一双“眼睛”让它在嘈杂派对里也能跟你轻松对话,如何用一个优美的公式教会它“速读”长篇报告,甚至让一份200页的PDF自己开口说话,并在一秒内找到AI画作的灵感“祖先”。准备好了吗?让我们一起进入AI更深邃、更智慧的内心世界。
00:00:39 AI画画的灵感,能秒速溯源吗?
00:06:29 大模型读书慢?给它一副聪明的“速读眼镜”
00:12:13 给AI一双眼睛,让它学会“察言观色”
00:16:37 AI的最高智慧,是承认自己不知道
00:22:56 如何让一份200页的PDF,自己开口说话?
本期介绍的几篇论文:
[CV] Fast Data Attribution for Text-to-Image Models
[CMU & Adobe Research & UC Berkeley]
https://arxiv.org/abs/2511.10721
---
[LG] Optimizing Mixture of Block Attention
[MIT]
https://arxiv.org/abs/2511.11571
---
[CL] AV-Dialog: Spoken Dialogue Models with Audio-Visual Input
[University of Washington & Meta AI Research]
https://arxiv.org/abs/2511.11124
---
[LG] Honesty over Accuracy: Trustworthy Language Models through Reinforced Hesitation
[Toyota Technological Institute at Chicago & University of California, San Diego]
https://arxiv.org/abs/2511.11500
---
[CL] Information Extraction From Fiscal Documents Using LLMs
[Google Inc & XKDR Forum]
https://arxiv.org/abs/2511.10659
00:00:39 AI画画的灵感,能秒速溯源吗?
00:06:29 大模型读书慢?给它一副聪明的“速读眼镜”
00:12:13 给AI一双眼睛,让它学会“察言观色”
00:16:37 AI的最高智慧,是承认自己不知道
00:22:56 如何让一份200页的PDF,自己开口说话?
本期介绍的几篇论文:
[CV] Fast Data Attribution for Text-to-Image Models
[CMU & Adobe Research & UC Berkeley]
https://arxiv.org/abs/2511.10721
---
[LG] Optimizing Mixture of Block Attention
[MIT]
https://arxiv.org/abs/2511.11571
---
[CL] AV-Dialog: Spoken Dialogue Models with Audio-Visual Input
[University of Washington & Meta AI Research]
https://arxiv.org/abs/2511.11124
---
[LG] Honesty over Accuracy: Trustworthy Language Models through Reinforced Hesitation
[Toyota Technological Institute at Chicago & University of California, San Diego]
https://arxiv.org/abs/2511.11500
---
[CL] Information Extraction From Fiscal Documents Using LLMs
[Google Inc & XKDR Forum]
https://arxiv.org/abs/2511.10659
评价...
空空如也
小宇宙热门评论...
暂无小宇宙热门评论