Album
时长:
19分钟
播放:
224
发布:
3个月前
主播...
简介...
https://xiaoyuzhoufm.com

00:00:32 AI也会“偏科”?高手如何跳出舒适区 


00:05:55 AI进化论:从“伸手党”到“高手的秘密” 


00:10:42 AI的“体检”新思路:如何看穿一个模型的“小心思”? 


00:15:17 一百万学生教会我们的事:简单,可能就是最优解 


本期介绍的四篇论文:


[LG] RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization  


[Tongyi Lab, Alibaba Group & Peking University]  


https://arxiv.org/abs/2508.00222  


---



[LG] MetaAgent: Toward Self-Evolving Agent via Tool Meta-Learning  


[BAAI]  


https://arxiv.org/abs/2508.00271  


---



[LG] Watch the Weights: Unsupervised monitoring and control of fine-tuned LLMs  


[Carnegie Mellon University (CMU)]  


https://arxiv.org/abs/2508.00161  


---



[LG] Learning to Optimize Feedback for One Million Students: Insights from Multi-Armed and Contextual Bandits in Large-Scale Online Tutoring  


[Carnegie Mellon University (CMU) & CK-12 Foundation]  


https://arxiv.org/abs/2508.00270  

评价...

空空如也

小宇宙热门评论...

暂无小宇宙热门评论

EarsOnMe

加入我们的 Discord

与播客爱好者一起交流

立即加入

扫描微信二维码

添加微信好友,获取更多播客资讯

微信二维码

播放列表

自动播放下一个

播放列表还是空的

去找些喜欢的节目添加进来吧