AI可可AI生活 - AI前沿：从代码生成到自动化科研 - EarsOnMe

主播

节目简介

来源：小宇宙

[CL] DiffuCoder：Understanding and Improving Masked Diffusion Models for Code Generation

[Apple]

https://arxiv.org/abs/2506.20639

---

[LG] Language Modeling by Language Models

[Allen Institute for AI]

https://arxiv.org/abs/2506.20249

---

[CL] Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs

[Harvard University]

https://arxiv.org/abs/2506.20666

---

[LG] Mastering Multiple-Expert Routing: Realizable H-Consistency and Strong Guarantees for Learning to Defer

[Courant Institute of Mathematical Sciences & Google Research]

https://arxiv.org/abs/2506.20650

---

[LG] Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards

[FAIR at Meta]

https://arxiv.org/abs/2506.20520

AI前沿：从代码生成到自动化科研