[CL] DiffuCoder:Understanding and Improving Masked Diffusion Models for Code Generation
[Apple]
https://arxiv.org/abs/2506.20639
---
[LG] Language Modeling by Language Models
[Allen Institute for AI]
https://arxiv.org/abs/2506.20249
---
[CL] Inside you are many wolves: Using cognitive models to interpret value trade-offs in LLMs
[Harvard University]
https://arxiv.org/abs/2506.20666
---
[LG] Mastering Multiple-Expert Routing: Realizable H-Consistency and Strong Guarantees for Learning to Defer
[Courant Institute of Mathematical Sciences & Google Research]
https://arxiv.org/abs/2506.20650
---
[LG] Asymmetric REINFORCE for off-Policy Reinforcement Learning: Balancing positive and negative rewards
[FAIR at Meta]
https://arxiv.org/abs/2506.20520
空空如也
暂无小宇宙热门评论