本期的 10 篇论文如下:
[00:40] TOP1(🔥129) | 🤖 Training Language Models to Self-Correct via Reinforcement Learning(通过强化学习训练语言模型进行自我修正)
[02:41] TOP2(🔥121) | 🚀 Qwen2.5-Coder Technical Report(Qwen2.5-Coder技术报告)
[04:44] TOP3(🔥96) | 🌐 Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models(Molmo 和 PixMo:用于最先进多模态模型的开放权重和开放数据)
[06:30] TOP4(🔥95) | 🖼 Guide-and-Rescale: Self-Guidance Mechanism for Effective Tuning-Free Real Image Editing(引导与重缩放:无调参自引导机制实现高效真实图像编辑)
[08:23] TOP5(🔥86) | 🧠 Attention Heads of Large Language Models: A Survey(大型语言模型注意力头:一项综述)
[10:17] TOP6(🔥85) | 🎥 Loopy: Taming Audio-Driven Portrait Avatar with Long-Term Motion Dependency(Loopy:驯服音频驱动的人像化身与长期运动依赖)
[11:56] TOP7(🔥81) | 🌐 OmniGen: Unified Image Generation(全能生成:统一图像生成模型)
[13:51] TOP8(🔥81) | 🧠 Emu3: Next-Token Prediction is All You Need(Emu3:下一个词预测是所有你需要的)
[15:45] TOP9(🔥78) | 📄 General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model(通用OCR理论:通过统一端到端模型迈向OCR-2.0)
[17:59] TOP10(🔥77) | 🧠 OLMoE: Open Mixture-of-Experts Language Models(OLMoE:开放式混合专家语言模型)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

空空如也
暂无小宇宙热门评论