本期的 15 篇论文如下:
[00:24] 🤖 Training Language Models to Self-Correct via Reinforcement Learning(通过强化学习训练语言模型进行自我修正)
[01:03] 📚 InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning(InfiMM-WebMath-40B:推进多模态预训练以增强数学推理)
[01:40] 🔍 MMSearch: Benchmarking the Potential of Large Models as Multi-modal Search Engines(MMSearch:评估大型模型作为多模态搜索引擎的潜力)
[02:19] 🌐 Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution(Oryx MLLM:按需任意分辨率的空间-时间理解)
[02:55] 🎨 LVCD: Reference-based Lineart Video Colorization with Diffusion Models(基于扩散模型的参考线稿视频着色)
[03:35] 🧠 B4: Towards Optimal Assessment of Plausible Code Solutions with Plausible Tests(B4:基于合理测试评估合理代码解决方案的最优方法)
[04:13] 📖 StoryMaker: Towards Holistic Consistent Characters in Text-to-image Generation(StoryMaker:在文本到图像生成中实现整体一致的角色)
[04:59] 🌐 3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion(3DTopia-XL:通过基本体扩散扩展高质量3D资产生成)
[05:39] 🚀 Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization(智能扩展:利用小模型初始化加速大型语言模型预训练)
[06:18] 🤖 Language Models Learn to Mislead Humans via RLHF(语言模型通过RLHF误导人类)
[06:59] 🎨 FlexiTex: Enhancing Texture Generation with Visual Guidance(FlexiTex:通过视觉引导增强纹理生成)
[07:36] 🎥 Denoising Reuse: Exploiting Inter-frame Motion Consistency for Efficient Video Latent Generation(去噪重用:利用帧间运动一致性实现高效视频潜在生成)
[08:13] 📚 MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions(通过反向指令为低资源语言生成高质量指令调优数据集)
[08:52] 🎙 CLAIR-A: Leveraging Large Language Models to Judge Audio Captions(利用大型语言模型评估音频字幕)
[09:28] ⚡ 3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt(3DGS-LM:使用Levenberg-Marquardt加速高斯散射优化)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

空空如也
暂无小宇宙热门评论