本期的 14 篇论文如下:
[00:23] 👁 VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models(VisuLogic:一个用于评估多模态大型语言模型中视觉推理能力的基准)
[01:08] 🎭 DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning(DreamID:基于Triplet ID Group Learning的高保真快速扩散人脸替换)
[01:46] 🌐 Trillion 7B Technical Report(Trillion-7B 技术报告)
[02:30] 💡 Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model(Pre-DPO:利用引导参考模型提升直接偏好优化中的数据利用率)
[03:11] 🧩 I-Con: A Unifying Framework for Representation Learning(I-Con:一种统一的表征学习框架)
[03:50] 🧩 Decoupled Global-Local Alignment for Improving Compositional Understanding(解耦的全局-局部对齐以提升组合理解能力)
[04:30] 🎨 DreamO: A Unified Framework for Image Customization(DreamO:图像定制的统一框架)
[05:12] 💡 Tina: Tiny Reasoning Models via LoRA(蒂娜:基于LoRA的小型推理模型)
[05:49] 🛡 A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment(LLM(-Agent) 全栈安全综合研究:数据、训练与部署)
[06:30] 🧐 RePOPE: Impact of Annotation Errors on the POPE Benchmark(RePOPE:标注错误对POPE基准的影响)
[07:06] 💡 Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading(重新思考:基于LLM自适应问题难度分级的优质CoT数据生成)
[07:46] 🛠 CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation(CRUST-Bench:C到安全Rust转译的综合基准)
[08:29] ✅ Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA(未被检查与忽视:用 CheckboxQA 数据集解决大语言模型中的复选框盲点)
[09:21] 🖼 Progressive Language-guided Visual Learning for Multi-Task Visual Grounding(多任务视觉定位的渐进式语言引导视觉学习)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

空空如也
暂无小宇宙热门评论