HuggingFace 每日AI论文速递 - 2025.10.20 | RPC剪枝提速保准；OmniVinci小数据跨模态称王 - EarsOnMe

主播

节目简介

来源：小宇宙

本期的 15 篇论文如下：

[00:20] 🧠 A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning（大模型推理中内部概率与自洽性桥接的理论研究）

[01:04] 🌐 OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM（OmniVinci：面向全模态理解大模型的架构与数据增强）

[01:44] 🎬 Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset（用百万级合成数据集放大指令式视频编辑）

[02:28] ✂ NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks（NANO3D：无需训练与掩码的高效3D编辑新方法）

[03:05] 🛰 Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery（Skyfall-GS：仅凭卫星影像合成沉浸式3D城市场景）

[03:41] ⚠ Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs（情境学习中的突发错位：狭窄示例可让大模型广泛失准）

[04:18] 🧬 Latent Diffusion Model without Variational Autoencoder（无需变分自编码器的潜在扩散模型）

[04:52] 📸 LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal（LightsOut：基于扩散的延展补全提升镜头眩光去除）

[05:30] 🧠 MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning（MorphoBench：随模型推理能力自适应难度的评测基准）

[06:14] 🧠 A$^2$FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning（A²FM：面向工具感知混合推理的自适应智能体基础模型）

[06:56] 🗣 Language Models Model Language（语言模型即语言本身）

[07:36] 🖼 BLIP3o-NEXT: Next Frontier of Native Image Generation（BLIP3o-NEXT：原生图像生成的下一个前沿）

[08:30] 🌐 Paper2Web: Let's Make Your Paper Alive!（Paper2Web：让你的论文“活”起来！）

[09:12] 🔬 Foundation Models for Scientific Discovery: From Paradigm Enhancement to Paradigm Transition（面向科学发现的基础模型：从范式增强到范式跃迁）

[09:55] 🔍 Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents（探索以进化：通过主动在线探索扩展深度研究智能体的聚合逻辑）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

在小宇宙查看该单集文稿

2025.10.20 | RPC剪枝提速保准；OmniVinci小数据跨模态称王

加入我们的 Discord

扫描微信二维码

播放列表