HuggingFace 每日AI论文速递 - 2026.01.06 | K-EXAONE MoE；NextFlow统一序列建模多模态 - EarsOnMe

主播

节目简介

来源：小宇宙

本期的 15 篇论文如下：

[00:21] 🧠 K-EXAONE Technical Report（K-EXAONE技术报告）

[00:56] 🚀 NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation（NextFlow：统一序列建模激活多模态理解与生成）

[01:36] 🎭 DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer（DreamID-V：通过扩散Transformer弥合图像到视频的鸿沟以实现高保真人脸交换）

[02:19] 🎨 VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation（VAR强化学习优化之道：解决视觉自回归生成中的异步策略冲突）

[03:04] 🚀 GARDO: Reinforcing Diffusion Models without Reward Hacking（GARDO：无需奖励黑客攻击的扩散模型强化方法）

[03:41] 🎨 VINO: A Unified Visual Generator with Interleaved OmniModal Context（VINO：一种具有交错式全模态上下文的统一视觉生成器）

[04:17] ♾ InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams（InfiniteVGGT：面向无尽流数据的视觉几何基础Transformer）

[04:54] 🧠 Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits（大型语言模型能否预测自身失败？通过内部电路实现自我感知）

[05:23] 🚀 Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling（Falcon-H1R：通过混合模型实现高效测试时扩展，推动推理前沿）

[05:57] 🔄 Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes（Talk2Move：基于强化学习的文本指令场景物体几何变换框架）

[06:43] 🔄 Recursive Language Models（递归语言模型）

[07:12] 🧠 KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs（KV-嵌入：通过仅解码器大语言模型内部KV重路由实现免训练文本嵌入）

[07:51] ⚠ COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs（COMPASS：评估大语言模型中组织特定政策对齐性的框架）

[08:52] 🛰 Toward Stable Semi-Supervised Remote Sensing Segmentation via Co-Guidance and Co-Fusion（通过协同引导与协同融合实现稳定的半监督遥感分割）

[09:40] 🧱 SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving（SWE-Lego：推动软件问题解决的监督微调极限）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

在小宇宙查看该单集文稿

2026.01.06 | K-EXAONE MoE；NextFlow统一序列建模多模态

加入我们的 Discord

扫描微信二维码

播放列表