HuggingFace 每日AI论文速递
10分钟速读热门AI论文

Album
主播:
拨号上网
出版方:
佚名
订阅数:
6772
集数:
255
最近更新:
1天前
评分
暂无评分
0人评价
5星
0%
4星
0%
3星
0%
2星
0%
1星
0%
播客简介...
每天10分钟,带您快速了解当日HuggingFace热门AI论文内容。每个工作日更新,欢迎订阅。 📢播客节目在小宇宙、Apple Podcast平台搜索【HuggingFace 每日AI论文速递】 🖼另外还有图文版,可在小红书搜索并关注【AI速递】
HuggingFace 每日AI论文速递的创作者...
拨号上网
HuggingFace 每日AI论文速递的音频...

2025.04.22 | LUFFY提升推理性能;FlowReasoner增强系统适应性。

本期的 15 篇论文如下: [00:25] 🧠 Learning to Reason under Off-Policy Guidance(离线策略指导下的推理学习) [01:00] 🤖 FlowReasoner: Reinforcing Query-Level Meta-Agents(FlowReasoner:强化查询级别元代理) [01:40] 🦅 Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models(Eagle 2.5:提升前沿视觉-语言模型长文本后训练性能) [02:22] 🧰 ToolRL: Reward is All Tool Learning Needs(工具强化学习:奖励是工具学习的全部) [03:07] 🌐 SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation(SphereDiff:通过球面潜在表示实现免调优全景图像和视频生成) [03:39] 🎨 StyleMe3D: Stylization with Disentangled Priors by Multiple Encoders on 3D Gaussians(StyleMe3D:基于3D高斯的解耦先验多编码器风格化) [04:18] 🤖 X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents(X-Teaming:基于自适应多智能体的多轮越狱与防御) [04:57] 🤖 UFO2: The Desktop AgentOS(UFO2:桌面AgentOS) [05:34] 🧑 LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs(LeetCodeDataset:一个用于代码大语言模型稳健评估和高效训练的时序数据集) [06:18] 👀 Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs(换个角度看世界:评估多模态大语言模型中的多视角理解能力) [07:02] 🤖 InfiGUI-R1: Advancing Multimodal GUI Agents from Reactive Actors to Deliberative Reasoners(InfiGUI-R1:推进多模态GUI智能体从反应式执行者到审慎推理者的演进) [07:42] 🕹 EasyEdit2: An Easy-to-use Steering Framework for Editing Large Language Models(EasyEdit2:一种用于编辑大型语言模型的简易操控框架) [08:23] 📱 LearnAct: Few-Shot Mobile GUI Agent with a Unified Demonstration Benchmark(LearnAct:基于统一演示基准的少样本移动GUI智能体) [09:06] 🖼 LookingGlass: Generative Anamorphoses via Laplacian Pyramid Warping(窥镜:基于拉普拉斯金字塔扭曲的生成式畸变图像) [09:50] 🎵 DRAGON: Distributional Rewards Optimize Diffusion Generative Models(DRAGON:利用分布奖励优化扩散生成模型) 【关注我们】 您还可以在以下平台找到我们,获得播客内容以外更多信息 小红书: AI速递

10分钟
41
1天前

2025.04.21 | 强化学习未提升新推理能力;MIG优化指令微调数据选择。

本期的 9 篇论文如下: [00:22] 🤔 Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?(强化学习真的能激励大语言模型产生超越基础模型的推理能力吗?) [00:59] 🧠 MIG: Automatic Data Selection for Instruction Tuning by Maximizing Information Gain in Semantic Space(MIG:通过最大化语义空间中的信息增益实现指令微调的自动数据选择) [01:41] 🤔 Could Thinking Multilingually Empower LLM Reasoning?(多语思考能否增强大型语言模型的推理能力?) [02:25] 🏙 AerialMegaDepth: Learning Aerial-Ground Reconstruction and View Synthesis(AerialMegaDepth:学习空中-地面重建与视角合成) [03:09] 🏠 HiScene: Creating Hierarchical 3D Scenes with Isometric View Generation(HiScene:利用等距视图生成创建分层3D场景) [03:52] 💡 NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes(NodeRAG:使用异构节点构建的基于图结构的RAG) [04:30] 🧠 It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization(一切皆有关联:一次关于测试时记忆、注意力偏差、保留和在线优化的探索之旅) [05:07] 🏞 Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images(令牌化图像块:用于大型图像中有效去雾的全局上下文融合) [05:51] 🧠 Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models(思想操控:外部思想能够有效应用于大型推理模型) 【关注我们】 您还可以在以下平台找到我们,获得播客内容以外更多信息 小红书: AI速递

6分钟
90
2天前

2025.04.18 | CLIMB提升领域模型表现;反蒸馏采样防止模型被盗用。

本期的 15 篇论文如下: [00:23] 🗂 CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training(CLIMB:基于聚类的迭代数据混合引导预训练方法) [01:03] 🧪 Antidistillation Sampling(反蒸馏采样) [01:41] 🤝 A Strategic Coordination Framework of Small LLMs Matches Large LLMs in Data Synthesis(小型LLM的策略协调框架在数据合成方面与大型LLM相媲美) [02:26] 🎬 Packing Input Frame Context in Next-Frame Prediction Models for Video Generation(视频生成中基于帧打包的下一帧预测模型) [03:02] 🤖 Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling(生成,但验证:通过回顾重采样减少视觉-语言模型中的幻觉) [03:43] 🧠 WORLDMEM: Long-term Consistent World Simulation with Memory(WORLDMEM:基于记忆的长期一致性世界模拟) [04:27] 🎬 VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models(VistaDPO:用于大型视频模型的分层时空直接偏好优化) [05:01] 🤖 NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation(NoisyRollout:利用数据增强强化视觉推理) [05:43] 🎨 DMM: Building a Versatile Image Generation Model via Distillation-Based Model Merging(DMM:构建基于蒸馏模型合并的通用图像生成模型) [06:20] 📊 ChartQAPro: A More Diverse and Challenging Benchmark for Chart Question Answering(ChartQAPro:一个更多样化和更具挑战性的图表问答基准) [07:07] 🤖 Exploring Expert Failures Improves LLM Agent Tuning(探索专家失败案例以提升LLM Agent的调优效果) [07:48] 🎨 InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework(InstantCharacter:使用可扩展的扩散Transformer框架个性化任何角色) [08:26] 📸 CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy(CCMNet:利用校准颜色校正矩阵实现跨相机色彩恒常性) [09:06] 🎬 FocusedAD: Character-centric Movie Audio Description(聚焦AD:以角色为中心的电影音频描述) [09:39] 🤔 Retrieval-Augmented Generation with Conflicting Evidence(检索增强生成与冲突证据) 【关注我们】 您还可以在以下平台找到我们,获得播客内容以外更多信息 小红书: AI速递

10分钟
85
5天前
HuggingFace 每日AI论文速递的评价...

空空如也

EarsOnMe

加入我们的 Discord

与播客爱好者一起交流

立即加入

播放列表

自动播放下一个

播放列表还是空的

去找些喜欢的节目添加进来吧