https://babi.com/

播客: HuggingFace 每日AI论文速递 - EarsOnMe - 精选播客,一听即合
HuggingFace 每日AI论文速递
10分钟速读热门AI论文

Album
主播:
拨号上网
出版方:
佚名
订阅数:
1.25万
集数:
508
最近更新:
1天前
播客简介...
每天10分钟,带您快速了解当日HuggingFace热门AI论文内容。每个工作日更新,欢迎订阅。 📢播客节目在小宇宙、Apple Podcast平台搜索【HuggingFace 每日AI论文速递】 🖼另外还有图文版,可在小红书搜索并关注【AI速递】
HuggingFace 每日AI论文速递的创作者...
HuggingFace 每日AI论文速递的节目...

2026.02.02 | ASTRA合成轨迹炼工具;THINKSAFE自对齐保安全

HuggingFace 每日AI论文速递

【赞助商】 通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事 传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd 【目录】 本期的 15 篇论文如下: [00:33] 🤖 ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas(ASTRA:基于自动化轨迹合成与强化学习竞技场的智能体训练框架) [01:22] 🛡 THINKSAFE: Self-Generated Safety Alignment for Reasoning Models(THINKSAFE:推理模型的自生成安全对齐) [02:18] 🧠 TTCS: Test-Time Curriculum Synthesis for Self-Evolving(TTCS:面向自进化的测试时课程合成) [03:09] 🍌 PaperBanana: Automating Academic Illustration for AI Scientists(PaperBanana:面向AI科学家的学术插图自动化生成框架) [03:51] 🔬 FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation(傅里叶采样器:通过频率引导生成解锁扩散语言模型的非自回归潜力) [04:40] 🧠 ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought(ReGuLaR:基于渲染思维链指导的变分潜在推理) [05:22] 🎯 SSL: Sweet Spot Learning for Differentiated Guidance in Agentic Optimization(SSL:基于甜点学习的差异化引导智能体优化) [06:02] 🎯 DenseGRPO: From Sparse to Dense Reward for Flow Matching Model Alignment(DenseGRPO:从稀疏奖励到稠密奖励的流匹配模型对齐方法) [07:08] 🧠 Pushing the Boundaries of Natural Reasoning: Interleaved Bonus from Formal-Logic Verification(突破自然推理的边界:形式逻辑验证的交织增益) [07:55] 📄 PaddleOCR-VL-1.5: Towards a Multi-Task 0.9B VLM for Robust In-the-Wild Document Parsing(PaddleOCR-VL-1.5:面向鲁棒野外文档解析的多任务0.9B视觉语言模型) [08:45] 🎬 DreamActor-M2: Universal Character Image Animation via Spatiotemporal In-Context Learning(DreamActor-M2:通过时空上下文学习的通用角色图像动画) [09:42] 🧠 MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning(MemOCR:面向高效长程推理的布局感知视觉记忆) [10:24] 🦢 Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text(金鹅:一种从未经验证的互联网文本中合成无限RLVR任务的简单技巧) [11:13] 📊 Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling(大语言模型在最佳N采样下对抗性风险的统计估计) [12:00] ⚡ RM -RF: Reward Model for Run-Free Unit Test Evaluation(RM-RF:一种用于免运行单元测试评估的奖励模型) 【关注我们】 您还可以在以下平台找到我们,获得播客内容以外更多信息 小红书: AI速递

13分钟
56
1天前

【月末特辑】1月最火AI论文 | mHC稳梯度;GDPO解多奖励

HuggingFace 每日AI论文速递

【赞助商】 通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事 传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd 【目录】 本期的 10 篇论文如下: [00:42] TOP1(🔥292) | 🧠 mHC: Manifold-Constrained Hyper-Connections(mHC:流形约束的超连接) [03:06] TOP2(🔥212) | 📈 GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization(GDPO:面向多奖励强化学习优化的组奖励解耦归一化策略优化) [04:45] TOP3(🔥209) | 🔍 Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning(观察、推理与搜索:面向智能体视频推理的开放网络视频深度研究基准) [06:59] TOP4(🔥193) | 👶 BabyVision: Visual Reasoning Beyond Language(BabyVision:超越语言的视觉推理) [08:57] TOP5(🔥190) | 🚀 STEP3-VL-10B Technical Report(STEP3-VL-10B 技术报告) [10:39] TOP6(🔥186) | 🤖 Agentic Reasoning for Large Language Models(大语言模型的智能体推理) [12:58] TOP7(🔥181) | 🧹 Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs(大语言模型能否清理你的数据?基于LLM的应用就绪数据准备综述) [15:19] TOP8(🔥171) | 🧠 LongCat-Flash-Thinking-2601 Technical Report(LongCat-Flash-Thinking-2601 技术报告) [17:22] TOP9(🔥165) | 🗺 Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization(借助地图思考:用于地理定位的强化并行地图增强智能体) [19:17] TOP10(🔥158) | 🧠 Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives(Idea2Story:将研究概念转化为完整科学叙事的自动化流程) 【关注我们】 您还可以在以下平台找到我们,获得播客内容以外更多信息 小红书: AI速递

22分钟
98
2天前

2026.01.30 | 空间智能基准测不准;Idea2Story一键成文

HuggingFace 每日AI论文速递

【赞助商】 通勤路上就听AI每周谈。AI每周谈,每周带你回顾上周AI大事 传送门 🔗https://www.xiaoyuzhoufm.com/podcast/688a34636f5a275f1cba40fd 【目录】 本期的 15 篇论文如下: [00:29] 🧭 Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models(万物归位:文本到图像模型空间智能基准测试) [01:21] 🧠 Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives(Idea2Story:将研究概念转化为完整科学叙事的自动化流程) [02:19] ⚡ Scaling Embeddings Outperforms Scaling Experts in Language Models(在语言模型中扩展嵌入层优于扩展专家混合) [02:58] 🔍 OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models(OCRVerse:迈向端到端视觉语言模型中的整体OCR) [03:39] 🤖 DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation(DynamicVLA:面向动态物体操作的视觉-语言-动作模型) [04:33] 🧠 MMFineReason: Closing the Multimodal Reasoning Gap via Open Data-Centric Methods(MMFineReason:通过开放数据为中心的方法弥合多模态推理鸿沟) [05:20] 🔺 PLANING: A Loosely Coupled Triangle-Gaussian Framework for Streaming 3D Reconstruction(PLANING:一种用于流式三维重建的松散耦合三角-高斯框架) [06:08] 🧠 ConceptMoE: Adaptive Token-to-Concept Compression for Implicit Compute Allocation(ConceptMoE:面向隐式计算分配的自适应令牌到概念压缩) [07:01] 🧩 AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts(AgentLongBench:通过环境推演实现可控的长上下文智能体基准测试) [07:43] 🧠 Exploring Reasoning Reward Model for Agents(探索智能体推理奖励模型) [08:39] 🎤 Qwen3-ASR Technical Report(Qwen3-ASR技术报告) [09:27] 🚀 Language-based Trial and Error Falls Behind in the Era of Experience(经验时代下基于语言的试错方法已然落后) [10:16] 🌐 Typhoon-S: Minimal Open Post-Training for Sovereign Large Language Models(台风-S:主权大语言模型的最小化开放后训练方法) [11:02] ⚡ Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening(可扩展的幂采样:通过分布锐化解锁LLM高效、免训练推理) [11:59] 🧠 MAD: Modality-Adaptive Decoding for Mitigating Cross-Modal Hallucinations in Multimodal Large Language Models(MAD:模态自适应解码用于缓解多模态大语言模型中的跨模态幻觉) 【关注我们】 您还可以在以下平台找到我们,获得播客内容以外更多信息 小红书: AI速递

13分钟
99+
4天前
HuggingFace 每日AI论文速递的评价...

空空如也

EarsOnMe

加入我们的 Discord

与播客爱好者一起交流

立即加入

扫描微信二维码

添加微信好友,获取更多播客资讯

微信二维码

播放列表

自动播放下一个

播放列表还是空的

去找些喜欢的节目添加进来吧