HuggingFace 每日AI论文速递
10分钟速读热门AI论文

Album
主播:
拨号上网
出版方:
佚名
订阅数:
9,199
集数:
378
最近更新:
1天前
播客简介...
每天10分钟,带您快速了解当日HuggingFace热门AI论文内容。每个工作日更新,欢迎订阅。 📢播客节目在小宇宙、Apple Podcast平台搜索【HuggingFace 每日AI论文速递】 🖼另外还有图文版,可在小红书搜索并关注【AI速递】
HuggingFace 每日AI论文速递的创作者...
HuggingFace 每日AI论文速递的节目...

2025.09.08 | 语言模型幻觉源于预训练;大模型图形编程性能提升

HuggingFace 每日AI论文速递

本期的 12 篇论文如下: [00:24] 🤔 Why Language Models Hallucinate(语言模型为何产生幻觉) [00:47] 🎨 Symbolic Graphics Programming with Large Language Models(使用大型语言模型进行符号化图形编程) [01:17] ⚡ Set Block Decoding is a Language Model Inference Accelerator(集合块解码:一种语言模型推理加速器) [01:43] 🎼 WildScore: Benchmarking MLLMs in-the-Wild Symbolic Music Reasoning(WildScore:多模态大语言模型在真实场景下的符号音乐推理基准测试) [02:14] 🌍 LatticeWorld: A Multimodal Large Language Model-Empowered Framework for Interactive Complex World Generation(LatticeWorld:基于多模态大语言模型的交互式复杂世界生成框架) [02:42] 💡 LuxDiT: Lighting Estimation with Video Diffusion Transformer(LuxDiT:基于视频扩散变换器的光照估计) [03:15] 📷 WinT3R: Window-Based Streaming Reconstruction with Camera Token Pool(WinT3R:基于窗口流式重建与相机令牌池) [03:44] 📉 On Robustness and Reliability of Benchmark-Based Evaluation of LLMs(基于基准测试的LLM评估的鲁棒性与可靠性研究) [04:07] 🔍 MedVista3D: Vision-Language Modeling for Reducing Diagnostic Errors in 3D CT Disease Detection, Understanding and Reporting(MedVista3D:用于减少3D CT疾病检测、理解和报告中诊断错误的视觉语言建模) [04:43] 🦾 U-ARM : Ultra low-cost general teleoperation interface for robot manipulation(U-ARM:用于机器人操作的超低成本通用遥操作接口) [05:16] 🔍 Behavioral Fingerprinting of Large Language Models(大型语言模型的行为指纹识别) [05:45] 🚀 Bootstrapping Task Spaces for Self-Improvement(自改进任务空间的引导构建) 【关注我们】 您还可以在以下平台找到我们,获得播客内容以外更多信息 小红书: AI速递

6分钟
85
1天前

2025.09.05 | 大型语言模型语义理解弱;图像编辑模型提升几何估计

HuggingFace 每日AI论文速递

本期的 13 篇论文如下: [00:22] 🤔 Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth(废话学:用深度解读无意义内容挑战大型语言模型) [00:47] 📐 From Editor to Dense Geometry Estimator(从编辑模型到密集几何估计器) [01:08] 🧠 Towards a Unified View of Large Language Model Post-Training(迈向大语言模型后训练的统一视角) [01:39] 🔄 Inverse IFEval: Can LLMs Unlearn Stubborn Training Conventions to Follow Real Instructions?(逆向IFEval:大型语言模型能否摒弃顽固训练惯例以遵循真实指令?) [02:05] 🔬 DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks(深度研究竞技场:基于研讨会任务对大语言模型研究能力的首次考核) [02:26] 🚀 Transition Models: Rethinking the Generative Learning Objective(过渡模型:重新思考生成式学习目标) [02:54] 🔍 NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings(NER检索器:基于类型感知嵌入的零样本命名实体检索) [03:24] ⚡ Few-step Flow for 3D Generation via Marginal-Data Transport Distillation(基于边缘数据传输蒸馏的少步流3D生成方法) [03:53] 🎬 Video-MTR: Reinforced Multi-Turn Reasoning for Long Video Understanding(视频多轮推理:面向长视频理解的强化多轮推理框架) [04:19] 🎭 Durian: Dual Reference-guided Portrait Animation with Attribute Transfer(Durian:基于双参考引导的肖像动画与属性迁移) [04:47] 📐 Drawing2CAD: Sequence-to-Sequence Learning for CAD Generation from Vector Drawings(Drawing2CAD:基于序列到序列学习的矢量绘图CAD生成) [05:24] 🧠 Delta Activations: A Representation for Finetuned Large Language Models(Delta激活:微调大型语言模型的一种表示方法) [06:01] ⚠ False Sense of Security: Why Probing-based Malicious Input Detection Fails to Generalize(虚假安全感:为何基于探测的恶意输入检测方法难以泛化) 【关注我们】 您还可以在以下平台找到我们,获得播客内容以外更多信息 小红书: AI速递

6分钟
86
4天前
HuggingFace 每日AI论文速递的评价...

空空如也

EarsOnMe

加入我们的 Discord

与播客爱好者一起交流

立即加入

扫描微信二维码

添加微信好友,获取更多播客资讯

微信二维码

播放列表

自动播放下一个

播放列表还是空的

去找些喜欢的节目添加进来吧