2026.04.24 | LLaTiSA四级闯关教模型读时序;WorldMark统一基准测视频世界模型
HuggingFace 每日AI论文速递
【目录】
本期的 15 篇论文如下:
00:23 📈 LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics(LLaTiSA:从视觉感知到语义的难度分层时间序列推理)
01:11 🎮 WorldMark: A Unified Benchmark Suite for Interactive Video World Models(WorldMark:交互式视频世界模型的统一基准套件)
01:54 🤖 UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling(UniT:面向人形机器人策略学习与世界建模的统一物理语言)
02:44 🎨 StyleID: A Perception-Aware Dataset and Metric for Stylization-Agnostic Facial Identity Recognition(StyleID:一种面向风格化无关面部身份识别的感知感知数据集与度量)
03:56 ⏩ Seeing Fast and Slow: Learning the Flow of Time in Videos(快慢视觉:学习视频中的时间流动)
04:39 ⚡ TingIS: Real-time Risk Event Discovery from Noisy Customer Incidents at Enterprise Scale(TingIS:企业级规模下从嘈杂客户事件中实时发现风险事件)
05:16 🧠 Hybrid Policy Distillation for LLMs(面向大语言模型的混合策略蒸馏)
05:48 🧠 Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks(面向长时域任务的LLM决策与技能库智能体协同进化)
06:44 🤖 VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation(VLAA-GUI:一种用于GUI自动化的模块化框架——知晓何时停止、恢复与搜索)
07:43 🧩 Context Unrolling in Omni Models(全模态模型中的上下文展开)
08:31 🎨 EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model(EditCrafter:基于预训练扩散模型的无调优高分辨率图像编辑)
09:34 🔗 UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection(UniGenDet:一种用于协同进化图像生成与生成图像检测的统一生成-判别框架)
10:25 🌐 WebGen-R1: Incentivizing Large Language Models to Generate Functional and Aesthetic Websites with Reinforcement Learning(WebGen-R1:利用强化学习激励大型语言模型生成功能性与美观性网站)
11:14 🔍 Trust but Verify: Introducing DAVinCI -- A Framework for Dual Attribution and Verification in Claim Inference for Language Models(信任但验证:引入DAVinCI——一种用于语言模型声明推理的双重归因与验证框架)
12:11 🔍 Explainable Disentangled Representation Learning for Generalizable Authorship Attribution in the Era of Generative AI(面向生成式AI时代的可解释解耦表示学习用于泛化作者归因)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递