Album
时长:
5分钟
播放:
148
发布:
3周前
主播...
简介...
https://xiaoyuzhoufm.com
本期的 11 篇论文如下:
[00:20] 🚀 GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models(GLM-4.5:智能体、推理与编程(ARC)基础模型)
[00:47] 👕 Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off(Voost:一种统一且可扩展的双向虚拟试穿与试脱扩散Transformer)
[01:11] 🎯 InfiGUI-G1: Advancing GUI Grounding with Adaptive Exploration Policy Optimization(InfiGUI-G1:通过自适应探索策略优化推进 GUI 元素定位能力)
[01:34] 🧠 Memp: Exploring Agent Procedural Memory(Memp:探索智能体程序性记忆)
[02:03] ✂ Pruning the Unsurprising: Efficient Code Reasoning via First-Token Surprisal(剪枝非关键信息:基于首令牌惊奇度的高效代码推理)
[02:29] 🪄 GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing(GENIE:用于神经辐射场交互式编辑的高斯编码)
[02:50] 📚 Adapting Vision-Language Models Without Labels: A Comprehensive Survey(无标签视觉-语言模型适应:一项全面综述)
[03:15] 🌍 MELLA: Bridging Linguistic Capability and Cultural Groundedness for Low-Resource Language MLLMs(MELLA:弥合低资源语言多模态大语言模型的语言能力与文化扎根性)
[03:37] 🧱 MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh(MeshLLM:赋能大型语言模型逐步理解和生成3D网格)
[04:02] 🎯 UI-AGILE: Advancing GUI Agents with Effective Reinforcement Learning and Precise Inference-Time Grounding(UI-AGILE:以有效强化学习和精准推断时定位提升图形用户界面智能体)
[04:30] ✨ LightSwitch: Multi-view Relighting with Material-guided Diffusion(光开关:基于材料引导扩散的多视角重照明)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
评价...

空空如也

小宇宙热门评论...

暂无小宇宙热门评论

EarsOnMe

加入我们的 Discord

与播客爱好者一起交流

立即加入

扫描微信二维码

添加微信好友,获取更多播客资讯

微信二维码

播放列表

自动播放下一个

播放列表还是空的

去找些喜欢的节目添加进来吧