本期的 11 篇论文如下:
[00:23] ✍ RepText: Rendering Visual Text via Replicating(RepText:通过复制渲染视觉文本)
[01:02] 📱 LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects(LLM驱动的手机GUI代理:进展与展望)
[01:44] 🔐 CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges(CipherBank:通过密码学挑战探索大型语言模型推理能力的边界)
[02:30] 🤔 Clinical knowledge in LLMs does not translate to human interactions(大型语言模型中的临床知识未能转化为人际互动)
[03:16] ⬇ Group Downsampling with Equivariant Anti-aliasing(群等变抗锯齿降采样)
[03:59] 📐 TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving(TrustGeoGen:用于可信多模态几何问题求解的可扩展且形式验证的数据引擎)
[04:39] 🤖 SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning(SPC:通过对抗博弈演进自博弈评论器以提升大型语言模型推理能力)
[05:30] 🖼 Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency(基于显式视觉依赖的多模态数学推理能力基准测试)
[06:15] 🚀 MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention(MMInference:通过模态感知置换稀疏注意力加速长文本VLM的预填充)
[06:49] 🔑 ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers(ICL密码:通过替换密码量化上下文学习中的“学习”)
[07:30] 💡 ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development(ChiseLLM:释放推理LLM在Chisel敏捷硬件开发中的力量)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

空空如也
暂无小宇宙热门评论