HuggingFace 每日AI论文速递 - 2025.04.29 | RepText提升多语言文本渲染；LLM改进手机GUI自动化。 - EarsOnMe

HuggingFace 每日AI论文速递
2025.04.29 | RepText提升多语言文本渲染；LLM改进手机GUI自动化。

时长：

8分钟

播放：

111

发布：

4个月前

主播...

拨号上网

简介...

本期的 11 篇论文如下：

[00:23] ✍ RepText: Rendering Visual Text via Replicating（RepText：通过复制渲染视觉文本）

[01:02] 📱 LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects（LLM驱动的手机GUI代理：进展与展望）

[01:44] 🔐 CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges（CipherBank：通过密码学挑战探索大型语言模型推理能力的边界）

[02:30] 🤔 Clinical knowledge in LLMs does not translate to human interactions（大型语言模型中的临床知识未能转化为人际互动）

[03:16] ⬇ Group Downsampling with Equivariant Anti-aliasing（群等变抗锯齿降采样）

[03:59] 📐 TrustGeoGen: Scalable and Formal-Verified Data Engine for Trustworthy Multi-modal Geometric Problem Solving（TrustGeoGen：用于可信多模态几何问题求解的可扩展且形式验证的数据引擎）

[04:39] 🤖 SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning（SPC：通过对抗博弈演进自博弈评论器以提升大型语言模型推理能力）

[05:30] 🖼 Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency（基于显式视觉依赖的多模态数学推理能力基准测试）

[06:15] 🚀 MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention（MMInference：通过模态感知置换稀疏注意力加速长文本VLM的预填充）

[06:49] 🔑 ICL CIPHERS: Quantifying "Learning'' in In-Context Learning via Substitution Ciphers（ICL密码：通过替换密码量化上下文学习中的“学习”）

[07:30] 💡 ChiseLLM: Unleashing the Power of Reasoning LLMs for Chisel Agile Hardware Development（ChiseLLM：释放推理LLM在Chisel敏捷硬件开发中的力量）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

评价...

空空如也

小宇宙热门评论...

暂无小宇宙热门评论

去听...

小宇宙

谁收藏了...

EarsOnMe

空空如也

加入我们的 Discord

扫描微信二维码

播放列表