2024.07.08 每日AI论文

Hugging Face 每日AI论文速递 每天10分钟,带您快速了解当日HuggingFace热门AI论文内容 今天带来的 15 篇论文如下: 🌐 Unveiling Encoder-Free Vision-Language Models(揭示无编码器的视觉-语言模型) 🗣️ FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs(FunAudioLLM:用于增强人类与大型语言模型之间自然语音交互的语音理解和生成基础模型) 🧠 AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents(AriGraph:为LLM代理学习知识图世界模型与情景记忆) 📚 Learning to (Learn at Test Time): RNNs with Expressive Hidden States(学习在测试时学习:具有表达性隐藏状态的RNN) 📊 ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild(ChartGemma:针对野外图表推理的视觉指令调优) 📈 RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models(可靠的多模态RAG用于医学视觉语言模型的事实性) 🗣️ Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge(STARK:具有人格常识知识的社会长期多模态对话) 🧠 DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning(DotaMath:利用代码辅助和自我修正的思维分解方法进行数学推理) 🛡️ Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks(安全遗忘:一种有效且具有普遍性的防御越狱攻击解决方案) 📊 On scalable oversight with weak LLMs judging strong LLMs(关于可扩展监督协议下弱大型语言模型对强大型语言模型的监督研究) 🎥 Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams(基于内存的实时长视频流理解) 📊 HEMM: Holistic Evaluation of Multimodal Foundation Models(HEMM:多模态基础模型的整体评估) 🤝 LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs(LLM-jp:一个跨组织项目,用于完全开放的日本大型语言模型的研究与开发) 📷 CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images(CRiM-GS:从运动模糊图像中连续刚体运动感知的高斯喷溅) 🔍 Granular Privacy Control for Geolocation with Vision Language Models(视觉语言模型的粒度隐私控制:地理定位)

10分钟
30
1年前
EarsOnMe

加入我们的 Discord

与播客爱好者一起交流

立即加入

扫描微信二维码

添加微信好友,获取更多播客资讯

微信二维码

播放列表

自动播放下一个

播放列表还是空的

去找些喜欢的节目添加进来吧