节目列表: HuggingFace 每日AI论文速递 - EarsOnMe

HuggingFace 每日AI论文速递
节目列表

2024.07.08 每日AI论文

Hugging Face 每日AI论文速递每天10分钟，带您快速了解当日HuggingFace热门AI论文内容今天带来的 15 篇论文如下： 🌐 Unveiling Encoder-Free Vision-Language Models（揭示无编码器的视觉-语言模型） 🗣️ FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs（FunAudioLLM：用于增强人类与大型语言模型之间自然语音交互的语音理解和生成基础模型） 🧠 AriGraph: Learning Knowledge Graph World Models with Episodic Memory for LLM Agents（AriGraph：为LLM代理学习知识图世界模型与情景记忆） 📚 Learning to (Learn at Test Time): RNNs with Expressive Hidden States（学习在测试时学习：具有表达性隐藏状态的RNN） 📊 ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild（ChartGemma：针对野外图表推理的视觉指令调优） 📈 RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models（可靠的多模态RAG用于医学视觉语言模型的事实性） 🗣️ Stark: Social Long-Term Multi-Modal Conversation with Persona Commonsense Knowledge（STARK：具有人格常识知识的社会长期多模态对话） 🧠 DotaMath: Decomposition of Thought with Code Assistance and Self-correction for Mathematical Reasoning（DotaMath：利用代码辅助和自我修正的思维分解方法进行数学推理） 🛡️ Safe Unlearning: A Surprisingly Effective and Generalizable Solution to Defend Against Jailbreak Attacks（安全遗忘：一种有效且具有普遍性的防御越狱攻击解决方案） 📊 On scalable oversight with weak LLMs judging strong LLMs（关于可扩展监督协议下弱大型语言模型对强大型语言模型的监督研究） 🎥 Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams（基于内存的实时长视频流理解） 📊 HEMM: Holistic Evaluation of Multimodal Foundation Models（HEMM：多模态基础模型的整体评估） 🤝 LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs（LLM-jp：一个跨组织项目，用于完全开放的日本大型语言模型的研究与开发） 📷 CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images（CRiM-GS：从运动模糊图像中连续刚体运动感知的高斯喷溅） 🔍 Granular Privacy Control for Geolocation with Vision Language Models（视觉语言模型的粒度隐私控制：地理定位）

10分钟

1年前

2024.07.05 每日AI论文

Hugging Face 每日AI论文速递每天10分钟，带您快速了解当日HuggingFace热门AI论文内容今天带来的 3 篇论文如下： 🔄 Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion（扩散强制：下一词预测与全序列扩散的结合） 🔍 Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models（让专家专注于他的领域：稀疏架构大型语言模型的专家专业化微调） 📊 Planetarium: A Rigorous Benchmark for Translating Text to Structured Planning Languages（天文馆：一个严格的基准，用于评估将文本转换为结构化规划语言的能力）

2分钟

99+

1年前

2024.07.08 每日AI论文

2024.07.05 每日AI论文

推荐播单

加入我们的 Discord

扫描微信二维码

播放列表