HuggingFace 每日AI论文速递 - 2024.07.09 每日AI论文 - EarsOnMe

主播...

简介...

Hugging Face 每日AI论文速递

每天10分钟，带您快速了解当日HuggingFace热门AI论文内容

今天带来的 17 篇论文如下：

📊 MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?（MJ-Bench：你的多模态奖励模型真的是文本到图像生成的好评判吗？）

🌐 LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages（LLaMAX：通过增强翻译能力扩展大型语言模型的语言视野至100种以上语言）

🎥 Learning Action and Reasoning-Centric Image Editing from Videos and Simulations（从视频和模拟中学习以动作和推理为中心的图像编辑）

📚 Associative Recurrent Memory Transformer（关联循环记忆变换器）

🌐 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation（ANOLE：一种开源、自回归、原生的大型多模态模型，用于交错图像-文本生成）

📚 Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction（评估语言模型上下文窗口：一种“工作记忆”测试与推理时校正）

🎥 Compositional Video Generation as Flow Equalization（组合视频生成作为流量均衡）

📊 PAS: Data-Efficient Plug-and-Play Prompt Augmentation System（PAS：数据高效的即插即用提示增强系统）

🚀 InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct（InverseCoder：通过逆向指令释放指令调优代码大型语言模型的潜力）

🛠️ Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images（Tailor3D：利用双面图像定制化编辑和生成3D资产）

🖼️ UltraEdit: Instruction-based Fine-Grained Image Editing at Scale（超编辑：基于指令的细粒度大规模图像编辑）

📚 Training Task Experts through Retrieval Based Distillation（通过检索基础提炼训练任务专家）

👁️‍🗨️ Multi-Object Hallucination in Vision-Language Models（视觉语言模型中的多对象幻觉现象）

🔍 Understanding Visual Feature Reliance through the Lens of Complexity（通过复杂度视角理解视觉特征依赖）

🎨 PartCraft: Crafting Creative Objects by Parts（PartCraft：通过部分创作创意物体）

📚 LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking（大型语言模型在实体链接中的上下文增强作用）

🔍 ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models（ANAH-v2：扩展大型语言模型幻觉标注的规模）

评价...

空空如也

小宇宙热门评论...

暂无小宇宙热门评论