Hugging Face 每日AI论文速递
每天10分钟,带您快速了解当日HuggingFace热门AI论文内容
今天带来的 17 篇论文如下:
📊 MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?(MJ-Bench:你的多模态奖励模型真的是文本到图像生成的好评判吗?)
🌐 LLaMAX: Scaling Linguistic Horizons of LLM by Enhancing Translation Capabilities Beyond 100 Languages(LLaMAX:通过增强翻译能力扩展大型语言模型的语言视野至100种以上语言)
🎥 Learning Action and Reasoning-Centric Image Editing from Videos and Simulations(从视频和模拟中学习以动作和推理为中心的图像编辑)
📚 Associative Recurrent Memory Transformer(关联循环记忆变换器)
🌐 ANOLE: An Open, Autoregressive, Native Large Multimodal Models for Interleaved Image-Text Generation(ANOLE:一种开源、自回归、原生的大型多模态模型,用于交错图像-文本生成)
📚 Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction(评估语言模型上下文窗口:一种“工作记忆”测试与推理时校正)
🎥 Compositional Video Generation as Flow Equalization(组合视频生成作为流量均衡)
📊 PAS: Data-Efficient Plug-and-Play Prompt Augmentation System(PAS:数据高效的即插即用提示增强系统)
🚀 InverseCoder: Unleashing the Power of Instruction-Tuned Code LLMs with Inverse-Instruct(InverseCoder:通过逆向指令释放指令调优代码大型语言模型的潜力)
🛠️ Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images(Tailor3D:利用双面图像定制化编辑和生成3D资产)
🖼️ UltraEdit: Instruction-based Fine-Grained Image Editing at Scale(超编辑:基于指令的细粒度大规模图像编辑)
📚 Training Task Experts through Retrieval Based Distillation(通过检索基础提炼训练任务专家)
👁️🗨️ Multi-Object Hallucination in Vision-Language Models(视觉语言模型中的多对象幻觉现象)
🔍 Understanding Visual Feature Reliance through the Lens of Complexity(通过复杂度视角理解视觉特征依赖)
🎨 PartCraft: Crafting Creative Objects by Parts(PartCraft:通过部分创作创意物体)
📚 LLMAEL: Large Language Models are Good Context Augmenters for Entity Linking(大型语言模型在实体链接中的上下文增强作用)
🔍 ANAH-v2: Scaling Analytical Hallucination Annotation of Large Language Models(ANAH-v2:扩展大型语言模型幻觉标注的规模)







空空如也
暂无小宇宙热门评论