Hugging Face 每日AI论文速递
每天10分钟,带您快速了解当日HuggingFace热门AI论文内容
今天带来的 16 篇论文如下:
👓 Vision language models are blind(视觉语言模型是盲的)
📹 Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision(视频-STaR:自训练实现视频指令调整与任意监督)
🌐 Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence(代理互联网:编织异构代理网络以实现协作智能)
👤 RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models(RodinHD:使用扩散模型生成高保真3D虚拟形象)
📚 AgentInstruct: Toward Generative Teaching with Agentic Flows(AgentInstruct:通过代理流程实现生成教学)
📚 Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities(适应希伯来语的大型语言模型:揭示DictaLM 2.0及其增强词汇和指令能力)
📹 MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions(MiraData:一个大规模视频数据集,具有长时长和结构化详细字幕)
🌐 Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions(基于图的描述:通过互联区域描述增强视觉描述)
🔍 Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps(回溯透镜:仅使用注意力映射检测和缓解大型语言模型中的上下文幻觉)
📚 Knowledge Composition using Task Vectors with Learned Anisotropic Scaling(使用任务向量的学习各向异性缩放进行知识组合)
📚 TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts(TheoremLlama:将通用大型语言模型转化为Lean4专家)
⚡ BM25S: Orders of magnitude faster lexical search via eager sparse scoring(BM25S:通过急切稀疏评分实现数量级更快的词汇搜索)
🎥 VIMI: Grounding Video Generation through Multi-modal Instruction(VIMI:通过多模态指令生成视频)
🔄 From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty(从循环到失误:语言模型在不确定性条件下的回退行为)
📚 How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions(如何知道?教学生成语言模型引用生物医学问题的答案)
📈 LETS-C: Leveraging Language Embedding for Time Series Classification(利用语言嵌入进行时间序列分类)


空空如也
暂无小宇宙热门评论