Album
时长:
11分钟
播放:
15
发布:
1年前
主播...
简介...
https://xiaoyuzhoufm.com

Hugging Face 每日AI论文速递


每天10分钟,带您快速了解当日HuggingFace热门AI论文内容


今天带来的 16 篇论文如下:


👓 Vision language models are blind(视觉语言模型是盲的)


📹 Video-STaR: Self-Training Enables Video Instruction Tuning with Any Supervision(视频-STaR:自训练实现视频指令调整与任意监督)


🌐 Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence(代理互联网:编织异构代理网络以实现协作智能)


👤 RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models(RodinHD:使用扩散模型生成高保真3D虚拟形象)


📚 AgentInstruct: Toward Generative Teaching with Agentic Flows(AgentInstruct:通过代理流程实现生成教学)


📚 Adapting LLMs to Hebrew: Unveiling DictaLM 2.0 with Enhanced Vocabulary and Instruction Capabilities(适应希伯来语的大型语言模型:揭示DictaLM 2.0及其增强词汇和指令能力)


📹 MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions(MiraData:一个大规模视频数据集,具有长时长和结构化详细字幕)


🌐 Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions(基于图的描述:通过互联区域描述增强视觉描述)


🔍 Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps(回溯透镜:仅使用注意力映射检测和缓解大型语言模型中的上下文幻觉)


📚 Knowledge Composition using Task Vectors with Learned Anisotropic Scaling(使用任务向量的学习各向异性缩放进行知识组合)


📚 TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts(TheoremLlama:将通用大型语言模型转化为Lean4专家)


⚡ BM25S: Orders of magnitude faster lexical search via eager sparse scoring(BM25S:通过急切稀疏评分实现数量级更快的词汇搜索)


🎥 VIMI: Grounding Video Generation through Multi-modal Instruction(VIMI:通过多模态指令生成视频)


🔄 From Loops to Oops: Fallback Behaviors of Language Models Under Uncertainty(从循环到失误:语言模型在不确定性条件下的回退行为)


📚 How do you know that? Teaching Generative Language Models to Reference Answers to Biomedical Questions(如何知道?教学生成语言模型引用生物医学问题的答案)


📈 LETS-C: Leveraging Language Embedding for Time Series Classification(利用语言嵌入进行时间序列分类)


评价...

空空如也

小宇宙热门评论...

暂无小宇宙热门评论

EarsOnMe

加入我们的 Discord

与播客爱好者一起交流

立即加入

扫描微信二维码

添加微信好友,获取更多播客资讯

微信二维码

播放列表

自动播放下一个

播放列表还是空的

去找些喜欢的节目添加进来吧