Hugging Face 每日AI论文速递
每天10分钟,带您快速了解当日HuggingFace热门AI论文内容
今天带来的 14 篇论文如下:
🌐 PaliGemma: A versatile 3B VLM for transfer(PaliGemma:一种多功能3B视觉语言模型用于迁移)
🌐 LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models(LLaVA-NeXT-Interleave:在大规模多模态模型中处理多图像、视频和3D问题)
🚀 Inference Performance Optimization for Large Language Models on CPUs(CPU上大型语言模型推理性能优化)
🌐 Controlling Space and Time with Diffusion Models(使用扩散模型控制空间和时间)
🎥🔊 Video-to-Audio Generation with Hidden Alignment(基于隐藏对齐的视频到音频生成)
🎥 VEnhancer: Generative Space-Time Enhancement for Video Generation(VEnhancer:生成空间-时间增强的视频生成技术)
📊 On Leakage of Code Generation Evaluation Datasets(关于代码生成评估数据集泄露的问题)
🔍 Do Vision and Language Models Share Concepts? A Vector Space Alignment Study(视觉和语言模型是否共享概念?一项向量空间对齐研究)
🤖 This&That: Language-Gesture Controlled Video Generation for Robot Planning(This&That:基于语言和手势控制的机器人视频生成规划)
🌌 CosmoCLIP: Generalizing Large Vision-Language Models for Astronomical Imaging(CosmoCLIP:通用大型视觉语言模型在天文图像处理中的应用)
🎥 Still-Moving: Customized Video Generation without Customized Video Data(Still-Moving:无需定制视频数据的定制化视频生成)
📊 An accurate detection is not all you need to combat label noise in web-noisy datasets(在网络噪声数据集中对抗标签噪声的准确检测并非全部所需)
🤖 BiGym: A Demo-Driven Mobile Bi-Manual Manipulation Benchmark(BiGym:移动双手机器人演示驱动操作基准)
👥 CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation(CrowdMoGen:零样本文本驱动的人群运动生成)
【关注我们,获取更多信息】
小红书:AI速递

空空如也
暂无小宇宙热门评论