HuggingFace 每日AI论文速递 - 2024.07.23 每日AI论文 | 大型语言模型、多模态处理、3D世界生成 - EarsOnMe

时长：

11分钟

播放：

109

发布：

1年前

主播...

简介...

大家好，欢迎收听“Hugging Face 每日AI论文速递”。今天是2024年7月23日，我们将带您快速浏览今日的20篇热门AI论文，涵盖了大型语言模型、多模态处理、3D世界生成等多个前沿领域。现在，让我们立即进入精彩的论文世界。

[00:24] 📚 Knowledge Mechanisms in Large Language Models: A Survey and Perspective（大型语言模型中的知识机制：综述与展望）

[00:55] 🔍 NNsight and NDIF: Democratizing Access to Foundation Model Internals（NNsight与NDIF：普及基础模型内部访问）

[01:41] 📊 POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation（POGEMA：合作多智能体导航的基准平台）

[02:15] 🎥 SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models（SlowFast-LLaVA：一种无需额外训练的视频大型语言模型的强基线方法）

[02:40] 📺 LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding（LongVideoBench：长上下文交错视频语言理解基准测试）

[03:14] 🎮 VideoGameBunny: Towards vision assistants for video games（VideoGameBunny：面向视频游戏的视觉助手）

[03:49] 🌐 BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes（BoostMVSNeRFs：提升基于MVS的NeRF在大规模场景中的通用视图合成质量）

[04:29] 🌐 AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?（AssistantBench：网络代理能否解决现实且耗时的任务？）

[05:04] 🌐 HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions（HoloDreamer：从文本描述生成全景3D世界的整体框架）

[05:36] 📚 BOND: Aligning LLMs with Best-of-N Distillation（BOND：将LLMs与Best-of-N蒸馏对齐）

[06:10] 📊 MIBench: Evaluating Multimodal Large Language Models over Multiple Images（MIBench：评估多模态大型语言模型在多图像场景下的表现）

[06:41] 🎶 MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation（MusiConGen：基于Transformer的文本到音乐生成中的节奏和和弦控制）

[07:19] 🔧 Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning（条件语言策略：可操纵多目标微调的通用框架）

[07:56] 🎭 Temporal Residual Jacobians For Rig-free Motion Transfer（无绑定运动转移的时间残差雅可比）

[08:28] 📉 Consent in Crisis: The Rapid Decline of the AI Data Commons（危机中的同意：AI数据共享的快速衰退）

[08:53] 🎨 Artist: Aesthetically Controllable Text-Driven Stylization without Training（Artist：无需训练的文本驱动美学可控风格化）

[09:26] 🎥 Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models（Cinemo：基于运动扩散模型的图像动画一致性与可控性）

[09:56] 🎥 Local All-Pair Correspondence for Point Tracking（局部全对应对应点跟踪）

[10:24] 🔥 ThermalNeRF: Thermal Radiance Fields（热辐射场：热辐射场）

[10:55] 🤖 GET-Zero: Graph Embodiment Transformer for Zero-shot Embodiment Generalization（GET-Zero：零样本实体泛化的图实体变换器）

【关注我们】

您还可以在以下平台找到我们，获得播客内容以外更多信息

小红书: AI速递

评价...

空空如也

小宇宙热门评论...

暂无小宇宙热门评论

去听...

小宇宙

谁收藏了...