大家好,欢迎收听“Hugging Face 每日AI论文速递”。今天是2024年7月23日,我们将带您快速浏览今日的20篇热门AI论文,涵盖了大型语言模型、多模态处理、3D世界生成等多个前沿领域。现在,让我们立即进入精彩的论文世界。

[00:24] 📚 Knowledge Mechanisms in Large Language Models: A Survey and Perspective(大型语言模型中的知识机制:综述与展望)
[00:55] 🔍 NNsight and NDIF: Democratizing Access to Foundation Model Internals(NNsight与NDIF:普及基础模型内部访问)
[01:41] 📊 POGEMA: A Benchmark Platform for Cooperative Multi-Agent Navigation(POGEMA:合作多智能体导航的基准平台)
[02:15] 🎥 SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models(SlowFast-LLaVA:一种无需额外训练的视频大型语言模型的强基线方法)
[02:40] 📺 LongVideoBench: A Benchmark for Long-context Interleaved Video-Language Understanding(LongVideoBench:长上下文交错视频语言理解基准测试)
[03:14] 🎮 VideoGameBunny: Towards vision assistants for video games(VideoGameBunny:面向视频游戏的视觉助手)
[03:49] 🌐 BoostMVSNeRFs: Boosting MVS-based NeRFs to Generalizable View Synthesis in Large-scale Scenes(BoostMVSNeRFs:提升基于MVS的NeRF在大规模场景中的通用视图合成质量)
[04:29] 🌐 AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?(AssistantBench:网络代理能否解决现实且耗时的任务?)
[05:04] 🌐 HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions(HoloDreamer:从文本描述生成全景3D世界的整体框架)
[05:36] 📚 BOND: Aligning LLMs with Best-of-N Distillation(BOND:将LLMs与Best-of-N蒸馏对齐)
[06:10] 📊 MIBench: Evaluating Multimodal Large Language Models over Multiple Images(MIBench:评估多模态大型语言模型在多图像场景下的表现)
[06:41] 🎶 MusiConGen: Rhythm and Chord Control for Transformer-Based Text-to-Music Generation(MusiConGen:基于Transformer的文本到音乐生成中的节奏和和弦控制)
[07:19] 🔧 Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning(条件语言策略:可操纵多目标微调的通用框架)
[07:56] 🎭 Temporal Residual Jacobians For Rig-free Motion Transfer(无绑定运动转移的时间残差雅可比)
[08:28] 📉 Consent in Crisis: The Rapid Decline of the AI Data Commons(危机中的同意:AI数据共享的快速衰退)
[08:53] 🎨 Artist: Aesthetically Controllable Text-Driven Stylization without Training(Artist:无需训练的文本驱动美学可控风格化)
[09:26] 🎥 Cinemo: Consistent and Controllable Image Animation with Motion Diffusion Models(Cinemo:基于运动扩散模型的图像动画一致性与可控性)
[09:56] 🎥 Local All-Pair Correspondence for Point Tracking(局部全对应对应点跟踪)
[10:24] 🔥 ThermalNeRF: Thermal Radiance Fields(热辐射场:热辐射场)
[10:55] 🤖 GET-Zero: Graph Embodiment Transformer for Zero-shot Embodiment Generalization(GET-Zero:零样本实体泛化的图实体变换器)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

空空如也
暂无小宇宙热门评论