本期的 15 篇论文如下:
[00:20] 🎥 TPDiff: Temporal Pyramid Video Diffusion Model(TPDiff:时间金字塔视频扩散模型)
[00:58] 🎥 Reangle-A-Video: 4D Video Generation as Video-to-Video Translation(Reangle-A-Video:将4D视频生成作为视频到视频的转换)
[01:42] 🧠 Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models(块扩散:在自回归与扩散语言模型之间插值)
[02:18] 🎯 RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling(RewardSDS:通过奖励加权采样对齐分数蒸馏)
[02:55] 🧠 GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training(GTR:引导思维强化防止基于RL的VLM代理训练中的思维崩溃)
[03:36] 📄 More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG(更多文档,相同长度:隔离RAG中多文档的挑战)
[04:19] 💃 Motion Anything: Any to Motion Generation(运动万象:任意到运动生成)
[05:15] 📊 WildIFEval: Instruction Following in the Wild(野外交互评估:复杂条件下的指令遵循)
[05:49] 📹 VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary(VLog:通过生成性检索叙事词汇的视频-语言模型)
[06:29] 🤖 Quantizing Large Language Models for Code Generation: A Differentiated Replication(量化大型语言模型用于代码生成:差异化复现)
[07:13] 🧠 Cost-Optimal Grouped-Query Attention for Long-Context LLMs(长上下文大语言模型的成本最优分组查询注意力)
[07:53] 🧬 Multimodal Language Modeling for High-Accuracy Single Cell Transcriptomics Analysis and Generation(高精度单细胞转录组分析与生成中的多模态语言建模)
[08:33] 🔄 Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space(无别名潜在扩散模型:提升扩散潜在空间的分数位移等变性)
[09:15] 🔄 Self-Taught Self-Correction for Small Language Models(小语言模型的自教自纠)
[09:49] 🧩 MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System(MoC:检索增强生成系统中的文本分块学习混合模型)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

空空如也
暂无小宇宙热门评论