时长:
10分钟
播放:
84
发布:
2天前
主播...
简介...
本期的 15 篇论文如下:
[00:20] ⚡ Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning(原生并行推理器:通过自蒸馏强化学习实现并行推理)
[01:04] 🧠 Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs(超越实数:用于长上下文大语言模型的旋转位置编码虚部扩展)
[01:54] 🎬 Unified Video Editing with Temporal Reasoner(基于时序推理的统一视频编辑)
[02:33] 🔍 DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems(DoVer:面向LLM多智能体系统的干预驱动自动调试方法)
[03:24] 🎮 Voxify3D: Pixel Art Meets Volumetric Rendering(Voxify3D:像素艺术与体素渲染的融合)
[04:07] 🎬 Scaling Zero-Shot Reference-to-Video Generation(零样本参考到视频生成的规模化研究)
[04:39] 🧬 Distribution Matching Variational AutoEncoder(分布匹配变分自编码器)
[05:12] 🔭 Multi-view Pyramid Transformer: Look Coarser to See Broader(多视图金字塔Transformer:看粗以见广)
[05:47] 🎬 EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing(EgoEdit:用于第一人称视频编辑的数据集、实时流式模型与基准测试)
[06:25] 🖼 LongCat-Image Technical Report(LongCat-Image技术报告)
[06:50] 🎬 UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation(UnityVideo:统一多模态多任务学习以增强世界感知的视频生成)
[07:36] 🔗 Relational Visual Similarity(关系视觉相似性)
[08:13] 🔬 On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models(论预训练、中期训练与强化学习在推理语言模型中的相互作用)
[08:57] 🎥 ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation(ReCamDriving:无需LiDAR的相机控制新轨迹视频生成)
[09:30] 🚀 Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning(超越词级监督:通过强化学习解锁基于解码的回归潜力)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
[00:20] ⚡ Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning(原生并行推理器:通过自蒸馏强化学习实现并行推理)
[01:04] 🧠 Beyond Real: Imaginary Extension of Rotary Position Embeddings for Long-Context LLMs(超越实数:用于长上下文大语言模型的旋转位置编码虚部扩展)
[01:54] 🎬 Unified Video Editing with Temporal Reasoner(基于时序推理的统一视频编辑)
[02:33] 🔍 DoVer: Intervention-Driven Auto Debugging for LLM Multi-Agent Systems(DoVer:面向LLM多智能体系统的干预驱动自动调试方法)
[03:24] 🎮 Voxify3D: Pixel Art Meets Volumetric Rendering(Voxify3D:像素艺术与体素渲染的融合)
[04:07] 🎬 Scaling Zero-Shot Reference-to-Video Generation(零样本参考到视频生成的规模化研究)
[04:39] 🧬 Distribution Matching Variational AutoEncoder(分布匹配变分自编码器)
[05:12] 🔭 Multi-view Pyramid Transformer: Look Coarser to See Broader(多视图金字塔Transformer:看粗以见广)
[05:47] 🎬 EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing(EgoEdit:用于第一人称视频编辑的数据集、实时流式模型与基准测试)
[06:25] 🖼 LongCat-Image Technical Report(LongCat-Image技术报告)
[06:50] 🎬 UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation(UnityVideo:统一多模态多任务学习以增强世界感知的视频生成)
[07:36] 🔗 Relational Visual Similarity(关系视觉相似性)
[08:13] 🔬 On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models(论预训练、中期训练与强化学习在推理语言模型中的相互作用)
[08:57] 🎥 ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation(ReCamDriving:无需LiDAR的相机控制新轨迹视频生成)
[09:30] 🚀 Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning(超越词级监督:通过强化学习解锁基于解码的回归潜力)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
评价...
空空如也
小宇宙热门评论...
暂无小宇宙热门评论