时长:
11分钟
播放:
50
发布:
8小时前
主播...
简介...
本期的 15 篇论文如下:
[00:20] 🗺 Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization(借助地图思考:用于地理定位的强化并行地图增强智能体)
[01:03] 🧠 MMFormalizer: Multimodal Autoformalization in the Wild(MMFormalizer:面向真实世界的多模态自动形式化方法)
[01:38] 🧬 The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning(思维分子结构:长链思维推理的拓扑映射)
[02:21] 🎭 CaricatureGS: Exaggerating 3D Gaussian Splatting Faces With Gaussian Curvature(CaricatureGS:基于高斯曲率夸张3D高斯泼溅人脸)
[03:04] 🔍 Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards(证据链构建:基于引文感知评分奖励的深度搜索智能体鲁棒强化学习)
[03:47] ⚙ EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis(EnvScaler:通过程序化合成扩展LLM智能体的工具交互环境)
[04:22] 🔮 Can We Predict Before Executing Machine Learning Agents?(我们能在执行前预测机器学习智能体的行为吗?)
[04:59] 🖼 AgentOCR: Reimagining Agent History via Optical Self-Compression(AgentOCR:通过光学自压缩重构智能体历史)
[05:39] 🎬 VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction(VideoAR:通过下一帧与尺度预测的自回归视频生成)
[06:29] 🔍 Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking(Qwen3-VL-Embedding与Qwen3-VL-Reranker:用于最先进多模态检索与排序的统一框架)
[07:23] 🔍 Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency(自信的幻觉?通过邻域一致性诊断大语言模型的真实性)
[08:07] 🔄 Orient Anything V2: Unifying Orientation and Rotation Understanding(Orient Anything V2:统一物体朝向与旋转理解的增强基础模型)
[08:37] 🔍 SmartSearch: Process Reward-Guided Query Refinement for Search Agents(SmartSearch:面向搜索代理的流程奖励引导查询优化框架)
[09:23] ⚙ Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals(目标力:教导视频模型实现物理条件目标)
[10:11] 📊 Same Claim, Different Judgment: Benchmarking Scenario-Induced Bias in Multilingual Financial Misinformation Detection(相同声明,不同判断:多语言金融虚假信息检测中场景诱导偏见的基准测试)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
[00:20] 🗺 Thinking with Map: Reinforced Parallel Map-Augmented Agent for Geolocalization(借助地图思考:用于地理定位的强化并行地图增强智能体)
[01:03] 🧠 MMFormalizer: Multimodal Autoformalization in the Wild(MMFormalizer:面向真实世界的多模态自动形式化方法)
[01:38] 🧬 The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning(思维分子结构:长链思维推理的拓扑映射)
[02:21] 🎭 CaricatureGS: Exaggerating 3D Gaussian Splatting Faces With Gaussian Curvature(CaricatureGS:基于高斯曲率夸张3D高斯泼溅人脸)
[03:04] 🔍 Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards(证据链构建:基于引文感知评分奖励的深度搜索智能体鲁棒强化学习)
[03:47] ⚙ EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis(EnvScaler:通过程序化合成扩展LLM智能体的工具交互环境)
[04:22] 🔮 Can We Predict Before Executing Machine Learning Agents?(我们能在执行前预测机器学习智能体的行为吗?)
[04:59] 🖼 AgentOCR: Reimagining Agent History via Optical Self-Compression(AgentOCR:通过光学自压缩重构智能体历史)
[05:39] 🎬 VideoAR: Autoregressive Video Generation via Next-Frame & Scale Prediction(VideoAR:通过下一帧与尺度预测的自回归视频生成)
[06:29] 🔍 Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking(Qwen3-VL-Embedding与Qwen3-VL-Reranker:用于最先进多模态检索与排序的统一框架)
[07:23] 🔍 Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency(自信的幻觉?通过邻域一致性诊断大语言模型的真实性)
[08:07] 🔄 Orient Anything V2: Unifying Orientation and Rotation Understanding(Orient Anything V2:统一物体朝向与旋转理解的增强基础模型)
[08:37] 🔍 SmartSearch: Process Reward-Guided Query Refinement for Search Agents(SmartSearch:面向搜索代理的流程奖励引导查询优化框架)
[09:23] ⚙ Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals(目标力:教导视频模型实现物理条件目标)
[10:11] 📊 Same Claim, Different Judgment: Benchmarking Scenario-Induced Bias in Multilingual Financial Misinformation Detection(相同声明,不同判断:多语言金融虚假信息检测中场景诱导偏见的基准测试)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
评价...
空空如也
小宇宙热门评论...
暂无小宇宙热门评论