本期的 17 篇论文如下:
[00:24] 🇵 Bielik 7B v0.1: A Polish Language Model -- Development, Insights, and Evaluation(Bielik 7B v0.1:波兰语言模型——开发、洞察与评估)
[01:00] 🤖 AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant(AgentStore:可扩展的异构代理作为专业化通才计算机助手集成)
[01:39] 🤖 GPT-4o System Card(GPT-4o系统卡片)
[02:21] 📄 Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction(文档解析揭秘:结构化信息提取的技术、挑战与前景)
[03:08] 🤖 LongReward: Improving Long-context Large Language Models with AI Feedback(长奖励:通过AI反馈提升长上下文大语言模型)
[03:43] 🎥 MarDini: Masked Autoregressive Diffusion for Video Generation at Scale(MarDini:大规模视频生成的掩码自回归扩散模型)
[04:22] 🌟 DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation(DreamClear:高容量真实世界图像修复与隐私安全数据集构建)
[05:10] 🧩 GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation(GrounDiT:基于噪声补丁移植的扩散变换器空间定位)
[05:49] 📚 A Survey of Small Language Models(小语言模型综述)
[06:23] 💾 COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training(COAT:压缩优化器状态和激活以实现高效的FP8训练)
[06:58] ⚡ Fast Best-of-N Decoding via Speculative Rejection(基于推测拒绝的快速最佳N解码)
[07:36] 🔍 Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines(视觉搜索助手:赋能视觉-语言模型作为多模态搜索引擎)
[08:25] 🎥 LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior(LARP:利用学习到的自回归生成先验进行视频标记化)
[09:00] 🤖 Neural Fields in Robotics: A Survey(机器人学中的神经场:综述)
[09:40] 🗣 Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction(对话2流程:预训练软对比动作驱动句子嵌入用于自动对话流程提取)
[10:15] 🩺 Language Models And A Second Opinion Use Case: The Pocket Professional(语言模型与第二意见应用案例:口袋专家)
[10:55] 🤖 Leveraging Locality to Boost Sample Efficiency in Robotic Manipulation(利用局部性提升机器人操作的样本效率)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

空空如也
暂无小宇宙热门评论