本期的 11 篇论文如下:
[00:26] 🌐 MM1.5: Methods, Analysis & Insights from Multimodal LLM Fine-tuning(MM1.5:多模态大语言模型微调的方法、分析与见解)
[01:04] 📏 Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models(Ruler:一种用于控制大型语言模型生成长度的模型无关方法)
[01:41] 🗣 DiaSynth -- Synthetic Dialogue Generation Framework(DiaSynth -- 合成对话生成框架)
[02:22] 📊 Hyper-Connections(OLMo-1B:探索DHC和SHC中的规模与训练)
[02:57] 🤖 UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models(UniAff:一种结合视觉语言模型的工具使用和关节运动的统一表示方法)
[03:35] 🔍 Cottention: Linear Transformers With Cosine Attention(Cottention:基于余弦注意力的线性变换器)
[04:10] 🤖 Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers(通过异构预训练Transformer扩展本体感觉-视觉学习)
[04:49] 🏋 Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code(咖啡健身房:评估和改进错误代码的自然语言反馈环境)
[05:29] 🖼 Image Copy Detection for Diffusion Models(扩散模型图像复制检测)
[06:09] 🧠 Can Models Learn Skill Composition from Examples?(模型能否从示例中学习技能组合?)
[06:43] 🎧 IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding(IDEAW:具有可逆双嵌入的鲁棒神经音频水印)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

空空如也
暂无小宇宙热门评论