本期的 13 篇论文如下:
[00:26] 🎵 Seed-Music: A Unified Framework for High Quality and Controlled Music Generation(Seed-Music:高质量和可控音乐生成的统一框架)
[01:03] ⚡ RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval(通过向量检索加速长上下文大语言模型推理)
[01:46] 🌐 Ferret: Federated Full-Parameter Tuning at Scale for Large Language Models(Ferret:大规模联邦学习中大型语言模型的全参数微调)
[02:35] 🔍 Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types(指导视觉语言模型选择用于跨任务、领域和知识类型的视觉问答)
[03:20] 🔊 ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds(ReCLAP:通过描述声音改进零样本音频分类)
[04:04] 📚 One missing piece in Vision and Language: A Survey on Comics Understanding(视觉与语言中的缺失一环:漫画理解综述)
[04:42] 🌐 jina-embeddings-v3: Multilingual Embeddings With Task LoRA(Jina-embeddings-v3:多语言嵌入与任务LoRA)
[05:28] 🧠 On the Diagram of Thought(关于思维图的探讨)
[06:10] 🔊 AudioBERT: Audio Knowledge Augmented Language Model(音频BERT:增强语言模型的音频知识)
[06:40] 🔍 Policy Filtration in RLHF to Fine-Tune LLM for Code Generation(在RLHF中进行策略过滤以微调LLM进行代码生成)
[07:20] 📊 Towards Predicting Temporal Changes in a Patient's Chest X-ray Images based on Electronic Health Records(基于电子健康记录预测患者胸部X光图像的时间变化)
[07:57] 🤖 Breaking reCAPTCHAv2(破解 reCAPTCHAv2)
[08:27] 🐝 beeFormer: Bridging the Gap Between Semantic and Interaction Similarity in Recommender Systems(beeFormer:在推荐系统中弥合语义和交互相似性之间的差距)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

空空如也
暂无小宇宙热门评论