本期的 11 篇论文如下:
[00:26] 🎨 Imagine yourself: Tuning-Free Personalized Image Generation(想象自己:无调优个性化图像生成)
[01:02] 😂 YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models(YesBut:评估视觉语言模型讽刺理解能力的高质量多模态数据集)
[01:40] 🌍 Prithvi WxC: Foundation Model for Weather and Climate(Prithvi WxC:天气和气候的基础模型)
[02:15] 🎵 MuCodec: Ultra Low-Bitrate Music Codec(MuCodec:超低比特率音乐编解码器)
[02:51] 🌈 Colorful Diffuse Intrinsic Image Decomposition in the Wild(在野外进行彩色漫反射内在图像分解)
[03:29] 🎥 Portrait Video Editing Empowered by Multimodal Generative Priors(基于多模态生成先验的肖像视频编辑)
[04:01] 🎥 Temporally Aligned Audio for Video with Autoregression(基于自回归的视频音频时间对齐生成)
[04:38] 📱 V^3: Viewing Volumetric Videos on Mobiles via Streamable 2D Dynamic Gaussians(V^3:通过可流式2D动态高斯函数在移动设备上观看体积视频)
[05:21] 📚 Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation(事实、获取与推理:检索增强生成的统一评估)
[05:57] 🛡 Hackphyr: A Local Fine-Tuned LLM Agent for Network Security Environments(Hackphyr:用于网络安全环境的本地微调LLM代理)
[06:34] 🎻 Minstrel: Structural Prompt Generation with Multi-Agents Coordination for Non-AI Experts(Minstrel:面向非AI专家的多智能体协同结构化提示生成)

【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递

空空如也
暂无小宇宙热门评论