Hugging Face 每日AI论文速递
每天10分钟,带您快速了解当日HuggingFace热门AI论文内容

今天带来的 15 篇论文如下:
📊 Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On(Skywork-Math:大型语言模型中数学推理能力的数据规模定律 -- 故事继续)
📊 MAVIS: Mathematical Visual Instruction Tuning(MAVIS:数学视觉指令调优)
📹 Video Diffusion Alignment via Reward Gradients(通过奖励梯度实现视频扩散对齐)
🔍 MambaVision: A Hybrid Mamba-Transformer Vision Backbone(MambaVision:一种混合Mamba-Transformer视觉骨干网络)
📊 GTA: A Benchmark for General Tool Agents(GTA:通用工具代理基准)
📊 The Synergy between Data and Multi-Modal Large Language Models: A Survey from Co-Development Perspective(数据与多模态大型语言模型的协同作用:从协同发展角度的调查)
🌐 DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception(DenseFusion-1M:整合视觉专家以实现全面多模态感知)
🎥 Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models(Live2Diff:基于单向注意力机制的视频扩散模型实现直播翻译)
🌲 Gradient Boosting Reinforcement Learning(梯度提升强化学习)
📉 Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients(Q-GaLore:使用INT4投影和层适应低秩梯度的量化GaLore)
📖 SEED-Story: Multimodal Long Story Generation with Large Language Model(SEED-Story:基于大型语言模型的多模态长故事生成)
📹 Generalizable Implicit Motion Modeling for Video Frame Interpolation(可泛化的隐式运动建模用于视频帧插值)
📊 OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects(OmniNOCS:用于2D物体3D提升的统一NOCS数据集与模型)
🎤 Autoregressive Speech Synthesis without Vector Quantization(无需向量量化的自回归语音合成)
🌍 WildGaussians: 3D Gaussian Splatting in the Wild(WildGaussians:自然环境中的3D高斯喷洒)
【关注我们,获取更多信息】
小红书: AI速递

空空如也
暂无小宇宙热门评论