Album
时长:
11分钟
播放:
94
发布:
2周前
主播...
简介...
https://xiaoyuzhoufm.com

本期的 15 篇论文如下:


[00:20] 🧠 A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning(大模型推理中内部概率与自洽性桥接的理论研究)


[01:04] 🌐 OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM(OmniVinci:面向全模态理解大模型的架构与数据增强)


[01:44] 🎬 Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset(用百万级合成数据集放大指令式视频编辑)


[02:28] ✂ NANO3D: A Training-Free Approach for Efficient 3D Editing Without Masks(NANO3D:无需训练与掩码的高效3D编辑新方法)


[03:05] 🛰 Skyfall-GS: Synthesizing Immersive 3D Urban Scenes from Satellite Imagery(Skyfall-GS:仅凭卫星影像合成沉浸式3D城市场景)


[03:41] ⚠ Emergent Misalignment via In-Context Learning: Narrow in-context examples can produce broadly misaligned LLMs(情境学习中的突发错位:狭窄示例可让大模型广泛失准)


[04:18] 🧬 Latent Diffusion Model without Variational Autoencoder(无需变分自编码器的潜在扩散模型)


[04:52] 📸 LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal(LightsOut:基于扩散的延展补全提升镜头眩光去除)


[05:30] 🧠 MorphoBench: A Benchmark with Difficulty Adaptive to Model Reasoning(MorphoBench:随模型推理能力自适应难度的评测基准)


[06:14] 🧠 A$^2$FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning(A²FM:面向工具感知混合推理的自适应智能体基础模型)


[06:56] 🗣 Language Models Model Language(语言模型即语言本身)


[07:36] 🖼 BLIP3o-NEXT: Next Frontier of Native Image Generation(BLIP3o-NEXT:原生图像生成的下一个前沿)


[08:30] 🌐 Paper2Web: Let's Make Your Paper Alive!(Paper2Web:让你的论文“活”起来!)


[09:12] 🔬 Foundation Models for Scientific Discovery: From Paradigm Enhancement to Paradigm Transition(面向科学发现的基础模型:从范式增强到范式跃迁)


[09:55] 🔍 Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents(探索以进化:通过主动在线探索扩展深度研究智能体的聚合逻辑)





【关注我们】


您还可以在以下平台找到我们,获得播客内容以外更多信息


小红书: AI速递

评价...

空空如也

小宇宙热门评论...

暂无小宇宙热门评论

EarsOnMe

加入我们的 Discord

与播客爱好者一起交流

立即加入

扫描微信二维码

添加微信好友,获取更多播客资讯

微信二维码

播放列表

自动播放下一个

播放列表还是空的

去找些喜欢的节目添加进来吧