Album
时长:
10分钟
播放:
84
发布:
6天前
主播...
简介...
https://xiaoyuzhoufm.com
本期的 15 篇论文如下:
[00:21] 🧠 K-EXAONE Technical Report(K-EXAONE技术报告)
[00:56] 🚀 NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation(NextFlow:统一序列建模激活多模态理解与生成)
[01:36] 🎭 DreamID-V:Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer(DreamID-V:通过扩散Transformer弥合图像到视频的鸿沟以实现高保真人脸交换)
[02:19] 🎨 VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation(VAR强化学习优化之道:解决视觉自回归生成中的异步策略冲突)
[03:04] 🚀 GARDO: Reinforcing Diffusion Models without Reward Hacking(GARDO:无需奖励黑客攻击的扩散模型强化方法)
[03:41] 🎨 VINO: A Unified Visual Generator with Interleaved OmniModal Context(VINO:一种具有交错式全模态上下文的统一视觉生成器)
[04:17] ♾ InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams(InfiniteVGGT:面向无尽流数据的视觉几何基础Transformer)
[04:54] 🧠 Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits(大型语言模型能否预测自身失败?通过内部电路实现自我感知)
[05:23] 🚀 Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling(Falcon-H1R:通过混合模型实现高效测试时扩展,推动推理前沿)
[05:57] 🔄 Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes(Talk2Move:基于强化学习的文本指令场景物体几何变换框架)
[06:43] 🔄 Recursive Language Models(递归语言模型)
[07:12] 🧠 KV-Embedding: Training-free Text Embedding via Internal KV Re-routing in Decoder-only LLMs(KV-嵌入:通过仅解码器大语言模型内部KV重路由实现免训练文本嵌入)
[07:51] ⚠ COMPASS: A Framework for Evaluating Organization-Specific Policy Alignment in LLMs(COMPASS:评估大语言模型中组织特定政策对齐性的框架)
[08:52] 🛰 Toward Stable Semi-Supervised Remote Sensing Segmentation via Co-Guidance and Co-Fusion(通过协同引导与协同融合实现稳定的半监督遥感分割)
[09:40] 🧱 SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving(SWE-Lego:推动软件问题解决的监督微调极限)
【关注我们】
您还可以在以下平台找到我们,获得播客内容以外更多信息
小红书: AI速递
评价...

空空如也

小宇宙热门评论...

暂无小宇宙热门评论

EarsOnMe

加入我们的 Discord

与播客爱好者一起交流

立即加入

扫描微信二维码

添加微信好友,获取更多播客资讯

微信二维码

播放列表

自动播放下一个

播放列表还是空的

去找些喜欢的节目添加进来吧