报告预览

阿里云：2025 Ovis2.5技术报告（英文版）（30页）.pdf

编号：870497

PDF 中文版 DOCX 30页 16.15MB 下载积分：VIP专享

下载报告请您先登录！

阿里云：2025 Ovis2.5技术报告（英文版）（30页）.pdf

1、2025-08-19Ovis2.5 Technical ReportOvis Team,Alibaba Grouphttps:/huggingface.co/AIDC-AI/Ovis2.5-9Bhttps:/ present Ovis2.5,a successor to Ovis2 designed for native-resolution visual perceptionand strong multimodal reasoning.Ovis2.5 integrates a native-resolution vision transformerthat processes images

2、 at their native,variable resolutions,avoiding the degradationfrom fixed-resolution tiling and preserving both fine detail and global layoutcrucial forvisually dense content like complex charts.To strengthen reasoning,we train the model tomove beyond linear chain-of-thought and perform reflectioninc

3、luding self-checking andrevision.This advanced capability is exposed as an optional“thinking mode”at inferencetime,allowing users to trade latency for enhanced accuracy on difficult inputs.The modelis trained via a comprehensive five-phase curriculum that progressively builds its skills.The process

4、begins with foundational visual and multimodal pretraining,advances throughlarge-scale instruction tuning,and culminates in alignment and reasoning enhancementusing DPO and GRPO.To scale these upgrades efficiently,we employ multimodal datapacking and hybrid parallelism,yielding a significant end-to-

5、end speedup.We releasetwo open-source models:Ovis2.5-9B and Ovis2.5-2B.The latter continues the“smallmodel,big performance”philosophy of Ovis2,making it ideal for resource-constrained,on-device scenarios.On the OpenCompass multimodal leaderboard,Ovis2.5-9B averages78.3,marking a substantial improvem

6、ent over its predecessor,Ovis2-8B,and achievingstate-of-the-art results among open-source MLLMs in the sub-40B parameter range;Ovis2.5-2B scores 73.9,establishing SOTA for its size.Beyond aggregate scores,Ovis2.5achieves leading results on STEM benchmarks,exhibits strong capabilities on groundingand

友情提示

1、下载报告失败解决办法
2、PDF文件下载后，可能会被浏览器默认打开，此种情况可以点击浏览器菜单，保存网页到桌面，就可以正常下载了。
3、本站不支持迅雷下载，请使用电脑自带的IE浏览器，或者360浏览器、谷歌浏览器下载即可。
4、本站报告下载后的文档和图纸-无水印,预览文档经过压缩，下载后原文更清晰。

本文（阿里云：2025 Ovis2.5技术报告（英文版）（30页）.pdf）为本站（111111）主动上传，三个皮匠报告文库仅提供信息存储空间，仅对用户上传内容的表现方式做保护处理，对上载内容本身不做任何修改或编辑。若此文所含内容侵犯了您的版权或隐私，请立即通知三个皮匠报告文库（点击联系客服），我们立即给予删除！

温馨提示：如果因为网速或其他原因下载失败请重新下载，重复下载不扣分。