
OpenDataLab
@OpenDataLab_AI
MinerU2.5 is a compact 1.2B VLM with a smart two-stage, coarse-to-fine pipeline (global layout → native-res crops) that delivers state-of-the-art doc parsing with low compute.
MinerU2.5 正式发布 🎉,这个参数规模仅 1.2B 的视觉-语言模型,通过创新的解耦架构和数据引擎,实现 SOTA 准确率,同时显著降低计算开销!!团队也公布了技术报告,一起看看它的模型组成、训练细节和实战表现 👇 1. 背景与挑战…



testing MineU, 1.2B VLM for 'efficient' document parsing. its not heavy, im really optimistic. huggingface.co/opendatalab/Mi…
🚀 The MinerU2.5 Technical Report is officially released!




MinerU是上海AI 实验室的哦,如果觉得vlm慢,可以使用Sglang加速,快到起飞。当然,这对设备有一定性能要求和门槛
#MathFusion is a novel framework that enhances mathematical reasoning through cross-problem instruction synthesis. 🦾Experimental results demonstrate that it achieves substantial improvements in mathematical reasoning while maintaining high data efficiency. #AI #Datasets




Vis3 is a visualization tool for #LLM and machine learning data, supporting cloud storage platforms with S3 protocol and various data formats. It offers interactive visualization through JSON, HTML, Markdown, and image views for efficient #data analysis.

#MinerU has officially cooperated with @CherryStudioHQ. You can directly call the MinerU function in Cherry Studio. MinerU officially provides each Cherry Studio user with a document processing quota of up to 500 pages per day.

#LabelLLM introduces an open-source platform dedicated to optimizing the #data annotation process integral to the development of #LLM. There are key features of LabelLLM. Try me👉 labelu-llm-demo.shlab.tech/supplier/task

#MinerU leverages the sophisticated PDF-Extract-Kit models to extract content from diverse documents effectively and ensure the accuracy of the final results. As its core, MinerU commits to facilitating the #mathematical and extended formulas parsing.

Agentic Document Extraction just got much faster! From previous 135sec median processing time down to 8sec. Extracts not just text but diagrams, charts, and form fields from PDFs to give LLM-ready output. Please see the video for details and some application ideas.
United States เทรนด์
- 1. #TORQSports N/A
- 2. Malcolm Brogdon 1,677 posts
- 3. Argentina 475K posts
- 4. Russ 18.9K posts
- 5. Waddle 3,823 posts
- 6. Big Balls 25.2K posts
- 7. Rickey 2,384 posts
- 8. $HIMS 4,330 posts
- 9. Olave 3,246 posts
- 10. #BeyondTheGates 5,503 posts
- 11. #ClockTower1Year N/A
- 12. Aphrodite 5,026 posts
- 13. Voting Rights Act 30.4K posts
- 14. Kings 161K posts
- 15. Maybe in California N/A
- 16. Capitol Police 28.5K posts
- 17. #TrumpsShutdownDragsOn 7,526 posts
- 18. Supreme Court Justice 9,773 posts
- 19. Justice Jackson 21.9K posts
- 20. Martha 21.3K posts
Something went wrong.
Something went wrong.