OpenMMLab's profile picture. From MMDetection to AI Exploration. Empowering AI research and development with OpenMMLab.

Discord:http://discord.gg/raweFPmdzG

OpenMMLab

@OpenMMLab

From MMDetection to AI Exploration. Empowering AI research and development with OpenMMLab. Discord:http://discord.gg/raweFPmdzG

OpenMMLab reposted

🚀Introducing #CapRL, the first study of applying GRPO for the open-ended and subjective image captioning task. 🤯 🤖The trained CapRL-3B model achieves image captioning performance comparable to Qwen2.5-VL-72B. ✨CapRL introduces a novel training framework that redefines caption…

intern_lm's tweet image. 🚀Introducing #CapRL, the first study of applying GRPO for the open-ended and subjective image captioning task. 🤯
🤖The trained CapRL-3B model achieves image captioning performance comparable to Qwen2.5-VL-72B.
✨CapRL introduces a novel training framework that redefines caption…
intern_lm's tweet image. 🚀Introducing #CapRL, the first study of applying GRPO for the open-ended and subjective image captioning task. 🤯
🤖The trained CapRL-3B model achieves image captioning performance comparable to Qwen2.5-VL-72B.
✨CapRL introduces a novel training framework that redefines caption…
intern_lm's tweet image. 🚀Introducing #CapRL, the first study of applying GRPO for the open-ended and subjective image captioning task. 🤯
🤖The trained CapRL-3B model achieves image captioning performance comparable to Qwen2.5-VL-72B.
✨CapRL introduces a novel training framework that redefines caption…
intern_lm's tweet image. 🚀Introducing #CapRL, the first study of applying GRPO for the open-ended and subjective image captioning task. 🤯
🤖The trained CapRL-3B model achieves image captioning performance comparable to Qwen2.5-VL-72B.
✨CapRL introduces a novel training framework that redefines caption…

OpenMMLab reposted

🚀 Big news for #lmdeploy v0.10.1! 🥳Our #FP8 high-performance inference is no longer limited to the latest #GPUs. It now supports all #NVIDIA architectures from V100 onwards, bringing major speedups to more users. 🤗github.com/InternLM/lmdep…

intern_lm's tweet image. 🚀  Big news for #lmdeploy v0.10.1! 
🥳Our #FP8 high-performance inference is no longer limited to the latest #GPUs. It now supports all #NVIDIA architectures from V100 onwards, bringing major speedups to more users.
🤗github.com/InternLM/lmdep…

OpenMMLab reposted

🔥LMDeploy v0.10.0 released! 😊Supercharges OpenAI’s GPT-OSS MXFP4 models. 😊Delivers exceptional performance for GPT-OSS models on V100 and higher GPUs. 😊On H800 & A100, LMDeploy outperforms vLLM across all scenarios—faster, more efficient inference! 🤗github.com/InternLM/lmdep…

intern_lm's tweet image. 🔥LMDeploy v0.10.0 released!
😊Supercharges OpenAI’s GPT-OSS MXFP4 models.
😊Delivers exceptional performance for GPT-OSS models on V100 and higher GPUs. 
😊On H800 & A100, LMDeploy outperforms vLLM across all scenarios—faster, more efficient inference!
🤗github.com/InternLM/lmdep…
intern_lm's tweet image. 🔥LMDeploy v0.10.0 released!
😊Supercharges OpenAI’s GPT-OSS MXFP4 models.
😊Delivers exceptional performance for GPT-OSS models on V100 and higher GPUs. 
😊On H800 & A100, LMDeploy outperforms vLLM across all scenarios—faster, more efficient inference!
🤗github.com/InternLM/lmdep…

🔥China’s Open-source VLMs boom—Intern-S1, MiniCPM-V-4, GLM-4.5V, Step3, OVIS 🧐Join the AI Insight Talk with @huggingface, @OpenCompassX, @ModelScope2022 and @ZhihuFrontier 🚀Tech deep-dives & breakthroughs 🚀Roundtable debates ⏰Aug 21, 5 AM PDT 📺Live: youtube.com/live/kh0WSMoVZ…

OpenMMLab's tweet image. 🔥China’s Open-source VLMs boom—Intern-S1, MiniCPM-V-4, GLM-4.5V, Step3, OVIS
🧐Join the AI Insight Talk with @huggingface, @OpenCompassX,  @ModelScope2022 and @ZhihuFrontier
🚀Tech deep-dives & breakthroughs
🚀Roundtable debates
⏰Aug 21, 5 AM PDT
📺Live: youtube.com/live/kh0WSMoVZ…


🔥China’s Open-source VLMs boom—Intern-S1, MiniCPM-V-4, GLM-4.5V, Step3, OVIS 🧐Join the AI Insight Talk with @huggingface, @OpenCompassX, @ModelScope2022 and @ZhihuFrontier 🚀Tech deep-dives & breakthroughs 🚀Roundtable debates ⏰Aug 21, 5 AM PDT 📺Live: youtube.com/live/kh0WSMoVZ…

OpenMMLab's tweet image. 🔥China’s Open-source VLMs boom—Intern-S1, MiniCPM-V-4, GLM-4.5V, Step3, OVIS
🧐Join the AI Insight Talk with @huggingface, @OpenCompassX,  @ModelScope2022 and @ZhihuFrontier
🚀Tech deep-dives & breakthroughs
🚀Roundtable debates
⏰Aug 21, 5 AM PDT
📺Live: youtube.com/live/kh0WSMoVZ…

OpenMMLab reposted

🚀 Introducing #CompassVerifier: A unified and robust answer verifier for #LLMs evaluation and #RLVR! ✨LLM progress is bottlenecked by weak evaluation, looking for an alternative to rule-based verifiers? CompassVerifier can handle multiple domains including math, science, and…

OpenCompassX's tweet image. 🚀 Introducing #CompassVerifier: A unified and robust answer verifier for  #LLMs evaluation and #RLVR!
✨LLM progress is bottlenecked by weak evaluation, looking for an alternative to rule-based verifiers? CompassVerifier can handle multiple domains including math, science, and…
OpenCompassX's tweet image. 🚀 Introducing #CompassVerifier: A unified and robust answer verifier for  #LLMs evaluation and #RLVR!
✨LLM progress is bottlenecked by weak evaluation, looking for an alternative to rule-based verifiers? CompassVerifier can handle multiple domains including math, science, and…
OpenCompassX's tweet image. 🚀 Introducing #CompassVerifier: A unified and robust answer verifier for  #LLMs evaluation and #RLVR!
✨LLM progress is bottlenecked by weak evaluation, looking for an alternative to rule-based verifiers? CompassVerifier can handle multiple domains including math, science, and…
OpenCompassX's tweet image. 🚀 Introducing #CompassVerifier: A unified and robust answer verifier for  #LLMs evaluation and #RLVR!
✨LLM progress is bottlenecked by weak evaluation, looking for an alternative to rule-based verifiers? CompassVerifier can handle multiple domains including math, science, and…

OpenMMLab reposted

Our paper won an outstanding paper on ACL 2025. Try our best open-source multimodal reasoning model Intern-S1 at huggingface.co/internlm/Inter…. This 241B MoE model combines strong general-task capabilities with state-of-the-art performance on a wide range of scientific tasks,…

intern_lm's tweet image. Our paper won an outstanding paper on ACL 2025.

Try our best open-source multimodal reasoning model Intern-S1 at huggingface.co/internlm/Inter….

This 241B MoE model combines strong general-task capabilities with state-of-the-art performance on a wide range of scientific tasks,…
intern_lm's tweet image. Our paper won an outstanding paper on ACL 2025.

Try our best open-source multimodal reasoning model Intern-S1 at huggingface.co/internlm/Inter….

This 241B MoE model combines strong general-task capabilities with state-of-the-art performance on a wide range of scientific tasks,…

OpenMMLab reposted

🚀Introducing Intern-S1, our most advanced open-source multimodal reasoning model yet! 🥳Strong general-task capabilities + SOTA performance on scientific tasks, rivaling leading closed-source commercial models. 🥰Built upon a 235B MoE language model and a 6B Vision encoder.…

intern_lm's tweet image. 🚀Introducing Intern-S1, our most advanced open-source multimodal reasoning model yet!
🥳Strong general-task capabilities + SOTA performance on scientific tasks, rivaling leading closed-source commercial models.
🥰Built upon a 235B MoE language model and a 6B Vision encoder.…
intern_lm's tweet image. 🚀Introducing Intern-S1, our most advanced open-source multimodal reasoning model yet!
🥳Strong general-task capabilities + SOTA performance on scientific tasks, rivaling leading closed-source commercial models.
🥰Built upon a 235B MoE language model and a 6B Vision encoder.…

OpenMMLab reposted

🚀 Introducing #POLAR: Bring Reward Model into a New Pre-training Era! ✨ Say goodbye to reward models with poor generalization! POLAR (Policy Discriminative Learning) is a groundbreaking pre-training paradigm that trains reward models to distinguish policy distributions,…

intern_lm's tweet image. 🚀 Introducing #POLAR: Bring Reward Model into a New Pre-training Era! 
✨ Say goodbye to reward models with poor generalization! POLAR (Policy Discriminative Learning) is a groundbreaking pre-training paradigm that trains reward models to distinguish policy distributions,…
intern_lm's tweet image. 🚀 Introducing #POLAR: Bring Reward Model into a New Pre-training Era! 
✨ Say goodbye to reward models with poor generalization! POLAR (Policy Discriminative Learning) is a groundbreaking pre-training paradigm that trains reward models to distinguish policy distributions,…

OpenMMLab reposted

We invited 3 top HF daily papers authors to deliver talks. Topics of this session: Reinforcement Learning Speakers: - Qi-Chen Zhao — Absolute Zero Reasoner: self-play RL that reaches SOTA reasoning with zero external data - Shu-Huai Ren — MiMo-VL: Xiaomi’s unified and…


OpenMMLab reposted

🥳Trained through #InternBootcamp, #InternThinker now combines pro-level Go skills with transparent reasoning. 😉In each game, it acts as a patient, insightful coach—analyzing the board, comparing moves, and clearly explaining each decision. 🤗Try it now: chat.intern-ai.org.cn/internthinker/…

intern_lm's tweet image. 🥳Trained through #InternBootcamp,  #InternThinker now combines pro-level Go skills with transparent reasoning.
😉In each game, it acts as a patient, insightful coach—analyzing the board, comparing moves, and clearly explaining each decision.
🤗Try it now: chat.intern-ai.org.cn/internthinker/…
intern_lm's tweet image. 🥳Trained through #InternBootcamp,  #InternThinker now combines pro-level Go skills with transparent reasoning.
😉In each game, it acts as a patient, insightful coach—analyzing the board, comparing moves, and clearly explaining each decision.
🤗Try it now: chat.intern-ai.org.cn/internthinker/…

🥳Introducing #InternBootcamp, an easy-to-use and extensible library for training large reasoning models. Unlimited automatic question generation and result verification. Over 1,000 verifiable tasks covering logic, puzzles, algorithms, games, and more. 🤗github.com/InternLM/Inter…

intern_lm's tweet image. 🥳Introducing #InternBootcamp, an easy-to-use and extensible library for training large reasoning models.
Unlimited automatic question generation and result verification.
Over 1,000 verifiable tasks covering logic, puzzles, algorithms, games, and more.
 🤗github.com/InternLM/Inter…


OpenMMLab reposted

🥳Introducing #InternBootcamp, an easy-to-use and extensible library for training large reasoning models. Unlimited automatic question generation and result verification. Over 1,000 verifiable tasks covering logic, puzzles, algorithms, games, and more. 🤗github.com/InternLM/Inter…

intern_lm's tweet image. 🥳Introducing #InternBootcamp, an easy-to-use and extensible library for training large reasoning models.
Unlimited automatic question generation and result verification.
Over 1,000 verifiable tasks covering logic, puzzles, algorithms, games, and more.
 🤗github.com/InternLM/Inter…

🥳#FaceShot generates animations for your "imaginary friends", like Teddy Bear, and brings them into life! 😉Project page: faceshot2024.github.io/faceshot/ 😉Paper link: arxiv.org/abs/2503.00740 😉Code: github.com/open-mmlab/Fac…


OpenMMLab reposted

🥳#StructFlowBench is a structurally annotated multi-turn benchmark that leverages a structure-driven generation paradigm to enhance the simulation of complex dialogue scenarios. 🥳StructFlowBench is now part of the #CompassHub! 😉Feel free to download and explore it—available…

OpenCompassX's tweet image. 🥳#StructFlowBench is a structurally annotated multi-turn benchmark that leverages a structure-driven generation paradigm to enhance the simulation of complex dialogue scenarios.
🥳StructFlowBench is now part of the #CompassHub! 😉Feel free to download and explore it—available…
OpenCompassX's tweet image. 🥳#StructFlowBench is a structurally annotated multi-turn benchmark that leverages a structure-driven generation paradigm to enhance the simulation of complex dialogue scenarios.
🥳StructFlowBench is now part of the #CompassHub! 😉Feel free to download and explore it—available…

OpenMMLab reposted

🥳Thrill to release the full RL training code of #OREAL! 😊Now you can fully reproduce the results of OREAL-7B/32B. Using #DeepSeek-R1-Distill-Qwen-32B, you can further obtain a model has 95.6 on MATH-500! 🤗Code: github.com/InternLM/OREAL 🤗Based on: github.com/InternLM/xtuner

🥳Introducing #OREAL, a new RL method for math reasoning. 😊With OREAL, a 7B model achieves 94.0 pass@1 on MATH-500, matching many 32B models, while OREAL-32B achieves 95.0 pass@1, surpassing #DeepSeek-R1 Distilled models. 🤗Paper/Model/Data: huggingface.co/papers/2502.06…

intern_lm's tweet image. 🥳Introducing #OREAL, a new RL method for math reasoning. 

😊With OREAL, a 7B model achieves 94.0 pass@1 on MATH-500, matching many 32B models, while OREAL-32B achieves 95.0 pass@1, surpassing #DeepSeek-R1 Distilled models. 

🤗Paper/Model/Data:
huggingface.co/papers/2502.06…
intern_lm's tweet image. 🥳Introducing #OREAL, a new RL method for math reasoning. 

😊With OREAL, a 7B model achieves 94.0 pass@1 on MATH-500, matching many 32B models, while OREAL-32B achieves 95.0 pass@1, surpassing #DeepSeek-R1 Distilled models. 

🤗Paper/Model/Data:
huggingface.co/papers/2502.06…


OpenMMLab reposted

🥳Introducing #OREAL, a new RL method for math reasoning. 😊With OREAL, a 7B model achieves 94.0 pass@1 on MATH-500, matching many 32B models, while OREAL-32B achieves 95.0 pass@1, surpassing #DeepSeek-R1 Distilled models. 🤗Paper/Model/Data: huggingface.co/papers/2502.06…

intern_lm's tweet image. 🥳Introducing #OREAL, a new RL method for math reasoning. 

😊With OREAL, a 7B model achieves 94.0 pass@1 on MATH-500, matching many 32B models, while OREAL-32B achieves 95.0 pass@1, surpassing #DeepSeek-R1 Distilled models. 

🤗Paper/Model/Data:
huggingface.co/papers/2502.06…
intern_lm's tweet image. 🥳Introducing #OREAL, a new RL method for math reasoning. 

😊With OREAL, a 7B model achieves 94.0 pass@1 on MATH-500, matching many 32B models, while OREAL-32B achieves 95.0 pass@1, surpassing #DeepSeek-R1 Distilled models. 

🤗Paper/Model/Data:
huggingface.co/papers/2502.06…

OpenMMLab reposted

🥳🚀🥳Try it now on: internlm-chat.intern-ai.org.cn

🚀Introducing InternLM3-8B-Instruct with Apache License 2.0. -Trained on only 4T tokens, saving more than 75% of the training cost. -Supports deep thinking for complex reasoning and normal mode for chat. Model:@huggingface huggingface.co/internlm/inter… GitHub: github.com/InternLM/Inter…

intern_lm's tweet image. 🚀Introducing InternLM3-8B-Instruct with Apache License 2.0.
-Trained on only 4T tokens, saving more than 75% of the training cost.
-Supports deep thinking for complex reasoning and normal mode for chat.
Model:@huggingface
huggingface.co/internlm/inter…
GitHub:
github.com/InternLM/Inter…
intern_lm's tweet image. 🚀Introducing InternLM3-8B-Instruct with Apache License 2.0.
-Trained on only 4T tokens, saving more than 75% of the training cost.
-Supports deep thinking for complex reasoning and normal mode for chat.
Model:@huggingface
huggingface.co/internlm/inter…
GitHub:
github.com/InternLM/Inter…


OpenMMLab reposted

🚀Introducing InternLM3-8B-Instruct with Apache License 2.0. -Trained on only 4T tokens, saving more than 75% of the training cost. -Supports deep thinking for complex reasoning and normal mode for chat. Model:@huggingface huggingface.co/internlm/inter… GitHub: github.com/InternLM/Inter…

intern_lm's tweet image. 🚀Introducing InternLM3-8B-Instruct with Apache License 2.0.
-Trained on only 4T tokens, saving more than 75% of the training cost.
-Supports deep thinking for complex reasoning and normal mode for chat.
Model:@huggingface
huggingface.co/internlm/inter…
GitHub:
github.com/InternLM/Inter…
intern_lm's tweet image. 🚀Introducing InternLM3-8B-Instruct with Apache License 2.0.
-Trained on only 4T tokens, saving more than 75% of the training cost.
-Supports deep thinking for complex reasoning and normal mode for chat.
Model:@huggingface
huggingface.co/internlm/inter…
GitHub:
github.com/InternLM/Inter…

Loading...

Something went wrong.


Something went wrong.