ms_swift2023's profile picture. llm&mllm training framework from modelscope

modelscope swift

@ms_swift2023

llm&mllm training framework from modelscope

你可能會喜歡

We add some features to improve the GRPO training speed up to 200% based on the awesome work of TRL and Open-R1, check script here: github.com/modelscope/ms-… check wandb here: wandb.ai/tastelikefeet/…

ms_swift2023's tweet image. We add some features to improve the GRPO training speed up to 200% based on the awesome work of TRL and Open-R1, check script here: github.com/modelscope/ms-… check wandb here: wandb.ai/tastelikefeet/…

SWIFT has supported the finetuning(VQA/Grounding/OCR/etc) of Qwen2-VL series models, check: github.com/modelscope/ms-…


And also support the finetuning of Florence series models: github.com/modelscope/swi…


SWIFT has support the finetuning of Gemma2: github.com/modelscope/swi…


SWIFT is an LLM/MLLM training framework from the ModelScope community and supports models like LLaMA3/Mistral/DeepSeek-VL/YI-VL/Qwen-VL/LLaVA. We have newly supported the EETQ and HQQ QLoRA training for LLM and MLLMs: github.com/modelscope/swi… @_akhaliq


We support the training of qwen2-110b models, use this script: github.com/modelscope/swi… to train!


United States 趨勢

你可能會喜歡

Loading...

Something went wrong.


Something went wrong.