modelscope swift

@ms_swift2023

llm&mllm training framework from modelscope

於十月 2018 加入

14貼文 9位跟隨者 49個跟隨中

你可能會喜歡

@redcap3838

modelscope swift

@ms_swift2023

年2月23日

We add some features to improve the GRPO training speed up to 200% based on the awesome work of TRL and Open-R1, check script here: github.com/modelscope/ms-… check wandb here: wandb.ai/tastelikefeet/…

ms_swift2023's tweet image. We add some features to improve the GRPO training speed up to 200% based on the awesome work of TRL and Open-R1, check script here: github.com/modelscope/ms-… check wandb here: wandb.ai/tastelikefeet/…

modelscope swift

@ms_swift2023

2024年8月29日

SWIFT has supported the finetuning(VQA/Grounding/OCR/etc) of Qwen2-VL series models, check: github.com/modelscope/ms-…

modelscope swift

@ms_swift2023

2024年6月28日

And also support the finetuning of Florence series models: github.com/modelscope/swi…

modelscope swift

@ms_swift2023

2024年6月28日

SWIFT has support the finetuning of Gemma2: github.com/modelscope/swi…

modelscope swift

@ms_swift2023

2024年6月7日

SWIFT has supported the finetuning of Qwen2/GLM4/GLM4v: github.com/modelscope/swi…

ms_swift2023's tweet card. Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, GLM4.5, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, Llava, GLM4v, Ph...

GitHub - modelscope/ms-swift: Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3,...

來源: github.com

modelscope swift

@ms_swift2023

2024年5月18日

We start a space in huggingface: huggingface.co/spaces/modelsc… meanwhile, we support peft 0.11.0 with hqq/eetq/vera/pissa training!

huggingface.co

Swift(LLM lightweight framework for fine-tuning and inference) - a Hugging Face Space by modelscope

來源: huggingface.co

modelscope swift

@ms_swift2023

2024年5月13日

Welcome to use quantized version of Yi1.5-6b/9b/34b! model can be found here: huggingface.co/modelscope

huggingface.co

modelscope (modelscope)

來源: huggingface.co

modelscope swift

@ms_swift2023

2024年5月11日

SWIFT is an LLM/MLLM training framework from the ModelScope community and supports models like LLaMA3/Mistral/DeepSeek-VL/YI-VL/Qwen-VL/LLaVA. We have newly supported the EETQ and HQQ QLoRA training for LLM and MLLMs: github.com/modelscope/swi… @_akhaliq