AI_Homelab's profile picture. Teacher, PICTS School Admin, Student in MAS Didactics of media and computer science (UZH, PHSZ, HSLU), Techno Optimist, tinkerer, ML enthusiast

Simon

@AI_Homelab

Teacher, PICTS School Admin, Student in MAS Didactics of media and computer science (UZH, PHSZ, HSLU), Techno Optimist, tinkerer, ML enthusiast

Simon reposted

Finally, I'd like to thank the following contributors for this release: @anthonywu (great meeting you in person in SF last month!) @Azrahel @ivanfioravanti for spreading the word and excitement around this model! For the full release notes, please see: github.com/filipstrand/mf…


Simon reposted

Ring-1T🔥 the trillion-parameter thinking model released by Ant group, the company behind Alipay @AntLingAGI huggingface.co/inclusionAI/Ri… ✨ 1T params (50B active)- MIT license ✨ 128K context (YaRN) ✨ RLVR, Icepop, and ASystem make trillion-scale RL stable


Ready to go to bourgogne en françe! One week break! ✌️ (besides some work)


Simon reposted

some models next week


Simon reposted

🛋️ Get Comfy With Comfy - WAN Alpha We're kicking off a new series of short tutorials / guides to get you started with the latest technologies in ComfyUI. In just 2 minutes, we’ll break down Wan Alpha — the new setup for creating layered, RGBA videos in ComfyUI. Workflow below!


Simon reposted

Introducing Figure 03


Will there be a Phi 5? Gemma 4? Llama 5? Full set of OPEN mistral models (new series? What's your guess? What main development are you looking forward to?


Simon reposted

Qwen Image Edit 2509 is the new leading open weights image editing model, ranking #3 overall in the Artificial Analysis Image Editing Arena and introducing multi-image editing capabilities! The latest release from Alibaba Qwen trails only Gemini 2.5 Flash (Nano-Banana) and…

ArtificialAnlys's tweet image. Qwen Image Edit 2509 is the new leading open weights image editing model, ranking #3 overall in the Artificial Analysis Image Editing Arena and introducing multi-image editing capabilities!

The latest release from Alibaba Qwen trails only Gemini 2.5 Flash (Nano-Banana) and…

Simon reposted

🚨 New Top Open Model Update! A relative newcomer to the Arena, @zai_org's GLM-4.6 takes the clear, undisputed #1 spot for Top Open Model. 🏆 It also ranks #4 overall, which is not an easy feat! The next top open model, DeepSeek R1 0528, has been the standing champion for…

arena's tweet image. 🚨 New Top Open Model Update!

A relative newcomer to the Arena, @zai_org's GLM-4.6 takes the clear, undisputed #1 spot for Top Open Model. 🏆

It also ranks #4 overall, which is not an easy feat! The next top open model, DeepSeek R1 0528, has been the standing champion for…

Introducing GLM-4.6: Advanced Agentic, Reasoning and Coding Capabilities As our new flagship model, GLM-4.6 brings significant advancements across real-world coding, long-context processing (up to 200K tokens), reasoning, search, writing, and agentic applications. API:…

Zai_org's tweet image. Introducing GLM-4.6: Advanced Agentic, Reasoning and Coding Capabilities

As our new flagship model, GLM-4.6 brings significant advancements across real-world coding, long-context processing (up to 200K tokens), reasoning, search, writing, and agentic applications.

API:…


Simon reposted

🔥 Ming-UniAudio: The 「Nano Banana」moment for speech is here! A single model for universal understanding, generation & free-form editing. First Unified Continuous Tokenizer 「MingTok-Audio」and Unified Und & Gen Speech LLM built on it. First Universal Free-form Speech Editing…


Simon reposted

The small VL model that you want! Smaller models are coming as well soon!

🚀 Qwen3-VL-30B-A3B-Instruct & Thinking are here! Smaller size, same powerhouse performance 💪—packed with all the capabilities of Qwen3-VL! 🔧 With just 3B active params, it’s rivaling GPT-5-Mini & Claude4-Sonnet — and often beating them across STEM, VQA, OCR, Video, Agent…

Alibaba_Qwen's tweet image. 🚀 Qwen3-VL-30B-A3B-Instruct & Thinking are here!
Smaller size, same powerhouse performance 💪—packed with all the capabilities of Qwen3-VL!

🔧 With just 3B active params, it’s rivaling GPT-5-Mini & Claude4-Sonnet — and often beating them across STEM, VQA, OCR, Video, Agent…


United States Trends

Loading...

Something went wrong.


Something went wrong.