1LittleCoder💻 (@1littlecoder) on Piclur: Deepseek casually drops a 685 Billion Parameter model on Hugging Face with MIT license! The higher variant performance on par with GPT-5 and Gemini 3.0 Pro! And yet, this model is open weights coming with a Technical Paper! / Piclur

1LittleCoder💻

Dec 1

Deepseek casually drops a 685 Billion Parameter model on Hugging Face with MIT license! The higher variant performance on par with GPT-5 and Gemini 3.0 Pro! And yet, this model is open weights coming with a Technical Paper!

1LittleCoder💻

@1littlecoder

Dec 1

The new Deepseek is here! Gold-medal performance in the 2025 International Mathematical Olympiad (IMO) and International Olympiad in Informatics (IOI).

1LittleCoder💻

@1littlecoder

Dec 1

huggingface.co/deepseek-ai/De…

deepseek-ai/DeepSeek-V3.2-Speciale · Hugging Face

Source: huggingface.co

Bryan Waldo

@bryanwaldo

Dec 1

I prefer my privacy invading apps to be watched by the US intelligence community.

1LittleCoder💻

@1littlecoder

Dec 2

It's an open weights model. You can download it and run it if you have a big enough computer. It's like pirating movies and watching on your computer. No one will know from the movie studio. Except here you don't have to pirate, it's free to download and redistribution.

Premium

@premium

Aug 5

Why guess when you can know?

John Dawson

@winthewestback

Dec 1

These behemoth parameter models are great, especially when hosted externally in the cloud. Good instructional models that run locally with sub 16 billion parameters with solid tool access and live internet access will win the day, at least in my book.

1LittleCoder💻

@1littlecoder

Dec 1

I guess they or someone using their models will release the distill versions

Patrick's AIBuzzNews

@AIBuzzNews

Dec 2

Yet again, another great Chinese release.

1LittleCoder💻

@1littlecoder

Dec 2

Yet again 🔥🔥🔥

Argentum AI

@Argentum_AI

Dec 1

It's funny that no one in the West is speaking about DeepSeek.

Alex DRocks

@DrocksAlex2

Dec 1

This is amazing. Open-source evolves quickly

Shawn Chauhan

@shawnchauhan1

Dec 1

Open weights + top-tier benchmarks = best of both worlds.

Skin Club

@skinclub_cases

10 h

Bored of the same old CS skins? Time for an upgrade! Unlock rare CS2 drops, swap your favorites, and hit the match with a fresh new look. Make a deposit now and claim your BONUS!

Himanshu Kumar

@codewithimanshu

Dec 2

Wow, LittleCoder, that's a game changer, but open source might face challenges, no?

Purple Carts

@brite_owl

Dec 1

Its defaulting to response in Chinese to me. My language setting is system

Milan

@milan2tokyo

Dec 1

But it’s a Chinese model…

Frank H

@FrankhHQ

Dec 2

Open source with this super large LLM are good as unusable for any normal person. How can can you run this locally without spending 10s of thousands in GPU? It's just a marketing ploy really.

•

@ericmass_

Dec 1

@grok explain

NoiseNull

@NoiseNull

Dec 1

It's great we have open source like Deepseek keeping the paid models more honest. Can't wait until they get good enough to be run on the average person's PC.

Walter Goldson

@waltergoldson

Dec 2

@grok what does this mean

ar0cket1

@ar0cket1

Dec 1

great for the opensource community, and some important proof of their new attention working but this type of performance is almost standard with the public recipe of making models. The bigger question is why aren’t the frontier labs further ahead, maybe they only release models a…

@XBusinessEurope

Jul 23

Want to reach a similar performance? 📅 Register to our Webinar 'X Ads for Beginners' to learn how!

Bedurion

@Bedurion

Dec 2

DeepSeek dropping this beast on Hugging Face feels like the open source revolution just leveled up big time and with its sparse attention slashing inference costs by 70% that MIT licensed powerhouse turns elite reasoning into everyday reality for devs everywhere.

Rouzbeh

@JustRouzbeh

Dec 1

V3 is actually a 671B MoE with about 37B active params, trained on 14.8T tokens in roughly 2.8M H800 hours, using MLA and multi token prediction. Frontier level performance with open weights and MIT license is a crazy unlock for anyone building serious agents

Dariaon

@DariaonPrompts

Dec 1

Is this really on GPT-5 level or is that just benchmark talk? Anyone tried it yet?

Prospero

@el_don_prospero

Dec 1

China, casually wiping out moats. They seemingly want to fill the world with machine intelligence that runs on everything. Just don't ask it about Tianmen square 😁 The PRC being more Open AI than OpenAI wasn't in my futures checklist.

Jikku Jose

@jikkujose

Dec 2

I just wish @GroqInc or @cerebras did their thing with this model! Please I need speed!

Manish Kulariya

@MKulria

Dec 1

This is how it should be. No hype, no drama, pure engineering. All the other releases are nothing but more drama less impact.

prayag sonar

@prayag_sonar

Dec 1

still common man can not download and use it. Need high end costly system. Even Quantisation wil be higher

BroxolT

@BroxolT

Dec 1

in reality it comes as close as like 30% maybe not even that to models like gemini 3 or gpt5. Thats the reality, these models are trained for testing, not for actual usage. They are useless in practice

Dennis H

@uncountable_ai

Dec 2

Shit, how many rented H100's are needed to run it, a whole rack?

InsForge

@InsForge_dev

Dec 2

685B open weights with MIT license is wild. The open frontier just leveled up.

Canopy Wave

@CanopyWave_CW

Dec 1

Great to see DeepSeek V3.2 supporting interleaved thinking!

Astro 🏴

@Astro1062

Dec 1

China is on a level the West cannot comprehend. Elon, Dalio, and Sachs understand fully, world’s richest man, world’s largest hedge fund manager, and top 3 diplomats in the world. But what do they know…

Mxa

@mxacod

Dec 2

When casually means 685 billion parameters and performance that rivals the big players, that's not casual that's revolutionary 🚀

Siddhant Mani

@_siddhantmani

Dec 2

Open weights + MIT license + GPT-5-tier performance… This isn't a release. This is a declaration of war on closed-model economies.

Nate Nguyen

@Nate1601

Dec 2

GPT-5 & Gemini 3.0 Pro, watch out! 🤣 Open source is coming for you! Seriously though, congrats to Deepseek on this impressive release. Can't wait to try it out.

bruce

@bruce_x_offi

Dec 2

Humanity will be forever grateful to DeepSeek.

D

@dozieokk

Dec 1

Honestly I need to invest in GPUs for this. Can't be wasting my money anymore. Or hmm, remote server for inference? Yh.

United States Trends

1. World Cup 200K posts
2. Paraguay 21.1K posts
3. FINALLY DID IT 426K posts
4. The Jupiter 96.7K posts
5. Morocco 60.5K posts
6. Croatia 14.5K posts
7. Argentina 184K posts
8. Infantino 54K posts
9. Portugal 74K posts
10. #USMNT 1,196 posts
11. Matt Campbell 9,350 posts
12. Wayne Gretzky 3,448 posts
13. Group D 14.6K posts
14. Senegal 33.6K posts
15. Lauryn Hill 10.3K posts
16. Iowa State 8,048 posts
17. Warner Bros 208K posts
18. Norway 27.9K posts
19. Ghana 61.2K posts
20. #Mundial2026 27.8K posts

Something went wrong.