llm_wizard's profile picture. Open Source Model Lover @ NVIDIA AI

Views my own.

Chris 🇨🇦

@llm_wizard

Open Source Model Lover @ NVIDIA AI Views my own.

I only negatively update on the IQ of people who have these kinds of absolutist takes on non-consensus philosophy

i knowww this take will be universally hated but i negatively update on the iq of anyone who believes in qualia or the hard problem of consciousness



That's the same face I made when I got my Spark - so glad we did this.

One last surprise for our #NVIDIAGTC golden ticket winners 👀 🎁 Yesterday, we took the winners to see our brand new NVIDIA DGX Spark on the show floor... and then gave them each one. We can't wait to see what they create. #SparkSomethingBig



Chris 🇨🇦 reposted

We built ProfBench to raise the bar for LLMs - literally. At @NVIDIA, we worked with domain experts to create a benchmark that goes far beyond trivia and short answers. ProfBench tests LLMs on complex, multi-step tasks that demand the kind of reasoning, synthesis, and clarity…


This is my favourite description about this video I've ever read. From one of the finalists from the World's Shortest Hackathon from @NVIDIAGTC - incredible work.

llm_wizard's tweet image. This is my favourite description about this video I've ever read.

From one of the finalists from the World's Shortest Hackathon from @NVIDIAGTC - incredible work.
llm_wizard's tweet image. This is my favourite description about this video I've ever read.

From one of the finalists from the World's Shortest Hackathon from @NVIDIAGTC - incredible work.

Just want to shout out the incredible work of our media team - who can take my incoherent shouting and turn it into this (go Nemotron btw) youtube.com/shorts/3m_48SI…

llm_wizard's tweet card. New Nemotron Open-Source Models and Datasets | #NVIDIAGTC

youtube.com

YouTube

New Nemotron Open-Source Models and Datasets | #NVIDIAGTC


Like, the Anthropic folks just do the coolest shit sometimes.

New Anthropic research: Signs of introspection in LLMs. Can language models recognize their own internal thoughts? Or do they just make up plausible answers when asked about them? We found evidence for genuine—though limited—introspective capabilities in Claude.

AnthropicAI's tweet image. New Anthropic research: Signs of introspection in LLMs.

Can language models recognize their own internal thoughts? Or do they just make up plausible answers when asked about them? We found evidence for genuine—though limited—introspective capabilities in Claude.


But more know every day!

Few know about it but Nvidia’s open-source AI game is insane. Just look at this lead on HF activity Jensen’s flexing at GTC today Once they dominated with Software 1.0 (more than hardware). Now they’re setting up to win Software 2.0 aka AI models + datasets.

Thom_Wolf's tweet image. Few know about it but Nvidia’s open-source AI game is insane.

Just look at this lead on HF activity Jensen’s flexing at GTC today

Once they dominated with Software 1.0 (more than hardware).

Now they’re setting up to win Software 2.0 aka AI models + datasets.


Chris 🇨🇦 reposted

Devs need specialized AI agents for domain-specific workflows, real-world deployment, and compliance. 📝 See our tech blog to learn about our newest open Nemotron models for building multimodal agents, RAG pipelines, and AI with content safety ➡️ nvda.ws/4ob5bvM


Chris 🇨🇦 reposted

“Researchers need open source. Developers need open source. Companies around the world — we need open source.” -Jensen Huang #NVIDIAGTC


NVIDIA is really committed to this open models thing - like you wouldn’t believe! New models dropped during the keynote: - NVIDIA Nemotron Nano 2 VL - NVIDIA Nemotron Parse 1.1 - Llama 3.1 Nemotron Safety Guard With a bunch of other models being moved from closed to open on…


Open Model section LESS GO

llm_wizard's tweet image. Open Model section LESS GO

People always ask: “But why NVIDIA make models? They sell GPUs”, Jensen at the keynote just explained it: “Extreme co-design”, better models help us make better chips help us make better networks help us make better models help us make better chips.


Tired: Data Centres Wired: AI Factory

llm_wizard's tweet image. Tired: Data Centres
Wired: AI Factory

GTC Keynote so far: - 6G - Hundreds of quibits and error correction for days


Had "Lemon Meringue" pie today - but they forgot the fuckin' lemon bro.

llm_wizard's tweet image. Had "Lemon Meringue" pie today - but they forgot the fuckin' lemon bro.

It's not nearly as big as I thought it was!

llm_wizard's tweet image. It's not nearly as big as I thought it was!

MoNoLiTh 👁️

llm_wizard's tweet image. MoNoLiTh 👁️

Loading...

Something went wrong.


Something went wrong.