johanferret's profile picture. Research Scientist at Google DeepMind. PhD from @InriaScool. All things Reinforcement Learning. Into generative art, roguelikes & music.

Johan Ferret

@johanferret

Research Scientist at Google DeepMind. PhD from @InriaScool. All things Reinforcement Learning. Into generative art, roguelikes & music.

مثبتة

We just released Gemma 3n, a mobile-first & multimodal LLM that works with as little as 2Gb RAM. Feels crazy to interact with a model whose training I contributed to, hosted on my *own* phone (see screenshot!) 🤯 It packs so much for its size, give it a try (how to in thread)!

johanferret's tweet image. We just released Gemma 3n, a mobile-first & multimodal LLM that works with as little as 2Gb RAM.

Feels crazy to interact with a model whose training I contributed to, hosted on my *own* phone (see screenshot!) 🤯

It packs so much for its size, give it a try (how to in thread)!

Johan Ferret أعاد

Introducing two new Gemini 2.5 models (Flash and Flash-Lite) which are more intelligent, cost effective, and token efficient. You can keep up with our latest models through `gemini-flash-latest` and `gemini-flash-lite-latest`!!

OfficialLoganK's tweet image. Introducing two new Gemini 2.5 models (Flash and Flash-Lite) which are more intelligent, cost effective, and token efficient. You can keep up with our latest models through `gemini-flash-latest` and `gemini-flash-lite-latest`!!

Johan Ferret أعاد

AI efficiency is important. Today, Google is sharing a technical paper detailing our comprehensive methodology for measuring the environmental impact of Gemini inference. We estimate that the median Gemini Apps text prompt uses 0.24 watt-hours of energy (equivalent to watching an…

JeffDean's tweet image. AI efficiency is important. Today, Google is sharing a technical paper detailing our comprehensive methodology for measuring the environmental impact of Gemini inference. We estimate that the median Gemini Apps text prompt uses 0.24 watt-hours of energy (equivalent to watching an…
JeffDean's tweet image. AI efficiency is important. Today, Google is sharing a technical paper detailing our comprehensive methodology for measuring the environmental impact of Gemini inference. We estimate that the median Gemini Apps text prompt uses 0.24 watt-hours of energy (equivalent to watching an…

Johan Ferret أعاد

🏥Introducing MedGemma, part 2, including: 🔥A 27B multimodal MedGemma 👀MedSigLIP, a lightweight image/text encoder for medical image retrieval/classification 📜A technical report with details Blog: research.google/blog/medgemma-… Paper: arxiv.org/abs/2507.05201

osanseviero's tweet image. 🏥Introducing MedGemma, part 2, including:

🔥A 27B multimodal MedGemma
👀MedSigLIP, a lightweight image/text encoder for medical image retrieval/classification
📜A technical report with details

Blog: research.google/blog/medgemma-…
Paper: arxiv.org/abs/2507.05201

Johan Ferret أعاد

I'm really impressed by the new Gemma 3n I tried a 7.5GB model from Ollama and a 15GB model through mlx-vlm - they seem very capable, and this is the first model of that size I've tried that can handle both image AND audio input in addition to text! simonwillison.net/2025/Jun/26/ge…


Johan Ferret أعاد

We’re fully releasing Gemma 3n, which brings powerful multimodal AI capabilities to edge devices. 🛠️ Here’s a snapshot of its innovations 🧵

GoogleDeepMind's tweet image. We’re fully releasing Gemma 3n, which brings powerful multimodal AI capabilities to edge devices. 🛠️

Here’s a snapshot of its innovations 🧵

Johan Ferret أعاد

🚀 Gemma 3n is HERE with day-0 MLX support! Thrilled to have collaborated with @GoogleDeepMind and @huggingface to bring the MLX community instant access to this groundbreaking model. Why Gemma 3n changes everything: 📹 True Multimodal: text, audio, image & video ⚡ Runs on…


Johan Ferret أعاد

🚨 Breaking: Google DeepMind has released full version of Gemma 3n. Huge because bcoz now we have full multimodal AI capabilities for edge devices. > Built to understand text, images, audio, and video > Comes in two sizes: E2B and E4B but performs like 5B and 8B parameter…

ai_for_success's tweet image. 🚨 Breaking: Google DeepMind has released full version of Gemma 3n.

Huge because bcoz now we have full multimodal AI capabilities for edge devices.
> Built to understand text, images, audio, and video 
> Comes in two sizes: E2B and E4B but performs like 5B and 8B parameter…
ai_for_success's tweet image. 🚨 Breaking: Google DeepMind has released full version of Gemma 3n.

Huge because bcoz now we have full multimodal AI capabilities for edge devices.
> Built to understand text, images, audio, and video 
> Comes in two sizes: E2B and E4B but performs like 5B and 8B parameter…

Johan Ferret أعاد

Announcing the full release of Gemma 3n, bringing powerful multimodal capabilities to edge devices for developers 🙌 ↓ developers.googleblog.com/en/introducing…


Johan Ferret أعاد

Our open source Gemma models are the most powerful single GPU/TPU models out there! Our latest model Gemma 3n has amazing performance, multimodal understanding, & can run with as little as 2GB of memory - perfect for edge devices - enjoy building at ai.studio !

We’re fully releasing Gemma 3n, which brings powerful multimodal AI capabilities to edge devices. 🛠️ Here’s a snapshot of its innovations 🧵

GoogleDeepMind's tweet image. We’re fully releasing Gemma 3n, which brings powerful multimodal AI capabilities to edge devices. 🛠️

Here’s a snapshot of its innovations 🧵


Johan Ferret أعاد

Gemma 3n general available! Gemma 3n is now available across all major open source libraries and platforms including, @huggingface transformers, @ollama, llama.cpp, @unslothai, @lmstudio @vllm_project, @sgl_project, mlx, and others. 🚀 Gemma 3n - is the first model under 10B…

_philschmid's tweet image. Gemma 3n general available! Gemma 3n is now available across all major open source libraries and platforms including, @huggingface transformers, @ollama, llama.cpp, @unslothai, @lmstudio @vllm_project, @sgl_project, mlx, and others. 🚀

Gemma 3n 
- is the first model under 10B…

Johan Ferret أعاد

I’m so excited to announce Gemma 3n is here! 🎉 🔊Multimodal (text/audio/image/video) understanding 🤯Runs with as little as 2GB of RAM 🏆First model under 10B with @lmarena_ai score of 1300+ Available now on @huggingface, @kaggle, llama.cpp, ai.dev, and more

osanseviero's tweet image. I’m so excited to announce Gemma 3n is here! 🎉

🔊Multimodal (text/audio/image/video) understanding
🤯Runs with as little as 2GB of RAM
🏆First model under 10B with @lmarena_ai score of 1300+

Available now on @huggingface, @kaggle, llama.cpp, ai.dev, and more

Johan Ferret أعاد

The new 2.5 Flash-Lite is nuts! Faster and better than 2.0 Flash but with the same pricing. 🤯

_philschmid's tweet image. The new 2.5 Flash-Lite is nuts! Faster and better than 2.0 Flash but with the same pricing. 🤯

Johan Ferret أعاد

Gemini 2.5 Pro + 2.5 Flash are now stable and generally available. Plus, get a preview of Gemini 2.5 Flash-Lite, our fastest + most cost-efficient 2.5 model yet. 🔦 Exciting steps as we expand our 2.5 series of hybrid reasoning models that deliver amazing performance at the…

sundarpichai's tweet image. Gemini 2.5 Pro + 2.5 Flash are now stable and generally available. Plus, get a preview of Gemini 2.5 Flash-Lite, our fastest + most cost-efficient 2.5 model yet. 🔦

Exciting steps as we expand our 2.5 series of hybrid reasoning models that deliver amazing performance at the…

Johan Ferret أعاد

🎉 It's a BIG day for Gemini 2.5 — 2.5 Flash and 2.5 Pro are now stable and generally available in AI Studio, Vertex AI, and the @GeminiApp — We're launching a preview of the new 2.5 Flash-Lite, our most cost-efficient and fastest 2.5 model yet More info on each model below ⬇️

GoogleAI's tweet image. 🎉 It's a BIG day for Gemini 2.5

— 2.5 Flash and 2.5 Pro are now stable and generally available in AI Studio, Vertex AI, and the @GeminiApp
— We're launching a preview of the new 2.5 Flash-Lite, our most cost-efficient and fastest 2.5 model yet

More info on each model below ⬇️

Johan Ferret أعاد

Gemini 2.5 tech report was released!

arankomatsuzaki's tweet image. Gemini 2.5 tech report was released!

Johan Ferret أعاد

Gemini 2.5 is production ready! We just launched 3 new Gemini models with 2.5 Pro and Flash being now generally available and a new Gemini 2.5 Flash Lite preview! 🧠⚡️🔦 Here is all you need to know: 🔦 New Gemini 2.5 Flash Lite (Preview) with Thinking, 1M context, only…

_philschmid's tweet image. Gemini 2.5 is production ready! We just launched 3 new Gemini models with 2.5 Pro and Flash being now generally available and a new Gemini 2.5 Flash Lite preview! 🧠⚡️🔦

Here is all you need to know:
🔦 New Gemini 2.5 Flash Lite (Preview) with Thinking, 1M context, only…

Johan Ferret أعاد

🚨 BREAKING: Google Launches Gemini 2.5 Flash-Lite, Makes Gemini 2.5 Flash & Pro Generally Available - Custom versions of 2.5 Flash-Lite and Flash added to Search. Full details 👇

ai_for_success's tweet image. 🚨 BREAKING: Google Launches Gemini 2.5 Flash-Lite, Makes Gemini 2.5 Flash & Pro Generally Available

- Custom versions of 2.5 Flash-Lite and Flash added to Search.

Full details 👇
ai_for_success's tweet image. 🚨 BREAKING: Google Launches Gemini 2.5 Flash-Lite, Makes Gemini 2.5 Flash & Pro Generally Available

- Custom versions of 2.5 Flash-Lite and Flash added to Search.

Full details 👇

Johan Ferret أعاد

Google has just released Gemini 2.5 Flash Lite This is the cheapest and fastest model available: You can literally: - Process the entire Harry Potter series for $0.22 - Analyze a 3-hour video for less than $0.35 And you can also enable thinking mode to enhance its…

itsPaulAi's tweet image. Google has just released Gemini 2.5 Flash Lite

This is the cheapest and fastest model available:

You can literally:

- Process the entire Harry Potter series for $0.22
- Analyze a 3-hour video for less than $0.35

And you can also enable thinking mode to enhance its…

Johan Ferret أعاد

Hello Gemini 2.5 Flash-Lite! So fast, it codes *each screen* on the fly (Neural OS concept 👇). The frontier isn't always about large models and beating benchmarks. In this case, a super fast & good model can unlock drastic use cases. Read more: blog.google/products/gemin…


Johan Ferret أعاد

🚨 The votes are in and the new Gemini-2.5-Flash-Lite has officially landed on the LMArena leaderboards! Let's see how it stacks up with the community: 💠 #12 Overall on Text Arena 💠 #3 in Creative Writing, #14 in Coding, #17 in Hard Prompt category 💠 3-6x cheaper than…

arena's tweet image. 🚨 The votes are in and the new Gemini-2.5-Flash-Lite has officially landed on the LMArena leaderboards!

Let's see how it stacks up with the community:
💠 #12 Overall on Text Arena
💠 #3 in Creative Writing, #14 in Coding, #17 in Hard Prompt category
💠 3-6x cheaper than…

Hot Gemini updates off the press. 🚀 Anyone can now use 2.5 Flash and Pro to build and scale production-ready AI applications. 🙌 We’re also launching 2.5 Flash-Lite in preview: the fastest model in the 2.5 family to respond to requests, with the lowest cost too. 🧵



Loading...

Something went wrong.


Something went wrong.