vivek_kumar's profile picture. Sr. Manager, Foundational Research @GoogleDeepMind
 🦋 @v1vekkumar.bsky.social 
Ex @Dolby & @Broadcom
Talks and Investments 👉🏽 http://portfolio.v1vek.com

Vivek Kumar

@vivek_kumar

Sr. Manager, Foundational Research @GoogleDeepMind 🦋 @v1vekkumar.bsky.social Ex @Dolby & @Broadcom Talks and Investments 👉🏽 http://portfolio.v1vek.com

Vivek Kumar reposted

I always thought the decline in fundamental AI research funding would happen because AI didn’t generate enough value to be worth the cost. But it seems like it’s happening because it generated too much value. And the race to capture that value is taking priority. Just…


AI for Finance done right 🚀 Congrats @KrisBennatti and @HudsonLabs team 🎉

Most AI fails in finance ❌ Hallucinations. Missed guidance. Inaccurate numbers. Today we launch The Co-Analyst → high-precision AI for institutional investors. Used by funds managing $1T+ AUM. Only the facts. Direct-from-source. Every time.

KrisBennatti's tweet image. Most AI fails in finance ❌

Hallucinations. Missed guidance. Inaccurate numbers.

Today we launch The Co-Analyst → high-precision AI for institutional investors.

Used by funds managing $1T+ AUM.

Only the facts. Direct-from-source. Every time.


Veo3-audio/Veo3-fast-audio at #1 🚀🚀🚀

🎬 The Video Arena Leaderboard is now live! 14,000+ community votes have ranked the top Text-to-Video and Image-to-Video models. 📝 Text-to-Video rankings: - #1 Veo3 (audio on) - #3 Veo3, Veo3-fast - #5 Hailuo 02 [Standard], Seedance 1.0 pro - #6 Kling 2.1 Master - #9 Wan 2.2…

arena's tweet image. 🎬 The Video Arena Leaderboard is now live!

14,000+ community votes have ranked the top Text-to-Video and Image-to-Video models.

📝 Text-to-Video rankings:

- #1 Veo3 (audio on)
- #3 Veo3, Veo3-fast
- #5 Hailuo 02 [Standard], Seedance 1.0 pro
- #6 Kling 2.1 Master
- #9 Wan 2.2…


Vivek Kumar reposted

We just discovered the 🔥 COOLEST 🔥 trick in Flow that we have to share: Instead of wordsmithing the perfect prompt, you can just... draw it. Take the image of your scene, doodle what you'd like on it (through any editing app), and then briefly describe what needs to happen…


Vivek Kumar reposted

We're looking for people to join us to work on Gemini Diffusion and help revolutionize language modeling! Details below: job-boards.greenhouse.io/deepmind/jobs/…

Excited to share what my team has been working on lately - Gemini diffusion! We bring diffusion to language modeling, yielding more power and blazing speeds! 🚀🚀🚀 Gemini diffusion is especially strong at coding. In this example the model generates at 2000 tokens/sec,…



Magenta RealTime - open weights live music model! 🚀🚀🚀

Excited to announce 🎵Magenta RealTime, the first open weights music generation model capable of real-time audio generation with real-time control. 👋 **Try Magenta RT on Colab TPUs**: colab.research.google.com/github/magenta… 👀 Blog post: g.co/magenta/rt 🧵 below



Vivek Kumar reposted

Or train every model as a MatFormer and have native elasticity (akin to virtualization) that can help you span the entire accuracy-vs-resource pareto frontier. 🪆

For years, we've been saying that bigger isn't always better for AI and that smaller specialized models are usually faster, cheaper and more accurate for your specific constraints. So super happy to release the long-overdue capability of finding the best model based on size on…

ClementDelangue's tweet image. For years, we've been saying that bigger isn't always better for AI and that smaller specialized models are usually faster, cheaper and more accurate for your specific constraints. 

So super happy to release the long-overdue capability of finding the best model based on size on…


Vivek Kumar reposted

read the first letter of every name in the gemini contributors list

var_epsilon's tweet image. read the first letter of every name in the gemini contributors list

Exciting! The Magenta team is sharing tools for real-time, interactive music generation. The new Lyria RealTime API are ready for you to build with 🚀

On the occasion of returning to Magenta's roots at @sonarplusd, we're dusting off the blog to share news and insights about what we're working on at @GoogleDeepMind on the Lyria Team. g.co/magenta/lyria-… Our latest post is about the Lyria RealTime API, providing access to…



Vivek Kumar reposted

Try the native audio dialog with thinking 👌

Gemini 2.5 Flash Preview now supports native audio output via the Live API for seamless, natural spoken interactions and greater voice control. A new experimental thinking version of this audio model supports reasoning capabilities for more complex tasks. ai.google.dev/gemini-api/doc…



reshaping reality ✨ - #veo3 just casually blowing up the internet - one generated video at a time. This is just the beginning 🚀

The Veo3 effect on traffic to @GoogleDeepMind. Have you tried it yet?

Similarweb's tweet image. The Veo3 effect on traffic to @GoogleDeepMind.

Have you tried it yet?


Veo 3 now available in more countries 🚀

Veo 3 dropped about 100 hours ago, and it's been on 🔥🔥🔥 ever since Now, we’re excited to announce: + 71 new countries have access + Pro subscribers get a trial pack of Veo 3 on the web (mobile soon) + Ultra subscribers get the highest # of Veo 3 gens w/ refreshes How to try…



Vivek Kumar reposted

So proud of our team's contributions!

Again very happy to see launches powered by Gemini Native Audio Output capabilities, where Google DeepMind team members in Tokyo🗼 made significant contributions! Text-to-Speech: aistudio.google.com/generate-speech Dialog: aistudio.google.com/live?model=gem…



Vivek Kumar reposted

The audio team released new dialog and TTS models. check it out at aistudio.google.com/live

💬 Smarter dialogue: Gemini-powered native audio means Project Astra has better context and customizable accents. 🕹️ Takes action: Computer control lets it open and engage with apps at your direction. 🤝 Personalized help: Integrates with your @Gmail, @GoogleCalendar and more…



Vivek Kumar reposted

Cutting-edge tomatoes 🔪 #Veo3 (Sound on!) 1. Slicing a felt tomato 👇


Vivek Kumar reposted

Veo 3 is multi-lingual, here is some speech in Turkish #Veo3


Still buzzing from this! grokking physics + synced audio 🎧 generation - the realism is insane! So proud of our team's contributions in making this a reality!


Loading...

Something went wrong.


Something went wrong.