Vivek Kumar

@vivek_kumar

Sr. Manager, Foundational Research @GoogleDeepMind 🦋 @v1vekkumar.bsky.social Ex @Dolby & @Broadcom Talks and Investments 👉🏽 http://portfolio.v1vek.com

Science & Technology

Mountain View, CA

v1vek.com

Joined April 2008

2KPosts 2KFollowers 595Following

You might like

@justin_salamon

@yuwang_tw

@keunwoochoi

@jordiponsdotme

@csteinmetz1

@serrjoa

@mittu1204

@deeplearnmusic

@NicholasJBryan

@marcoamaram

@sebastian_ewert

@affige_yang

@GauthamMysore

@elio_elioo

@QiuqiangK

Vivek Kumar reposted

Awni Hannun

@awnihannun

Oct 23

I always thought the decline in fundamental AI research funding would happen because AI didn’t generate enough value to be worth the cost. But it seems like it’s happening because it generated too much value. And the race to capture that value is taking priority. Just…

Vivek Kumar

@vivek_kumar

Sep 9

AI for Finance done right 🚀 Congrats @KrisBennatti and @HudsonLabs team 🎉

Kris

@KrisBennatti

Sep 9

Most AI fails in finance ❌ Hallucinations. Missed guidance. Inaccurate numbers. Today we launch The Co-Analyst → high-precision AI for institutional investors. Used by funds managing $1T+ AUM. Only the facts. Direct-from-source. Every time.

KrisBennatti's tweet image. Most AI fails in finance ❌

Hallucinations. Missed guidance. Inaccurate numbers.

Today we launch The Co-Analyst → high-precision AI for institutional investors.

Used by funds managing $1T+ AUM.

Only the facts. Direct-from-source. Every time.

Vivek Kumar

@vivek_kumar

Aug 6

Veo3-audio/Veo3-fast-audio at #1 🚀🚀🚀

lmarena.ai

@arena

Aug 6

🎬 The Video Arena Leaderboard is now live! 14,000+ community votes have ranked the top Text-to-Video and Image-to-Video models. 📝 Text-to-Video rankings: - #1 Veo3 (audio on) - #3 Veo3, Veo3-fast - #5 Hailuo 02 [Standard], Seedance 1.0 pro - #6 Kling 2.1 Master - #9 Wan 2.2…

arena's tweet image. 🎬 The Video Arena Leaderboard is now live!

14,000+ community votes have ranked the top Text-to-Video and Image-to-Video models.

📝 Text-to-Video rankings:

- #1 Veo3 (audio on)
- #3 Veo3, Veo3-fast
- #5 Hailuo 02 [Standard], Seedance 1.0 pro
- #6 Kling 2.1 Master
- #9 Wan 2.2…

Vivek Kumar reposted

Google Labs

@GoogleLabs

Jul 24

We just discovered the 🔥 COOLEST 🔥 trick in Flow that we have to share: Instead of wordsmithing the perfect prompt, you can just... draw it. Take the image of your scene, doodle what you'd like on it (through any editing app), and then briefly describe what needs to happen…

Vivek Kumar reposted

Brendan O'Donoghue

@bodonoghue85

Jul 4

We're looking for people to join us to work on Gemini Diffusion and help revolutionize language modeling! Details below: job-boards.greenhouse.io/deepmind/jobs/…

Brendan O'Donoghue

@bodonoghue85

May 20

Excited to share what my team has been working on lately - Gemini diffusion! We bring diffusion to language modeling, yielding more power and blazing speeds! 🚀🚀🚀 Gemini diffusion is especially strong at coding. In this example the model generates at 2000 tokens/sec,…

Vivek Kumar

@vivek_kumar

Jun 20

Magenta RealTime - open weights live music model! 🚀🚀🚀

Chris Donahue

@chrisdonahuey

Jun 20

Excited to announce 🎵Magenta RealTime, the first open weights music generation model capable of real-time audio generation with real-time control. 👋 **Try Magenta RT on Colab TPUs**: colab.research.google.com/github/magenta… 👀 Blog post: g.co/magenta/rt 🧵 below

Vivek Kumar reposted

Aditya Kusupati

@adityakusupati

Jun 17

Or train every model as a MatFormer and have native elasticity (akin to virtualization) that can help you span the entire accuracy-vs-resource pareto frontier. 🪆

clem 🤗

@ClementDelangue

Jun 16

For years, we've been saying that bigger isn't always better for AI and that smaller specialized models are usually faster, cheaper and more accurate for your specific constraints. So super happy to release the long-overdue capability of finding the best model based on size on…

ClementDelangue's tweet image. For years, we've been saying that bigger isn't always better for AI and that smaller specialized models are usually faster, cheaper and more accurate for your specific constraints.

So super happy to release the long-overdue capability of finding the best model based on size on…

Vivek Kumar reposted

varepsilon

@var_epsilon

Jun 17

read the first letter of every name in the gemini contributors list

Vivek Kumar

@vivek_kumar

Jun 12

Exciting! The Magenta team is sharing tools for real-time, interactive music generation. The new Lyria RealTime API are ready for you to build with 🚀

Adam Roberts

@ada_rob

Jun 12

On the occasion of returning to Magenta's roots at @sonarplusd, we're dusting off the blog to share news and insights about what we're working on at @GoogleDeepMind on the Lyria Team. g.co/magenta/lyria-… Our latest post is about the Lyria RealTime API, providing access to…

Vivek Kumar reposted

Ankur Bapna

@ankurbpn

May 27

Try the native audio dialog with thinking 👌

Google AI Developers

@googleaidevs

May 27

Gemini 2.5 Flash Preview now supports native audio output via the Live API for seamless, natural spoken interactions and greater voice control. A new experimental thinking version of this audio model supports reasoning capabilities for more complex tasks. ai.google.dev/gemini-api/doc…

googleaidevs's tweet card. Learn about Google's most advanced AI models including Gemini 2.5 Pro

Gemini models | Gemini API | Google AI for Developers

Source: ai.google.dev

Vivek Kumar

@vivek_kumar

May 27

reshaping reality ✨ - #veo3 just casually blowing up the internet - one generated video at a time. This is just the beginning 🚀

Similarweb

@Similarweb

May 27

The Veo3 effect on traffic to @GoogleDeepMind. Have you tried it yet?

Vivek Kumar

@vivek_kumar

May 25

Veo 3 now available in more countries 🚀

Josh Woodward

@joshwoodward

May 24

Veo 3 dropped about 100 hours ago, and it's been on 🔥🔥🔥 ever since Now, we’re excited to announce: + 71 new countries have access + Pro subscribers get a trial pack of Veo 3 on the web (mobile soon) + Ultra subscribers get the highest # of Veo 3 gens w/ refreshes How to try…

Vivek Kumar reposted

Manish Gupta

@ManishGuptaMG1

May 23

So proud of our team's contributions!

Heiga Zen (全炳河)

@heiga_zen

May 22

Again very happy to see launches powered by Gemini Native Audio Output capabilities, where Google DeepMind team members in Tokyo🗼 made significant contributions! Text-to-Speech: aistudio.google.com/generate-speech Dialog: aistudio.google.com/live?model=gem…

Vivek Kumar reposted

Tara Sainath

@tnsainath

May 22

The audio team released new dialog and TTS models. check it out at aistudio.google.com/live

Google DeepMind

@GoogleDeepMind

May 20

💬 Smarter dialogue: Gemini-powered native audio means Project Astra has better context and customizable accents. 🕹️ Takes action: Computer control lets it open and engage with apps at your direction. 🤝 Personalized help: Integrates with your @Gmail, @GoogleCalendar and more…

Vivek Kumar reposted

Inbar Mosseri

@inbar_mosseri

May 21

Cutting-edge tomatoes 🔪 #Veo3 (Sound on!) 1. Slicing a felt tomato 👇

Vivek Kumar reposted

Hakan Erdogan

@HakanErdoganPhD

May 21

Veo 3 is multi-lingual, here is some speech in Turkish #Veo3

Vivek Kumar

@vivek_kumar

May 21

Still buzzing from this! grokking physics + synced audio 🎧 generation - the realism is insane! So proud of our team's contributions in making this a reality!