rishabh_joshi4's profile picture. Google DeepMind @GoogleAI. Ex @LTIatCMU @tsvetshop @BITS_Pilani.

Rishabh Joshi

@rishabh_joshi4

Google DeepMind @GoogleAI. Ex @LTIatCMU @tsvetshop @BITS_Pilani.

Rishabh Joshi 已转帖

We are launching Gemini-pro-exp-03-25 which is our most capable model and generally useful on a wide vareity of real-world tasks. It's #1 on LMArena and SOTA on a wide set of benchmarks. This is a massive Gemini wide effort and I am incredibly proud of the team behind this. 🚀🚀

BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆 Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer…

arena's tweet image. BREAKING: Gemini 2.5 Pro is now #1 on the Arena leaderboard - the largest score jump ever (+40 pts vs Grok-3/GPT-4.5)! 🏆

Tested under codename "nebula"🌌, Gemini 2.5 Pro ranked #1🥇 across ALL categories and UNIQUELY #1 in Math, Creative Writing, Instruction Following, Longer…


Rishabh Joshi 已转帖

Exciting News from Chatbot Arena! @GoogleDeepMind's new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes. For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive…

arena's tweet image. Exciting News from Chatbot Arena!

@GoogleDeepMind's new Gemini 1.5 Pro (Experimental 0801) has been tested in Arena for the past week, gathering over 12K community votes.

For the first time, Google Gemini has claimed the #1 spot, surpassing GPT-4o/Claude-3.5 with an impressive…

Today, we are making an experimental version (0801) of Gemini 1.5 Pro available for early testing and feedback in Google AI Studio and the Gemini API. Try it out and let us know what you think! aistudio.google.com



Rishabh Joshi 已转帖

One more day until #GoogleIO! We’re feeling 🤩. See you tomorrow for the latest news about AI, Search and more.


Rishabh Joshi 已转帖

Quickly start your chat with Gemini using the new shortcut in the Chrome desktop address bar👇 Step 1: Type “@” in the desktop address bar and select Chat with Gemini Step 2: Write your prompt Step 3: Get your response on gemini.google.com Seriously. It’s that easy ✨


Rishabh Joshi 已转帖

Gemini 1.5 Pro has entered the (LMSys) Arena! Some highlights: -The only "mid" tier model at the highest level alongside "top" tier models from OpenAI and Anthropic ♊️ -The model excels at multimodal, and long context (not measured here) 🐍 -This model is also state-of-the-art…

More exciting news today -- Gemini 1.5 Pro result is out! Gemini 1.5 Pro API-0409-preview now achieves #2 on the leaderboard, surpassing #3 GPT4-0125-preview to almost top-1! Gemini shows even stronger performance on longer prompts, in which it ranks joint #1 with the latest…

arena's tweet image. More exciting news today -- Gemini 1.5 Pro result is out!

Gemini 1.5 Pro API-0409-preview now achieves #2 on the leaderboard, surpassing #3 GPT4-0125-preview to almost top-1!

Gemini shows even stronger performance on longer prompts, in which it ranks joint #1 with the latest…
arena's tweet image. More exciting news today -- Gemini 1.5 Pro result is out!

Gemini 1.5 Pro API-0409-preview now achieves #2 on the leaderboard, surpassing #3 GPT4-0125-preview to almost top-1!

Gemini shows even stronger performance on longer prompts, in which it ranks joint #1 with the latest…


Really grateful to be a part of this amazing team!

We’re excited to announce 𝗚𝗲𝗺𝗶𝗻𝗶: @Google’s largest and most capable AI model. Built to be natively multimodal, it can understand and operate across text, code, audio, image and video - and achieves state-of-the-art performance across many tasks. 🧵 dpmd.ai/announcing-gem…



Glad to be part of this effort and contribute to the team!

Introducing Gemini 1.0, our most capable and general AI model yet. Built natively to be multimodal, it’s the first step in our Gemini-era of models. Gemini is optimized in three sizes - Ultra, Pro, and Nano Gemini Ultra’s performance exceeds current state-of-the-art results on…

sundarpichai's tweet image. Introducing Gemini 1.0, our most capable and general AI model yet. Built natively to be multimodal, it’s the first step in our Gemini-era of models. Gemini is optimized in three sizes - Ultra, Pro, and Nano

Gemini Ultra’s performance exceeds current state-of-the-art results on…


Rishabh Joshi 已转帖

Exciting times, welcome Gemini (and MMLU>90)! State-of-the-art on 30 out of 32 benchmarks across text, coding, audio, images, and video, with a single model 🤯 Co-leading Gemini has been my most exciting endeavor, fueled by a very ambitious goal. And that is just the beginning!…


Rishabh Joshi 已转帖

student researcher applications for next year @GoogleDeepMind are open! i don't yet know if i will have any on my team, but there will be lots of fantastic projects to choose from, so please apply! google.com/about/careers/…


Rishabh Joshi 已转帖

People are realizing RLHF can be easy with DPO and SLiC-HF. If you were wondering how they compare, the answer is they are pretty similar and our paper (arxiv.org/abs/2309.06657 led by @Terenceliu4444) shows the math. The biggest question is whether you should train a preference…

peterjliu's tweet image. People are realizing RLHF can be easy with DPO and SLiC-HF. If you were wondering how they compare, the answer is they are pretty similar and our paper (arxiv.org/abs/2309.06657 led by
@Terenceliu4444) shows the math. 

The biggest question is whether you should train a preference…
peterjliu's tweet image. People are realizing RLHF can be easy with DPO and SLiC-HF. If you were wondering how they compare, the answer is they are pretty similar and our paper (arxiv.org/abs/2309.06657 led by
@Terenceliu4444) shows the math. 

The biggest question is whether you should train a preference…

Aligning LLMs with Human Preferences is one of the most active research areas🧪  RLHF, DPO, and SLiC are all techniques for aligning LLMs, but they come with challenges. 🥷 @GoogleDeepMind proposes a new method, “Statistical Rejection Sampling Optimization (RSO)” 🧶

_philschmid's tweet image. Aligning LLMs with Human Preferences is one of the most active research areas🧪 
RLHF, DPO, and SLiC are all techniques for aligning LLMs, but they come with challenges. 🥷
@GoogleDeepMind proposes a new method, “Statistical Rejection Sampling Optimization (RSO)” 

🧶


A great summary of our most recent paper where we build on SLiC-HF and show that sampling from policy and optimizing is better than direct (DPO). We introduce a nice way to sample candidates for training.

Aligning LLMs with Human Preferences is one of the most active research areas🧪  RLHF, DPO, and SLiC are all techniques for aligning LLMs, but they come with challenges. 🥷 @GoogleDeepMind proposes a new method, “Statistical Rejection Sampling Optimization (RSO)” 🧶

_philschmid's tweet image. Aligning LLMs with Human Preferences is one of the most active research areas🧪 
RLHF, DPO, and SLiC are all techniques for aligning LLMs, but they come with challenges. 🥷
@GoogleDeepMind proposes a new method, “Statistical Rejection Sampling Optimization (RSO)” 

🧶


A few friends, exactly a decade ago on 08/23/2013, won a NASA competition on making a design patch. They decided to name it chandrayan 3. Exactly 10 years later on 08/23/2023 the chandrayan 3 mission landed on the moon! Coincidence.. or prophecy? @isro @NASAKennedy @PMOIndia

India made history by being the first country to descend on the Moon’s South Pole on August 23, 2023. Congratulations ISRO and to every single Indian! Here's a story of 3 girls who dreamt big! #Chandrayaan3 #Chandrayaan3Success #Chandrayaan3Mission #ISRO #PMModi #IndiaOnTheMoon

belle_aish's tweet image. India made history by being the first country to descend on the Moon’s South Pole on August 23, 2023.
Congratulations ISRO and to every single Indian! Here's a story of 3 girls who dreamt big!
#Chandrayaan3 #Chandrayaan3Success #Chandrayaan3Mission #ISRO #PMModi #IndiaOnTheMoon
belle_aish's tweet image. India made history by being the first country to descend on the Moon’s South Pole on August 23, 2023.
Congratulations ISRO and to every single Indian! Here's a story of 3 girls who dreamt big!
#Chandrayaan3 #Chandrayaan3Success #Chandrayaan3Mission #ISRO #PMModi #IndiaOnTheMoon


Rishabh Joshi 已转帖

We also showed in the original SLiC paper that if your model likelihood is well-calibrated you can just decode a lot and rank by likelihood to filter arxiv.org/abs/2210.00045


Tired of trying to get RL to work with Human Feedback? Try our method - SLiC: Sequence level calibration using human feedback! Work with @yaozhaoai @peterjliu @khalman_m @Mohamma78108419 and Tianqi at @GoogleAI @DeepMind

Here is our “slick” RLHF-alternative without RL: arxiv.org/abs/2305.10425 (SLiC-HF) TL;DR: Works as well as RLHF, but a lot simpler. About as easy and efficient as fine-tuning. Much better than simply fine-tuning on good examples. From great collaborators: @yaozhaoai,…

peterjliu's tweet image. Here is our “slick” RLHF-alternative without RL: arxiv.org/abs/2305.10425 (SLiC-HF)

TL;DR: Works as well as RLHF, but a lot simpler.

About as easy and efficient as fine-tuning. Much better than simply fine-tuning on good examples.

From great collaborators: @yaozhaoai,…


Rishabh Joshi 已转帖

The phenomenal teams from Google Research’s Brain and @DeepMind have made many of the seminal research advances that underpin modern AI, from Deep RL to Transformers. Now we’re joining forces as a single unit, Google DeepMind, which I’m thrilled to lead! dpmd.ai/announcing-goo…


Rishabh Joshi 已转帖

𝗚𝗼𝗼𝗴𝗹𝗲 𝗗𝗲𝗲𝗽𝗠𝗶𝗻𝗱


Rishabh Joshi 已转帖

Excited and looking forward to this tomorrow! If you are around and interested, please stop by.

For this week's NLP seminar, we are delighted to host @danish037 ! Danish will talk about Evaluating Explanations. The talk will be Thursday at 11AM PT. Non-Stanford affiliate registration form: forms.gle/KF1Tdjar3Ud4Yz…

stanfordnlp's tweet image. For this week's NLP seminar, we are delighted to host @danish037 ! Danish will talk about Evaluating Explanations. The talk will be Thursday at 11AM PT. Non-Stanford affiliate registration form: forms.gle/KF1Tdjar3Ud4Yz…


Rishabh Joshi 已转帖

Which training examples taught my LLM to do that? 🤔 New from Google Research: Simfluence tracks how much "smarter" your model gets after consuming each example. It can then simulate scenarios like “What if I removed X dataset from my training corpus?” arxiv.org/abs/2303.08114 🧵

kelvin_guu's tweet image. Which training examples taught my LLM to do that? 🤔 New from Google Research: Simfluence tracks how much "smarter" your model gets after consuming each example. It can then simulate scenarios like “What if I removed X dataset from my training corpus?” arxiv.org/abs/2303.08114 🧵

Rishabh Joshi 已转帖

I am looking for a few PhD/MTech (research) students for my lab at IISc Bangalore. The institute-wide applications for these programs are now open (deadline: March, 23). Please email me if you have any questions. iisc.ac.in/admissions/

I am beyond thrilled to share that I'll be starting as an assistant professor at the Indian Institute of Science (IISc), Bangalore in April 2023. I couldn’t have been luckier—I'm grateful for the support of many kind mentors, peers, students, friends and family members. (1/4)



Rishabh Joshi 已转帖

We’re super excited to share our #eacl2023 paper - Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey Paper: arxiv.org/abs/2210.07700 w/ @shocheen, Lucille Njoo, @anas_ant, Yulia Tsvetkov from @tsvetshop lab. 1/7


Loading...

Something went wrong.


Something went wrong.