Saurabh Singh

@saurasingh

Research Scientist @ Poetiq, Ex-Google DeepMind

California, USA

saurabhsingh.info

Joined December 2009

62Posts 248Followers 407Following

You might like

@shivangi2201

@tanmay2099

@alexschwing

@RickyTQChen

@jacobmbuckman

@niclane7

@ehsanik

@yin_hongxu

@wellecks

@ShenlongWang

@EdgarDobriban

@koutilya40192

@jiasenlu

@priyankjaini

@Aditi184

Pinned

Saurabh Singh

@saurasingh

Nov 20

Peek into our groundbreaking results -- this is what takeoff looks like 🚀.

Poetiq

@poetiq_ai

Nov 20

Is more intelligence always more expensive? Not necessarily. Introducing Poetiq. We’ve established a new SOTA and Pareto frontier on @arcprize using Gemini 3 and GPT-5.1.

poetiq_ai's tweet image. Is more intelligence always more expensive? Not necessarily.

Introducing Poetiq. We’ve established a new SOTA and Pareto frontier on @arcprize using Gemini 3 and GPT-5.1.

Saurabh Singh

@saurasingh

Nov 21

Poetiq + Gemini 3 = New Global SOTA. We just broke the ARC-AGI benchmarks (and the cost barrier) using Google's latest model. @GeminiApp

Poetiq

@poetiq_ai

Nov 20

Is more intelligence always more expensive? Not necessarily. Introducing Poetiq. We’ve established a new SOTA and Pareto frontier on @arcprize using Gemini 3 and GPT-5.1.

Saurabh Singh

@saurasingh

Nov 19

Someone might claim Vision is a Language problem. Turn into a sequence and train Transformer from scratch 😂.

Rosinality

@rosinality

Nov 19

You can just train ViT from scratch to solve ARC.

Gemini 3 Pro has around ~7.5T params (vibe-mathing with explanation) > the naive fit with with an R^2 of 0.8816 yields a mean estimation of 2.325 Quadrillion parameters > ummm, that's not it > let's only take sparse MoE reasoning models > this includes gpt-oss-20B and 120B,…

scaling01's tweet image. Gemini 3 Pro has around ~7.5T params
(vibe-mathing with explanation)

&gt; the naive fit with with an R^2 of 0.8816 yields a mean estimation of 2.325 Quadrillion parameters
&gt; ummm, that's not it

&gt; let's only take sparse MoE reasoning models
&gt; this includes gpt-oss-20B and 120B,…

Saurabh Singh

@saurasingh

Nov 19

From the look of it, TRM seems to be recurrent (a function called in loop) vs. recursive (function calling itself).

Alexia Jolicoeur-Martineau

@jm_alexia

Nov 18

For those who don't know, "Deep thinking" is a older recursive model from 2021. I suspect that "Gemini 3 Deep Think" use some TRM-style recursion.

Saurabh Singh

@saurasingh

Nov 19

These are amazing results from Gemini 3 on the ARC-2 challenge.

Greg Kamradt

@GregKamradt

Nov 18

After we got early access to Gemini 3 Pro, it was SOTA, we were impressed and then Google told us, "there's one more thing..." Deep Think sets the new high water mark on ARC-AGI-2

Saurabh Singh

@saurasingh

Nov 15

I think diffusion should be done in latent space. A very simple and principled (upper bound on likelihood) training objective is described here, with all the math worked out for anyone interested. No need to be discreet, we can just assume continuous latents.…

Maxime Labonne

@maximelabonne

Nov 10

Just found the dLLM library to create Diffusion Language Models It's still early but it's insanely fun to experiment with diffusion (training, inference, eval) dLLM has the potential of becoming the main library for diffusion LLMs

Saurabh Singh

@saurasingh

Nov 15

Actually, it's called a model.

François Chollet

@fchollet

Nov 14

All the great breakthroughs in science are, at their core, compression. They take a complex mess of observations and say, "it's all just this simple rule". Symbolic compression, specifically. Because the rule is always symbolic -- usually expressed as mathematical equations. If…

Saurabh Singh reposted

Nate Gillman

@GillmanLab

Oct 31, 2024

LLMs are powerful sequence modeling tools! They not only can generate language, but also actions for playing video games, or numerical values for forecasting time series. Can we help LLMs better model these continuous "tokens"? Our answer: Fourier series! Let me explain… 🧵(1/n)

Saurabh Singh

@saurasingh

Jul 26, 2024

Congrats to all the best paper award winners at ICML, no doubt lot of hard work went into it. However, some of these papers clearly can't be reproduced, due to missing details etc. Didn't realize that is not necessary anymore (to be best).

Saurabh Singh reposted

Alfredo De la Fuente

@alfo_512

Jun 15, 2024

Happy to announce our paper ‘Fourier Basis Density Model’, joint work with @saurasingh and Johannes Ballé, won Best Paper Award at Picture Coding Symposium (PCS 2024)🏅 arxiv.org/abs/2402.15345

Saurabh Singh reposted

Alfredo De la Fuente

@alfo_512

Mar 5, 2024

Can we use Fourier series to model univariate PDFs? (esp. useful in neural compression) We introduce a lightweight, flexible, and end-to-end trainable Fourier basis density model. w/ @saurasingh Johannes Ballé Paper: arxiv.org/abs/2402.15345 Code: github.com/google/codex

alfo_512's tweet image. Can we use Fourier series to model univariate PDFs?
(esp. useful in neural compression)

We introduce a lightweight, flexible, and end-to-end trainable Fourier basis density model.

w/ @saurasingh Johannes Ballé

Paper: arxiv.org/abs/2402.15345
Code: github.com/google/codex

Saurabh Singh reposted

Tanmay Gupta

@tanmay2099

Apr 21, 2022

Pleased to announce that the GPV-1 paper will be presented as an Oral at #CVPR2022. An updated version of the paper that conveys the key ideas much more clearly is now available on arxiv arxiv.org/abs/2104.00743 #GeneralPurposeVision #GPV @allen_ai

Tanmay Gupta

@tanmay2099

Aug 25, 2021

Excited to share snippets from our latest video explaining the ideas behind "General Purpose Vision" Video: youtu.be/ok2-Y58PGAY Paper, code & demo: prior.allenai.org/projects/gpv Work done with collaborators @kamath_amita @anikembhavi @HoiemDerek @allen_ai @IllinoisCS 🧵

PRIOR

Source: prior.allenai.org

Saurabh Singh reposted

Jeff Dean

@JeffDean

Oct 8, 2020

An excellent implementation of keepalive loop.

Dr. Shuchi Grover 🇮🇳🇺🇸🌎

@shuchig

Oct 7, 2020

Saurabh Singh reposted

Sander Dieleman

@sedielem

Oct 5, 2020

Very excited about the renewed focus on iterative refinement as a powerful tool for generative modelling! Here are a few relevant ICLR 2021 submissions: (image credit: github.com/ermongroup/ncsn) (1/3)

Saurabh Singh reposted

George Toderici

@george_toderici

Jul 12, 2020

Our group just released an arXiv paper that reviews Nonlinear Transform Coding approaches. We hope this will make the development of new end-to-end neural/learned compression methods easier to understand for those new to the field. arxiv.org/abs/2007.03034

george_toderici's tweet image. Our group just released an arXiv paper that reviews Nonlinear Transform Coding approaches. We hope this will make the development of new end-to-end neural/learned compression methods easier to understand for those new to the field. arxiv.org/abs/2007.03034

Saurabh Singh reposted

Peyman Milanfar

@docmilanfar

May 25, 2020

1/8 What does it mean to filter an image (or signal)? Often we choose, or design, a set of weights and apply them to the input image. But what loss/objective function does this process optimize (if any)? Should we care?

docmilanfar's tweet image. 1/8 What does it mean to filter an image (or signal)? Often we choose, or design, a set of weights and apply them to the input image. But what loss/objective function does this process optimize (if any)? Should we care?

Saurabh Singh reposted

Jeremy Konyndyk

@JeremyKonyndyk

May 1, 2020

Welcome to May. As I feared, the federal government wasted April much as it wasted February. That is a harsh assessment given how much the country has been suffering. But without competent federal leadership, the best we are managing is to tread water. Some stats: