Kevin Swersky

@kswersk

Research Scientist at Deepmind.

Toronto, Ontario

Inscrit en Juin 2015

391Posts 8KAbonnés 524Abonnements

Vous pourriez aimer

@_rockt

@brandondamos

@dpkingma

@yeewhye

@latentjasper

@yaringal

@j_foerst

@Luke_Metz

@quocleix

@nickfrosst

@marcgbellemare

@poolio

@tejasdkulkarni

@dustinvtran

@tom_rainforth

Kevin Swersky a reposté

Paul Vicol

@PaulVicol

25 sept.

🚀 All this and more in our paper! arXiv: arxiv.org/abs/2509.20328 Project page: video-zero-shot.github.io By @thwiedemer, Yuxuan Li, @PaulVicol, @shaneguML, @nmatares, @kswersk, @_beenkim, @priyankjaini, and Robert Geirhos.

Kevin Swersky a reposté

Do generative video models learn physical principles from watching videos? Very excited to introduce the Physics-IQ benchmark, a challenging dataset of real-world videos designed to test physical understanding of video models. Webpage: physics-iq.github.io

Kevin Swersky a reposté

Alex Wiltschko

@awiltschko

29 oct.

Well, we actually did it. We digitized scent. A fresh summer plum was the first fruit and scent to be fully digitized and reprinted with no human intervention. It smells great. Holy moly, I’m still processing the magnitude of what we’ve done. And yet, it feels like as we cross…

Osmo

@Osmo_Labs

29 oct.

Scent Teleportation Update: WE DID IT! #Osmo #TechNews #AI #Scent

Kevin Swersky a reposté

Hanie Sedghi

@HanieSedghi

25 juin 2024

🆕🔥We show that LLMs *can* plan if instructed well! 🔥Instructing the model using ICL leads to a significant boost in planning performance, + can be further improved by using long context. arxiv.org/abs/2406.13094 w/ @Azade_na @bohnetbd A.Parisi @Kgoshvadi @kswersk @hanjundai +

HanieSedghi's tweet image. 🆕🔥We show that LLMs *can* plan if instructed well! 🔥Instructing the model using ICL leads to a significant boost in planning performance, + can be further improved by using long context. arxiv.org/abs/2406.13094
w/ @Azade_na @bohnetbd A.Parisi @Kgoshvadi @kswersk @hanjundai +

Kevin Swersky a reposté

AK

@_akhaliq

28 mai 2024

Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models We address the long-standing problem of how to learn effective pixel-based image diffusion models at scale, introducing a remarkably simple greedy growing method for stable training of large-scale,

_akhaliq's tweet image. Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models

We address the long-standing problem of how to learn effective pixel-based image diffusion models at scale, introducing a remarkably simple greedy growing method for stable training of large-scale,

Kevin Swersky a reposté

Priyank Jaini

@priyankjaini

4 déc. 2023

We have a student researcher opportunity in our team @GoogleDeepMind in Toronto 🍁 If you’re excited about research on diffusion models, and generative video models, please fill the form : forms.gle/auNq61N35AvTZS… and apply here: deepmind.google/about/careers/…

priyankjaini's tweet card. At the heart of Google DeepMind’s mission is our commitment to act as responsible pioneers in the field of AI, in service of society’s needs and expectations.

Careers

Source: deepmind.google

Kevin Swersky a reposté

Paul Vicol

@PaulVicol

2 oct. 2023

Check out @clark_kev’s and my paper on fine-tuning diffusion models on differentiable rewards! We present DRaFT, which computes gradients through diffusion sampling. DRaFT is efficient & works across many reward functions. With @kswersk, @fleet_dj arXiv: arxiv.org/abs/2309.17400

Kevin Clark

@clark_kev

2 oct. 2023

@PaulVicol and I are excited to introduce DRaFT, a method that fine-tunes diffusion models on rewards (such as scores from human preference models) by backpropagating through the diffusion sampling! with @kswersk, @fleet_dj arXiv: arxiv.org/abs/2309.17400 (1/5)

clark_kev's tweet image. @PaulVicol and I are excited to introduce DRaFT, a method that fine-tunes diffusion models on rewards (such as scores from human preference models) by backpropagating through the diffusion sampling!

with @kswersk, @fleet_dj
arXiv: arxiv.org/abs/2309.17400
(1/5)

Kevin Swersky

@kswersk

2 oct. 2023

I’m really excited about this project! Backpropagation and variations are extremely effective at fine-tuning diffusion models on downstream rewards.

Kevin Clark

@clark_kev

2 oct. 2023

Kevin Swersky

@kswersk

6 avr. 2023

This is a really natural framework to improve Bayesian optimization when you have access to related optimization tasks arxiv.org/abs/2109.08215 Joint work with @ziwphd, @GeorgeEDahl, Chansoo Lee, @zacharynado, @jmgilmer, @latentjasper, @ZoubinGhahrama1

Google AI

@GoogleAI

6 avr. 2023

Hyper Bayesian optimization (HyperBO) is a highly customizable interface that pre-trains a Gaussian process model and automatically defines model parameters, making Bayesian optimization easier to use while outperforming traditional methods. Learn more → goo.gle/3GnMYHG

Kevin Swersky a reposté

Ting Chen

@tingchenai

13 oct. 2022

📢Introducing Pix2Seq-D, a generalist framework casting panoptic segmentation as a discrete data generation task conditioned on pixels. Works for both images and videos, with minimal task engineering. arxiv.org/abs/2210.06366 work w/ Lala Li, @srbhsxn @geoffreyhinton @fleet_dj

Kevin Swersky

@kswersk

13 oct. 2022

This was a really interesting project for me to learn about and apply neural fields, with some great collaborators!

Andrea Tagliasacchi 🇨🇦

@taiyasaki

13 oct. 2022

📢📢📢 𝐂𝐔𝐅 – 𝐂𝐨𝐧𝐭𝐢𝐧𝐮𝐨𝐮𝐬 𝐔𝐩𝐬𝐚𝐦𝐩𝐥𝐢𝐧𝐠 𝐅𝐢𝐥𝐭𝐞𝐫𝐬 Neural-fields beat classical CNNs in (regressive) super-res: cuf-paper.github.io/cuf.pdf AFAIK, a first for @neural_fields in 2D deep learning? Mostly their wins are in sparse, higher-dim signals ~NeRF

taiyasaki's tweet image. 📢📢📢 𝐂𝐔𝐅 – 𝐂𝐨𝐧𝐭𝐢𝐧𝐮𝐨𝐮𝐬 𝐔𝐩𝐬𝐚𝐦𝐩𝐥𝐢𝐧𝐠 𝐅𝐢𝐥𝐭𝐞𝐫𝐬

Neural-fields beat classical CNNs in (regressive) super-res: cuf-paper.github.io/cuf.pdf

AFAIK, a first for @neural_fields in 2D deep learning?
Mostly their wins are in sparse, higher-dim signals ~NeRF

Kevin Swersky a reposté

Sergey Levine

@svlevine

27 oct. 2021

The overall recipe is general, and the same method could be applied to many other design problems in principle! More in the paper: arxiv.org/abs/2110.11346 Awesome collaboration led by @aviral_kumar2 & @ayazdanb! w/ Milad Hashemi & @kswersk

svlevine's tweet image. The overall recipe is general, and the same method could be applied to many other design problems in principle! More in the paper: arxiv.org/abs/2110.11346

Awesome collaboration led by @aviral_kumar2 &amp; @ayazdanb! w/ Milad Hashemi &amp; @kswersk

Kevin Swersky a reposté

Kory Mathewson

@korymath

14 sept. 2021

I made a bot improvise for 1000 hours and then asked it to come up with a few short-form improv games of it's own. Here's the first three...

korymath's tweet image. I made a bot improvise for 1000 hours and then asked it to come up with a few short-form improv games of it's own. Here's the first three...

Kevin Swersky a reposté

Jakub Tomczak

@jmtomczak

13 sept. 2021

Big shout-out to @wgrathwohl @kcjacksonwang @jh_jacobsen @DavidDuvenaud @kswersk @mo_norouzi for their amazing paper "Your classifier is secretly an energy-based model and you should treat it like one"!

Kevin Swersky

@kswersk

19 juil. 2021

This was a very fun project: an elegant algorithm that works well on the difficult task of sampling from discrete EBMs. Congratulations @wgrathwohl and team!

ICML Conference

@icmlconf

19 juil. 2021

ICML 2021 Outstanding Paper Award Honorable Mentions: 2/4. Will Grathwohl, Kevin Swersky, Milad Hashemi, David Duvenaud, and Chris Maddison 📜Oops I Took A Gradient: Scalable Sampling for Discrete Distributions (Tuesday 9am US Eastern)

Kevin Swersky a reposté

ICML Conference

@icmlconf

19 juil. 2021

Kevin Swersky a reposté

will grathwohl

@wgrathwohl

10 mai 2021

Hi all! Very pleased to share that my latest paper: "Oops I Took A Gradient: Scalable Sampling for Discrete Distributions" (arxiv.org/abs/2102.04509) has been accepted to ICML for a long presentation. Energy-Based Models have seen amazing progress in the last few years...

Kevin Swersky a reposté

Geoffrey Hinton

@geoffreyhinton

26 févr. 2021

I have a new paper on how to represent part-whole hierarchies in neural networks. arxiv.org/abs/2102.12627

Kevin Swersky a reposté

Amir Yazdan

@ayazdanb

5 févr. 2021

Leveraging #MachineLearning for accelerator design enables faster exploration of the architecture search space leading to more efficient hardware across a range of applications. Collaboration w/: @cangermueller, Berkin Akin, Yanqi Zhou, @miladhash, @kswersk.

Google AI

@GoogleAI

4 févr. 2021

Check out new work on ML-driven design and exploration of custom accelerators, showing how #MachineLearning facilitates architecture exploration by rapidly identifying high-performing configurations across a range of applications. Learn more ↓ goo.gle/3rngiEh

Kevin Swersky a reposté

Google AI

@GoogleAI

4 févr. 2021

hardmaru

@hardmaru

Eric Jang

@ericjang11

Soumith Chintala

@soumithchintala

Delip Rao e/σ

@deliprao

Dan Roy

@roydanroy

Kyunghyun Cho

@kchonyc

Andrew Gordon Wilson

@andrewgwils

Rosanne Liu

@savvyRL

Petar Veličković

@PetarV_93

Ferenc Huszár

@fhuszar

Sander Dieleman

@sedielem

Richard Socher

@RichardSocher

$sarahookr's profile picture. Adaptive Intelligence. Built @Cohere_Labs, @GoogleBrain, @GoogleDeepmind. ML Efficiency, Multimodal\lingual. Changing spaces where breakthroughs happen.$

Sara Hooker

@sarahookr

Sasha Rush

@srush_nlp

Jeff Dean

@JeffDean

Chris J. Maddison

@cjmaddison

Miles Brundage

@Miles_Brundage

Taco Cohen

@TacoCohen

Frank Nielsen

@FrnkNlsn

Danijar Hafner

@danijarh

Luke Ben

@Benluke22Luke

achilles wang

@achilleswang8

Ariel

@ArielZhu0729

wenjun tao

@TaoWenjun24561

SanchosDonkey

@sanchos_donkey

Autodidac

@Autodidac178306

Dhindsa

@dhindsa4825

Iris Chen

@IrisChen117

Jess Adams

@JessAdams46249

Bernardo Pamplona

@Bgpamplona

Abhishek Peri

@Abhishekperi1

AI Force

@YunZhi87214

Brim

@cullinan902

dragonAI

@AIMLforEdu

oahqb

@xyqoqvka

G Deer 😽

@mew_narita

DatBoiWithNoLife

@No_Life_Nobody

Finny Teker

@FinnyTeker

Luis Mejia

@LuisMejiaCanada

Salem Hanagh Gamy

@GamySalem

Yixin Lin

@yixin_lin_

Matej Kajinic

@matejkajinic

Sidharth Malhotra

@sidml

Andrew Davison

@AjdDavison

SAI RAM

@rsai31385_ram

Moon

@moonwonlee

bhushan

@bhushanpawar23

Høudini7🗼

@Houdini_7M

kovariance

@kovariance

hou.mon | هومان

@ihouman

Jamie Pullman

@Pullman

Florent (Flo) Nduwayezu

@nduwaflorent

Robert Scoble

@Scobleizer

mahmoud abbasid

@AbbasidMahmoud

Yash Kr Gupta

@ykgup

David Qiu

@qiucisco

CCIIFANG

@cciifang

Humans of the Latent Space

@latenthumans

Kazi Ershed Ahmed

@ErshedAhmed1965

tiffinita

@tiffinita

fofr

@fofrAI

Skalz

@_skalz_

Hernan Moraldo

@hhm

Krishna Pal Deora

@deorakp1

Murad Hemmadi

@muradhem

Z2

@zeetuuuu

Evan Chu

@evan_j_chu

Sohaibyousafzai

@SuhaibYusufzai

Shrikar

@Shrikarhaha

Theylor

@theylorz

hardmaru

@hardmaru

Peyman Milanfar

@docmilanfar

Eric Jang

@ericjang11

Soumith Chintala

@soumithchintala

(((ل()(ل() 'yoav))))👾

@yoavgo

Dan Roy

@roydanroy

Kyunghyun Cho

@kchonyc

Andrew Gordon Wilson

@andrewgwils

Rosanne Liu

@savvyRL

Kosta Derpanis

@CSProfKGD

NeurIPS Conference

@NeurIPSConf

Wojciech Zaremba

@woj_zaremba

Petar Veličković

@PetarV_93

Ferenc Huszár

@fhuszar

Sander Dieleman

@sedielem

Richard Socher

@RichardSocher

$sarahookr's profile picture. Adaptive Intelligence. Built @Cohere_Labs, @GoogleBrain, @GoogleDeepmind. ML Efficiency, Multimodal\lingual. Changing spaces where breakthroughs happen.$