Ethan

@torchcompiled

trying to feel the magic. cofounder at @leonardoai_ directing research at @canva

tampa - sydney

ethansmith2000.com

於四月 2022 加入

12千貼文 9千位跟隨者 845個跟隨中

置頂

Ethan

@torchcompiled

年2月14日

personally I feel like the inflection point was early 2022. The sweet spot where clip-guided diffusion was just taking off, forcing unconditional models to be conditional through strange patchwork of CLIP evaluating slices of the canvas at a time. It was like improv, always…

torchcompiled's tweet image. personally I feel like the inflection point was early 2022. The sweet spot where clip-guided diffusion was just taking off, forcing unconditional models to be conditional through strange patchwork of CLIP evaluating slices of the canvas at a time. It was like improv, always…

EPROM

@eprombeats

年2月14日

Image synthesis used to look so good. These are from 2021. I feel like this was an inflection point, and the space has metastasized into something abhorrent today (Grok, etc). Even with no legible representational forms, there was so much possibility in these images.

eprombeats's tweet image. Image synthesis used to look so good. These are from 2021. I feel like this was an inflection point, and the space has metastasized into something abhorrent today (Grok, etc). Even with no legible representational forms, there was so much possibility in these images.

Ethan

@torchcompiled

年10月14日

I think this aged well. There’s been quite a bit on training VAEs to have favorable representations, aligning them with embedding models. Why not just use the embedding models themself as the latent space?

torchcompiled's tweet image. I think this aged well.

There’s been quite a bit on training VAEs to have favorable representations, aligning them with embedding models. Why not just use the embedding models themself as the latent space?

Saining Xie

@sainingxie

年10月14日

three years ago, DiT replaced the legacy unet with a transformer-based denoising backbone. we knew the bulky VAEs would be the next to go -- we just waited until we could do it right. today, we introduce Representation Autoencoders (RAE). >> Retire VAEs. Use RAEs. 👇(1/n)

sainingxie's tweet image. three years ago, DiT replaced the legacy unet with a transformer-based denoising backbone. we knew the bulky VAEs would be the next to go -- we just waited until we could do it right.

today, we introduce Representation Autoencoders (RAE).

&gt;&gt; Retire VAEs. Use RAEs. 👇(1/n)

Ethan

@torchcompiled

年10月14日

In getting the outcomes we want from models, it all comes down to search. There’s two strategies here - reducing size of search space - searching efficiently Finetuning, RL, and prompt engineering all tighten the generative distribution around the outputs we want. Searching…

Ethan 已轉發

Ethan

@torchcompiled

年10月12日

New post! As opposed to building reward models over human ratings and using them for RL, can a model develop its own reward function? Humans seem to develop their own aesthetic preferences through exploration and socializing. How can we mimic this for generative models?

torchcompiled's tweet image. New post! As opposed to building reward models over human ratings and using them for RL, can a model develop its own reward function?

Humans seem to develop their own aesthetic preferences through exploration and socializing.

How can we mimic this for generative models?

Ethan

@torchcompiled

年10月12日

2 raised solutions here: 1 captures the social aspect while leaving out the difficulty of exploration: That one's taste develops by learning about other's tastes, we could imagine training a generative model trained over a dataset of many reward models, and sample new plausible…

torchcompiled's tweet image. 2 raised solutions here:

1 captures the social aspect while leaving out the difficulty of exploration: That one's taste develops by learning about other's tastes, we could imagine training a generative model trained over a dataset of many reward models, and sample new plausible…

Ethan

@torchcompiled

年10月12日

Ethan 已轉發

Ethan

@torchcompiled

年9月14日

New post! I believe we can think of ourselves in two different lenses: an exact point of experience and the history of our patterns of behavior. Though the two are deeply interconnected.

torchcompiled's tweet image. New post! I believe we can think of ourselves in two different lenses: an exact point of experience and the history of our patterns of behavior. Though the two are deeply interconnected.

KaliYuga

@KaliYuga_ai

AK

@_akhaliq

hardmaru

@hardmaru

proxima centauri b

@proximasan

sure, ai

@sureailabs

TomLikesRobots🤖

@TomLikesRobots

Claire Silver 🌸

@ClaireSilver12

Roope Rainisto

@rainisto

illustrata

@illustrata_ai

pharmapsychotic

@pharmapsychotic

Guy Parsons

@GuyP

huemin

@huemin_art

Jeremy Torman ♏

@TormanJeremy

Dreaming Tulpa 🥓👑

@dreamingtulpa

AmliArt

@amli_art

BLΛC

@blac_ai

makeitrad

@makeitrad1

Emm | scenario.com

@emmanuel_2m

ErotemeArt.eth

@ErotemeArt

Thibaud Zamora

@thibaudz

M

@intalentive

Max Shenkman

@maxshenkmann

Ryan Tabrizi

@ryan_tabrizi

Eamwarbie

@Eamwarbie75079

NEA

@waterPattern

Al H

@a_k_hassan

Hong Jeon

@hongjjeon

p11i

@the_p11i

Ugur Yekta Basak

@uguryektabasak

Siddarth Venkatraman

@siddarthv66

Sasidhar

@Sasidharrr

Chris Greer

@ChrisMGreer

Solana Live Giveaway

@Solanaconnect5

Josh Harkins-Finn

@JHarkFinn

Tom The Freak

@TomTheFreak

Al_th

@Al_Th

Mr. For Example

@MrForExample

vfx.ai

@vfx_ai

Arpit mohapatra

@moharpit

akash

@akashtattva

fourth

@fourthfought

Manuel Garcia

@ManuelOrManny

diana gaia

@diah_gay

Grace Li

@grx_xce

learning

@reprompting

Vixen

@VividAura12

Chapy

@ChapyCode

Christian Hinrichsen

@DataHiresJP

Aditya Ramesh

@model_mechanic

Soran Ghaderi

@soranghadri

Le Duy Duc

@duyducle20

Kushpreet Singh

@kushpreets57

Kazz

@kazzorr_

Doraking

@doraking_en

Shivank Nigam

@NigamShivank

Fredrik K. Gustafsson

@fregu856

Angad Ahuja

@angadahujasin

zhiqiang xu

@xu_zhiqiang

박지호

@qkrwlgh0314

john

@john56391960913

Thanh Pham

@pth_opt

chintak

@chintak

$et_tu_deux's profile picture. Diffraction limit ignorant, applied category theoretician and chief metanarrative author #27 #lgrw #goblue #floodthezone Erev Shabbat! GPGPU to the car!$