Gilad

@giladturok

CS phd student @cornell_tech 🐻 | diffusion language models 🤖 | novelty seeker 🤠 | Prev: @Uber @FlatironInst @Columbia

NYC 🗽🇺🇸

giladturok.github.io

Bergabung pada November 2023

1KPostingan 1KPengikut 4KMengikuti

Disematkan

Gilad

@giladturok

28 Jul 2024

I'm currently in my JAX era: - Write my own JAX projects for ML + stats - Do serious deep learning with {model, tensor, data} parallelism - Write JAX-style code (functional) with grad, vmap, pmap, jit - Understand why JAX is fast (jit, SDMP, lax) - PyTorch vs JAX trade-offs

Gilad

@giladturok

14 Des

Big moves in diffusion LLMs

Zhihu Frontier

@ZhihuFrontier

13 Des

🤯 Diffusion LLMs: The New Frontier? @TheInclusionAI has released LLaDA 2.0—the first diffusion model to scale to 100B params, matching frontier LLMs while achieving 2× faster inference! 🚀 Analysis from Zhihu contributor 赵俊博 Jake (Zhejiang Uni & Project Member): The Bet:…

ZhihuFrontier's tweet image. 🤯 Diffusion LLMs: The New Frontier?
@TheInclusionAI has released LLaDA 2.0—the first diffusion model to scale to 100B params, matching frontier LLMs while achieving 2× faster inference! 🚀

Analysis from Zhihu contributor 赵俊博 Jake (Zhejiang Uni &amp; Project Member):

The Bet:…

Gilad

@giladturok

2 Des

NeurIPS Starter Pack

Gilad memposting ulang

Yair Feldman

@yair_feldman

26 Nov

🧵 New paper: "Simple Context Compression" - we show that mean-pooling beats the widely-used compression-tokens method for compressing contexts in LLMs, while being simpler and more efficient! with @yoavartzi (1/7)

yair_feldman's tweet image. 🧵 New paper: "Simple Context Compression" - we show that mean-pooling beats the widely-used compression-tokens method for compressing contexts in LLMs, while being simpler and more efficient!
with @yoavartzi
(1/7)

Gilad

@giladturok

25 Nov

Amazing place to work!

Alberto Bietti

@albertobietti

24 Nov

Want to do fundamental ML research in NYC? 🧠 The Center for Computational Mathematics @FlatironInst @SimonsFdn is hiring! – Flatiron Research Fellow (postdoc, by Dec 1): apply.interfolio.com/173401 – Open Rank (by Jan 15): apply.interfolio.com/173640

Gilad

@giladturok

18 Nov

Another day, another issue with Python env management/dependencies

Gilad memposting ulang

Gilad

@giladturok

11 Sep

I made a LaTeX CV template that's nice looking + easy to use (link below). I added some nice features: 1. CV/resume dual mode 2. publication author annotations (equal contribution *) 3. Fancy contact bar Looking for feedback! Trying to make it drop dead simple to use! Thanks.

Gilad

@giladturok

13 Agu

Anyone else find most LaTeX CV templates to be ugly and/or hard to use?? Like why can’t they just work and look nice? I’ve truly never *once* been able to understand the style file for these templates !!

Gilad

@giladturok

14 Nov

Found on the NYC subway this week.

Gilad

@giladturok

12 Nov

Diffusion LLMs (dLLMs) aim to speed up inference by unmasking multiple tokens at once. Yet most top dLLMs only perform well when unmasking ~1 token at a time. I wonder if the key is to let them remask—unmask multiple tokens, then selectively mask again as needed.