Hussein Hazimeh

@hazimeh_h

Research scientist @OpenAI. @MIT phd

San Francisco, CA

hazimeh.com

Joined October 2015

55Posts 330Followers 195Following

You might like

@RyanCoryWright

@Adam235711

@AmineBennouna8

@swati1729

@mariohsouto

@HadiCharkhgard

Hussein Hazimeh reposted

François Chollet

@fchollet

Dec 20

Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks. It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task…

fchollet's tweet image. Today OpenAI announced o3, its next-gen reasoning model. We've worked with OpenAI to test it on ARC-AGI, and we believe it represents a significant breakthrough in getting AI to adapt to novel tasks.

It scores 75.7% on the semi-private eval in low-compute mode (for $20 per task…

Hussein Hazimeh reposted

Sebastien Bubeck

@SebastienBubeck

Dec 20

o3 and o3-mini are my favorite models ever. o3 essentially solves AIME (>90%), GPQA (~90%), ARC-AGI (~90%), and it gets 1/4th of the Frontier Maths. To understand how insane 25% on Frontier Maths is, see this quote by Tim Gowers. The sparks are intensifying ...

SebastienBubeck's tweet image. o3 and o3-mini are my favorite models ever.

o3 essentially solves AIME (&gt;90%), GPQA (~90%), ARC-AGI (~90%), and it gets 1/4th of the Frontier Maths.

To understand how insane 25% on Frontier Maths is, see this quote by Tim Gowers.

The sparks are intensifying ...

Hussein Hazimeh reposted

Aran Komatsuzaki

@arankomatsuzaki

Feb 7, 2024

Scaling Laws for Downstream Task Performance of Large Language Models Studies how the choice of the pretraining data and its size affect downstream cross-entropy and BLEU score arxiv.org/abs/2402.04177

arankomatsuzaki's tweet image. Scaling Laws for Downstream Task Performance of Large Language Models

Studies how the choice of the pretraining data and its size affect downstream cross-entropy and BLEU score

arxiv.org/abs/2402.04177

Hussein Hazimeh reposted

Berivan Isik

@BerivanISIK

Feb 7, 2024

Very excited to share the paper from my last @GoogleAI internship: Scaling Laws for Downstream Task Performance of LLMs. arxiv.org/pdf/2402.04177… w/ Natalia Ponomareva, @hazimeh_h, Dimitris Paparas, Sergei Vassilvitskii, and @sanmikoyejo 1/6

BerivanISIK's tweet image. Very excited to share the paper from my last
@GoogleAI internship: Scaling Laws for Downstream Task Performance of LLMs.

arxiv.org/pdf/2402.04177…

w/ Natalia Ponomareva, @hazimeh_h, Dimitris Paparas, Sergei Vassilvitskii, and @sanmikoyejo
1/6

Hussein Hazimeh reposted

Journal of Machine Learning Research

@JmlrOrg

Oct 14, 2023

'LibMTL: A Python Library for Deep Multi-Task Learning', by Baijiong Lin, Yu Zhang. jmlr.org/papers/v24/22-… #mtl #python #libmtl

Hussein Hazimeh reposted

Journal of Machine Learning Research

@JmlrOrg

Oct 12, 2023

'L0Learn: A Scalable Package for Sparse Learning using L0 Regularization', by Hussein Hazimeh, Rahul Mazumder, Tim Nonet. jmlr.org/papers/v24/22-… #sparse #regularization #l0learn

Hussein Hazimeh reposted

Xihong Lin

@XihongLin

Sep 27, 2023

A new preprint on ALL-SUM: an ensembled L0Learn method for faster, more accurate polygenetic risk scores(PRSs) using GWAS summary stats. It ensembles L0+L2 regulated PRSs across toning grids. Applied to many UKB phenotypes. Congrats to @Tony_Chen6, @AndrewHaoyu, Rahul Mazumder

Tony

@Tony_Chen6

Sep 27, 2023

Very excited share our preprint on a new method for constructing polygenic risk scores from GWAS summary statistics. Big thanks to my amazing advisors @AndrewHaoyu, Rahul Mazumder (MIT), and @XihongLin for their support on this project. biorxiv.org/content/10.110…

Tony_Chen6's tweet image. Very excited share our preprint on a new method for constructing polygenic risk scores from GWAS summary statistics. Big thanks to my amazing advisors @AndrewHaoyu, Rahul Mazumder (MIT), and @XihongLin for their support on this project. biorxiv.org/content/10.110…

Hussein Hazimeh reposted

Tony

@Tony_Chen6

Sep 27, 2023

Hussein Hazimeh reposted

Shibal Ibrahim

@shibal_ibrahim

Oct 1, 2023

Check out our work on Flexible Modeling and Multitask Learning using Differentiable Tree Ensembles in KDD'22 Paper: dl.acm.org/doi/10.1145/35… Our paper had won Best Student Paper Award in KDD'22 (Research Track).

Hussein Hazimeh reposted

Mahesh Sathiamoorthy

@madiator

Sep 10, 2023

Great article covering six papers on Mixture of Experts, of which one is ours 🙂 (DSelect-K with @hazimeh_h, @achowdhery, and others): arxiv.org/abs/2106.03760

finbarr

@finbarrtimbers

Sep 9, 2023

my article about MoE routing layers is out! I took it down to 6 routing papers:

Hussein Hazimeh reposted

finbarr

@finbarrtimbers

Sep 9, 2023

my article about MoE routing layers is out! I took it down to 6 routing papers:

finbarr

@finbarrtimbers

Aug 22, 2023

I've got a big ol' post coming out in artificial fintelligence where I read 7 routing papers for MoE models and summarize them. hopefully coming out later this week (maybe weekend).

Hussein Hazimeh reposted

Demis Hassabis

@demishassabis

Aug 29, 2023

Really excited to share the beta release of #SynthID, a watermarking tool created by @GoogleDeepMind and made available via @GoogleCloud to help tag and identify AI-generated images

Hussein Hazimeh reposted

Sami Ramly

@SamiRamly

Aug 26, 2023

(1/2) I used ML to make Chess more fun in Single-Player. If you: - are an ML practitioner / work in AI, - like chess, math or puzzle games, - enjoy stories abt code, hacks, and setbacks, then this tech story is for you. A deep dive into the ML side of puzzle design.

SamiRamly's tweet image. (1/2) I used ML to make Chess more fun in Single-Player.

If you:
- are an ML practitioner / work in AI,
- like chess, math or puzzle games,
- enjoy stories abt code, hacks, and setbacks,

then this tech story is for you. A deep dive into the ML side of puzzle design.

Hussein Hazimeh

@hazimeh_h

Aug 17, 2023

Check out our work on pruning neural nets! The method leverages several ideas from combinatorial optimization and high-dimensional statistics to enable faster and more accurate pruning Blog: goo.gle/3E12kAc ICML paper: proceedings.mlr.press/v202/benbaki23…

Google AI

@GoogleAI

Aug 17, 2023

Introducing CHITA, an optimization-based approach for pruning pre-trained neural networks at scale. Learn how it leverages advances from several fields and outperforms state-of-the-art pruning methods in terms of scalability and performance tradeoffs → goo.gle/3E12kAc

Hussein Hazimeh reposted

J. AI Research-JAIR

@JAIR_Editor

Jul 24, 2023

New Article: "How to DP-fy ML: A Practical Guide to Machine Learning with Differential Privacy" by Ponomareva, Hazimeh, Kurakin, Xu, Denison, McMahan, Vassilvitskii, Chien, and Thakurta jair.org/index.php/jair…

Hussein Hazimeh reposted

Google AI

@GoogleAI

May 19, 2023

Today, we discuss the current state of differentially private ML (DP-ML) research with an overview of common techniques for obtaining DP-ML models, engineering challenges, mitigation techniques and current open questions. Learn more ↓ goo.gle/3Iril5k

Hussein Hazimeh reposted

Hadi Salman

@hadisalmanX

Apr 24, 2023

Excited to announce that our image immunization paper is accepted as an *Oral* at #ICML2023! 🔥 Come chat with us about it in Hawaii!

Aleksander Madry

@aleks_madry

Nov 3, 2022

Last week on @TheDailyShow, @Trevornoah asked @OpenAI @miramurati a (v. important) Q: how can we safeguard against AI-powered photo editing for misinformation? youtu.be/Ba_C-C6UwlI?t=… My @MIT students hacked a way to "immunize" photos against edits: gradientscience.org/photoguard/ (1/8)

aleks_madry's tweet image. Last week on @TheDailyShow, @Trevornoah asked @OpenAI @miramurati a (v. important) Q: how can we safeguard against AI-powered photo editing for misinformation? youtu.be/Ba_C-C6UwlI?t=…

My @MIT students hacked a way to "immunize" photos against edits: gradientscience.org/photoguard/ (1/8)

Hussein Hazimeh reposted

Florian Stimberg

@FStimberg

Mar 9, 2023

How easy is it for adversaries to hide image content from classifiers through obfuscations? Our new benchmark allows you to evaluate this! Dataset and evaluation code: github.com/deepmind/image… Paper: arxiv.org/abs/2301.12993. Joint work between @DeepMind, @Google & @GoogleAI 🧵

Hussein Hazimeh reposted

Alexey Kurakin

@alexey2004

Mar 3, 2023

Training ML models with differential privacy could be challenging. To aid practitioners, we wrote a detailed survey with known best practices of DP-training of ML models: arxiv.org/abs/2303.00654

Hussein Hazimeh reposted

Gonzalo Muñoz

@gmunoz_m

Nov 3, 2021

Mark your calendars! The 2022 MIP Workshop feat. DANniversary, will be held May 23-26, 2022, at DIMACS, Rutgers University. This year MIP will also host a computational competition, to be launched on November 16th. More info at mixedinteger.org/2022/