Adit Jain

@aditjain1980

PhD Candidate @ Cornell ECE. Interested in Machine Learning and Reinforcement Learning.

Ithaca, NY

aditj.github.io

於四月 2023 加入

64貼文 55位跟隨者 315個跟隨中

置頂

Adit Jain

年9月29日

1/ Chain of thought reasoning can be significantly improved using RLVR but can we improve the generation process for reasoning tokens during training for better exploration, efficiency and performance? @brendanh0gan and I explore this question in our recent work 🧵 (tldr: yes!)

aditjain1980's tweet image. 1/ Chain of thought reasoning can be significantly improved using RLVR but can we improve the generation process for reasoning tokens during training for better exploration, efficiency and performance? @brendanh0gan and I explore this question in our recent work 🧵
(tldr: yes!)

Adit Jain

年10月14日

Sign up as a reviewer or AE if you can! I have been a reviewer for TMLR for almost 2 years now and it has been a greatly positive learning experience.

Transactions on Machine Learning Research

年10月14日

As Transactions on Machine Learning Research (TMLR) grows in number of submissions, we are looking for more reviewers and action editors. Please sign up! Only one paper to review at a time and <= 6 per year, reviewers report greater satisfaction than reviewing for conferences!

TmlrOrg's tweet image. As Transactions on Machine Learning Research (TMLR) grows in number of submissions, we are looking for more reviewers and action editors. Please sign up!

Only one paper to review at a time and &lt;= 6 per year, reviewers report greater satisfaction than reviewing for conferences!

Adit Jain

年9月25日

Very cool work!

Sakana AI

年9月25日

We’re excited to introduce ShinkaEvolve: An open-source framework that evolves programs for scientific discovery with unprecedented sample-efficiency. Blog: sakana.ai/shinka-evolve/ Code: github.com/SakanaAI/Shink… Like AlphaEvolve and its variants, our framework leverages LLMs to…

Adit Jain 已轉發

Richard Sutton

@RichardSSutton

年9月1日

My acceptance speech at the Turing award ceremony: Good evening ladies and gentlemen. The main idea of reinforcement learning is that a machine might discover what to do on its own, without being told, from its own experience, by trial and error. As far as I know, the first…

Adit Jain

年8月15日

Gemma-3 270M has interesting collapse behavior - it uses words from different indian languages - Hindi, Tamil, Marathi, Bangla (the task is english based) - perhaps a multi-lingual pretraining quirk?

aditjain1980's tweet image. Gemma-3 270M has interesting collapse behavior - it uses words from different indian languages - Hindi, Tamil, Marathi, Bangla (the task is english based) - perhaps a multi-lingual pretraining quirk?

Adit Jain 已轉發

Brendan Hogan

年8月13日

introducing qqWen: our fully open-sourced project (code+weights+data+detailed technical report) for full-stack finetuning (pretrain+SFT+RL) a series of models (1.5b, 3b, 7b, 14b & 32b) for a niche financial programming language called Q All details below!

brendanh0gan's tweet image. introducing qqWen: our fully open-sourced project (code+weights+data+detailed technical report) for full-stack finetuning (pretrain+SFT+RL) a series of models (1.5b, 3b, 7b, 14b &amp; 32b) for a niche financial programming language called Q

All details below!

brendanh0gan's tweet image. introducing qqWen: our fully open-sourced project (code+weights+data+detailed technical report) for full-stack finetuning (pretrain+SFT+RL) a series of models (1.5b, 3b, 7b, 14b &amp; 32b) for a niche financial programming language called Q

All details below!

Adit Jain

年8月9日

bullish on @TmlrOrg

Adit Jain

年8月7日

the em-dashes live on

aditjain1980's tweet image. the em-dashes live on

Yvrarbeam

@Yvrarbeam68902

Sayan Deb Sarkar

@debsarkar_sayan

Andreas Kirsch 🇺🇦

@BlackHC

Oovregir

@Oovregir197999

Jubayer Ibn Hamid

@jubayer_hamid

Svk

@Svk20012305

john🐍

@johnstuckey77

Nacho

@nachirulo21

/

@gazorp5

Puneet Kohli

@punkohl

Drew Ryan

@ENIAXON

Yashwant Alvee

@yashwant_alvee

Rahul Kumar

@badhanrahul01

Alvaro Fernandez

@alvarozone

Konstantin Dobler

@konstantdobler

Artificially Intelligent

@ArtiIntelligent

Ricetuna

@Ricetuuuna

Anshuman Mishra

@anshumanmishra

Govind K

@t2govind

Arif Ogan

@oganix

Zafir Stojanovski

@zafstojano

Ben (no treats)

@andersonbcdefg

Aman Priyanshu

@AmanPriyanshu6

Hari

@_hrkrshnn

deliciousSandwich

@mrsirrisrm

Yohei

@yoheinakajima

Larissa

@Whuiarha5286

Janvijay Singh

@iamjanvijay

Abhimanyu Suthar

@abhimanyu_25s

Stephanie

@Acp9H2l7Z2Hv4

Shivam Vats

@ShivaamVats

Carlota Parés-Morlans

@carlotapares

F. Güney

@ftm_guney

Kevin Rojas

@KevRojas1499

Alfonso Amayuelas

@AlfonAmayuelas

Brendan Hogan

@brendanh0gan

Satty

@natty_satty

Chand

@Chand7449215950

Rushil

@rushil1904

Abhishek Nagaraj 🗺️

@abhishekn

Mahesh Sathiamoorthy

@madiator

Little orange.

@GordonSymons11

y

@youssefish

Shlok Bum

@bum_shlok

Ano Kiyosaki

@AnoKiyosaki

will brown

@willccbb

Natalie Collina

@natalie_collina

Annie Ulichney

@annieulichney

Drishti Chouhan

@__drishtea

Lida Kanari

@LidaKanari

Robert Nishihara

@robertnishihara

Linda Vivah (Haviv)

@lindavivah

Tenny Yin

@tennyyin

Sayan Deb Sarkar

@debsarkar_sayan

Abhijith Reddy

@archerabi

Arjun Jain | Fast Code AI

@ArjunFastCode

Jubayer Ibn Hamid

@jubayer_hamid

Omar Khattab

@lateinteraction

Aditya Vashistha

@imadityav

Telt

@twofifteenam

Georgia Channing

@cgeorgiaw

Ansh Gupta

@AnshGupta610

Aditya Agarwal

@adityaag

Yohei

@yoheinakajima

Leandro von Werra

@lvwerra

Ross Taylor

@rosstaylor90

Tanmoy Chakraborty

@Tanmoy_Chak

Gowthami

@gowthami_s

Rishabh Agarwal

@agarwl_

Alex Dimakis

@AlexGDimakis

Abhimanyu Suthar

@abhimanyu_25s

Janvijay Singh

@iamjanvijay

Arthur Mensch

@arthurmensch

Yacine Mahdid

@yacinelearning

Arc Institute

@arcinstitute

Thiyagarajan Maruthavanan (Rajan)

@mtrajan

pash

@pashmerepat

Microsoft AI

@MicrosoftAI

Dwarak

@DwaraknathG

Pierre Richemond 🇪🇺

@TheOneKloud

Saurabh Shah

@saurabh_shah2

Jalaj Upadhyay

@jalajupadhyay

Vincent Weisser

@vincentweisser

Sourcegraph

@Sourcegraph

Amp

@AmpCode

Vaish Shrivastava

@VaishShrivas

Hanna Hajishirzi

@HannaHajishirzi

Igor Babuschkin

@ibab

Nick Frosst

@nickfrosst

Brendan Hogan

@brendanh0gan

Brunella

@brunellaism

Divam Gupta

@divamgupta

tokenbender

@tokenbender

Shivam Vats

@ShivaamVats

Carlota Parés-Morlans

@carlotapares

F. Güney

@ftm_guney

Jack Morris

@jxmnop

David Corbitt

@dvdcrbt

Rahul

@selfawareatom

Mengdi Wang

@MengdiWang10

United States 趨勢

1. Wemby 62.6K posts
2. Clippers 10.9K posts
3. Spurs 43.6K posts
4. Cooper Flagg 12.8K posts
5. Mavs 16.7K posts
6. #QueenRadio 15.6K posts
7. Maxey 10.7K posts
8. Sixers 23.7K posts
9. Embiid 13.8K posts
10. VJ Edgecombe 24K posts
11. #AEWDynamite 23.6K posts
12. Victor Wembanyama 15.9K posts
13. Knicks 34.5K posts
14. Anthony Davis 5,175 posts
15. Jazz 23.4K posts
16. Klay 7,782 posts
17. Bulls 24.7K posts
18. Pistons 6,874 posts
19. Celtics 26K posts
20. #PorVida 2,416 posts

Something went wrong.

Something went wrong.