Sourav

@srvmshr

ML University of Tokyo. Prev: Microsoft Research RF, @virginia_tech. Personal opinions. Coasting life with @jnchrltte

Science & Technology

Tokyo-to, Japan

於四月 2013 加入

1千貼文 610位跟隨者 974個跟隨中

你可能會喜歡

@tanmay2099

@DhruvBatra_

@neu_rips

@AmlabUva

@avt_im

@andrew_n_carr

@maxjaderberg

@NagraniArsha

@_sam_sinha_

@SingularMattrix

@yubai01

@RickyTQChen

@yash2kant

@misovalko

@BlackHC

Sourav

@srvmshr

年11月5日

I jumped into the @windsurf camp today. Some of the features they integrate are so cool - like deepwiki and codemaps. Only feedback: please integrate other models via Openrouter for e.g... BYOK is great but limited only to Anthropic

Sourav 已轉發

Claude

@claudeai

年10月31日

Claude Code's native installer is now generally available. It's simpler, more stable, and doesn't require Node.js. We recommend this as the default installation method for all Claude Code users going forward.

claudeai's tweet image. Claude Code's native installer is now generally available.

It's simpler, more stable, and doesn't require Node.js. We recommend this as the default installation method for all Claude Code users going forward.

Sourav 已轉發

Shane Gu

@shaneguML

年10月31日

Hot take: DAgger (Ross 2011) should be the first paper you read to get into RL, instead of Sutton's book. Maybe also read scheduled sampling (Bengio 2015). And before RL, study supervised learning thoroughly.

shaneguML's tweet image. Hot take: DAgger (Ross 2011) should be the first paper you read to get into RL, instead of Sutton's book. Maybe also read scheduled sampling (Bengio 2015). And before RL, study supervised learning thoroughly.

Sourav 已轉發

Simons Institute for the Theory of Computing

@SimonsInstitute

年10月30日

Ever wondered about graph learning? Watch Ameya Velingker (@ameya_pa) and Haggai Maron (@HaggaiMaron) give a masterful introduction at the Simons Institute's workshop on Graph Learning Meets Theoretical Computer Science. Video: simons.berkeley.edu/talks/ameya-ve…

SimonsInstitute's tweet image. Ever wondered about graph learning? Watch Ameya Velingker (@ameya_pa) and Haggai Maron (@HaggaiMaron) give a masterful introduction at the Simons Institute's workshop on Graph Learning Meets Theoretical Computer Science. Video: simons.berkeley.edu/talks/ameya-ve…

Sourav

@srvmshr

年10月29日

Good deep dive after you digest 😀 1. sander.ai/2023/07/20/per… 2. arxiv.org/abs/2406.08929 3. lilianweng.github.io/posts/2021-07-…

Chieh-Hsin (Jesse) Lai

@JCJesseLai

年10月29日

Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on! 📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon. It traces the core…

JCJesseLai's tweet image. Tired to go back to the original papers again and again? Our monograph: a systematic and fundamental recipe you can rely on!

📘 We’re excited to release 《The Principles of Diffusion Models》— with @DrYangSong, @gimdong58085414, @mittu1204, and @StefanoErmon.

It traces the core…

Sourav

@srvmshr

年10月28日

Different places, different systems 'Tenure' makes no sense in S Asia, but has a lot of weight in N-Am univ. Similarly, one could be a Lecturer/Asst Professor with just a Masters in Asia, and only seek job progression after a doctorate. Best not to have preconceived biases

Xin Eric Wang @ EMNLP 2025

@xwang_lk

年10月27日

Received a PhD application email from an Assistant Professor at another county (for themselves). I am so confused at multiple levels.

Sourav 已轉發

Jürgen Schmidhuber

@SchmidhuberAI

年10月27日

Our Huxley-Gödel Machine learns to rewrite its own code, estimating its own long-term self-improvement potential. It generalizes on new tasks (SWE-Bench Lite), matching the best officially checked human-engineered agents. Arxiv 2510.21614 With @Wenyi_AI_Wang, @PiotrPiekosAI,…

SchmidhuberAI's tweet image. Our Huxley-Gödel Machine learns to rewrite its own code, estimating its own long-term self-improvement potential. It generalizes on new tasks (SWE-Bench Lite), matching the best officially checked human-engineered agents. Arxiv 2510.21614 With @Wenyi_AI_Wang, @PiotrPiekosAI,…

Sourav

@srvmshr

年10月28日

Guide-coded a high throughput document pipeline using @allen_ai OLMO ocr + LayoutLM3 today. Combining these two proved sticky - especially for runs in parallel. LayoutLM has natural proclivity to use easyocr tesseract type of OCR engine. Too many rough edges to patch over

Sourav

@srvmshr

年10月27日

Slightly non-technical take as recounted by my late landlady Ginny 50s~70s had strong hopes. Post war, there was only one way - UP! People worked more towards common good. Individualism was less. Things got done easier because the path of least resistance also was the fastest.

Patrick Collison

@patrickc

年10月27日

What’s the best thing written about why the remarkably vigorous and inventive France of the 70s and 80s (TGV, Minitel, Ariane, Rafale, Concorde, the world’s preeminent nuclear grid…) has not been nearly as visible in the 21st century? What went wrong?

Sourav

@srvmshr

年10月27日

There is no moat when you have open source players like @MiniMax__AI coming in hot 🔥 Congratulations on the nice release

MiniMax (official)

@MiniMax__AI

年10月27日

We’re open-sourcing MiniMax M2 — Agent & Code Native, at 8% Claude Sonnet price, ~2x faster ⚡ Global FREE for a limited time via MiniMax Agent & API - Advanced Coding Capability: Engineered for end-to-end developer workflows. Strong capability on a wide-range of applications…

MiniMax__AI's tweet image. We’re open-sourcing MiniMax M2 — Agent &amp; Code Native, at 8% Claude Sonnet price, ~2x faster
⚡ Global FREE for a limited time via MiniMax Agent &amp; API
- Advanced Coding Capability: Engineered for end-to-end developer workflows. Strong capability on a wide-range of applications…

Sourav 已轉發

Sebastian Raschka

@rasbt

年10月21日

DeepSeek finally released a new model and paper. And because this DeepSeek-OCR release is a bit different from what everyone expected, and DeepSeek releases are generally a big deal, I wanted to do a brief explainer of what it is all about. In short, they explore how vision…

rasbt's tweet image. DeepSeek finally released a new model and paper. And because this DeepSeek-OCR release is a bit different from what everyone expected, and DeepSeek releases are generally a big deal, I wanted to do a brief explainer of what it is all about.

In short, they explore how vision…

Sourav

@srvmshr

年10月19日

I wonder what does it take for @SonyAlpha to spot amateur photographers outside of Instagram (ughh!). Do they even notice people on other channels? Genuinely curious

srvmshr's tweet image. I wonder what does it take for @SonyAlpha to spot amateur photographers outside of Instagram (ughh!).

Do they even notice people on other channels? Genuinely curious

Sourav

@srvmshr

年10月19日

Weekend hikes be like

Sourav 已轉發

Gabriel Synnaeve

@syhw

年10月9日

This is an excellent history of LLMs, doesn't miss seminal papers I know. Reminds you we're standing on the shoulders of giants, and giants are still being born today. gregorygundersen.com/blog/2025/10/0…

Sourav 已轉發

Mathias Niepert

@Mniepert

年10月11日

Really interesting comparison between recent equilibrium flow matching and equilibrium matching papers arxiv.org/abs/2507.16521

Samir dar

@Samir_Darouich

年10月10日

I just read the Equilibrium Matching (EqM) paper; it’s excellent and insightful work! Interestingly, we recently published a related method called Adaptive Equilibrium Flow Matching (AEFM). Leaving out “adaptive” reveals strong conceptual parallels between the two approaches.

Sourav

@srvmshr

年10月10日

Tried shooting & digitizing on film camera (Fujifilm). The render is so beautiful & something DSLRs can only mimic by film recipes. OG film stock has a different appeal

srvmshr's tweet image. Tried shooting &amp; digitizing on film camera (Fujifilm). The render is so beautiful &amp; something DSLRs can only mimic by film recipes. OG film stock has a different appeal

Sebastian Raschka

@rasbt

Michael Bronstein

@mmbronstein

Talia Ringer 🕊

@TaliaRinger

rohan anil

@_arohan_

$sarahookr's profile picture. Adaptive Intelligence. Built @Cohere_Labs, @GoogleBrain, @GoogleDeepmind. ML Efficiency, Multimodal\lingual. Changing spaces where breakthroughs happen.$

Sara Hooker

@sarahookr

Nathan Benaich

@nathanbenaich

Ulugbek S. Kamilov

@ukmlv

Symmetry and Geometry in Neural Representations

@neur_reps

Jeje

@Jj54761863

Woody Lee

@writerwoody

Aayush Karan

@aakaran31

Zara

@ZaraZetlin

alexmolas

@molasalex

Robert Scoble

@Scobleizer

Christian S. Perone

@tarantulae

Adhit

@5_4dh1t

dwasf

@dwasf79850

Luigui Sánchez

@LuiguiSnchez3

Ali

@AliAlmu02285303

Guaki

@Guaki306970

Eflercut

@Eflercut086

Xiang Yue

@xiangyue96

Marco Fumero

@marco_fumero

云舒的AI实践笔记

@wuhao8480867921

Hashir Omer Farooqi

@hashiromer621

Sandeep Sharma

@Sandeep1066116

Thad Bogan

@ThadBogan8510

🌝

@mathphysicsquit

Wolf Rowell

@wolfrowell

Lin May insulinpumplife.com 🧃💉

@Lin62866960

Sairam Ravu

@ravusairam

Ali S

@AliS1535131

ZKIWU

@zkiwu

Tatiset

@Tatiset23JbS

Nismesrare

@NismesrareavmS

Kaoss D.

@AreX_CorSa

Reausue

@Reausuer2zeACx

Therpe

@Therpei5C

pratyush kumar karna

@PKKARNA11

McTisue

@McTisueV90aBcq

Direct Handle

@DirectHandle12

Breakaway

@chrisbe1968

wen👩🏻‍💻

@ds_wen_

ADAM

@noadm18

Searknea

@SearkneaYsc

Yogesh

@yogesh_s_danu

.

@e____no_

Thomas Miller

@livingsoulz

Dario Salvati

@dw4rez

Kulendu

@cool_endu

Sachin bapat

@bapat_sach29684

Wang Ma

@WangMa70190365

Whawdror

@Whawdrorxtjeai

Emile van Krieken

@EmilevanKrieken

DΞΞP in JΛPΛN ♨️

@DogePunk2077

sourav roy

@souravros

Andrej Karpathy

@karpathy

Sebastian Raschka

@rasbt

Behnam Neyshabur

@bneyshabur

NeurIPS Conference

@NeurIPSConf

Michael Bronstein

@mmbronstein

Tom Goldstein

@tomgoldsteincs

Jason Wei

@_jasonwei

Dileep George

@dileeplearning

Aran Komatsuzaki

@arankomatsuzaki

Soumith Chintala

@soumithchintala

Sasha Rush

@srush_nlp

Talia Ringer 🕊

@TaliaRinger

rohan anil

@_arohan_

Ferenc Huszár

@fhuszar

$sarahookr's profile picture. Adaptive Intelligence. Built @Cohere_Labs, @GoogleBrain, @GoogleDeepmind. ML Efficiency, Multimodal\lingual. Changing spaces where breakthroughs happen.$