Ishan Gupta

@code_igx

25 🇮🇳, Hustler @RITtigers NY 🇺🇸 | RnD on Quantum AI, Superintelligence & Systems | Ex- @Broadcom @VMware

Software developer/Programmer/Software engineer

New York, USA

linkedin.com/in/ishangupta-…

انضم في مايو 2017

9ألفالمنشورات 1ألفالمتابعون 3ألفالمتابَعون

قد يعجبك

@prithvi137

@rhatdan

@OCI_ORG

@muratdemirbas

@VedantShrotria

@petecheslock

@BretFisher

@LitmusChaos

@Uma_Mukkara

@syke

@rawkode

@kathyzant

@felixrieseberg

@matvelloso

Ishan Gupta أعاد

机器之心 JIQIZHIXIN

@jiqizhixin

8 س

Recommender systems can improve by modeling users. TagCF uses an LLM to extract tag based logic graphs that reveal user roles and behavioral logic, then integrates them to boost ranking performance. Online and offline results show user role modeling can outperform item topic…

jiqizhixin's tweet image. Recommender systems can improve by modeling users.

TagCF uses an LLM to extract tag based logic graphs that reveal user roles and behavioral logic, then integrates them to boost ranking performance.

Online and offline results show user role modeling can outperform item topic…

Ishan Gupta أعاد

机器之心 JIQIZHIXIN

@jiqizhixin

22 س

Google just released the SIMA2 paper on arXiv with Demis Hassabis’s name on it. SIMA 2: A Generalist Embodied Agent for Virtual Worlds Paper: arxiv.org/abs/2512.04797

jiqizhixin's tweet image. Google just released the SIMA2 paper on arXiv with Demis Hassabis’s name on it.

SIMA 2: A Generalist Embodied Agent for Virtual Worlds

Paper: arxiv.org/abs/2512.04797

Ishan Gupta أعاد

Ksenia_TuringPost

@TheTuringPost

٤ ديسمبرم

Multimodal fusion is key to building AI that truly understands the world. But it’s still hard to find the right way to do it, partly because diffusion is dynamic while text is static. @AIatMeta and @AI_KAUST proposed MoS – Mixture of States, which fixes this mismatch by routing…

TheTuringPost's tweet image. Multimodal fusion is key to building AI that truly understands the world.

But it’s still hard to find the right way to do it, partly because diffusion is dynamic while text is static.

@AIatMeta and @AI_KAUST proposed MoS – Mixture of States, which fixes this mismatch by routing…

Ishan Gupta أعاد

Lior Alexander

@LiorOnAI

٢٨ نوفمبرم

Qwen just won Best Paper Award at NeurIPS. And it wasn’t for a flashy new architecture. It was for fixing a problem Transformers had for years. Here’s what you need to know:

LiorOnAI's tweet image. Qwen just won Best Paper Award at NeurIPS.

And it wasn’t for a flashy new architecture.

It was for fixing a problem Transformers had for years.

Here’s what you need to know:

Ishan Gupta أعاد

BURKOV

@burkov

٤ ديسمبرم

NeurIPS 2025 Best Paper Award: Attention lets language models decide which tokens matter at each position, but it has limitations—for example, a tendency to over-focus on early tokens regardless of their relevance. Gating mechanisms, which selectively suppress or amplify…

burkov's tweet image. NeurIPS 2025 Best Paper Award:

Attention lets language models decide which tokens matter at each position, but it has limitations—for example, a tendency to over-focus on early tokens regardless of their relevance.

Gating mechanisms, which selectively suppress or amplify…

Ishan Gupta أعاد

Sebastian Raschka

@rasbt

٣ ديسمبرم

This interesting week started with DeepSeek V3.2! I just wrote up a technical tour of the predecessors and components that led up to this: 🔗 magazine.sebastianraschka.com/p/technical-de… - Multi-Head Latent Attention - RLVR - Sparse Attention - Self-Verification - GRPO Updates

rasbt's tweet image. This interesting week started with DeepSeek V3.2!

I just wrote up a technical tour of the predecessors and components that led up to this:

🔗 magazine.sebastianraschka.com/p/technical-de…

- Multi-Head Latent Attention
- RLVR
- Sparse Attention
- Self-Verification
- GRPO Updates

Ishan Gupta أعاد

Google Research

@GoogleResearch

٤ ديسمبرم

Today at #NeurIPS2025, we present Titans, a new architecture that combines the speed of RNNs with the performance of Transformers. It uses deep neural memory to learn in real-time, effectively scaling to contexts larger than 2 million tokens. More at: goo.gle/3Kd5ojF

GoogleResearch's tweet image. Today at #NeurIPS2025, we present Titans, a new architecture that combines the speed of RNNs with the performance of Transformers. It uses deep neural memory to learn in real-time, effectively scaling to contexts larger than 2 million tokens. More at: goo.gle/3Kd5ojF

Ishan Gupta أعاد

Ashutosh Maheshwari

@asmah2107

٤ ديسمبرم

Twitter is cool. But it’s 10x better when you connect with people who like building and scaling GenAI systems. If you’re into LLMs, GenAI, Distributed Systems or backend. say hi.

Ishan Gupta أعاد

Elon Musk

@elonmusk

٤ ديسمبرم

Yup

Crémieux

@cremieuxrecueil

٤ ديسمبرم

Humanity has so thoroughly banished hunger that, as of this year, there are more obese kids than there are underweight kids.

cremieuxrecueil's tweet image. Humanity has so thoroughly banished hunger that, as of this year, there are more obese kids than there are underweight kids.

Ishan Gupta أعاد

Rohan Paul

@rohanpaul_ai

٤ ديسمبرم

Beautiful Tencent paper. Shows a language model that keeps improving itself using only 1% to 5% human labeled questions while reaching the level of systems trained on about 20 times more data. Earlier self play systems let a model write and solve its own questions, but over…

rohanpaul_ai's tweet image. Beautiful Tencent paper.

Shows a language model that keeps improving itself using only 1% to 5% human labeled questions while reaching the level of systems trained on about 20 times more data.

Earlier self play systems let a model write and solve its own questions, but over…

Ishan Gupta أعاد

Avi Chawla

@_avichawla

٤ ديسمبرم

I have been fine-tuning LLMs for over 2 years now! Here are the top 5 LLM fine-tuning techniques, explained with visuals: First of all, what's so different about LLM finetuning? Traditional fine‑tuning is impractical for LLMs (billions of params; 100s GB). Since this kind of…

_avichawla's tweet image. I have been fine-tuning LLMs for over 2 years now!

Here are the top 5 LLM fine-tuning techniques, explained with visuals:

First of all, what's so different about LLM finetuning?

Traditional fine‑tuning is impractical for LLMs (billions of params; 100s GB).

Since this kind of…

Ishan Gupta أعاد

Rohan Paul

@rohanpaul_ai

٤ ديسمبرم

The paper behind DeepSeek-V3.2 Its high-compute Speciale version reaches gold medal level on top math and coding contests and competes with leading closed models. Standard attention makes the model compare every token with every other token, so compute explodes as inputs get…

rohanpaul_ai's tweet image. The paper behind DeepSeek-V3.2

Its high-compute Speciale version reaches gold medal level on top math and coding contests and competes with leading closed models.

Standard attention makes the model compare every token with every other token, so compute explodes as inputs get…

Ishan Gupta أعاد

Nikhil Kamath

@nikhilkamathcio

٣٠ نوفمبرم

Here’s to delaying gratification. The future belongs to the patient. @elonmusk

Ishan Gupta أعاد

Elon Musk

@elonmusk

٣٠ نوفمبرم

Interview with Nikhil

Nikhil Kamath

@nikhilkamathcio

٣٠ نوفمبرم

Out now @elonmusk

Ishan Gupta أعاد

Srishti 🫧

@18islove_

٢ ديسمبرم

Can’t believe how human-like Tesla’s Optimus moves.

من Dima Zeniuk

Ishan Gupta أعاد

Elon Musk

@elonmusk

٢ ديسمبرم

Running robot

Tesla Optimus

@Tesla_Optimus

٢ ديسمبرم

Just set a new PR in the lab

Ishan Gupta أعاد

Tesla Owners Silicon Valley

@teslaownersSV

٣٠ نوفمبرم

.@elonmusk "One way to frame civilizational progress is the percentage completion on the Kardashev scale. Kardashev I is what percentage of a planet's energy are you successfully turning into useful work. Concept II would be, what percentage of the sun's energy are you…

Ishan Gupta أعاد

Elon Musk

@elonmusk

٢ ديسمبرم

Congrats @SpaceX team and thank you @USSpaceForce!

SpaceX

@SpaceX

١ ديسمبرم

We’ve received approval to develop Space Launch Complex-37 for Starship operations at Cape Canaveral Space Force Station. Construction has started. With three launch pads in Florida, Starship will be ready to support America’s national security and Artemis goals as the world’s…

Ishan Gupta أعاد

Tanishq Abraham @ NeurIPS

@iScienceLuvr

٢ ديسمبرم

Test-time scaling of diffusions with flow maps This paper is pretty cool, providing a better to guide image generation with a reward function. The standard approach evaluates the reward function on intermediate steps to get a reward gradient to modify sampling. However the…

iScienceLuvr's tweet image. Test-time scaling of diffusions with flow maps

This paper is pretty cool, providing a better to guide image generation with a reward function. The standard approach evaluates the reward function on intermediate steps to get a reward gradient to modify sampling. However the…

Ishan Gupta أعاد

BURKOV

@burkov

٢ ديسمبرم

This Google's paper from last year came almost unnoticed by the public, but it's really an alternative architecture to the transformer that proves more parameter-efficient and effective on similar tasks. As you might know, Transformers scale quadratically with sequence length.…

burkov's tweet image. This Google's paper from last year came almost unnoticed by the public, but it's really an alternative architecture to the transformer that proves more parameter-efficient and effective on similar tasks.

As you might know, Transformers scale quadratically with sequence length.…

developer-guy

@developerguyba

Michael Cade

@MichaelCade1

harness.io

@harnessio

unnati

@Unnati_twts

Uma Mukkara

@Uma_Mukkara

Marie Antons 💻

@MarieAntons

Prithvi Raj

@prithvi137

Vedant Kakde

@vedantstwt

Kunal Verma

@kverma_

DA

@AIBlackAmethyst

Trevor Campbell

@TrevCampbell

Ashna Arora

@ashnaaroraAE

ho9xev3kt

@ho9xev3kt4286

Ingrid @ NeurIPS

@ingowoo

Daugi

@Daugi5284668

Bohuslav

@bohuslav20492

NCR vala👍

@NCR_vala

Eliot

@Eliot1985059

Ielievu

@Ielievu917

Gourav Sharma

@sharmagourav

Milan Stankovic, PhD

@milstan

Kalho

@Kalho03591

Ophelia

@guidaboni183206

SaraAdelaide

@8D58hfXtu8DhDMm

CynthiaHarry

@797SXG9csiNmU4T

Joel

@Joel87956976597

CeciliaBessie

@WiEf0kB0Plqe6

wen👩🏻‍💻

@ds_wen_

Nimaw

@Nimaw0404

Kesha

@_rskai

NellyMary

@YtEP4DTJTA2ulh

Tracy

@Tracy5et9

Aayush Karan

@aakaran31

Sebfox

@Sebfox1

Jane Viviana

@janeviviana1

Chandu Mirji

@ChanduMirj007

Otorhu

@Otorhu3426659

CoraLattimore

@e5DdOH333kIKfU

Buzzytalk Devhub : For Developers. By Developers

@BuzzytalkDevHub

fafa.👩🏻‍💻

@ds_fafa_

MartinaEvelina

@mGZ2a9sh4cgRITM

PrincessEvelynClark

@FPC217brAweEec

jason

@jasonth0

PhoebeStephen

@Jq5RxoFAyR80xm

Dylan S Phillips

@dylsphillips

JUDITH AURELIA

@JUDITHAURELIA_

VeronicaSmith

@322Zyywp01AEt9

Ytlalqeat

@Ytlalqeat24053

HeddaHarry

@e2e0sRmB80v5hz

Wirawmo

@Wirawmo847437

bun.bun.🐽

@ds_bun_

Gimall

@Gimall223

Meadow

@n711zRBEyoZf9t

Crorma

@Crorma511504

Iwuiawdek

@Iwuiawdek82689

NeuralNetAlpha🇺🇸

@Sloqer37296

Poofxot

@Poofxot2775776

🧯🧯

@Jorcauc47039

Philip Ginah

@philipginah

Kunal Kushwaha

@kunalstwt

Saiyam Pathak

@SaiyamPathak

Khushboo Verma

@khushbooverma

developer-guy

@developerguyba

ahmetb

@ahmetb

Aditya Oberai

@adityaoberai

Michael Cade

@MichaelCade1

Tanya Rajhans

@tanyarajhans7

harness.io

@harnessio

Bartłomiej Płotka

@bwplotka

Priyansh Agarwal

@Priyansh_31Dec

Rishika Gupta

@rishikagupta__

Bilgin Ibryam

@bibryam

unnati

@Unnati_twts

CNCF

@CloudNativeFdn

Saumya Singh

@saumya1singh

Archie Sengupta

@archiexzzz

Uma Mukkara

@Uma_Mukkara

Kunal Shah

@kunalb11

Bhavya Sachdeva

@bhavya_58

DA

@AIBlackAmethyst

Artificial Intelligence @ KAUST

@AI_KAUST

Jason Weston

@jaseweston

Jakob Foerster

@j_foerst

Spencer Frei

@sfrei_

Keak

@Keak_ai

Siddharth Panda

@SiddhartPanda09

Liwei Jiang @ NeurIPS 2025

@liweijianglw

Columbia Business School

@Columbia_Biz

Imade.

@ImadeIyamu

Pierce Alexander Lilholt

@PierceLilholt

david fant

@da_fant

Quanta Magazine

@QuantaMagazine

Ashna Arora

@ashnaaroraAE

Ingrid @ NeurIPS

@ingowoo

Nikhil Kamath

@nikhilkamathcio

United States Space Force

@USSpaceForce

Aleksander Holynski

@holynski_

Mallikarjuna @ NeurIPS ✈️

@_neuralnerd_

Ruiqi Gao

@RuiqiGao

Andrew Beam

@AndrewLBeam

Adi Chinchure

@adityachinchure

Ahmad Beirami ✈️ NeurIPS

@abeirami

Gauri Gupta @ NeurIPS'25

@gauri__gupta

NeoSigma

@NeoSigmaAI

Ritvik Kapila @ NeurIPS

@RitvikKapila

Kartik Aggarwal

@Kartik__A

Pedro Pachuca

@PedroPachuca4

Adit Shah @ NeurIPS 2025

@aditshah00

Cua @ NeurIPS 2025

@trycua

Francesco

@francedot

$sarahookr's profile picture. Adaptive Intelligence. Built @Cohere_Labs, @GoogleBrain, @GoogleDeepmind. ML Efficiency, Multimodal\lingual. Changing spaces where breakthroughs happen.$