Ishan Gupta

@code_igx

25 🇮🇳, Hustler @RITtigers NY 🇺🇸 | RnD on Quantum AI, Superintelligence & Systems | Ex- @Broadcom @VMware

Software developer/Programmer/Software engineer

New York, USA

linkedin.com/in/ishangupta-…

เข้าร่วมเมื่อ พฤษภาคม 2017

9พันโพสต์ 1พันผู้ติดตาม 3พันกําลังติดตาม

คุณอาจชื่นชอบ

@prithvi137

@rhatdan

@OCI_ORG

@muratdemirbas

@VedantShrotria

@petecheslock

@BretFisher

@LitmusChaos

@Uma_Mukkara

@syke

@rawkode

@kathyzant

@felixrieseberg

@matvelloso

Ishan Gupta รีโพสต์แล้ว

Lior Alexander

@LiorOnAI

28 พ.ย.

Qwen just won Best Paper Award at NeurIPS. And it wasn’t for a flashy new architecture. It was for fixing a problem Transformers had for years. Here’s what you need to know:

LiorOnAI's tweet image. Qwen just won Best Paper Award at NeurIPS.

And it wasn’t for a flashy new architecture.

It was for fixing a problem Transformers had for years.

Here’s what you need to know:

Ishan Gupta รีโพสต์แล้ว

NeurIPS 2025 Best Paper Award: Attention lets language models decide which tokens matter at each position, but it has limitations—for example, a tendency to over-focus on early tokens regardless of their relevance. Gating mechanisms, which selectively suppress or amplify…

burkov's tweet image. NeurIPS 2025 Best Paper Award:

Attention lets language models decide which tokens matter at each position, but it has limitations—for example, a tendency to over-focus on early tokens regardless of their relevance.

Gating mechanisms, which selectively suppress or amplify…

Ishan Gupta รีโพสต์แล้ว

Sebastian Raschka

@rasbt

3 ธ.ค.

This interesting week started with DeepSeek V3.2! I just wrote up a technical tour of the predecessors and components that led up to this: 🔗 magazine.sebastianraschka.com/p/technical-de… - Multi-Head Latent Attention - RLVR - Sparse Attention - Self-Verification - GRPO Updates

rasbt's tweet image. This interesting week started with DeepSeek V3.2!

I just wrote up a technical tour of the predecessors and components that led up to this:

🔗 magazine.sebastianraschka.com/p/technical-de…

- Multi-Head Latent Attention
- RLVR
- Sparse Attention
- Self-Verification
- GRPO Updates

Ishan Gupta รีโพสต์แล้ว

Google Research

@GoogleResearch

4 ธ.ค.

Today at #NeurIPS2025, we present Titans, a new architecture that combines the speed of RNNs with the performance of Transformers. It uses deep neural memory to learn in real-time, effectively scaling to contexts larger than 2 million tokens. More at: goo.gle/3Kd5ojF

GoogleResearch's tweet image. Today at #NeurIPS2025, we present Titans, a new architecture that combines the speed of RNNs with the performance of Transformers. It uses deep neural memory to learn in real-time, effectively scaling to contexts larger than 2 million tokens. More at: goo.gle/3Kd5ojF

Ishan Gupta รีโพสต์แล้ว

Ashutosh Maheshwari

@asmah2107

4 ธ.ค.

Twitter is cool. But it’s 10x better when you connect with people who like building and scaling GenAI systems. If you’re into LLMs, GenAI, Distributed Systems or backend. say hi.

Ishan Gupta รีโพสต์แล้ว

Elon Musk

@elonmusk

4 ธ.ค.

Yup

Crémieux

@cremieuxrecueil

4 ธ.ค.

Humanity has so thoroughly banished hunger that, as of this year, there are more obese kids than there are underweight kids.

cremieuxrecueil's tweet image. Humanity has so thoroughly banished hunger that, as of this year, there are more obese kids than there are underweight kids.

Ishan Gupta รีโพสต์แล้ว

Rohan Paul

@rohanpaul_ai

4 ธ.ค.

Beautiful Tencent paper. Shows a language model that keeps improving itself using only 1% to 5% human labeled questions while reaching the level of systems trained on about 20 times more data. Earlier self play systems let a model write and solve its own questions, but over…

rohanpaul_ai's tweet image. Beautiful Tencent paper.

Shows a language model that keeps improving itself using only 1% to 5% human labeled questions while reaching the level of systems trained on about 20 times more data.

Earlier self play systems let a model write and solve its own questions, but over…

Ishan Gupta รีโพสต์แล้ว

Avi Chawla

@_avichawla

4 ธ.ค.

I have been fine-tuning LLMs for over 2 years now! Here are the top 5 LLM fine-tuning techniques, explained with visuals: First of all, what's so different about LLM finetuning? Traditional fine‑tuning is impractical for LLMs (billions of params; 100s GB). Since this kind of…

_avichawla's tweet image. I have been fine-tuning LLMs for over 2 years now!

Here are the top 5 LLM fine-tuning techniques, explained with visuals:

First of all, what's so different about LLM finetuning?

Traditional fine‑tuning is impractical for LLMs (billions of params; 100s GB).

Since this kind of…

Ishan Gupta รีโพสต์แล้ว

Rohan Paul

@rohanpaul_ai

4 ธ.ค.

The paper behind DeepSeek-V3.2 Its high-compute Speciale version reaches gold medal level on top math and coding contests and competes with leading closed models. Standard attention makes the model compare every token with every other token, so compute explodes as inputs get…

rohanpaul_ai's tweet image. The paper behind DeepSeek-V3.2

Its high-compute Speciale version reaches gold medal level on top math and coding contests and competes with leading closed models.

Standard attention makes the model compare every token with every other token, so compute explodes as inputs get…

Ishan Gupta รีโพสต์แล้ว

Nikhil Kamath

@nikhilkamathcio

30 พ.ย.

Here’s to delaying gratification. The future belongs to the patient. @elonmusk

Ishan Gupta รีโพสต์แล้ว

Elon Musk

@elonmusk

30 พ.ย.

Interview with Nikhil

Nikhil Kamath

@nikhilkamathcio

30 พ.ย.

Out now @elonmusk

Ishan Gupta รีโพสต์แล้ว

Srishti 🫧

@18islove_

2 ธ.ค.

Can’t believe how human-like Tesla’s Optimus moves.

จาก Dima Zeniuk

Ishan Gupta รีโพสต์แล้ว

Elon Musk

@elonmusk

2 ธ.ค.

Running robot

Tesla Optimus

@Tesla_Optimus

2 ธ.ค.

Just set a new PR in the lab

Ishan Gupta รีโพสต์แล้ว

Tesla Owners Silicon Valley

@teslaownersSV

30 พ.ย.

.@elonmusk "One way to frame civilizational progress is the percentage completion on the Kardashev scale. Kardashev I is what percentage of a planet's energy are you successfully turning into useful work. Concept II would be, what percentage of the sun's energy are you…

Ishan Gupta รีโพสต์แล้ว

Elon Musk

@elonmusk

2 ธ.ค.

Congrats @SpaceX team and thank you @USSpaceForce!

SpaceX

@SpaceX

1 ธ.ค.

We’ve received approval to develop Space Launch Complex-37 for Starship operations at Cape Canaveral Space Force Station. Construction has started. With three launch pads in Florida, Starship will be ready to support America’s national security and Artemis goals as the world’s…

Ishan Gupta รีโพสต์แล้ว

Tanishq Abraham @ NeurIPS

@iScienceLuvr

2 ธ.ค.

Test-time scaling of diffusions with flow maps This paper is pretty cool, providing a better to guide image generation with a reward function. The standard approach evaluates the reward function on intermediate steps to get a reward gradient to modify sampling. However the…

iScienceLuvr's tweet image. Test-time scaling of diffusions with flow maps

This paper is pretty cool, providing a better to guide image generation with a reward function. The standard approach evaluates the reward function on intermediate steps to get a reward gradient to modify sampling. However the…

Ishan Gupta รีโพสต์แล้ว

BURKOV

@burkov

2 ธ.ค.

This Google's paper from last year came almost unnoticed by the public, but it's really an alternative architecture to the transformer that proves more parameter-efficient and effective on similar tasks. As you might know, Transformers scale quadratically with sequence length.…

burkov's tweet image. This Google's paper from last year came almost unnoticed by the public, but it's really an alternative architecture to the transformer that proves more parameter-efficient and effective on similar tasks.

As you might know, Transformers scale quadratically with sequence length.…

Ishan Gupta รีโพสต์แล้ว

机器之心 JIQIZHIXIN

@jiqizhixin

1 ธ.ค.

Why has neural B-frame video compression lagged behind P-frame methods despite its potential for better performance with bi-directional references? This study addresses this gap by tackling unbalanced reference frame contributions in hierarchical coding—their BRHVC method uses…

jiqizhixin's tweet image. Why has neural B-frame video compression lagged behind P-frame methods despite its potential for better performance with bi-directional references?

This study addresses this gap by tackling unbalanced reference frame contributions in hierarchical coding—their BRHVC method uses…

Ishan Gupta รีโพสต์แล้ว

DAIR.AI

@dair_ai

1 ธ.ค.

Agentic AI Overview This report provides a comprehensive overview of architectures, applications, and future directions. Great read for AI devs and enthusiasts. It introduces a new dual-paradigm framework that categorizes agentic systems into two distinct lineages: the…

dair_ai's tweet image. Agentic AI Overview

This report provides a comprehensive overview of architectures, applications, and future directions.

Great read for AI devs and enthusiasts.

It introduces a new dual-paradigm framework that categorizes
agentic systems into two distinct lineages: the…

Ishan Gupta รีโพสต์แล้ว

DailyPapers

@HuggingPapers

30 พ.ย.

OPPO AI Agent Team introduces O-Mem, an Omni Memory System for LLM Agents It brings personalized, long-horizon, self-evolving capabilities to conversational AI by actively profiling users and supporting hierarchical retrieval for adaptive responses.

HuggingPapers's tweet image. OPPO AI Agent Team introduces O-Mem, an Omni Memory System for LLM Agents

It brings personalized, long-horizon, self-evolving capabilities to conversational AI by actively profiling users and supporting hierarchical retrieval for adaptive responses.

developer-guy

@developerguyba

Michael Cade

@MichaelCade1

harness.io

@harnessio

unnati

@Unnati_twts

Uma Mukkara

@Uma_Mukkara

Marie Antons 💻

@MarieAntons

Prithvi Raj

@prithvi137

Vedant Kakde

@vedantstwt

Kunal Verma

@kverma_

Trevor Campbell

@TrevCampbell

Ashna Arora

@ashnaaroraAE

ho9xev3kt

@ho9xev3kt4286

Ingrid @ NeurIPS

@ingowoo

Daugi

@Daugi5284668

Bohuslav

@bohuslav20492

NCR vala👍

@NCR_vala

Eliot

@Eliot1985059

Ielievu

@Ielievu917

Gourav Sharma

@sharmagourav

Milan Stankovic, PhD

@milstan

Kalho

@Kalho03591

Ophelia

@guidaboni183206

SaraAdelaide

@8D58hfXtu8DhDMm

CynthiaHarry

@797SXG9csiNmU4T

Joel

@Joel87956976597

CeciliaBessie

@WiEf0kB0Plqe6

wen👩🏻‍💻

@ds_wen_

Nimaw

@Nimaw0404

Kesha

@_rskai

NellyMary

@YtEP4DTJTA2ulh

Tracy

@Tracy5et9

Aayush Karan

@aakaran31

Sebfox

@Sebfox1

Jane Viviana

@janeviviana1

Chandu Mirji

@ChanduMirj007

Otorhu

@Otorhu3426659

CoraLattimore

@e5DdOH333kIKfU

Buzzytalk Devhub : For Developers. By Developers

@BuzzytalkDevHub

fafa.👩🏻‍💻

@ds_fafa_

MartinaEvelina

@mGZ2a9sh4cgRITM

PrincessEvelynClark

@FPC217brAweEec

jason

@jasonth0

PhoebeStephen

@Jq5RxoFAyR80xm

Dylan S Phillips

@dylsphillips

JUDITH AURELIA

@JUDITHAURELIA_

VeronicaSmith

@322Zyywp01AEt9

Ytlalqeat

@Ytlalqeat24053

HeddaHarry

@e2e0sRmB80v5hz

Wirawmo

@Wirawmo847437

bun.bun.🐽

@ds_bun_

Gimall

@Gimall223

Meadow

@n711zRBEyoZf9t

Crorma

@Crorma511504

Iwuiawdek

@Iwuiawdek82689

NeuralNetAlpha🇺🇸

@Sloqer37296

Poofxot

@Poofxot2775776

🧯🧯

@Jorcauc47039

Philip Ginah

@philipginah

Swaran bharat

@BharatSwaran

Kunal Kushwaha

@kunalstwt

Saiyam Pathak

@SaiyamPathak

Khushboo Verma

@khushbooverma

developer-guy

@developerguyba

ahmetb

@ahmetb

Aditya Oberai

@adityaoberai

Michael Cade

@MichaelCade1

Tanya Rajhans

@tanyarajhans7

harness.io

@harnessio

Bartłomiej Płotka

@bwplotka

Priyansh Agarwal

@Priyansh_31Dec

Rishika Gupta

@rishikagupta__

Bilgin Ibryam

@bibryam

unnati

@Unnati_twts

CNCF

@CloudNativeFdn

Saumya Singh

@saumya1singh

Archie Sengupta

@archiexzzz

Uma Mukkara

@Uma_Mukkara

Kunal Shah

@kunalb11

Bhavya Sachdeva

@bhavya_58

Liwei Jiang @ NeurIPS 2025

@liweijianglw

Columbia Business School

@Columbia_Biz

Imade.

@ImadeIyamu

Pierce Alexander Lilholt

@PierceLilholt

david fant

@da_fant

Quanta Magazine

@QuantaMagazine

Ashna Arora

@ashnaaroraAE

Ingrid @ NeurIPS

@ingowoo

Nikhil Kamath

@nikhilkamathcio

United States Space Force

@USSpaceForce

Aleksander Holynski

@holynski_

Mallikarjuna @ NeurIPS ✈️

@_neuralnerd_

Ruiqi Gao

@RuiqiGao

Andrew Beam

@AndrewLBeam

Adi Chinchure

@adityachinchure

Ahmad Beirami ✈️ NeurIPS

@abeirami

Gauri Gupta @ NeurIPS'25

@gauri__gupta

NeoSigma

@NeoSigmaAI

Ritvik Kapila @ NeurIPS

@RitvikKapila

Kartik Aggarwal

@Kartik__A

Pedro Pachuca

@PedroPachuca4

Adit Shah @ NeurIPS 2025

@aditshah00

Cua @ NeurIPS 2025

@trycua

Francesco

@francedot

$sarahookr's profile picture. Adaptive Intelligence. Built @Cohere_Labs, @GoogleBrain, @GoogleDeepmind. ML Efficiency, Multimodal\lingual. Changing spaces where breakthroughs happen.$