Akash Srivastava

@variational_i

Director, Core AI, IBM. Chief Architect http://instructLAB.ai . Founder, Red Hat AI Innovation Team. PI @MITIBMLab. ❤️ Density Ratios.

Cambridge, MA

ai-innovation.team

Joined July 2009

641Posts 1KFollowers 1KFollowing

You might like

@Pavel_Izmailov

@arkitus

@DhruvBatra_

@pulkitology

@volokuleshov

@pliang279

@adityagrover_

@FrancescoLocat8

@maxjaderberg

@yisongyue

@liyzhen2

@jainprateek_

@iatitov

@adam_golinski

@_sam_sinha_

Akash Srivastava

@variational_i

Jun 17

What does it take to scale AI beyond the lab? At #RedHatSummit, @ishapuri101 and I spoke with Red Hat CEO Matt Hicks & CTO Chris Wright on inference-time scaling, open infra (LLMD), and making AI affordable for enterprise. 🎧 youtu.be/mj1dwrPfvb4 #NoMathAI @RedHat_AI

variational_i's tweet card. Inference Time Scaling for Enterprises | No Math AI

youtube.com

YouTube

Inference Time Scaling for Enterprises | No Math AI

Source: youtube.com

Akash Srivastava

@variational_i

Apr 24

🚀 How is generative AI transforming the way we design cars, planes, and entire systems? In Ep 2 of No Math AI, @ishapuri101 and I chat with Dr. @_faezahmed (@MIT DeCoDE Lab) about how AI boosts creativity, cuts design time, and works with engineers—not against them.

Red Hat AI

@RedHat_AI

Apr 24

How is generative AI reshaping engineering design? In Episode 2 of No Math AI, hosts Dr. Akash Srivastava (@variational_i) and MIT PhD student Isha Puri (@ishapuri101) sit down with Dr. Faez Ahmed (@_faezahmed) from MIT DeCoDE Lab to explore just that. 👇

Akash Srivastava

@variational_i

Apr 5

SQuat: KV-Cache for making reasoning models go 🚀 📄paper: lnkd.in/emKhAVZu 💻 code: lnkd.in/e8TJ7N3R From my awesome collaborators @RedHat_AI

Hao Wang

@HW_HaoWang

Apr 5

[1/x] 🚀 We're excited to share our latest work on improving inference-time efficiency for LLMs through KV cache quantization---a key step toward making long-context reasoning more scalable and memory-efficient.

HW_HaoWang's tweet image. [1/x] 🚀 We're excited to share our latest work on improving inference-time efficiency for LLMs through KV cache quantization---a key step toward making long-context reasoning more scalable and memory-efficient.

Akash Srivastava reposted

Red Hat AI

@RedHat_AI

Apr 2

Excited to share our preliminary work on customizing reasoning models using Red Hat AI Innovation’s Synthetic Data Generation (SDG) package! 📄 Turn your documents into training data for LLMs. 🧵👇

Akash Srivastava reposted

Isha Puri

@ishapuri101

Mar 4

had a great time giving a talk about probabilistic inference scaling and the power of small models at the IBM Research ML Seminar Series - the best talks end with tons of questions, and it was great to see everyone so engaged : ) youtube.com/watch?v=--3rsQ…

ishapuri101's tweet card. Scaling Small LLMs to o1 level! Probabilistic Methods for Inference...

youtube.com

YouTube

Scaling Small LLMs to o1 level! Probabilistic Methods for Inference...

Source: youtube.com

Akash Srivastava

@variational_i

Feb 7

Come along and help us build reasoning in small LLMs

Kai Xu

@xukai92

Feb 7

🚀 Exploring LLM reasoning—live! We, the @RedHat AI Innovation Team, are working on reproducing R1-like reasoning in small LLMs without distilling R1 or its derivatives. We’re documenting our journey in real-time: 🔗 Follow along: red-hat-ai-innovation-team.github.io/posts/r1-like-…

Akash Srivastava

@variational_i

Feb 6

Excited to share our latest work with @ishapuri101 et al.! 🚀 We introduce a probabilistic inference approach for inference-time scaling of LLMs using particle-based Monte Carlo methods—achieving 4–16x better scaling on math reasoning tasks and O1-level performance on MATH500.

Isha Puri

@ishapuri101

Feb 6

[1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint @MIT_CSAIL / @RedHat AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out …abilistic-inference-scaling.github.io

ishapuri101's tweet image. [1/x] can we scale small, open LMs to o1 level? Using classical probabilistic inference methods, YES! Joint @MIT_CSAIL / @RedHat AI Innovation Team work introduces a particle filtering approach to scaling inference w/o any training! check out …abilistic-inference-scaling.github.io

Akash Srivastava reposted

Seungwook Han

@seungwookh

Dec 18

🧩 Why do task vectors exist in pretrained LLMs? Our new research uncovers how transformers form internal abstractions and the mechanisms behind in-context learning(ICL).

seungwookh's tweet image. 🧩 Why do task vectors exist in pretrained LLMs?

Our new research uncovers how transformers form internal abstractions and the mechanisms behind in-context learning(ICL).

Akash Srivastava reposted

Cole Hurwitz

@cole_hurwitz

Sep 17, 2024

Neural activity is correlated among animals performing the same task and across sequential trials. Led by @zhang_yizi and @hl3616, we develop an reduced-rank model that exploits shared structure across animals to improve neural decoding. biorxiv.org/content/10.110…

cole_hurwitz's tweet image. Neural activity is correlated among animals performing the same task and across sequential trials.

Led by @zhang_yizi and @hl3616, we develop an reduced-rank model that exploits shared structure across animals to improve neural decoding.

biorxiv.org/content/10.110…

Akash Srivastava reposted

Cole Hurwitz

@cole_hurwitz

Jul 24, 2024

What will a foundation model for the brain look like? We argue that it must be able to solve a diverse set of tasks across multiple brain regions and animals. Check out our preprint where we introduce a multi-region, multi-animal, multi-task model (MtM): arxiv.org/abs/2407.14668

Akash Srivastava reposted

Seungwook Han

@seungwookh

May 14, 2024

🚀 Stronger, simpler, and better! 🚀 Introducing Value Augmented Sampling (VAS) - our new algorithm for LLM alignment and personalization that outperforms existing methods!

seungwookh's tweet image. 🚀 Stronger, simpler, and better! 🚀

Introducing Value Augmented Sampling (VAS) - our new algorithm for LLM alignment and personalization that outperforms existing methods!

Akash Srivastava reposted

Seungwook Han

@seungwookh

May 11, 2024

Excited to give a talk on our hottest, newest work “Value Augmented Sampling for Language Model Alignment and Personalization” at 2:30p Halle A3 in #ICLR2024 Reliable and Responsible Foundation Models Workshop 🥳🥳

Huaxiu Yao

@HuaxiuYaoML

May 11, 2024

📢Workshop on Reliable and Responsible Foundation Models will happen today (8:50am - 5:00pm). Join us at #ICLR2024 room Halle A 3 for a wonderful lineup of speakers, along with 63 amazing posters and 4 contributed talks! Schedule: iclr-r2fm.github.io/#program.

Akash Srivastava

@variational_i

May 5, 2024

Attending #ICLR2024, interested in continual learning and like probabilistic modeling? Lazar from the @MITIBMLab, will be presenting our latest work that takes a probabilistic approach to modular continual learning on Tuesday, 7 May, Halle B #222 (iclr.cc/virtual/2024/p…).

Lazar Valkov

@lazarvalkov

May 5, 2024

I’ll be presenting our #ICLR2024 paper on a probabilistic approach to scaling modular continual learning algorithms while achieving different types of knowledge transfer. (arxiv.org/abs/2306.06545, in collaboration with @variational_i @swarat @RandomlyWalking ). A tldr (1/8):

Akash Srivastava reposted

Faez Ahmed

@_faezahmed

Apr 13, 2024

Check out our work titled "From Automation to Augmentation: Redefining Engineering Design and Manufacturing in the Age of NextGen-AI", where we highlight the requirements for NextGenAI suitable for design, engineering, and manufacturing. mit-genai.pubpub.org/pub/9s6690gd/r…

_faezahmed's tweet card. In the mid-2010s, as computing and other digital technologies matured (Brynjolfsson and McAfee 2014), researchers began to speculate about a new era of innovation—with artificial intelligence (AI) as...

From Automation to Augmentation: Redefining Engineering Design and Manufacturing in the Age of...

Source: mit-genai.pubpub.org

MIT Stone Center on Inequality & Shaping Work

@MITshapingwork

Apr 12, 2024

Instead of continuing to emphasize automation, a human-centric approach to the next generation of #AI technologies in #manufacturing could enhance workers' skills and boost productivity. mit-genai.pubpub.org/pub/9s6690gd/r… @AustinLentsch @DAcemogluMIT @baselinescene @_faezahmed @MITMechE

MITshapingwork's tweet image. Instead of continuing to emphasize automation, a human-centric approach to the next generation of #AI technologies in #manufacturing could enhance workers' skills and boost productivity.

mit-genai.pubpub.org/pub/9s6690gd/r…

@AustinLentsch @DAcemogluMIT @baselinescene @_faezahmed @MITMechE

Akash Srivastava reposted

Mathieu

@miniapeur

Mar 9, 2024

Akash Srivastava

@variational_i

Mar 6, 2024

New work from @MITIBMLab researchers on large scale alignment of LLMs. Check out the models at HF huggingface.co/ibm/merlinite-…

ibm-research/merlinite-7b · Hugging Face

Source: huggingface.co

David Cox

@neurobongo

Mar 6, 2024

Hey, we did a thing: "LAB: Large-scale Alignment for chatBots"—a new synthetic data-driven LLM alignment method that yields great results without using large-scale human or proprietary model data. arxiv.org/abs/2403.01081 models: huggingface.co/ibm/labradorit…, huggingface.co/ibm/merlinite-…

Akash Srivastava

@variational_i

Mar 5, 2024

New work on automated red-teaming in LLMs using curiosity-driven exploration! #iclr24

Zhang-Wei Hong

@ZhangWeiHong9

Mar 5, 2024

(1/4) 🎉 Excited to share our ICLR'24 paper on "Curiosity-driven Red-teaming for Large Language Models"! We bridge curiosity-driven exploration in reinforcement learning (RL) with red-teaming, introducing the Curiosity-driven Red-teaming (CRT) method. #ICLR24 #AI #LLMSecurity

ZhangWeiHong9's tweet image. (1/4) 🎉 Excited to share our ICLR'24 paper on "Curiosity-driven Red-teaming for Large Language Models"! We bridge curiosity-driven exploration in reinforcement learning (RL) with red-teaming, introducing the Curiosity-driven Red-teaming (CRT) method. #ICLR24 #AI #LLMSecurity

Akash Srivastava

@variational_i

Dec 15, 2023

❤️ #NeurIPS2023. After 4 years, met my adviser and my adviser's advisor at the same time.

Akash Srivastava reposted

Zhang-Wei Hong

@ZhangWeiHong9

Dec 11, 2023

Uniform sampling hampers offline RL. How to fix it? Check our paper at #NeurIPS2023. Time: Wed 13 Dec 5 p.m. CST — 7 p.m. CST Location: Great Hall & Hall B1+B2 (level 1) #1908 Paper: openreview.net/forum?id=TW99H… Code: github.com/Improbable-AI/…

Akash Srivastava

@variational_i

Dec 13, 2023

Interested in learning how generative models can help with constrained design generation and topology optimization? Come to poster 540, 10:45am session today neurips.cc/virtual/2023/p… where @georgosgeorgos will be presenting our work on aligning TO with diffusion models #NeurIPS23