
Satyabrat Singh
@satyabratsingh
Interest in Software, ML, Quant Research, MSc in ML from UCL, MSc Maths from IIT
You might like
Glad to introduce our new work "Game-Theoretic Regularized Self-Play Alignment of Large Language Models". arxiv.org/abs/2503.00030 🎉 We introduce RSPO, a general, provably convergent framework to bring different regularization strategies into self-play alignment. 🧵👇

Thrilled to introduce our test-time algorithm for robust multi-objective alignment! Huge kudos to my incredible collaborators for making this happen!
❓No clue about the priorities of the objectives? ❗️ Focus on robustness at test-time! 🚀Robust Multi-Objective Decoding (RMOD) is a novel inference-time alignment algorithm that produces robust responses under multiple objectives to consider.


❓No clue about the priorities of the objectives? ❗️ Focus on robustness at test-time! 🚀Robust Multi-Objective Decoding (RMOD) is a novel inference-time alignment algorithm that produces robust responses under multiple objectives to consider.


🚀Sampling = Reinforcement Learning🤖 This means you can train a neural sampler using RL! We introduce the Value Gradient Sampler (VGS)—a novel diffusion sampler that leverages value functions to generate samples from an unnormalized density. 📄 Paper: arxiv.org/abs/2502.13280

(1/9) Flying to #NeurIPS2024 ? Our paper arxiv.org/abs/2405.20304 and blog shorturl.at/aIShm might be an interesting read on ur long flight to Vancouver! Accepted at #NeurIPS2024 and excited to present it as a poster on 13th December (1-4pm)!
On my way to #NeurIPS2024 ✈️ We are presenting several papers this year, including REDUCER, ARDT, GR-DPO/IPO, invariant BO. I’d love to connect and chat about topics like Alignment, RL/RLHF, LLM deception, robustness, and reasoning!
🚀🚀🚀 Introducing Adversarially Robust Decision Transformer (ARDT) 🚀🚀🚀 The first Decision Transformer for adversarial game-solving and robust decision-making, accepted to #NeurIPS #NeurIPS2024 🚨Change slightly : Replacing returns-to-go with minimax return. 🚨 Improve…

15 years ago today, I got a second chance at life… never realized how close death could be #MumbaiTerrorAttack #GratefulForLife
📣 If you've got an objective that exhibits symmetries, you should be using invariant kernel BO 📣 🚀 More sample efficient than constrained/naive BO! 🚀 More compute efficient than data augmentation! 🧵 1/4 #NeurIPS2024 #BayesianOptimisation #ai
This book is an absolute gem for understanding the intricacies of neural nets. Huge thanks to @SimonScardapane #MachineLearning #DeepLearning #AI


DeepSets are useful where we need permutation invariance. Imagine a batch of data with shape (n,m) —we split this batch into k sets, each of size (k,m) feed them through a neural network, and aggregate the outputs as: f(X) = ∑(i=1 to k) g(x_i). This method captures the essence…
The 2nd edition of my #ReinforcementLearning 477-page textbook for my course at ASU has just been published and is freely available at the book's website web.mit.edu/dimitrib/www/R… which also contains slides, videolectures, and supporting material
Competition Launch Alert! Realtime Marketdata hosted by @JaneStreetGroup 🎯 Challenge: Develop an ML forecasting model using real-world data derived from production systems 💰 Prize Pool: $120,000 ⏰ Entry Deadline: 12/30/2024 Explore the difficult dynamics that shape financial…
Detailed Thread on Option Pricing with Deep Learning Original paper link -cs230.stanford.edu/projects_fall_… Like, Retweet and comment "OPDL", to receive the code for the Deep Learning for Option Pricing

Human mind is complex network, inadvertently saw some gross pictures of accident and feel really disturbed, shouldnt @X sensor them ?
Impressed by Cursor, makes programming better :) cursor.com
cursor.com
Cursor: The best way to code with AI
Built to make you extraordinarily productive, Cursor is the best way to code with AI.
🔒 Discover the hidden risks of AI-driven devices with Dr Anna Maria Mandalari. Gain essential insights on security & privacy from the session at the Festival of Research 2024. 🎥 Watch now: buff.ly/3AdtpBD #CyberSecurity #FOR24

🚀 Exciting PhD opportunity in IoT, edge devices, security & privacy at UCL! I have a fully-funded 4-year studentship available. More info: ucl.ac.uk/electronic-ele… #PhDOpportunity #IoT @UCL_ICCS
Wonder how many times #AI is mentioned in town-hall meetings these days, captures merely 99.99% of the time :) #AI #MachineLearning
United States Trends
- 1. No Kings 430K posts
- 2. Good Saturday 28.2K posts
- 3. Chelsea 97.6K posts
- 4. #Caturday 3,132 posts
- 5. Garnacho 15.7K posts
- 6. Neto 25.5K posts
- 7. Ange 20.9K posts
- 8. #SaturdayVibes 3,874 posts
- 9. Chalobah 4,725 posts
- 10. Forest 86.9K posts
- 11. Reece James 5,623 posts
- 12. #NFOCHE 22.1K posts
- 13. Acheampong 9,370 posts
- 14. Guiu 6,208 posts
- 15. Massie 40K posts
- 16. Joao Pedro 8,845 posts
- 17. Lavia 7,079 posts
- 18. Estevao 11.2K posts
- 19. Emiru 12.7K posts
- 20. Ohtani 245K posts
Something went wrong.
Something went wrong.