Shridhar
@JupyterAI
PhD student in ML/NLP @eth_en | Past: @MSFTResearch @AIatMeta @AmazonScience @rptu_kl_ld | He/him. Views are my own.
You might like
Can Large Language Models (LLMs) accurately judge their own generative output? Introducing ART: Ask, Refine, and Trust. 1. ASK important questions to decide if refinement is needed 2. execute REFINEMENT 3. affirm or withhold TRUST in refinement
The ART of LLM Refinement: Ask, Refine, and Trust Achieves a performance gain of 5 points over self-refinement baselines, while using a much smaller model as the decision maker arxiv.org/abs/2311.07961
A student should confidently exploit what it knows and then explore its limits with a teacher. “Learning is often a mix of confidence and curiosity” Check out how we can balance the two for knowledge distillation at @aclmeeting in Vienna! #ACL2025
I am happy to share that our work SIKeD has been accepted to ACL 2025 @aclmeeting findings! More details below
“Reasoning’s like dominoes—nudge that first piece just right and the rest fall perfectly into place.” See you all in Vienna!! #ACL2025
Happy to share that this paper has now been accepted to ACL 2025 @aclmeeting findings! Paper link: arxiv.org/abs/2311.07945 Details in🧵
Gave a simple image to @OpenAI O3 to look into and it zoomed in and out to make sure it’s “q” and not “9”. Is zooming in and out with corresponding text part of some alignment strategy? Or some form of augmentation that works ? Any papers in this direction ?
Thought #OpenAI's deep research would add URLs to BibTeX entries easily. Seemed like a perfect use case given I provide all the sources to look into. But NO, it decided to choose a couple of the entries and ignored all the others.
Game the system!
Llama 4 quietly dropped from 1417 to 1273 ELO, on par with DeepSeek v2.5
Claude 3.7 Max Thinking in Cursor is hands down the best for cloning anything 💻✨
It’s crazy how the definition of small models is changing so fast. Now it’s 17B MoE with over 100B parameters. Not sure if this will help the open source community to train their own models which was the main reason why llama was so popular.
Introducing our first set of Llama 4 models! We’ve been hard at work doing a complete re-design of the Llama series. I’m so excited to share it with the world today and mark another major milestone for the Llama herd as we release the *first* open source models in the Llama 4…
What’s the equivalent of Vibe coding for AI agent? Anything specific people are testing?
ICML 2025's rebuttal process be like🤣: 👨💻 Authors: spend a whole week writing a careful rebuttal ✅ Reviewer: clicks "acknowledge" without reading 🚫 Author: not allowed to reply anymore So what does acknowledge mean here? "You speak. I pretend to listen. Conversation over."🙃
My congratulations to @DGukesh on his victory today. He has summitted the highest peak of all: making his mother happy!
If @narendramodi ji is interested, I would be down to figuring out an economic structure where all Indian students, faculty and researchers can get Perplexity Pro.
Indian gov't is buying a subscription to 13,000 academic journals, and then making them all available to "18 million students, faculty, and researchers" for free. The cost is $715 million over 3 years. It includes Elsevier, Nature, and AAAS. Have any other countries done this?
United States Trends
- 1. FINALLY DID IT 564K posts
- 2. The BONK 104K posts
- 3. Good Tuesday 31.7K posts
- 4. $FULC 10.3K posts
- 5. US Leading Investment Team 6,825 posts
- 6. #Nifty 11.6K posts
- 7. Jalen 78.7K posts
- 8. Eagles 120K posts
- 9. #tuesdayvibe 1,707 posts
- 10. #BAZAARWomenofTheYearXFreen 380K posts
- 11. Israel and Judah 1,718 posts
- 12. #TuesdayFeeling 1,036 posts
- 13. Chargers 88.2K posts
- 14. Chainers 1,777 posts
- 15. LINGLING BA HAUS64 485K posts
- 16. #Haus64xLingMOME 486K posts
- 17. Piers 89.5K posts
- 18. Herbert 34.2K posts
- 19. Fuentes 122K posts
- 20. Oslo 74K posts
You might like
-
Shruti Rijhwani
@shrutirij -
Shaily
@shaily99 -
Zhijing Jin ✈️ San Diego NeurIPS
@ZhijingJin -
Wangchunshu Zhou
@wangchunshu -
Prithviraj (Raj) Ammanabrolu
@rajammanabrolu -
Afra Amini
@afra_amini -
Emanuele Palumbo
@palu_ema -
Alice Bizeul
@AliceBizeul -
Mrinmaya Sachan
@mrinmayasachan -
Niklas Stoehr
@niklas_stoehr -
Andreas Opedal
@OpedalAndreas -
Kelly Marchisio @NeurIPS
@cheeesio -
Swarnadeep Saha
@swarnaNLP -
Eleanor Jiang
@eleanorjiang630 -
Karolina Stanczak
@karstanczak
Something went wrong.
Something went wrong.