Yong Zheng-Xin (Yong)
@yong_zhengxin
reasoning models @BrownCSDept || ex-intern/collab @AIatMeta @Cohere_Labs || sometimes write on http://yongzx.substack.com
You might like
Congrats on the job and thanks for sharing your experience! It's a great read and we need more articles like this. link for those interested: rona.substack.com/p/becoming-a-c…
i finally got a job as a compiler engineer!! it took months of grinding, so i wrote a biiig post about how i recruited, what the interviews are like, etc. link in bio 🥰
From multilingual models to diverse benchmarks and multimodal learning — Day 1 of Connect brings together researchers expanding what’s possible in global AI. 🖇️ Our lightning talks spotlight collaborative work that make AI more representative of the world’s languages. ⚡
How is memorized data stored in a model? We disentangle MLP weights in LMs and ViTs into rank-1 components based on their curvature in the loss, and find representational signatures of both generalizing structure and memorized training data
Can LLMs use tens of thousands of tools to navigate complex enterprise environments? In my @Microsoft internship work, we - introduce TheMCPCompany, a benchmark with 18,000+ tools - show that using a massive tool set is cheaper, faster, and more effective than web browsing…
📢Thrilled to introduce ATLAS 🗺️: scaling laws beyond English, for pretraining, finetuning, and the curse of multilinguality. The largest public, multilingual scaling study to-date—we ran 774 exps (10M-8B params, 400+ languages) to answer: 🌍Are scaling laws different by…
New research with @AdtRaghunathan, Nicholas Carlini and Anthropic! We built ImpossibleBench to measure reward hacking in LLM coding agents 🤖, by making benchmark tasks impossible and seeing whether models game tests or follow specs. (1/9)
Published today in @ScienceMagazine: a landmark study led by Microsoft scientists with partners, showing how AI-powered protein design could be misused—and presenting first-of-its-kind red teaming & mitigations to strengthen biosecurity in the age of AI.
As I am working on presentation for our multilingual safety survey work at EMNLP, I came across this interesting recent report by OpenAI: "Disrupting malicious uses of AI: October 2025" At least 4 out of 7 case studies involve multilingual safety openai.com/global-affairs…
United States Trends
- 1. #SmackDown 38.8K posts
- 2. Giulia 12.7K posts
- 3. Caleb Wilson 4,821 posts
- 4. #BostonBlue 4,016 posts
- 5. #OPLive 1,751 posts
- 6. Supreme Court 172K posts
- 7. Rockets 19.6K posts
- 8. Tulane 2,956 posts
- 9. #TheLastDriveIn 2,425 posts
- 10. Northwestern 4,401 posts
- 11. #Dateline N/A
- 12. Lash Legend 5,203 posts
- 13. Podz 1,528 posts
- 14. Justice Jackson 3,451 posts
- 15. Chelsea Green 5,529 posts
- 16. NBA Cup 8,942 posts
- 17. Harrison Barnes N/A
- 18. Reed 23.8K posts
- 19. Justice Ketanji Brown Jackson 2,110 posts
- 20. Sengun 4,019 posts
You might like
-
Wenhu Chen
@WenhuChen -
Zhaofeng Wu ✈️ EMNLP
@zhaofeng_wu -
Niklas Muennighoff
@Muennighoff -
Shunyu Yao
@ShunyuYao12 -
Jialu Li
@JialuLi96 -
Omar Khattab
@lateinteraction -
Prithviraj (Raj) Ammanabrolu
@rajammanabrolu -
Nouha Dziri
@nouhadziri -
Louis Castricato @ lovecraftian horrors
@lcastricato -
Tristan Thrush
@TristanThrush -
Weijia Shi
@WeijiaShi2 -
Ofir Press
@OfirPress -
Pan Lu
@lupantech -
Samuel Albanie 🇬🇧
@SamuelAlbanie -
Wangchunshu Zhou
@wangchunshu
Something went wrong.
Something went wrong.