rackingroll's profile picture. Sr. Scientist at @Amazon Search, CS Ph.D from @RiceUniversity. ML, IR, NLP. Love traveling, adventuring, and having fun. Warriors Fan.

Chen Luo

@rackingroll

Sr. Scientist at @Amazon Search, CS Ph.D from @RiceUniversity. ML, IR, NLP. Love traveling, adventuring, and having fun. Warriors Fan.

Amazing! Alex lives beyond the boundaries of what’s “fair.” 💪

Alex Honnold’s selfie from the top of Taipei 101 after his historic free solo. #SkyscraperLIVE

netflix's tweet image. Alex Honnold’s selfie from the top of Taipei 101 after his historic free solo. #SkyscraperLIVE


Chen Luo reposted

We’re recruiting postdocs this year! Help us spread the world 🙏

🚀 InfiniAI Lab @ CMU is hiring Postdocs! We are looking for outstanding postdoctoral researchers in ML systems and security to join InfiniAI Lab at Carnegie Mellon University. Research directions include (but are not limited to): 🤖 AI Agents & RL 🔐 Machine Learning…



Chen Luo reposted

Really enjoyed Matt’s keynote at #AWSreInvent today. So much innovation happening in @awscloud, and you could see it with the array of launches he unveiled. So many parts of the keynote worth watching, but will point to a few: 1/ Excited about the availability of Trainium3.…

ajassy's tweet image. Really enjoyed Matt’s keynote at #AWSreInvent today. So much innovation happening in @awscloud, and you could see it with the array of launches he unveiled.

So many parts of the keynote worth watching, but will point to a few:

1/ Excited about the availability of Trainium3.…
ajassy's tweet image. Really enjoyed Matt’s keynote at #AWSreInvent today. So much innovation happening in @awscloud, and you could see it with the array of launches he unveiled.

So many parts of the keynote worth watching, but will point to a few:

1/ Excited about the availability of Trainium3.…
ajassy's tweet image. Really enjoyed Matt’s keynote at #AWSreInvent today. So much innovation happening in @awscloud, and you could see it with the array of launches he unveiled.

So many parts of the keynote worth watching, but will point to a few:

1/ Excited about the availability of Trainium3.…
ajassy's tweet image. Really enjoyed Matt’s keynote at #AWSreInvent today. So much innovation happening in @awscloud, and you could see it with the array of launches he unveiled.

So many parts of the keynote worth watching, but will point to a few:

1/ Excited about the availability of Trainium3.…

Great talk!

Full video of the ICCV '25 presentation



Chen Luo reposted

Full video of the ICCV '25 presentation


Congratulations to these awesome young AI researchers!

Excited to announce @amazon's new AI PhD Fellowship Program supporting 100+ students across 9 universities like Carnegie Mellon, MIT & Stanford. Fellows will be paired with senior scientists working in related fields, plus receive financial support and AWS credits for research.…

RohitPrasadAI's tweet image. Excited to announce @amazon's new AI PhD Fellowship Program supporting 100+ students across 9 universities like Carnegie Mellon, MIT & Stanford. Fellows will be paired with senior scientists working in related fields, plus receive financial support and AWS credits for research.…


I'm expecting a bunch of XXX-OCR papers to come out in the next few weeks — e.g., Search-OCR, Shopping-OCR, Ads-OCR, Rec-OCR, lol.

I quite like the new DeepSeek-OCR paper. It's a good OCR model (maybe a bit worse than dots), and yes data collection etc., but anyway it doesn't matter. The more interesting part for me (esp as a computer vision at heart who is temporarily masquerading as a natural language…



GPT-5 is great, but honestly, I was expecting a more powerful model with additional abilities, such as interacting with the physical world, rather than just incremental improvements on benchmarks.


We’re revealing the magic behind aligning LLMs to power conversational shopping for customers of Amazon Rufus at hashtag#SIGIR2025! If you’re in Padua this year, come check out our presentation and posters! Details: lnkd.in/gu4Sx5nJ

rackingroll's tweet image. We’re revealing the magic behind aligning LLMs to power conversational shopping for customers of Amazon Rufus at hashtag#SIGIR2025!

If you’re in Padua this year, come check out our presentation and posters!
Details: lnkd.in/gu4Sx5nJ

Chen Luo reposted

Say hello to Multiverse — the Everything Everywhere All At Once of generative modeling. 💥 Lossless, adaptive, and gloriously parallel 🌀 Now open-sourced: multiverse4fm.github.io I was amazed how easily we could extract the intrinsic parallelism of even SOTA autoregressive…

🔥 We introduce Multiverse, a new generative modeling framework for adaptive and lossless parallel generation. 🚀 Multiverse is the first open-source non-AR model to achieve AIME24 and AIME25 scores of 54% and 46% 🌐 Website: multiverse4fm.github.io 🧵 1/n



💪💪💪

Until next season 💙

NBCSWarriors's tweet image. Until next season 💙


Chen Luo reposted

New blog post: let's talk about latents! sander.ai/2025/04/15/lat…


Cool!

This post is unavailable.

Chen Luo reposted

This is it: The world’s smartest AI, Grok 3, now available for free (until our servers melt). Try Grok 3 now: x.com/i/grok X Premium+ and SuperGrok users will have increased access to Grok 3, in addition to early access to advanced features like Voice Mode


Chen Luo reposted

⏰📢After years of working on long-context efficiency, I’ve started to doubt if it’s truly necessary (Many of you have probably noticed the decline of interest in long llms). Despite strong models like Gemini, short-context + retrieval often do the trick—faster, cheaper, and…

🚀 RAG vs. Long-Context LLMs: The Real Battle ⚔️ 🤯Turns out, simple-to-build RAG can match million-dollar long-context LLMs (LC LLMs) on most existing benchmarks. 🤡So, do we even need long-context models? YES. Because today’s benchmarks are flawed: ⛳ Too Simple –…

InfiniAILab's tweet image. 🚀 RAG vs. Long-Context LLMs: The Real Battle ⚔️ 
🤯Turns out, simple-to-build RAG can match million-dollar long-context LLMs (LC LLMs) on most existing benchmarks. 
🤡So, do we even need long-context models? 

YES. Because today’s benchmarks are flawed:
⛳ Too Simple –…


The moment I realized Andrej post a YT video. I immediately stop everything, find a meeting room and start watching. lol

New 3h31m video on YouTube: "Deep Dive into LLMs like ChatGPT" This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental…

karpathy's tweet image. New 3h31m video on YouTube:
"Deep Dive into LLMs like ChatGPT"

This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental…


🫣

It's 1000% real:

ShamsCharania's tweet image. It's 1000% real:


Chen Luo reposted

📢Call for Papers: LLM for E-Commerce Workshop @ WWW'25 📅April 28-29, 2025 | Sydney, Australia 🌍 Explore how LLMs are transforming e-commerce: foundations, applications & systems. 📝Submit: openreview.net/group?id=ACM.o… (by Jan 26, 2025 AoE) 👉Details: llm4ecommerce.github.io


Chen Luo reposted

🚀 DeepSeek-R1 is here! ⚡ Performance on par with OpenAI-o1 📖 Fully open-source model & technical report 🏆 MIT licensed: Distill & commercialize freely! 🌐 Website & API are live now! Try DeepThink at chat.deepseek.com today! 🐋 1/n

deepseek_ai's tweet image. 🚀 DeepSeek-R1 is here!

⚡ Performance on par with OpenAI-o1
📖 Fully open-source model & technical report
🏆 MIT licensed: Distill & commercialize freely!

🌐 Website & API are live now! Try DeepThink at chat.deepseek.com today!

🐋 1/n

Loading...

Something went wrong.


Something went wrong.