ray

@raydistributed

The AI framework trusted by OpenAI, Uber, and Airbnb. Created and developed by @anyscalecompute.

docs.ray.io

Beigetreten im August 2019

2KPosts 11KFollower 2Folge ich

Was dir gefallen könnte

@anyscalecompute

@robertnishihara

@DbrxMosaicAI

@ClementDelangue

@Gradio

@LightningAI

@julien_c

@wandb

@MLflow

@QodoAI

@Tim_Dettmers

@_philschmid

@srush_nlp

@BigscienceW

@LysandreJik

ray hat repostet

swyx #DevWritersRetreat

@swyx

02.12.

very honored to have @leerob grace the AIE stage for the first of hopefully many times. Cursor's execution is world-class and I heavily appreciate the detail Lee went into on Composer's arch and training - check out the custom MXFP8 kernels used, @anyscalecompute Ray load…

swyx's tweet image. very honored to have @leerob grace the AIE stage for the first of hopefully many times.

Cursor's execution is world-class and I heavily appreciate the detail Lee went into on Composer's arch and training - check out the custom MXFP8 kernels used, @anyscalecompute Ray load…

AI Engineer

@aiDotEngineer

02.12.

🆕Building Cursor Composer youtube.com/watch?v=fL1iJH… @leerob joined us for the first time to complement @srush_nlp's popular talk on the technical journey behind Cursor Composer, @cursor_ai's first full fledged coding LLM!

aiDotEngineer's tweet card. Building Cursor Composer – Lee Robinson, Cursor

youtube.com

YouTube

Building Cursor Composer – Lee Robinson, Cursor

Quelle: youtube.com

ray

@raydistributed

01.12.

Come join us for Ray LLM Office Hours tomorrow! Feel free to bring any questions, comments, or simply good vibes. We'd love to have you there!

Seiji Eicher

@seiji_________

01.12.

Ray Serve/Data LLM office hours tomorrow 12/2, 9:30-10:30a PT. Come through to chat distributed LLM inference 🚀 @nikhil_r_ghosh giving away free alpha on batch embeddings workloads; I'll demo the new wide-EP and disaggregated serving APIs for Ray Serve

ray hat repostet

Seiji Eicher

@seiji_________

26.11.

Wide-EP and prefill/decode disaggregation APIs for vLLM are now available in Ray 2.52 🚀🚀 Validated at 2.4k tokens/H200 on Anyscale Runtime, these patterns maximize sparse MoE model inference efficiency, but often require non-trivial orchestration logic. Here’s how they…

seiji_________'s tweet image. Wide-EP and prefill/decode disaggregation APIs for vLLM are now available in Ray 2.52 🚀🚀

Validated at 2.4k tokens/H200 on Anyscale Runtime, these patterns maximize sparse MoE model inference efficiency, but often require non-trivial orchestration logic.

Here’s how they…

ray

@raydistributed

26.11.

In 2.52, we're introducing ray symmetric-run -- a CLI command to simplify and improve the large model interactive development experience on Ray and @vllm_project! Spinning up multi-node vLLM with Ray on interactive environments can be tedious, requiring users to juggle separate…

raydistributed's tweet image. In 2.52, we're introducing ray symmetric-run -- a CLI command to simplify and improve the large model interactive development experience on Ray and @vllm_project!

Spinning up multi-node vLLM with Ray on interactive environments can be tedious, requiring users to juggle separate…

ray

@raydistributed

25.11.

Last call for Ray x AI21 Labs Tel Aviv meetup! 🇮🇱 Efficient LLM inference at scale with Ray + vLLM, real-world lessons from Anyscale & AI21 Labs. 🗓️ Nov 26 | 🕕 6–8pm (GMT+2) 📍 AI21 Labs, Tel Aviv 👉 Register here: luma.com/6skfv9ob

raydistributed's tweet image. Last call for Ray x AI21 Labs Tel Aviv meetup! 🇮🇱

Efficient LLM inference at scale with Ray + vLLM, real-world lessons from Anyscale &amp; AI21 Labs.

🗓️ Nov 26 | 🕕 6–8pm (GMT+2)
📍 AI21 Labs, Tel Aviv
👉 Register here: luma.com/6skfv9ob

ray hat repostet

Malhar Patel

@malharhar

18.11.

am asked a lot by early-stage robotics companies how to build scalable multimodal data infra -- so here's a sneak peek 🙂 thanks @robertnishihara for the invite! youtu.be/pkkV1US2IKc?si…

malharhar's tweet card. Applied Intuition’s Blueprint for Scalable RL + Batch Inference | Ray...

youtube.com

YouTube

Applied Intuition’s Blueprint for Scalable RL + Batch Inference | Ray...

Quelle: youtube.com

ray

@raydistributed

17.11.

Which dataset trained that model? Which Ray job created it? Lineage Tracking shows the full picture:datasets, models & compute. Webinar 11/20, 10 AM PT → na2.hubs.ly/H0227Y20

raydistributed's tweet image. Which dataset trained that model? Which Ray job created it?

Lineage Tracking shows the full picture:datasets, models &amp; compute.

Webinar 11/20, 10 AM PT → na2.hubs.ly/H0227Y20

ray

@raydistributed

14.11.

It’s only been a week since #RaySummit and the energy’s still running high. Big thanks to our Ray Summit sponsors and partners for being part of the event where AI builders shape what’s next.

raydistributed's tweet image. It’s only been a week since #RaySummit and the energy’s still running high.

Big thanks to our Ray Summit sponsors and partners for being part of the event where AI builders shape what’s next.

ray

@raydistributed

13.11.

NEW: Ray Direct Transport (RDT) — native RDMA for Ray Core. GPU-to-GPU transfers up to 1000x faster via NVLink & InfiniBand. Ideal for RL loops, rollout pipelines, & multi-GPU inference. 🔗 na2.hubs.ly/H0224V00

raydistributed's tweet image. NEW: Ray Direct Transport (RDT) — native RDMA for Ray Core.

GPU-to-GPU transfers up to 1000x faster via NVLink &amp; InfiniBand. Ideal for RL loops, rollout pipelines, &amp; multi-GPU inference.

🔗 na2.hubs.ly/H0224V00

ray

@raydistributed

10.11.

Last call to join the Ray Community's first Amsterdam Meetup hosted by @Adyen! 🇳🇱 Learn how Ray + Anyscale power scalable AI -- from data processing to model training and RL. 🕓 Nov 11, 5:30 PM CET 📍 Adyen Office Register now: luma.com/nou9t2um

raydistributed's tweet image. Last call to join the Ray Community's first Amsterdam Meetup hosted by @Adyen! 🇳🇱 Learn how Ray + Anyscale power scalable AI -- from data processing to model training and RL.

🕓 Nov 11, 5:30 PM CET
📍 Adyen Office

Register now: luma.com/nou9t2um

ray

@raydistributed

10.11.

This was one of the highlights of Ray Summit last week.

Sasha Rush

@srush_nlp

09.11.

Talk at Ray Summit on "Building Cursor Composer." Overview of the work from our research team. youtube.com/watch?v=md8D8e…

srush_nlp's tweet card. Ray Summit 2025 Keynote: Building Cursor Composer with Sasha Rush

youtube.com

YouTube

Ray Summit 2025 Keynote: Building Cursor Composer with Sasha Rush

Quelle: youtube.com

ray

@raydistributed

07.11.

New Anyscale releases announced at Ray Summit, from Developer Central to Anyscale Runtime to Cluster Controller. Read the roll up blog: na2.hubs.ly/H01X3pd0

raydistributed's tweet image. New Anyscale releases announced at Ray Summit, from Developer Central to Anyscale Runtime to Cluster Controller.

Read the roll up blog: na2.hubs.ly/H01X3pd0

ray

@raydistributed

06.11.

Ray Summit 2025 just wrapped, and what an incredible week it's been! Thank you all for coming! Thousands of builders, researchers, and platform engineers came together in San Francisco to share how they’re scaling AI in production with Ray.

ray hat repostet

Robert Nishihara

@robertnishihara

06.11.

Yesterday at Ray Summit, we announced a partnership between @anyscalecompute and @Azure! This brings Anyscale's managed & optimized @raydistributed experience to the Azure ecosystem. Anyscale on Azure is now in private preview and is accessible directly through the Azure Portal.…

robertnishihara's tweet card. The path from prototype to production for AI/ML workloads is rarely straightforward. As data pipelines expand and model complexity grows, teams can find

Powering Distributed AI/ML at Scale with Azure and Anyscale | All things Azure

Quelle: devblogs.microsoft.com

ray

@raydistributed

05.11.

Announcing Anyscale Runtime for Ray: same APIs, faster and cheaper. Benchmarks: 2-3x image throughput, 10x feature eng, 2.5x TPC-H Q1, 7x recsys QPS, 40% faster video. Geotab 43x; Tripadvisor ~70% cost cut. 👉 Read the blog: na2.hubs.ly/H01X2Cm0

raydistributed's tweet card. Faster, cheaper and more resilient distributed AI processing with Anyscale Runtime powered by the Ray open-source framework

Announcing Anyscale Runtime for Faster, Cheaper and More Resilient AI, Powered by Ray

Quelle: anyscale.com

ray

@raydistributed

03.11.

Your infra can’t fix what it can’t see 👀 At #RaySummit 2025, Mengjin Yan & Nikita Vemuri from @anyscalecompute reveal Ray-native observability – dashboards built for distributed AI, storing telemetry in your own cloud Plus: a new Ray Export API to extend monitoring to your stack

raydistributed's tweet image. Your infra can’t fix what it can’t see 👀
At #RaySummit 2025, Mengjin Yan &amp; Nikita Vemuri from @anyscalecompute reveal Ray-native observability – dashboards built for distributed AI, storing telemetry in your own cloud
Plus: a new Ray Export API to extend monitoring to your stack

ray

@raydistributed

01.11.

SGLang 🤝 Ray! We're super excited to have @ying11231 and @liin1211 talk about SGLang and its new features at Ray Summit! They'll highlight the newest SGLang features and also talk about SGLang's integration with Ray Data LLM. Hope to see you there!

raydistributed's tweet image. SGLang 🤝 Ray!

We're super excited to have @ying11231 and @liin1211 talk about SGLang and its new features at Ray Summit!

They'll highlight the newest SGLang features and also talk about SGLang's integration with Ray Data LLM.

Hope to see you there!

LMSYS Org

@lmsysorg

31.10.

SGLang at Ray Summit 2025 is coming! 📍 San Francisco • Nov 3–5 • Hosted by @anyscalecompute 🗓 On Nov 5, SGLang is invited to give a talk on Efficient LLM Serving 🎤 @ying11231 & @liin1211 will introduce core features, high-throughput & low-latency tricks, real-world…