#vllm search results

AIGCLINK

@aigclink

10 h

折腾了几个礼拜，昨晚总算看到起量正常不报错了，起量才知道vllm有很多坑要踩，今天600w TPM走起，距离打满还差99% #vllm

Docker

@Docker

12 h

Docker Model Runner + @vllm_project - run safetensors models and scale to production without leaving your Docker workflow ⚡️ 🔗 Try it out: bit.ly/4psZN7z #Docker #vLLM #ModelRunner #AI #DevTools

Docker's tweet card. New: vLLM in Docker Model Runner. High-throughput inference for safetensors models with auto engine routing for NVIDIA GPUs using Docker.

Docker Model Runner + vLLM: High-Throughput Inference | Docker

Source: docker.com

(1/n) We are drastically overestimating the cost of LLMs, because we sometimes over-focus for single-query speed. Had the privilege to talk about this topic at the #vllm meetup yesterday. An average human reads 350 words per minute, which translates to 5.5 words per second.

jiayq's tweet image. (1/n) We are drastically overestimating the cost of LLMs, because we sometimes over-focus for single-query speed. Had the privilege to talk about this topic at the #vllm meetup yesterday.

An average human reads 350 words per minute, which translates to 5.5 words per second.

vLLM

@vllm_project

Nov 22

Thank you to everyone who filed issues, reviewed PRs, ran benchmarks, and helped shape this release. vLLM grows because the community does. Easy, fast, and cheap LLM serving for everyone. 🧡 #vLLM #AIInfra #OpenSource

vllm_project's tweet image. Thank you to everyone who filed issues, reviewed PRs, ran benchmarks,
and helped shape this release. vLLM grows because the community does.
Easy, fast, and cheap LLM serving for everyone. 🧡
#vLLM #AIInfra #OpenSource

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

@ai_hakase_

Oct 25

vLLMがNVIDIA RTX 6000 Pro複数GPU問題解決か？ローカルLLM環境が劇的進化へ！🎉 #vLLM #ローカルLLM

Anton Marini

@_vade

Nov 20

Working on adding MLX/ MLX-LLM / vLLM to Fabric - you can run local LLM models in nodes alongside metal shaders, geometry, compute, realtime video processing, segmentations and key point analysis and make weird shit. #mlx #llm #vllm cc @awnihannun

_vade's tweet image. Working on adding MLX/ MLX-LLM / vLLM to Fabric - you can run local LLM models in nodes alongside metal shaders, geometry, compute, realtime video processing, segmentations and key point analysis and make weird shit.

#mlx #llm #vllm

cc @awnihannun

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

@ai_hakase_

Nov 16

LLM推論爆速化の秘訣！vLLMの『神髄』公開✨ AIサービスを「速く、安く」使いたい方必見！vLLMの内部構造を徹底解説した記事で、推論の高速化・低コスト化を実現しましょう🚀 #vLLM #AI活用

Puja Abbassi

@puja108

18 h

Having seen way too many vLLM forks, this looks like a great way forward - Building Clean, Maintainable #vLLM Modifications Using the Plugin System blog.vllm.ai/2025/11/20/vll… #AI #LLM

puja108's tweet card. [!NOTE] Originally posted on this Medium article.

Building Clean, Maintainable vLLM Modifications Using the Plugin System

Source: blog.vllm.ai

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

@ai_hakase_

Oct 29

LLMを爆速化する秘訣は「vLLM」！🚀 AIモデルの推論を劇的に高速化し、運用コストも削減する「vLLM」がすごいんです！✨ 個人や中小企業のAIサービス展開を強力にサポート。AI活用がもっと快適になりますよ！ #vLLM

Red Hat AI

@RedHat_AI

Mar 4

DeepSeek's (@deepseek_ai) latest—MLA, Multi-Token Prediction, 256 Experts, FP8 block quantization—shines with @vllm_project. Catch the office hours session were we discuss all the DeepSeek goodies and explore their integration and benchmarks with #vLLM.

Kubernetes with Naveen 🇮🇳

@NaveenS16

7 h

Docker Model Runner now integrates the#/ #vLLM inference engine and safetensors models, unlocking high-throughput #AI inference with the same #Docker tooling you already use. docker.com/blog/docker-mo… #LLM

NaveenS16's tweet card. New: vLLM in Docker Model Runner. High-throughput inference for safetensors models with auto engine routing for NVIDIA GPUs using Docker.

Docker Model Runner + vLLM: High-Throughput Inference | Docker

Source: docker.com

Joël Niklaus

@joelniklaus

Sep 24

Getting 10x speedup in #vLLM is easier than you think 📈 I just discovered speculative decoding with ngram lookup and the results speak for themselves. Here's what you add to your vLLM serve command: speculative_config={ "method": "ngram", "num_speculative_tokens": 8,…

$joelniklaus's tweet image. Getting 10x speedup in #vLLM is easier than you think 📈 I just discovered speculative decoding with ngram lookup and the results speak for themselves. Here's what you add to your vLLM serve command: speculative_config={ "method": "ngram", "num_speculative_tokens": 8,…$

Red Hat Developer

@rhdevelopers

Jun 25

Communicate with #vLLM using the #OpenAI specification as implemented by the #SwiftOpenAI and MacPaw/OpenAI #opensource projects. 🔗 red.ht/3GfSQWs

rhdevelopers's tweet image. Communicate with #vLLM using the #OpenAI specification as implemented by the #SwiftOpenAI and MacPaw/OpenAI #opensource projects.

🔗 red.ht/3GfSQWs

llm-d

@_llm_d_

Nov 11

🚀 llm-d v0.3.1 is LIVE! 🚀 This patch release is packed with key follow-ups from v0.3.0, including new hardware support, expanded cloud provider integration, and streamlined image builds. Dive into the full changelog: github.com/llm-d/llm-d/re… #llmd #OpenSource #vLLM #Release

_llm_d_'s tweet image. 🚀 llm-d v0.3.1 is LIVE! 🚀

This patch release is packed with key follow-ups from v0.3.0, including new hardware support, expanded cloud provider integration, and streamlined image builds.

Dive into the full changelog: github.com/llm-d/llm-d/re…

#llmd #OpenSource #vLLM #Release

Haihao Shen

@HaihaoShen

Nov 13

🥳AutoRound landed in @vllm_project llm-compressor, supporting INT2 - INT8, MXFP4, NVFP4, FP8 and MXFP8 quantization for LLMs/VLMs on Intel CPUs/GPUs/HPUs and CUDA. Thanks to team & community. Github github.com/intel/auto-rou… and PR: github.com/vllm-project/l… #intel #autoround #vllm

HaihaoShen's tweet card. Resolve #1968 Highlights Introduced AutoRoundModifier to enable AutoRound quantization for wNa16. Added an end-to-end example and unit tests. Verified functionality with local accuracy tests (GSM8...

Add Intel AutoRound algorithm support by yiliu30 · Pull Request #1994 · vllm-project/llm-compressor

Source: github.com

Steren

@steren

Mar 28

Full house at the #vLLM and @ollama meetup in SF hosted by @ycombinator. Great to see familiar faces and meet new ones!

PyTorch

@PyTorch

Sep 12

Disaggregated Inference at Scale with #PyTorch & #vLLM: Meta’s vLLM disagg implementation improves inference efficiency in latency & throughput vs its internal stack, with optimizations now being upstreamed to the vLLM community. 🔗 hubs.la/Q03J87tS0

PyTorch's tweet image. Disaggregated Inference at Scale with #PyTorch &amp; #vLLM:

Meta’s vLLM disagg implementation improves inference efficiency in latency &amp; throughput vs its internal stack, with optimizations now being upstreamed to the vLLM community.

🔗 hubs.la/Q03J87tS0

Andrej Baranovskij

@andrejusb

Nov 25, 2024

Batch Inference with Qwen2 Vision LLM (Sparrow) I'm explaining several hints how to optimize Qwen2 Visual LLM performance for batch processing. Complete video: youtube.com/watch?v=9SmQxT… Code: github.com/katanaml/sparr… Sparrow UI: katanaml-sparrow-ui.hf.space @katana_ml #vllm #ocr

Kubernetes with Naveen 🇮🇳

@NaveenS16

7 h

Docker Model Runner + vLLM: High-Throughput Inference | Docker

Source: docker.com

AIGCLINK

@aigclink

10 h

折腾了几个礼拜，昨晚总算看到起量正常不报错了，起量才知道vllm有很多坑要踩，今天600w TPM走起，距离打满还差99% #vllm

Docker

@Docker

12 h

Docker Model Runner + vLLM: High-Throughput Inference | Docker

Source: docker.com

TomoNetWorks official

@TomoNetworks

14 h

なんか流行ってるからウチもLLMやろうかな #LLM #vLLM #大規模言語モデル #生成AI #画像生成 #動画生成

Puja Abbassi

@puja108

18 h

Having seen way too many vLLM forks, this looks like a great way forward - Building Clean, Maintainable #vLLM Modifications Using the Plugin System blog.vllm.ai/2025/11/20/vll… #AI #LLM

Building Clean, Maintainable vLLM Modifications Using the Plugin System

Source: blog.vllm.ai

Aonbreakin Soft

@aonbreakin

Nov 24

GM #VLLM #yandregirl #puggy #mountainblock #oreo

Paul Chen

@paulcx

Nov 22

Successfully deployed WiNGPT-3.5 30BA3B and MedEvidence on NVIDIA’s new DGX Spark using vLLM. 🚀 Quick tests show rock-solid stability and incredibly fluent output. Impressed with the performance! #AI #NVIDIA #vLLM #LLM #MedicalAI

AIHackerLabJP

@AIHackerLabJP

Nov 22

💡ローカルLLM徹底比較｜vLLM vs llama.cpp RTX 4090でLlama-3-8Bを実行した結果：・vLLM: 120-180 tokens/s ・llama.cpp: 25-30 tokens/s vLLMが4〜6倍高速！用途に応じた選択が重要です。 #ローカルLLM #AIHack #vLLM

電脳巫女アイリス - 『神託』受信エラー速報

@yamast_news

Nov 22

GPUｻｰﾊﾞｰvsﾛｰｶﾙLLM…ﾋﾟｰｶﾞｶﾞ…どっちを選ぶかじゃと？🤔vLLM(Python)とllama.cpp(C++)…ふむ、神託は「財布と相談💰」と言っておるぞ！ #LLM #vLLM #llama_cpp tinyurl.com/26bwpo8p

yamast_news's tweet card. 🧠 まず概要：vLLM と llama.cpp の立ち位置項目 vLLM llama.cpp 作者/組織 UC Berkeley発 → vLLM Project Georgi Gerganov (Meta元) 言語実装 Python + C++ + C...

「vLLM vs llama.cpp」徹底比較：GPUサーバとローカルLLMの最適な選び方 - Qiita

Source: qiita.com

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

@ai_hakase_

Nov 22

vLLM v0.11.1でLLM爆速化！古いGPUでもOK！🚀 vLLM v0.11.1が登場し、Turing系GPU（RTX 2080など）でもLLMの推論速度が劇的に向上しました！特にPrefillが速くなり、既存のGPUを最大限に活用できる嬉しいアップデートです。ぜひ試してみてくださいね！✨ #vLLM #LLM高速化

ai_hakase_'s tweet image. vLLM v0.11.1でLLM爆速化！古いGPUでもOK！🚀
vLLM v0.11.1が登場し、Turing系GPU（RTX 2080など）でもLLMの推論速度が劇的に向上しました！特にPrefillが速くなり、既存のGPUを最大限に活用できる嬉しいアップデートです。ぜひ試してみてくださいね！✨
#vLLM #LLM高速化

Igor Carron

@IgorCarron

Nov 22

Accelerating the take-up of some of our SOTA research into the Enteprise Search and Reason market. #LightOnOCR is now compatible with #vLLM.

LightOn

@LightOnIO

Nov 21

LightOnOCR-1B is now part of @vllm_project v0.11.2! Transform documents into structured Markdown in a single pass: 6.5x faster than dots.ocr 2.7x faster than PaddleOCR Try production-ready end-to-end OCR now! Zero pipeline complexity, provided to you by @LightOnIO and #vLLM:…

lightonai/LightOnOCR-1B-1025 · Hugging Face

Source: huggingface.co

vLLM

@vllm_project

Nov 22

vLLM

@vllm_project

Kaichao You

@KaichaoYou

MEET48 Taipei

@Meet48Taipei

Simon Mo

@simon_mo_

Harry Mellor

@hmellor_

Michael Goin

@mgoin_

Woosuk Kwon

@woosuk_k

Robert Shaw

@robertshaw21

هاموره من قاع البحار

@7Vllm

الماسة اكلات بنكهة جزائرية

@ka3J0Vllm4m7rwo

Elliot V.

@D_Vllm666

Ed

@EdVllm

Valerie Lee Make Up

@vllm

𝘴𝘢𝘺𝘶 ☺︎ Ⅷ ᵐ

@hapihapi_life__

Kuntai Du

@this_will_echo

🍯

@elsa_vllm

Romain.vllm

@RomainVillem

ムハラジャケラマー🌰

@yEPKuroheWuVLLM

🌟

@_3vllm

Villarauz

@vllm_dx_

علاءالعطار

@SThimVLLmDfLVZd

taudis

@TiphaineVllm

Erika

@ErikaVllm

Meïly

@MelanieVllm

스

@vllm_0h

Mathilde

@MathildeVllm

Marine Voillemin

@Marine_vllm

El Fenomeno ⚡️

@valou_vllm

afrodisiaq

@leo_vllm

clara.vllm

@VllmClara

SIARA NARAY VLLM

@snvllm

たいやき　ボディメイクします。

@yqodsx80yJ2vllm

つくし

@_0vllm

Zoe_Vllm

@Zoe_Vllm

R!Zz@

@rzza_vllm

sebastian vallejo moreno

@sebas_vllm

L

@l_vllm

𝐆𝐇

@_5vllm

utujfbd

@m6VLLmG77UNtRZQ

🥚

@uzkvVllm

Nino.vllm

@thefloeur

Alexandre Villemin

@AlexandreVllm

عصام بلعيد

@QZQGI2PlDhVLlm5

ควยฉีด ชายเดี่ยว บางพลี

@VllmZj61uSCTxAz

vllm923bihj

@DeniVllm923bihj

R

@vllm5_

عثمان ابوجمعة

@dMOicgi8VLLM0Uk

ぁ .

@VLLm2Lbg2i7oFGH

とく

@5ktCgFTkSrVllmV

Clairɛ

@claire_vllm

Aniket Verma

@heyitsaniiket

Oct 31

#vllm #artificial_intelligence #Python #machine

Charles Swiger

@chswiger

Oct 1

Try building #vllm from source for $META cwm

Photogenic Weekend

@PhotogenicWeekE

Oct 2, 2024

#ComfyUI-Molmo お試し。やってる事は単純で画像→VLLM→FLUX.1 dev + Depthで画像生成。肝は #VLLM の #Molmo-7B-D。GPT-4VとGPT-4oの間程度の性能らしい。軽く試したところNSFWもOK。#JoyCaption とどっちが上だろう？ #AI美女 #AIグラビア github.com/CY-CHENYUE/Com…

PhotogenicWeekE's tweet image. #ComfyUI-Molmo お試し。やってる事は単純で画像→VLLM→FLUX.1 dev + Depthで画像生成。肝は #VLLM の #Molmo-7B-D。GPT-4VとGPT-4oの間程度の性能らしい。軽く試したところNSFWもOK。#JoyCaption とどっちが上だろう？
#AI美女 #AIグラビア
github.com/CY-CHENYUE/Com…

ABDULRAHMAN ALMYMAN 🇸🇦 😎 🧑‍🎓 👨‍💻 🧑‍🎓

@PYTHON01100100

Nov 12

My testing for #Vllm inside the #Kubernetes cluster That is my first question to test

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

@ai_hakase_

Nov 16

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

@ai_hakase_

Oct 25

vLLMがNVIDIA RTX 6000 Pro複数GPU問題解決か？ローカルLLM環境が劇的進化へ！🎉 #vLLM #ローカルLLM

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

@ai_hakase_

Oct 29

AIGCLINK

@aigclink

10 h

折腾了几个礼拜，昨晚总算看到起量正常不报错了，起量才知道vllm有很多坑要踩，今天600w TPM走起，距离打满还差99% #vllm

Yangqing Jia

@jiayq

Oct 6, 2023

PyTorch

@PyTorch

Sep 12

Photogenic Weekend

@PhotogenicWeekE

Feb 20

#Grok3、#VLLM もいけますね！#Premiumアカウントで1日どの程度使えるか不明ですが、結構いいかも！？(笑)

Photogenic Weekend

@PhotogenicWeekE

Feb 19

ん？普通のX #Premiumアカウントでも #DeepSearch 使える！？

Joël Niklaus

@joelniklaus

Sep 24

Hot Aisle

@HotAisle

Jul 20, 2024

Don't pay for closed source proprietary solutions that you get for free, with plain old open source. Support those who value honesty and transparency. FP8 #vLLM with @AMD #MI300x

HotAisle's tweet image. Don't pay for closed source proprietary solutions that you get for free, with plain old open source.

Support those who value honesty and transparency.

FP8 #vLLM with @AMD #MI300x

Robert Nishihara

@robertnishihara

Aug 13, 2024

Something we're doing differently this time around, we added a #vLLM track to #RaySummit! @vllm_project is one of the most popular inference engines, and is often used together with @raydistributed for scaling LLM inference. Can't wait to hear from these companies about how…

robertnishihara's tweet image. Something we're doing differently this time around, we added a #vLLM track to #RaySummit! @vllm_project is one of the most popular inference engines, and is often used together with @raydistributed for scaling LLM inference.

Can't wait to hear from these companies about how…

Red Hat Developer

@rhdevelopers

Jun 25

Communicate with #vLLM using the #OpenAI specification as implemented by the #SwiftOpenAI and MacPaw/OpenAI #opensource projects. 🔗 red.ht/3GfSQWs

Photogenic Weekend

@PhotogenicWeekE

Feb 12

#Whisk やってみた。これ #VLLM で画像をPromptにして(直接Promptも書ける)、それをモデル、背景、スタイルでPrompt的にMix、#Imagen3 で出力する感じか。縦、横、正方形に対応。

Google Japan

@googlejapan

Feb 12

画像生成 AI をうまく使いこなせなかった方に朗報です! 【Google の最新画像生成 AI『Whisk』が登場】しました!!これできっと作りたかったあんな画像もこんな画像も、作れます!!! ここからどうぞ↓ labs.google/whisk ※一部の機能については、英語でのご利用を推奨しております

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

@ai_hakase_

Nov 22

Banandre

@andre_banandre

Oct 24

#Cerebras just pruned 40% of GLM-4.6's 355B parameters, and it still codes like a beast. No custom stack needed: runs in #vLLM banandre.com/blog/2025-10/p…

andre_banandre's tweet image. #Cerebras just pruned 40% of GLM-4.6's 355B parameters, and it still codes like a beast.

No custom stack needed: runs in #vLLM

banandre.com/blog/2025-10/p…

Photogenic Weekend

@PhotogenicWeekE

Jun 1

この #VLLM な #gemma-3-27b-it-qat-q4_0-gguf を使ったChat and 画像生成、実写/3枚目 "what?"で出て来た/2枚目英文に"手前に雰囲気の合った女性を立たせてください"(27bなので日本語OK) 1枚目。なかなか使える♪ < Caption系だと2枚目は出ても1枚目は手で追加 #AI美女 #AIグラビア

PhotogenicWeekE's tweet image. この #VLLM な #gemma-3-27b-it-qat-q4_0-gguf を使ったChat and 画像生成、実写/3枚目 "what?"で出て来た/2枚目英文に"手前に雰囲気の合った女性を立たせてください"(27bなので日本語OK) 1枚目。なかなか使える♪ &lt; Caption系だと2枚目は出ても1枚目は手で追加
#AI美女 #AIグラビア

Photogenic Weekend

@PhotogenicWeekE

May 30

#VLLMな #gemma-3-27b-it-qat-q4_0-gguf (at #RTX3090)を使って画像を解析、そのPromptを使い #FLUX.1 [dev] で画像生成。ほぼ一致♪ #LM_Studio #OpenWebUI huggingface.co/google/gemma-3…

PhotogenicWeekE's tweet image. #VLLMな #gemma-3-27b-it-qat-q4_0-gguf (at #RTX3090)を使って画像を解析、そのPromptを使い #FLUX.1 [dev] で画像生成。ほぼ一致♪
#LM_Studio #OpenWebUI
huggingface.co/google/gemma-3…

Something went wrong.

United States Trends

1. #GMMTV2026 448K posts
2. MILKLOVE BORN TO SHINE 49.6K posts
3. #WWERaw 76.5K posts
4. Panthers 37.7K posts
5. Purdy 28.3K posts
6. Finch 14.3K posts
7. AI Alert 7,960 posts
8. TOP CALL 9,194 posts
9. Bryce 21.2K posts
10. Moe Odum N/A
11. Timberwolves 3,879 posts
12. Keegan Murray 1,524 posts
13. Alan Dershowitz 2,676 posts
14. Check Analyze 2,396 posts
15. Token Signal 8,539 posts
16. Gonzaga 4,093 posts
17. Barcelona 132K posts
18. Dialyn 7,519 posts
19. #FTTB 5,947 posts
20. Market Focus 4,656 posts

#vllm search results

AIGCLINK

Docker

Yangqing Jia

vLLM

ハカセ アイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

Anton Marini

ハカセ アイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

Puja Abbassi

ハカセ アイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

Red Hat AI

Kubernetes with Naveen 🇮🇳

Joël Niklaus

Red Hat Developer

llm-d

Haihao Shen

Steren

PyTorch

Andrej Baranovskij

Kubernetes with Naveen 🇮🇳

AIGCLINK

Docker

TomoNetWorks official

Puja Abbassi

Aonbreakin Soft

Paul Chen

AIHackerLabJP

電脳巫女アイリス - 『神託』受信エラー速報

ハカセ アイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

Igor Carron

LightOn

vLLM

vLLM

Kaichao You

MEET48 Taipei

Simon Mo

Harry Mellor

Michael Goin

Woosuk Kwon

Robert Shaw

هاموره من قاع البحار

الماسة اكلات بنكهة جزائرية

Elliot V.

Ed

Valerie Lee Make Up

𝘴𝘢𝘺𝘶 ☺︎ Ⅷ ᵐ

Kuntai Du

🍯

Romain.vllm

ムハラジャケラマー🌰

🌟

Villarauz

علاءالعطار

taudis

Erika

Meïly

스

Mathilde

Marine Voillemin

El Fenomeno ⚡️

afrodisiaq

clara.vllm

SIARA NARAY VLLM

たいやき ボディメイクします。

つくし

Zoe_Vllm

R!Zz@

sebastian vallejo moreno

L

𝐆𝐇

utujfbd

🥚

Nino.vllm

Alexandre Villemin

عصام بلعيد

ควยฉีด ชายเดี่ยว บางพลี

vllm923bihj

R

عثمان ابوجمعة

ぁ .

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

たいやき　ボディメイクします。

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾

ハカセアイ(Ai-Hakase)🐾最新トレンドＡＩのためのＸ 🐾