#exllamav2 search results

forasteran

Nov 15, 2023

ComfyにもチャットAI搭載🤭 ComfyUI ExLlama Nodes github.com/Zuellni/ComfyU… #ComfyUI で #ExLlamaV2 使えて、対話しながら呪文作ってくれるｗ x.com/toyxyz3/status…

forasteran's tweet image. ComfyにもチャットAI搭載🤭

ComfyUI ExLlama Nodes
github.com/Zuellni/ComfyU…

#ComfyUI で #ExLlamaV2 使えて、対話しながら呪文作ってくれるｗ
x.com/toyxyz3/status…

toyxyz

@toyxyz3

Nov 15, 2023

github.com/Zuellni/ComfyU…

toyxyz3's tweet card. ExLlamaV2 nodes for ComfyUI. Contribute to Zuellni/ComfyUI-ExLlama-Nodes development by creating an account on GitHub.

GitHub - Zuellni/ComfyUI-ExLlama-Nodes: ExLlamaV2 nodes for ComfyUI.

Source: github.com

turboderp

@turboderp_

Nov 23, 2024

Fun with grounding in Qwen2-VL. Finding the things. #wherearethethings #exllamav2 #cat

AITopTools

@aitoptools

Nov 20, 2023

Check out ExLlamaV2, the fastest library to run LLMs. #AI #MachineLearning #ExLlamaV2 towardsdatascience.com/exllamav2-the-…

towardsdatascience.com

ExLlamaV2: The Fastest Library to Run LLMs | Towards Data Science

Quantize and run EXL2 models

Source: towardsdatascience.com

In the top menu, to the right of "Select a model" there is a gear icon. It will bring up the Settings modal. Select Connections and it will have a OpenAI API section. Add the http://ip:port/v1 of your tabbyAPI and your API key. That's it. #exllamav2 #exl2 #llm #localLlama

GitHubTrending｜ GitHub每日热点项目追踪

@GithubtrendingG

Sep 14, 2023

#exllamav2 #Python A fast inference library for running LLMs locally on modern consumer-class GPUs gtrending.top/content/3391/

EXTRACTUM.IO

@extractum_io

Dec 26, 2023

#EXL2 #quantization format introduced in #ExLlamaV2 supports 2 to 8-bit precision. High performance on consumer GPUs. Mixed precision, smaller model size, and lower perplexity while maintaining accuracy. Find EXL2 models at llm.extractum.io/list/?exl2 #MachineLearning #EXL2 #LLMs

extractum_io's tweet image. #EXL2 #quantization format introduced in #ExLlamaV2 supports 2 to 8-bit precision. High performance on consumer GPUs. Mixed precision, smaller model size, and lower perplexity while maintaining accuracy. Find EXL2 models at llm.extractum.io/list/?exl2
#MachineLearning #EXL2 #LLMs

ᕈablo ᑕarrera (パブロ)

@prce__

Jul 8, 2024

Exllama v2 now on @huggingface spaces by the awesome @turboderp_ huggingface.co/spaces/pabloce… #exllamav2 #exllama #opensource #communitybuilding

huggingface.co

Exllama - a Hugging Face Space by pabloce

Source: huggingface.co

Richard Ginsberg

@RichardGinsberg

Apr 14, 2024

If you happen to have a total of 64gb of VRAM at your disposal #exl2 #exllamav2 #GenerativeAI #mixtral huggingface.co/machinez/zephy…

machinez/zephyr-orpo-141b-A35b-v0.1-exl2 · Hugging Face

Source: huggingface.co

Andrew Zhu

@xhinker

Jan 6, 2024

#ExllamaV2 is currently the fastest inference framework for Mixtral 8x7 MoE. It is so good. Can run Mixtral 4bit GPTQ in a 24G + 8G GPU, 3 bit in just one 24G GPU. Its auto VRAM split loading is amazing. github.com/turboderp/exll…