#smallllm search results

Marcos Keyser

Oct 16

Home-lab gap: Mac Studio has the bandwidth but no CUDA; DGX Spark has CUDA but not the bandwidth. Both have big unified memory for inference, but 7–14B fine-tuning is still bandwidth-bound. We need Studio-class bandwidth plus CUDA in one box. #SmallLLM #EdgeLLM #LoRA #QLoRA…

markeyser's tweet image. Home-lab gap: Mac Studio has the bandwidth but no CUDA; DGX Spark has CUDA but not the bandwidth. Both have big unified memory for inference, but 7–14B fine-tuning is still bandwidth-bound. We need Studio-class bandwidth plus CUDA in one box. #SmallLLM #EdgeLLM #LoRA #QLoRA…

Roger Thompson

@DrRogerThomp

Oct 20

💡 What if building an AI assistant didn’t need billions or NDAs — just curiosity, a GPU & $100? Discover how NanoChat reshapes the cost-barrier of AI creation 🔍 👉 medium.com/@rogt.x1997/th… #SmallLLM #AITraining #OpenSourceAI

Peng Zhang

@PengZhang596343

Jul 12

Hugging Face newest SmoLlm3 3B is now live on our WebAI platform, now locally runnable inside your user's browser. #SmallLLM

PengZhang596343's tweet image. Hugging Face newest SmoLlm3 3B is now live on our WebAI platform, now locally runnable inside your user's browser. #SmallLLM

Roger Thompson

@DrRogerThomp

Oct 20

$100 training run. 1.9B parameters. 38B tokens. Four training stages. Control your AI instead of renting it. ⚙️ #SmallLLM #DIYAI #AIOwnership medium.com/p/the-100-shoc…

Ben Wilcock

@benbravo73

Dec 20, 2023

📽️ Watch this! An AI server running privately on my home network. Not only is this possible (and relatively easy), it's also fast, secure, and incredibly useful! Better than creepy Alexa - cleverer and less eavesdroppy. 😜 #SmallLLM #PrivateAI @OLLAMA

Ajit

@dead_relu

Jan 31

Is it possible to separate the reasoning and memory part out of a LLM? If so we can make personalized LLMs in a much cheaper and affordable way. Think of it as a person having not much memory power but quite a reasonable guy. #ML #Training #SmallLLM #AI

Navigable

@NavigableAI

Mar 14

A fine-tuned small LLM delivers faster, more accurate, and cost-effective user assistance—tailored to your product’s needs. 🚀 zurl.co/c8fUN #AIAssistant #LLM #SmallLLM #FineTuning #UserAssistance #TechInnovation #DigitalTransformation #AI #MachineLearning

NavigableAI's tweet image. A fine-tuned small LLM delivers faster, more accurate, and cost-effective user assistance—tailored to your product’s needs. 🚀

zurl.co/c8fUN

#AIAssistant #LLM #SmallLLM #FineTuning #UserAssistance #TechInnovation #DigitalTransformation #AI #MachineLearning

alby13

@alby13

Nov 24

SmolLM is offering a remarkably small AI at 360 Million Parameters (The Standard is Billions) "Small Models can achieve impressive results?" It's right 30% of the time, ever time huggingface.co/spaces/Hugging… #SmolLM #SmallLLM #SLM #SmallLanguageModel Info: huggingface.co/blog/smollm

SmolLM - blazingly fast and remarkably powerful

Source: huggingface.co

alby13

@alby13

Jun 15, 2024

I am hopeful for this new ChatGPT AI angle for LLMs, but I can tell you that I successfully ran TinyLlama 1.1B on a Raspberry Pi 5 at quite a fast speed, which is only a 638 Megabyte download. #SmallLLM #SLM #TinyLlama #1BitLLM #1BitAI #TinyAI #TinyLLM github.com/jzhang38/TinyL…

alby13's tweet card. The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens. - jzhang38/TinyLlama

GitHub - jzhang38/TinyLlama: The TinyLlama project is an open endeavor to pretrain a 1.1B Llama...

Source: github.com

Rohan Paul

@rohanpaul_ai

Jun 15, 2024

✨ Microsoft 1-bit era paper (released in Feb) is really a masterpiece. BitNet b1.58 70B was 4.1 times faster and 8.9 times higher throughput capable than the corresponding FP16 LLaMa. 📌 Requires almost no multiplication operations for matrix multiplication and can be highly…

rohanpaul_ai's tweet image. ✨ Microsoft 1-bit era paper (released in Feb) is really a masterpiece.

BitNet b1.58 70B was 4.1 times faster and 8.9 times higher throughput capable than the corresponding FP16 LLaMa.

📌 Requires almost no multiplication operations for matrix multiplication and can be highly…

Roger Thompson

@DrRogerThomp

Oct 20

Roger Thompson

@DrRogerThomp

Oct 20

$100 training run. 1.9B parameters. 38B tokens. Four training stages. Control your AI instead of renting it. ⚙️ #SmallLLM #DIYAI #AIOwnership medium.com/p/the-100-shoc…