#sparsemodels search results

Scryptacle

Oct 8

Smarter routing, not bigger models. Elegant work by Apple’s researchers — important progress in scalable AI design. 👏 #Apple #AppleAI #SparseModels #AIArchitecture

5/5 What aspects of trillion-param MoE deployment interest you most? Memory offloading strategies? Dynamic routing budgets? Hierarchical expert organization? Drop your thoughts below 👇 #MoE #LLMs #SparseModels #AIResearch

TechTonic

@wtf_techtonic

Aug 10, 2024

Read more about this exciting finding and its implications for AI development: openreview.net/forum?id=svm53… openreview.net/pdf?id=svm53KQ… #artificialintelligence #deeplearning #sparsemodels

AITopTools

@aitoptools

Aug 18, 2023

#AI & #MachineLearning need to converge well. Check out this new theory that could make this possible! 🤔 #SparseModels forbes.com/sites/johnwern…

سيف محمد بن صفوان | Saif Bin Safwan

@Saif_BinSafwan

Nov 28

أعلنت Neural Magic عن نموذج اللغة Sparse Llama 3.1 8B، أصغر حجماً وأكثر كفاءة من سابقه. يهدف النموذج الجديد إلى جعل تقنيات الذكاء الاصطناعي في متناول الجميع، حيث يمكن تشغيله بأجهزة أقل قوة. #AI #MachineLearning #SparseModels #NeuralMagic #Llama_3_1_8B marktechpost.com/2024/11/25/neu…

marktechpost.com

Neural Magic Releases 2:4 Sparse Llama 3.1 8B: Smaller Models for Efficient GPU Inference

Source: marktechpost.com

H2O.ai

@h2oai

Feb 24, 2017

@Stanford H2O.ai advisors, Trevor Hastie & Rob Tibshirani, are holding a 2-day course in #MachineLearning #DeepLearning #SparseModels.

Global Tech Council

@GTechCouncil

Mar 8

Tech terms decoded! 🛠️ Attention techies, it’s time for #TermOfTheDay. Today, we are learning about: Sparse Models! ⚡ #TechTerms #SparseModels #AI #MachineLearning #DeepLearning #TechEducation

GTechCouncil's tweet image. Tech terms decoded! 🛠️

Attention techies, it’s time for #TermOfTheDay.

Today, we are learning about: Sparse Models! ⚡

#TechTerms #SparseModels #AI #MachineLearning #DeepLearning #TechEducation

Red Hat AI

@RedHat_AI

Nov 25

Download Sparse Llama: huggingface.co/neuralmagic/Sp… See benchmarks and our approach: hubs.li/Q02ZlXd90 Thanks to @_EldarKurtic, Alexdre Marques, @markurtz_, @DAlistarh, Shubhra Pandit & the Neural Magic team for always enabling efficient AI! #SparseModels #OpenSourceAI #vLLM

huggingface.co

RedHatAI/Sparse-Llama-3.1-8B-2of4 · Hugging Face

Source: huggingface.co

FinSentim

@FinSentim

Aug 13, 2021

@sarahookr, @KaliTessera, and Benjamin Rosman take a broader view of training #sparsnetworks and consider the role of regularization, optimization, and architecture choices on #sparsemodels. They propose a simple experimental framework, #SameCapacitySparse vs #DenseComparison.

$sarahookr's profile picture. Adaptive Intelligence. Built @Cohere_Labs, @GoogleBrain, @GoogleDeepmind. ML Efficiency, Multimodal\lingual. Changing spaces where breakthroughs happen.$

Sara Hooker

@sarahookr

Aug 12, 2021

Tomorrow at @ml_collective DLTC reading group, @KaliTessera will be presenting our work on how initialization is only one piece of the puzzle for training sparse networks. Can taking a wider view of model design choices unlock sparse training? bit.ly/3xFtHKI

sarahookr's tweet image. Tomorrow at @ml_collective DLTC reading group, @KaliTessera will be presenting our work on how initialization is only one piece of the puzzle for training sparse networks.

Can taking a wider view of model design choices unlock sparse training?

bit.ly/3xFtHKI