Ramin Hasani

@ramin_m_h

building @LiquidAI_

Science & Technology

New York, USA

raminhasani.com

7월 2012에 가입

1천게시물 5천팔로워 474팔로우 중

내가 좋아할 만한 콘텐츠

@neur_reps

@danijarh

@jaschasd

@YiMaTweets

@ofirnachum

@iclr_conf

@jefrankle

@TacoCohen

@Cohere_Labs

@hausman_k

@MilesCranmer

@natolambert

@pliang279

@jo_brandstetter

@bneyshabur

고정된 트윗

Ramin Hasani

@ramin_m_h

. 8. 5.

Building a foundation model is an art! it involves many complex dimensions and stages, from architecture, data, pre-training, post-training to inference. getting it all right requires masterful and tasteful execution. there are very few teams around the world that can make…

Ramin Hasani 님이 재게시함

Jimmy Smith

@jimmysmith1919

25 분

Check out this fun work that explores fine-tuning small LFM2 and Qwen3 models for French. arxiv.org/abs/2510.05846

Maxime Labonne

@maximelabonne

3 시간

📚 Efficient Language Specialization for Small Language Models @maxencelsb and @SinoueG have released a preprint about their excellent work on fine-tuning small models in French. It shows a solid post-training pipeline to improve French performance while preserving English…

maximelabonne's tweet image. 📚 Efficient Language Specialization for Small Language Models

@maxencelsb and @SinoueG have released a preprint about their excellent work on fine-tuning small models in French. It shows a solid post-training pipeline to improve French performance while preserving English…

Ramin Hasani 님이 재게시함

Maxime Labonne

@maximelabonne

3 시간

Ramin Hasani

@ramin_m_h

23 시간

new LFM for PII Japanese data extraction on par with GPT5! 💪🏽 enjoy

Liquid AI

@LiquidAI_

23 시간

We have a new nano LFM that is on-par with GPT-5 on data extraction with 350M parameters. Introducing LFM2-350M-PII-Extract-JP 🇯🇵 Extracts personally identifiable information (PII) from Japanese text → returns structured JSON for on-device masking of sensitive data. Delivers…

LiquidAI_'s tweet image. We have a new nano LFM that is on-par with GPT-5 on data extraction with 350M parameters.

Introducing LFM2-350M-PII-Extract-JP 🇯🇵

Extracts personally identifiable information (PII) from Japanese text → returns structured JSON for on-device masking of sensitive data.

Delivers…

Ramin Hasani

@ramin_m_h

. 10. 13.

LFMs fly on iphone! you can try a large series of LFMs on Apollo: apps.apple.com/us/app/apollo-…

ramin_m_h's tweet card. ‎Chat with private, local AIs, connect to every open source AI, or your own locally-hosted private LLMs. Apollo is your own customizable client for accessing language models from all around the web....

‎Apollo - Powered by Liquid

출처: apps.apple.com

Adrien Grondin

@adrgrondin

. 10. 11.

Apple wasn’t kidding, the iPhone 17 Pro is really built for running LLMs Here’s LFM2 8B A1B by @LiquidAI_ running on-device with MLX in @LocallyAIApp, the iPhone runs the 8B model with zero struggle Thanks @Prince_Canuma for the port to MLX, it made the MLX Swift port possible

Ramin Hasani 님이 재게시함

Mathias Lechner

@mlech26l

. 10. 12.

Day 1 of the @LiquidAI_ fine-tuning hackathon in Tokyo this weekend. Jointly organized with @weights_biases and @LambdaAPI

Ramin Hasani 님이 재게시함

Marktechpost AI Dev News ⚡

@Marktechpost

. 10. 11.

Liquid AI Releases LFM2-8B-A1B: An On-Device Mixture-of-Experts with 8.3B Params and a 1.5B Active Params per Token How much capability can a sparse 8.3B-parameter MoE with a ~1.5B active path deliver on your phone without blowing latency or memory? Liquid AI has released…

Ramin Hasani 님이 재게시함

Awni Hannun

@awnihannun

. 10. 11.

Btw, it should get faster on the next version of MLX Swift. We made some improvements to 1D grouped convs that will speed up this model nicely.

Ramin Hasani 님이 재게시함

Adrien Grondin

@adrgrondin

. 10. 11.

Ramin Hasani 님이 재게시함

Pau Labarta Bajo

@paulabartabajo_

. 10. 10.

Hello everyone! Let me (re)introduce myself!

Ramin Hasani 님이 재게시함

Fernando Fernandes Neto

@FernandoNetoAi

. 10. 9.

People should give a try on @LiquidAI_ models with Spectrum. You can SFT/RLFT your models with VERY LOW memory footprint, without having to do LoRA or qLoRA... This beautiful thing prevents a lot of the catastrophic forgetting. LFM models work out of the box.

FernandoNetoAi's tweet image. People should give a try on @LiquidAI_ models with Spectrum. You can SFT/RLFT your models with VERY LOW memory footprint, without having to do LoRA or qLoRA...

This beautiful thing prevents a lot of the catastrophic forgetting.

LFM models work out of the box.

Ramin Hasani 님이 재게시함

Adrien Grondin

@adrgrondin

. 10. 9.

I added LFM 2 8B A1B in @LocallyAIApp for iPhone 17 Pro and iPhone Air The first mixture of experts model by @LiquidAI_, 8B total parameters (1B active), performance similar to 3-4B models but speed of a 1B model Runs great on the 17 Pro with Apple MLX

adrgrondin's tweet image. I added LFM 2 8B A1B in @LocallyAIApp for iPhone 17 Pro and iPhone Air

The first mixture of experts model by @LiquidAI_, 8B total parameters (1B active), performance similar to 3-4B models but speed of a 1B model

Runs great on the 17 Pro with Apple MLX

Ramin Hasani 님이 재게시함

Jimmy Smith

@jimmysmith1919

. 10. 7.

We just released LFM2-8B-A1B, a small MoE optimized for latency-sensitive applications on-device. Larger model quality with the speed of a 1.5B class model. Huggingface: huggingface.co/LiquidAI/LFM2-… Blog: liquid.ai/blog/lfm2-8b-a…

jimmysmith1919's tweet card. We are releasing LFM2-8B-A1B, our first on-device Mixture-of-Experts (MoE) with 8.3B total parameters and 1.5B active parameters per token. By activating only a sparse subset of parameters during...

LFM2-8B-A1B: An Efficient On-device Mixture-of-Experts | Liquid AI

출처: liquid.ai

Liquid AI

@LiquidAI_

. 10. 7.

Meet LFM2-8B-A1B, our first on-device Mixture-of-Experts (MoE)! 🐘 > LFM2-8B-A1B is the best on-device MoE in terms of both quality and speed. > Performance of a 3B-4B model class, with up to 5x faster inference profile on CPUs and GPUs. > Quantized variants fit comfortably on…

Ramin Hasani 님이 재게시함

Maxime Labonne

@maximelabonne

. 10. 7.

LFM2-8B-A1B just dropped on @huggingface! 8.3B params with only 1.5B active/token 🚀 > Quality ≈ 3–4B dense, yet faster than Qwen3-1.7B > MoE designed to run on phones/laptops (llama.cpp / vLLM) > Pre-trained on 12T tokens → strong math/code/IF

Ramin Hasani 님이 재게시함

Prince Canuma

@Prince_Canuma

. 10. 8.

LFM2-MoE by @LiquidAI_ now on MLX ✅ github.com/ml-explore/mlx…

Add lfm2 moe by Blaizzy · Pull Request #537 · ml-explore/mlx-lm

출처: github.com

Ramin Hasani 님이 재게시함

Anes Valentic

@Matrix_Memories

. 10. 7.

Small MoEs are on the rise. @LiquidAI_ drops LFM2-8B-A1B.

Maxime Labonne

@maximelabonne

. 10. 7.

Ramin Hasani

@ramin_m_h

. 10. 7.

Enjoy our even better on-device model! 🐘 Running on @amd AI PCs with the fastest inference profile!

Liquid AI

@LiquidAI_

. 10. 7.

Ramin Hasani 님이 재게시함

Alexander Amini

@xanamini

. 10. 7.

Meet LFM2-8B-A1B by @LiquidAI_ - 8B total and 1B active params 🐘 - 5x faster on CPUs and GPUs ⚡️ - Perfect for fast, private, edge 📱/💻/🚗/🤖

Liquid AI

@LiquidAI_

. 10. 7.

Ramin Hasani 님이 재게시함

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

. 10. 7.

LFM2-8B-A1B Liquid AI’s first on-device MoE, with 8.3B total parameters and 1.5B active per token. It matches 3–4B dense model quality while running faster than Qwen3-1.7B. Architecture - 18 gated short-conv blocks, 6 GQA blocks (LFM2 backbone) - Sparse MoE feed-forward layers…

gm8xx8's tweet image. LFM2-8B-A1B
Liquid AI’s first on-device MoE, with 8.3B total parameters and 1.5B active per token. It matches 3–4B dense model quality while running faster than Qwen3-1.7B.

Architecture
- 18 gated short-conv blocks, 6 GQA blocks (LFM2 backbone)
- Sparse MoE feed-forward layers…

𝚐𝔪𝟾𝚡𝚡𝟾

@gm8xx8

. 10. 1.

LFM2-Audio-1.5B Liquid AI’s first end-to-end audio foundation model, built for real-time conversation at only 1.5B parameters. Competitive with much larger models, it unifies speech and text without separate ASR or TTS. Architecture - LFM2 multimodal backbone - FastConformer…