
John T Davies 🇪🇺
@jtdavies
Entrepreneur, CTO in AI & FinTech, investor, father to 3 grown boys, husband to Rachel, astrophysicist, keen photographer, cyclist, über-geek, travelled a lot.
你可能會喜歡
I fed this image (below) from @Prince_Canuma's ongoing work on MLX-VLM to support Qwen3-VL and batch. @Alibaba_Qwen's Qwen3-VL-30B-a3B-Instruct-4bit (MLX) with the following prompt. For me on an M4 it was... Prompt: 895 tokens, 711.575 tokens-per-sec Generation: 134 tokens,…

Travelling back from a great week at @Devoxx. I love travelling by train, this is the relatively new Frecciarossa service from Paris to Marseille. Working with a full meal service and an office at 320km/h (over 200mph), cheaper than a flight and very civilised!

John Davies giving an important talk on local LLMs. Currently describing how Incept5 has built a banking workflow on Embabel using only local models. @java @jtdavies @incept5

Tuesday at @Devoxx Belgium, @JamesWard, @springrod and I had a workshop on Agentic AI using Embabel. I have my last talk later this afternoon on local private LLMs.

This will be right up @Prince_Canuma’s street for MLX!
Today, we expand our LFM2 family to audio. 👂👄 LFM2-Audio is an end-to-end audio-text omni foundation model, and delivers responsive, real-time conversation on-device at just 1.5B parameters. One model. Seamless multimodal support. No chains. > Speech-to-speech >…
This is very impressive, on the leaderboard with a 15B, 128k context reasoning model. I shall be testing this today, nice work guys!
SLAM Labs presents Apriel-1.5-15B-Thinker 🚀 An open-weights multimodal reasoning model that hits frontier-level performance with just a fraction of the compute.

What are the chances of bumping in to an old friend you haven’t seen for 18 years in the hotel lift at midnight? That was a WTF moment last night with @jstrachan. Of course we had to go for a drink!

We’re committing heavily to Embabel, on a global scale.
"We are now using Embabel in a global deployment with one of the world's largest banks" Private LLM Agents on the JVM : Lessons from GOAP with Embabel. Recent talk in Berlin by Pierre Davies and Sasha Saw of Incept5: @kotlin #genai #embabel @jtdavies youtube.com/watch?v=AFS3aY…
youtube.com
YouTube
Private LLM Agents on the JVM : Lessons from GOAP with Embabel
This is crazy performance, great work MLX team, just done a git pull and tried it, wow!
MLX 0.28 + Qwen3-Next = 🔥 Look at the speed even with large contexts! This is the future of Local AI: NOW! M4 Max really shines with just 3B active params. Details in 🧵

Well I finally downloaded the new @Alibaba_Qwen Qwen3-Next models but it seems Llama.cpp support is not there yet. Off to bed, hopefully it'll be fixed by the time I wake up 🤞

I'm sure Netflix streams have stopped all over the world as the geeks download this latest masterpiece from the @Alibaba_Qwen team. I am almost there with the 2x160GB downloads. Then off to Llama.cpp to quantise it, probably Q8, then to GGUF and MLX it, ready for my Mac laptop.
🚀 Introducing Qwen3-Next-80B-A3B — the FUTURE of efficient LLMs is here! 🔹 80B params, but only 3B activated per token → 10x cheaper training, 10x faster inference than Qwen3-32B.(esp. @ 32K+ context!) 🔹Hybrid Architecture: Gated DeltaNet + Gated Attention → best of speed &…

This is fantastic research, information like this is going to have a huge impact on local models, thanks Daniel.
DeepSeek V3.1 dynamic @UnslothAI quants on Aider Polyglot benchmarks are here! 1. 3-bit thinking gets 75.6% vs 76.1% un-quantized 2. Leaving attn_k_b in 8-bit gets +2% accuracy vs 4-bit 3. Dynamic quants beat other similar imatrix quants 4. AMA r/LocalLlama today 10AM PST!…

A superb evening with @starbuxman and his lovely partner in San Francisco. Between cocktails we managed to cover Spring AI and Embabel. It seems we’re both speaking at @Devoxx later this year!

Working perfectly on my Mac, albeit slow for the moment but very impressive. CC @Prince_Canuma for MLX.
🚀 Excited to introduce Qwen-Image-Edit! Built on 20B Qwen-Image, it brings precise bilingual text editing (Chinese & English) while preserving style, and supports both semantic and appearance-level editing. ✨ Key Features ✅ Accurate text editing with bilingual support ✅…
Two very enjoyable days in St. Pete, Florida with friends on the weekend, just landed in San Francisco, back to work, kyb, compliance, cryptography and AI, well almost work 😎
I’m heading to the Bay Area! I’ve got a pretty busy start of the week but Thursday evening and Friday are looking good. Hoping to catch up with some colleagues, let me know if you’re around, would love to talk LLMs & AI (over🍺&🍸).

Just arrived in New York, one thing I really don’t miss is the 2+ hour immigration line and officer with no sense of humour. Now in serious need of a negroni, or perhaps a margarita, no a whisky sour… Watch this space!
United States 趨勢
- 1. Bills 109K posts
- 2. Falcons 29.6K posts
- 3. Josh Allen 13.3K posts
- 4. phil 127K posts
- 5. Bijan 17.7K posts
- 6. Drake London 5,058 posts
- 7. Bears 46.6K posts
- 8. Chris Moore 1,841 posts
- 9. Dan Quinn 1,011 posts
- 10. #NLCS 5,562 posts
- 11. McDermott 3,904 posts
- 12. Caleb 31.1K posts
- 13. Snell 3,559 posts
- 14. #RaiseHail 4,865 posts
- 15. Commanders 30.5K posts
- 16. Jayden 11.9K posts
- 17. phan 93.3K posts
- 18. Teoscar 1,012 posts
- 19. Beane 1,940 posts
- 20. #BUFvsATL 2,877 posts
你可能會喜歡
Something went wrong.
Something went wrong.