mconcat

@monoidconcat

Modernist

Joined June 2020

3KPosts 519Followers 301Following

You might like

@noamwithveto

@boratheworld

@interweb_inc

@MarkoBaricevic_

@simon_warta

@aleksb3z

@misangmadrid

@andynog

@aaronxkong

@JeremyParish69

@tw_tter

@Daniel_Farinax

@jhernandezb_

@MbBrainz

@davidfeiock

mconcat reposted

Aryaman Arora

@aryaman2020

Nov 20

🫡 new paper neurons can be a sparse and interpretable basis for circuit tracing, once you make the right decisions about which neurons and how you circuit trace! i'm excited for how this affects future progress on circuits + automating interp

Transluce

@TransluceAI

Nov 20

Is your LM secretly an SAE? Most circuit-finding interpretability methods use learned features rather than raw activations, based on the belief that neurons do not cleanly decompose computation. In our new work, we show MLP neurons actually do support sparse, faithful circuits!

TransluceAI's tweet image. Is your LM secretly an SAE?

Most circuit-finding interpretability methods use learned features rather than raw activations, based on the belief that neurons do not cleanly decompose computation. In our new work, we show MLP neurons actually do support sparse, faithful circuits!

mconcat reposted

Nathan Lambert

@natolambert

Nov 20

We present Olmo 3, our next family of fully open, leading language models. This family of 7B and 32B models represents: 1. The best 32B base model. 2. The best 7B Western thinking & instruct models. 3. The first 32B (or larger) fully open reasoning model. This is a big…

natolambert's tweet image. We present Olmo 3, our next family of fully open, leading language models.
This family of 7B and 32B models represents:

1. The best 32B base model.
2. The best 7B Western thinking &amp; instruct models.
3. The first 32B (or larger) fully open reasoning model.

This is a big…

mconcat reposted

Neel Nanda

@NeelNanda5

Nov 13

Video: youtu.be/Tgq7E4YcPKQ

NeelNanda5's tweet card. What Happened With Sparse Autoencoders?

youtube.com

YouTube

What Happened With Sparse Autoencoders?

Source: youtube.com

mconcat reposted

Neel Nanda

@NeelNanda5

Nov 13

New video: What happened with sparse autoencoders? SAEs were a big craze in mech interp, then suddenly weren't. In this talk, I give the story of SAEs as I experienced it, reflect on mistakes I made, how I think about them now and ways they're over AND under hyped and next steps

mconcat reposted

kalomaze

@kalomaze

Nov 9

RL LEARNING WITH LORA: A DIVERSE DEEP DIVE

mconcat reposted

maharshi

@maharshii

Nov 3

update: wrote a triton kernel for this - has correct tiled layout for scale factors - uses inline ptx for conversion - 4x faster than torch compiled version triton is absolutely amazing for writing memory bound kernels tbh

maharshi

@maharshii

Nov 2

simple NVFP4 quantization within 100 lines of pytorch

mconcat

@monoidconcat

Oct 28

I like winter

mconcat

@monoidconcat

Nov 23, 2024

I like winter

mconcat reposted

jietang

@jietang

Oct 27

try 4.6 very soon

mconcat

@monoidconcat

Oct 25

GLM 4.5 air running on 4x 3090 with 28 tokens/sec Pipeline parallelism over PCIe(no nvlink)

mconcat

@monoidconcat

Oct 26

We need a transcoder trained for glm 4.5 air. One of the best local models. Would be super practical to have a skip transcoder and able to steer individual circuits.

mconcat

@monoidconcat

Oct 26

GLM 4.5 Air running in ~60tok/sec on 4x 3090! 3090s are still great cards to buy if you want to run inference with 100b models, locally, for your own use x.com/monoidconcat/s…

monoidconcat's tweet image. GLM 4.5 Air running in ~60tok/sec on 4x 3090!

3090s are still great cards to buy if you want to run inference with 100b models, locally, for your own use

x.com/monoidconcat/s…

mconcat

@monoidconcat

Oct 20

Upgraded my setup to 1x 5090+ 4x 3090, going to test run GLM 4.5 air on this Considering to add another 5090 in a near future(whenever I can get one)

monoidconcat's tweet image. Upgraded my setup to 1x 5090+ 4x 3090, going to test run GLM 4.5 air on this

Considering to add another 5090 in a near future(whenever I can get one)

mconcat reposted

Cerebras

@cerebras

Oct 24

Due to popular demand, we've released more REAP models! @Zai_org ➡️GLM4.6-FP8 REAP@25% ➡️GLM4.6-FP8 REAP@30% ➡️GLM4.6-FP8 REAP@40% REAP is a one-shot pruning technique developed and open sourced by Cerebras. It compresses MoEs by up to 50% with minimal loss in coding ability.…

mconcat

@monoidconcat

Oct 20

Upgraded my setup to 1x 5090+ 4x 3090, going to test run GLM 4.5 air on this Considering to add another 5090 in a near future(whenever I can get one)

mconcat reposted

george hotz archive

@geohotarchive

Oct 15

Pathetic Losers geohot.github.io//blog/jekyll/u…

mconcat reposted

Simon Willison

@simonw

Oct 13

My notes on nanochat, including links to the training data it uses simonwillison.net/2025/Oct/13/na…

Andrej Karpathy

@karpathy

Oct 13

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

karpathy's tweet image. Excited to release new repo: nanochat!
(it's among the most unhinged I've written).

Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

mconcat reposted

Ben Burtenshaw

@ben_burtenshaw

Oct 14

nanochat day 1: sharing everything we have: - we've got an org on the hub to share resources and discuss learning - we've trained a tokenizer and published it on the hub. - integrated base training with trackio for free logging. curves! if you're also working on this. join the…