@Dev4YM

Software engineer, +4 years experience working on enterprise systems. Full-stack across TS, JS, SQL, Python, C# Flutter. Automation, system design, and using AI

Joined February 2024

12Posts 0Followers 7Following

@Dev4YM

Dec 28

LLMs currently regenerate output token-by-token even when the same content already exists in the conversation. Imagine a decoder that can emit references (EMIT_REF(chunk_id)) instead of re-generating identical tokens. Copy-on-write, but for language. Wouldn't that be cool? #LLM