Dev4YM's profile picture. Software engineer, +4 years experience working on enterprise systems. Full-stack across TS, JS, SQL, Python, C# Flutter. Automation, system design, and using AI

@Dev4YM

@Dev4YM

Software engineer, +4 years experience working on enterprise systems. Full-stack across TS, JS, SQL, Python, C# Flutter. Automation, system design, and using AI

LLMs currently regenerate output token-by-token even when the same content already exists in the conversation. Imagine a decoder that can emit references (EMIT_REF(chunk_id)) instead of re-generating identical tokens. Copy-on-write, but for language. Wouldn't that be cool? #LLM


This account does not have any followers
Loading...

Something went wrong.


Something went wrong.