dejavucoder's profile picture. llm and shitposting
into crafting ai products and evals
dm open to talk on ai engg/post-training/llm stuff

sankalp

@dejavucoder

llm and shitposting into crafting ai products and evals dm open to talk on ai engg/post-training/llm stuff

고정된 트윗

you can read my latest blogpost: my experience with claude code after 2 weeks of adventure now - some lore why i started using it - it's several features - my current workflow - must know commands almost like a beginner guide or log if you will sankalp.bearblog.dev/my-claude-code…

dejavucoder's tweet image. you can read my latest blogpost: my experience with claude code after 2 weeks of adventure now

- some lore why i started using it
- it's several features
- my current workflow
- must know commands

almost like a beginner guide or log if you will

sankalp.bearblog.dev/my-claude-code…

was in e/xperiments discord office hours with @tokenbender recently and he was advocating that we should seed the models with our own raw thoughts and build upon it - LLMs as augmentators. instead of outsourcing thinking and idea gen, be steve jobs and let llm be the wozniak.

steve jobs was the prompter, and woz, his llm the bitter engineers coming out of the woodwork to criticize llms are coming to the realization that no matter how good they were, they were ultimately replaceable this whole time but no tech is coming to replace jobs



i swear this feature is playing with me. i never see will brown's post. 50% of the times i would see the 2nd and 3rd person's posts.

dejavucoder's tweet image. i swear this feature is playing with me. i never see will brown's post. 50% of the times i would see the 2nd and 3rd person's posts.

1 day in normie world = 7 days ~ 1 week in ai world


people are still figuring out things about codex sydney sweeney and me have been saying since 1 month now...

dejavucoder's tweet image. people are still figuring out things about codex sydney sweeney and me have been saying since 1 month now...
dejavucoder's tweet image. people are still figuring out things about codex sydney sweeney and me have been saying since 1 month now...

how claude sonnet 4.5 feels while fucking around in my prod codebase

Joonas Virtanen님으로부터

gotta keep your mind open to learn faster


my buddy lost his job to AI. his job was to be absolutely right .


model be like:

dejavucoder's tweet image. model be like:

We're doing all kinds of stuff with these models that the public isn't even thinking of yet.



sankalp 님이 재게시함

one of the most interesting evals or attempts to test shape rotating powers of a language model i have seen in a while. so simple yet challenging the models in latent space.

can your llm rotate a shape inside it's head? i found out yes but it's a fucking idiot when it comes to the upper layer... why? non uniform spatial reasoning.... here's an eval to test the internal latent reasoning of your models.

OccupyingM's tweet image. can your llm rotate a shape inside it's head? 

i found out yes but it's a fucking idiot when it comes to the upper layer...

why? non uniform spatial reasoning....

here's an eval to test the internal latent reasoning of your models.


love karpathy sensei's koding koding koding energy

dejavucoder's tweet image. love karpathy sensei's koding koding koding energy

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

karpathy's tweet image. Excited to release new repo: nanochat!
(it's among the most unhinged I've written).

Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…


read this thread an year ago but i didn't fully understand it today lol. cursor codebase indexing is clever in the sense they separate the semantic search (cloud) from actual code access (done locally). they store your codebase embeddings on cloud with reference meta-data. they…

dejavucoder's tweet image. read this thread an year ago but i didn't fully understand it today lol. cursor codebase indexing is clever in the sense they separate the semantic search (cloud) from actual code access (done locally). they store your codebase embeddings on cloud with reference meta-data. they…
dejavucoder's tweet image. read this thread an year ago but i didn't fully understand it today lol. cursor codebase indexing is clever in the sense they separate the semantic search (cloud) from actual code access (done locally). they store your codebase embeddings on cloud with reference meta-data. they…

An underrated part of Cursor is our codebase indexing system. It provides efficient indexing/updating without storing any code on our servers. (1/9)



sankalp 님이 재게시함

Introducing nanosft, a clean single file implementation of finetuning for chat style model. Loads gpt2-124M weights on nanogpt and does supervised finetuning using just pytorch. a side project that I made recently for some prep. link below :) qts/rts appericiated

ChinmayKak's tweet image. Introducing nanosft, a clean single file implementation of finetuning for chat style model. Loads gpt2-124M weights on nanogpt and does supervised finetuning using just pytorch. 
a side project that I made recently for some prep. link below :) 
qts/rts appericiated

timeline cleanse

dejavucoder's tweet image. timeline cleanse

the 'you're absolutely right' problem is similar or worse with sonnet 4.5 and more than annoyance, i find it hard to trust the output when the model says so especially for more subjective tasks


asked a difficult technical question to a friend who got claude merch recently and he said wait a second... let me put on my thinking cap first


Loading...

Something went wrong.


Something went wrong.