sankalp

@dejavucoder

llm and shitposting into crafting ai products and evals dm open to talk on ai engg/post-training/llm stuff

sankalp.bearblog.dev/blog/

10월 2021에 가입

41천게시물 17천팔로워 600팔로우 중

내가 좋아할 만한 콘텐츠

@dprophecyguy

@cto_junior

@sand7one

@shrihacker

@shauseth

@fuckpoasting

@ankushdharkar

@IndraAdhikary7

@filterpapi

@Gravito841

@pragdua

@Parikshit_K_

@OtakuProcess

@NirantK

@ankitiscracked

고정된 트윗

sankalp

@dejavucoder

. 7. 17.

you can read my latest blogpost: my experience with claude code after 2 weeks of adventure now - some lore why i started using it - it's several features - my current workflow - must know commands almost like a beginner guide or log if you will sankalp.bearblog.dev/my-claude-code…

dejavucoder's tweet image. you can read my latest blogpost: my experience with claude code after 2 weeks of adventure now

- some lore why i started using it
- it's several features
- my current workflow
- must know commands

almost like a beginner guide or log if you will

sankalp.bearblog.dev/my-claude-code…

sankalp

@dejavucoder

35 분

was in e/xperiments discord office hours with @tokenbender recently and he was advocating that we should seed the models with our own raw thoughts and build upon it - LLMs as augmentators. instead of outsourcing thinking and idea gen, be steve jobs and let llm be the wozniak.

Air Katakana

@airkatakana

1 시간

steve jobs was the prompter, and woz, his llm the bitter engineers coming out of the woodwork to criticize llms are coming to the realization that no matter how good they were, they were ultimately replaceable this whole time but no tech is coming to replace jobs

sankalp

@dejavucoder

1 시간

i swear this feature is playing with me. i never see will brown's post. 50% of the times i would see the 2nd and 3rd person's posts.

dejavucoder's tweet image. i swear this feature is playing with me. i never see will brown's post. 50% of the times i would see the 2nd and 3rd person's posts.

sankalp

@dejavucoder

1 시간

1 day in normie world = 7 days ~ 1 week in ai world

sankalp

@dejavucoder

2 시간

people are still figuring out things about codex sydney sweeney and me have been saying since 1 month now...

sankalp

@dejavucoder

2 시간

how claude sonnet 4.5 feels while fucking around in my prod codebase

Joonas Virtanen님으로부터

sankalp

@dejavucoder

3 시간

gotta keep your mind open to learn faster

sankalp

@dejavucoder

5 시간

my buddy lost his job to AI. his job was to be absolutely right .

sankalp

@dejavucoder

17 시간

model be like:

Vedant Misra

@vedantmisra

. 10. 12.

We're doing all kinds of stuff with these models that the public isn't even thinking of yet.

sankalp 님이 재게시함

tokenbender

@tokenbender

19 시간

one of the most interesting evals or attempts to test shape rotating powers of a language model i have seen in a while. so simple yet challenging the models in latent space.

krishna

@OccupyingM

20 시간

can your llm rotate a shape inside it's head? i found out yes but it's a fucking idiot when it comes to the upper layer... why? non uniform spatial reasoning.... here's an eval to test the internal latent reasoning of your models.

OccupyingM's tweet image. can your llm rotate a shape inside it's head?

i found out yes but it's a fucking idiot when it comes to the upper layer...

why? non uniform spatial reasoning....

here's an eval to test the internal latent reasoning of your models.

sankalp

@dejavucoder

20 시간

love karpathy sensei's koding koding koding energy

Andrej Karpathy

@karpathy

22 시간

Excited to release new repo: nanochat! (it's among the most unhinged I've written). Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

karpathy's tweet image. Excited to release new repo: nanochat!
(it's among the most unhinged I've written).

Unlike my earlier similar repo nanoGPT which only covered pretraining, nanochat is a minimal, from scratch, full-stack training/inference pipeline of a simple ChatGPT clone in a single,…

sankalp

@dejavucoder

21 시간

read this thread an year ago but i didn't fully understand it today lol. cursor codebase indexing is clever in the sense they separate the semantic search (cloud) from actual code access (done locally). they store your codebase embeddings on cloud with reference meta-data. they…

dejavucoder's tweet image. read this thread an year ago but i didn't fully understand it today lol. cursor codebase indexing is clever in the sense they separate the semantic search (cloud) from actual code access (done locally). they store your codebase embeddings on cloud with reference meta-data. they…

Aman Sanger

@amanrsanger

2024. 1. 24.

An underrated part of Cursor is our codebase indexing system. It provides efficient indexing/updating without storing any code on our servers. (1/9)

sankalp 님이 재게시함

Chinmay Kak

@ChinmayKak

24 시간

Introducing nanosft, a clean single file implementation of finetuning for chat style model. Loads gpt2-124M weights on nanogpt and does supervised finetuning using just pytorch. a side project that I made recently for some prep. link below :) qts/rts appericiated

ChinmayKak's tweet image. Introducing nanosft, a clean single file implementation of finetuning for chat style model. Loads gpt2-124M weights on nanogpt and does supervised finetuning using just pytorch.
a side project that I made recently for some prep. link below :)
qts/rts appericiated

sankalp

@dejavucoder

. 10. 13.

timeline cleanse

sankalp

@dejavucoder

. 10. 13.

the 'you're absolutely right' problem is similar or worse with sonnet 4.5 and more than annoyance, i find it hard to trust the output when the model says so especially for more subjective tasks