Varun Kumar

@uberkumar

Independent Engineer. When LLMs start to feel things, I’ll know about it

Software developer/Programmer/Software engineer

revado.ai

Joined January 2022

54Posts 13Followers 98Following

You might like

@travishenry

Varun Kumar reposted

AJ’s AI

@AJs_AI

Mar 9

Manus just blessed me AGAIN They saw how quickly we ran through the first 100 community codes… so they gave me 500 more…. If you missed out on the first round, here’s your chance: 1. Retweet this post 2. Follow 3. DM me “MANUS” If you got the access code in the first…

AJs_AI's tweet image. Manus just blessed me AGAIN

They saw how quickly we ran through the first 100 community codes… so they gave me 500 more….

If you missed out on the first round, here’s your chance:

1. Retweet this post
2. Follow
3. DM me “MANUS”

If you got the access code in the first…

This post is unavailable.

Varun Kumar reposted

Jenia Jitsev 🏳️‍🌈 🇺🇦 🇮🇱 🇮🇷

@JJitsev

Jan 25

We might wonder why releasing open-weights for DeepSeek R1 was not a problem, but the data used for training remains closed. Could it be we would see massive amount of both o1-mini and Claude 3.5 Sonnet traces in there? With the data closed, we can only guess.

Varun Kumar

@uberkumar

Jan 23

Sinners judging sinners for sinning differently

Varun Kumar reposted

Sergey Karayev

@sergeykarayev

Apr 11, 2024

Born too late for hand-coding Railroad Tycoon in assembly, born too early for not coding at all. Born just in time for writing "You are a helpful AI assistant who only returns RFC-compliant JSON"

Varun Kumar reposted

Elizabeth Yin 💛

@dunkhippo33

Mar 19, 2024

Last yr, I personally paid more in taxes than what I made (!!). I was completely shocked - I didn't think it was possible to *owe more* than you make. But it is. To be clear, this post isn't meant to ask for pity, but I think it can help a lot of ppl out. More >>

Varun Kumar reposted

roon

@tszzl

Mar 14, 2024

a single ounce of being honest with ones true motivations can save low self awareness mfs so much heartache

Varun Kumar reposted

Peter Yang

@petergyang

Mar 7, 2024

I thought Dune 2 was the best movie of 2024 until I watched this masterpiece (sound on).

Varun Kumar reposted

Gergely Orosz

@GergelyOrosz

Sep 27, 2021

Every promotion system has a downside. An example of one not incentivizing this is the founder personally approving/rejecting all promotions, and as the org grows, directors. The downside of this approach is it's full of biases and you need to be on the good side of the founder.

Varun Kumar reposted

Sergey Karayev

@sergeykarayev

Feb 5, 2024

Idea: “startup prison” for those who want to REALLY focus on building. You get a room with a bed, toilet, and desk. Internet is fast. Food is organic, no seed oils. Once a day you get courtyard time to lift and talk shop. Once a week there’s a lame party. Costs 1% equity.

Varun Kumar reposted

terminally onλine εngineer

@tekbog

Feb 2, 2024

uncle bob has played us for absolute fools

Varun Kumar

@uberkumar

Feb 2, 2024

An explosion of complexity often completely unnecessary and unjustified, but the truth is that things are what they are. And in such a context, an idiot who knows everything is a precious ally. -antirez

Varun Kumar

@uberkumar

Feb 1, 2024

Oh my goodness

Varun Kumar

@uberkumar

Feb 1, 2024

That it do

mattparlmer 🪐 🌷

@mattparlmer

Jan 31, 2024

The cursor crown goes extremely hard

Varun Kumar reposted

Delip Rao e/σ

@deliprao

Dec 13, 2023

I have been testing mistral-medium & GPT-4’s code generation abilities for non-trivial problems. These are problems even experience engineers will take time to work it out. I am summarizing some examples and overall impression in this thread: 🧶

Varun Kumar reposted

Andrej Karpathy

@karpathy

Dec 11, 2023

Official post on Mixtral 8x7B: mistral.ai/news/mixtral-o… Official PR into vLLM shows the inference code: github.com/vllm-project/v… New HuggingFace explainer on MoE very nice: huggingface.co/blog/moe In naive decoding, performance of a bit above 70B (Llama 2), at inference speed…

karpathy's tweet card. We’re on a journey to advance and democratize artificial intelligence through open source and open science.

Mixture of Experts Explained

Source: huggingface.co

Guillaume Lample @ NeurIPS 2024

@GuillaumeLample

Dec 11, 2023

Very excited to release our second model, Mixtral 8x7B, an open weight mixture of experts model. Mixtral matches or outperforms Llama 2 70B and GPT3.5 on most benchmarks, and has the inference speed of a 12B dense model. It supports a context length of 32k tokens. (1/n)

GuillaumeLample's tweet image. Very excited to release our second model, Mixtral 8x7B, an open weight mixture of experts model.
Mixtral matches or outperforms Llama 2 70B and GPT3.5 on most benchmarks, and has the inference speed of a 12B dense model. It supports a context length of 32k tokens. (1/n)

Varun Kumar

@uberkumar

Nov 6, 2023

In response to all the recent developments from OpenAI dev day, what is your go to Ui template for prototyping AI chat experiences? I've heard good things from @huggingface's chat-ui. Anyone recommend anything else?

Varun Kumar reposted

Aravind Srinivas

@AravSrinivas

Sep 3, 2023

This actually happened to Evernote. They took the advice of “keep talking to your customers and ship whatever they want” as the only guiding principle for product development. And what ended up happening was paying users liked it, but the product become unintuitive and feature…