harshith

@theharshithh

parallel tensor splitter @simplismartHQ, prev @onfinance_ai

github.com/theharshithh

انضم في أبريل 2022

682المنشورات 330المتابعون 3ألفالمتابَعون

قد يعجبك

@Chandan_Perla

@YashRathi4304

@AyushmanGarg4

@iamrealshariff

@ItsLP1802

مثبتة

harshith

@theharshithh

٢٨ مايوم

had just shipped a repo for training speculative decoding heads to speed up inference of llms by ~3x. get any base model, train a few speculative heads, see the difference in throughput. 🧵on more details. 1/n

harshith

@theharshithh

6 س

how does cloudflare handle ~almost all cdn traffic and still shoot itself in the leg multiple times over. first react useEffect, now sql filtering.

swyx 🗽 @aidotengineer AIE CODE

@swyx

9 س

cloudflare outage was due to one bad SQL statement that baked in an assumption it shouldnt have can you spot the bug here? no. because SQL does not Make Wrong Code Look Wrong. sometimes i wonder how many SEVs, performance issues and privacy leaks happen because we took a query…

swyx's tweet image. cloudflare outage was due to one bad SQL statement that baked in an assumption it shouldnt have

can you spot the bug here? no. because SQL does not Make Wrong Code Look Wrong.

sometimes i wonder how many SEVs, performance issues and privacy leaks happen because we took a query…

harshith

@theharshithh

١٧ نوفمبرم

colab in vs-code/cursor should be that 100x outcome feature.

harshith

@theharshithh

١٥ نوفمبرم

please take this adivice from me and do not waste weeks on @LiteLLM. it JUST DOESNT WORK. 1. core repo is 200mb, where 180mb is images and gifs. 2. we had to contribute to our forks and install to get it working. how is so hard to write python transformation fns. just write…

harshith

@theharshithh

١٣ نوفمبرم

instagram's codebase is ~20mn lines of python. do what u will with information js devs, im sorry. python is really all you need.

theharshithh's tweet image. instagram's codebase is ~20mn lines of python. do what u will with information

js devs, im sorry.
python is really all you need.

harshith أعاد

Simran Arora

@simran_s_arora

١١ نوفمبرم

AI has been built on one vendor’s stack for too long. AMD’s GPUs now offer state-of-the-art peak compute and memory bandwidth — but the lack of mature software / the “CUDA moat” keeps that power locked away. Time to break it and ride into our multi-silicon future. 🌊 It's been a…

simran_s_arora's tweet image. AI has been built on one vendor’s stack for too long.
AMD’s GPUs now offer state-of-the-art peak compute and memory bandwidth — but the lack of mature software / the “CUDA moat” keeps that power locked away. Time to break it and ride into our multi-silicon future. 🌊

It's been a…

harshith

@theharshithh

١١ نوفمبرم

scaling rl is an infra problem than a ml problem. apart from the fact that training is just waiting for rollouts to be completed and updating.

harshith

@theharshithh

١١ نوفمبرم

and ladies and gentlemen, its all compute and data. cheetah was a previous checkpoint of composer. build your platform, collect data, train and win!

harshith

@theharshithh

١١ نوفمبرم

and ladies and gentlemen, its all compute and data. cheetah was a previous checkpoint of composer. build your platform, collect data, train and win!

harshith

@theharshithh

١١ نوفمبرم

testing something here. ignore

bud wiser

@w0rdgenerator

٢ نوفمبرم

Makeup ate today

harshith

@theharshithh

٩ نوفمبرم

here is something I worked on a few months before, you can train a few speculative decoding heads and infer out of that. end to end setup. can be modified to train for eagle spec heads. leave a star if found helpful! github.com/theharshith/sp…

theharshithh's tweet card. Contribute to theharshith/speculative-decoding development by creating an account on GitHub.

GitHub - theharshith/speculative-decoding

المصدر: github.com

Raj Dabre

@prajdabre

٧ نوفمبرم

Here's your weekend challenge: Implement speculative decoding. Step 1: Read the following paper and/or blog: arxiv.org/abs/2211.17192 galacodes.hashnode.dev/speculative-de… (cc @jaygala223) Step 2: Choose a family of models which come in various sizes. My choice would be the Gemma3 or Qwen…

prajdabre's tweet image. Here's your weekend challenge: Implement speculative decoding.

Step 1: Read the following paper and/or blog: arxiv.org/abs/2211.17192 galacodes.hashnode.dev/speculative-de… (cc @jaygala223)
Step 2: Choose a family of models which come in various sizes. My choice would be the Gemma3 or Qwen…

harshith

@theharshithh

٨ نوفمبرم

the only company i want to work for: @anyscalecompute

Archie Sengupta

@archiexzzz

٨ نوفمبرم

Companies i'd defo work for if my startup goes balls up: @atomicworkhq @PrimeIntellect @NousResearch @clonerobotics @PalantirTech @GroqInc @Starcloud_Inc1 @cursor_ai @brexHQ @ExaAILabs

harshith

@theharshithh

٨ نوفمبرم

bro is pitching the competitor of loom to the founder of loom lmao

harshith أعاد

Elliot Arledge

@elliotarledge

٥ نوفمبرم

John Wick of CUDA kernels.

Lisan al Gaib

@scaling01

٥ نوفمبرم

of course merged by the 500IQ Tsinghua GOAT himself

harshith

@theharshithh

٦ نوفمبرم

“win the internet for a day” as a service the more i think about this, the more it makes sense. so much value unlock for everyone in general.

harshith

@theharshithh

٥ نوفمبرم

i wonder how they plan moving data in and data out, latest nasa's latest number is 900mbps. assuming it would be for large scale pretraining, they would launch "space jobs". not sure if this is helpful for realtime inference. exciting time to be alive

Sundar Pichai

@sundarpichai

٤ نوفمبرم

Our TPUs are headed to space! Inspired by our history of moonshots, from quantum computing to autonomous driving, Project Suncatcher is exploring how we could one day build scalable ML compute systems in space, harnessing more of the sun’s power (which emits more power than 100…

sundarpichai's tweet image. Our TPUs are headed to space!

Inspired by our history of moonshots, from quantum computing to autonomous driving, Project Suncatcher is exploring how we could one day build scalable ML compute systems in space, harnessing more of the sun’s power (which emits more power than 100…

harshith

@theharshithh

٢ نوفمبرم

"While you’re crafting the perfect launch tweet, support tickets pile up. While you’re updating your bio with “Backed by [Famous Fund],” customers churn. While you’re performing success, someone boring is building" while I sort of agree of what @nikunj is telling here but also,…

harshith

@theharshithh

٢ نوفمبرم

no we will take care of inference, you just release those weights

Zephyr

@zephyr_z9

١ نوفمبرم

"Open-sourcing does not equal free; running inference servers comes with costs." U need to buy a computer to run Linux too "Open-sourcing the weights of large models is different from open-source software; there is no reverse contribution from the developer community."…