srk

@fastdaima

problem solver

Science & Technology

India

fastdaima.github.io/dltalkies/

Tham gia vào Tháng 3 2021

588Bài đăng 19Người theo dõi 618Đang theo dõi

Bạn có thể thích

@khoiuna

@DavidLaiKinChu1

@mriyaz

Ghim

srk

@fastdaima

31 thg 12, 2021

little little goals: & spending 2 hours practising new things & spending 2 hours on kaggle everyday & publishing blogs twice a week & small contributions to open source & losing 2 kilos per month -> (24 kilos this year) & finding my optimum sleep cycle (through long experiments)

srk đã đăng lại

Femke Plantinga

@femke_plantinga

29 thg 10

BM25 powers billions of searches daily. But 90% of developers can't explain how it actually ranks results. 𝗕𝗠𝟮𝟱𝗙 is the algorithm that powers keyword search in most modern search engines. Here's a super simple breakdown of how BM25 works: • 𝗧𝗲𝗿𝗺 𝗙𝗿𝗲𝗾𝘂𝗲𝗻𝗰𝘆…

femke_plantinga's tweet image. BM25 powers billions of searches daily.

But 90% of developers can't explain how it actually ranks results.

𝗕𝗠𝟮𝟱𝗙 is the algorithm that powers keyword search in most modern search engines.

Here's a super simple breakdown of how BM25 works:

• 𝗧𝗲𝗿𝗺 𝗙𝗿𝗲𝗾𝘂𝗲𝗻𝗰𝘆…

srk đã đăng lại

Elliot Arledge (h/eng)

@elliotarledge

24 thg 10

if this is you i would take the following very seriously: what worked for me was teaching. you get obsessed with a paper or architecture or whatever it is and have the pain point of not being able to find good resources on it. whether it be that for LLMs 3 years ago, or CUDA one…

Gaurav Pandey

@Gauravkirket

24 thg 10

@elliotarledge Any suggestions about projects, papers that I should study to give myself a good chance in ML field? I have intermediate level knowledge of about most of ML related domains.

srk đã đăng lại

Ashutosh Maheshwari

@asmah2107

20 thg 10

Don’t overthink AI agents. > Learn Chain-of-Thought (CoT) > Learn Tree of Thoughts (ToT) > Learn ReAct Framework > Learn Self-Correction / Reflection > Learn Function Calling & Tool Use > Learn Planning Algorithms (LLM+P) > Learn Long-term Memory Architectures > Learn…

srk đã đăng lại

Charles Miller

@charlesmiller_7

20 thg 10

This mindset changed by life. Maybe it'll change yours too. Do it badly.

srk đã đăng lại

Phil Eaton

@eatonphil

13 thg 10

My favorite technical blogs

srk đã đăng lại

Ahmad

@TheAhmadOsman

10 thg 10

- build an autograd engine from scratch - write a mini-GPT from scratch - implement LoRA and fine-tune a model on real data - hate CUDA at least once - cry - keep going the roadmap - 5 phases - if you already know something? skip - if you're lost? rewatch - if you’re stuck? use…

srk đã đăng lại

Ahmad

@TheAhmadOsman

8 thg 10

i built a simple tool that makes Claude Code work with any local LLM full demo: > vLLM serving GLM-4.5 Air on 4x RTX 3090s > Claude Code generating code + docs via my proxy > 1 Python file + .env handles all requests > nvtop showing live GPU load > how it all works Buy a GPU

srk đã đăng lại

Ashutosh Maheshwari

@asmah2107

8 thg 10

Retrieval techniques I’d learn if I wanted to build RAG systems: Bookmark this. 1.BM25 2.Dense Retrieval 3.ColBERT 4.DPR (Dense Passage Retrieval) 5.ANN Indexes (FAISS, HNSW) 6.Vector Quantization 7.Re-ranking (Cross-Encoder) 8.Late Interaction Models 9.Embedding Normalization…

srk đã đăng lại

Ashutosh Maheshwari

@asmah2107

6 thg 10

Inference optimizations I’d study if I wanted sub-second LLM responses: Bookmark this. 1.KV-Caching 2.Speculative Decoding 3.FlashAttention 4.PagedAttention 5.Batch Inference 6.Early Exit Decoding 7.Parallel Decoding 8.Mixed Precision Inference 9.Quantized Kernels 10.Tensor…

asmah2107's tweet image. Inference optimizations I’d study if I wanted sub-second LLM responses:

Bookmark this.

1.KV-Caching
2.Speculative Decoding
3.FlashAttention
4.PagedAttention
5.Batch Inference
6.Early Exit Decoding
7.Parallel Decoding
8.Mixed Precision Inference
9.Quantized Kernels
10.Tensor…

srk đã đăng lại

Ashutosh Maheshwari

@asmah2107

24 thg 9

You're in an ML Engineer interview at Databricks The interviewer asks "Your production chatbot's accuracy was 95% at launch. Six weeks later, user complaints are up and evals show 80%. What do you do?" You reply : "The model is wrong, we need to retrain it." Game over.…

srk đã đăng lại

mitsuri

@0xmitsurii

23 thg 9

Google's former CEO Eric Schmidt shares his weekend habit that led to billion dollar decision.

srk đã đăng lại

Dylan Abruscato

@DylanAbruscato

22 thg 9

This has been the secret to my entire career

tommy

@tommytrxnh

22 thg 9

YOU CANNOT COMPETE WITH SOMEONE WHO IS HAVING FUN

srk đã đăng lại

JFPuget 🇺🇦🇨🇦🇬🇱

@JFPuget

22 thg 9

Very interesting share from my team: @ChrisDeotte @The0Viel @0verfit @Giba1 ) Lesson from competitive ML on @kaggle. Kaggling is not all you need to do to become great data scientists or ML engineers But it is extremely useful for improving your modeling skills. Hope that…

Gilberto Titericz Jr

@Giba1

21 thg 9

After hundreds of Kaggle competitions and years of trial, error, and wins, our team @ChrisDeotte @The0Viel @0verfit @giba1, have compiled a playbook for tabular modeling—a system that’s carried us to the top of leaderboards and held up on real-world data. developer.nvidia.com/blog/the-kaggl…

Giba1's tweet card. Over hundreds of Kaggle competitions, we’ve refined a playbook that consistently lands us near the top of the leaderboard—no matter if we’re working with millions of rows, missing values…

The Kaggle Grandmasters Playbook: 7 Battle-Tested Modeling Techniques for Tabular Data | NVIDIA...

Nguồn: developer.nvidia.com

srk đã đăng lại

Gauri Tripathi

@Gauri_the_great

17 thg 9

Some resources about GPUs I found good as a noob in GPU programming, 1. jax-ml.github.io/scaling-book/g… 2. modal.com/gpu-glossary/p… 3. multimodalai.substack.com/p/the-mlai-eng… 4. bytesofintelligence.substack.com/p/maximizing-g… 5. youtube.com/playlist?list=…

srk đã đăng lại

Arpit Bhayani

@arpit_bhayani

18 thg 9

As software engineers, most of us have an itch to improve - to become better at coding, architecture, or leadership. But do you see the problem? It's all fuzzy and vague. I follow a simple flow of questions that helps me stay focused, improve consistently, and make an…

srk đã đăng lại

Sayash Kapoor

@sayashk

16 thg 9

We spent the last year evaluating agents for HAL. My biggest learning: We live in the Windows 95 era of agent evaluation.

srk đã đăng lại

Elliot Arledge (h/eng)

@elliotarledge

9 thg 9

"HDM achieves competitive 1024x1024 generation quality while maintaining a remarkably low training cost of $535-620 using four RTX5090 GPUs"

elliotarledge's tweet image. "HDM achieves competitive 1024x1024 generation quality while maintaining a remarkably low training cost of $535-620 using four RTX5090 GPUs"

srk đã đăng lại

ℏεsam

@Hesamation

9 thg 9

Hugging Face 🤗 cooked again. They just dropped a new free course with certification on fine-tuning: > instruction tuning > RL and preference alignment > evaluation > creating synthetic data By the end of 2025 you have learned these essential skills from the best.

Hesamation's tweet image. Hugging Face 🤗 cooked again.
They just dropped a new free course with certification on fine-tuning:
&gt; instruction tuning
&gt; RL and preference alignment
&gt; evaluation
&gt; creating synthetic data
By the end of 2025 you have learned these essential skills from the best.

srk

@fastdaima

22 thg 8

Pure systems programming talk on how deep we have to go to understand issues and solving it! Bliss to watch !

Akshat Bubna

@akshat_b

21 thg 8

At @modal we've built every layer of the AI infra stack from scratch — from filesystems and networking to our own async queues and multi-cloud GPU orchestration. I sat down with @narayanarjun from @amplifypartners to go into depth on all of this, including the fun ways the…

srk đã đăng lại

DHH

@dhh

30 thg 7

You have all the time you need, you're just spending it poorly. Don't tell me you don't have time for Linux or kids OR BOTH. You have time for all of it once you stop filling your day with junk activities.