PS

@pranabkumar_

Joined May 2012

7KPosts 253Followers 2KFollowing

You might like

@SuhagKirti

@583c7047d28942c

@Roshan_rising

PS reposted

ℏεsam

@Hesamation

Nov 3

Stanford just dropped a 1:47 hour masterclass about LLM training on YouTube. this is the 4th lecture in their new Fall 2025 series of transformers and LLM. this is the new go-to course on LLMs imo, and you need to take it seriously. subjects in this video: → Pretraining →…

Hesamation's tweet image. Stanford just dropped a 1:47 hour masterclass about LLM training on YouTube. this is the 4th lecture in their new Fall 2025 series of transformers and LLM.

this is the new go-to course on LLMs imo, and you need to take it seriously.

subjects in this video:
→ Pretraining
→…

PS reposted

Vibhor Varshney

@nakulvibhor

Nov 3

Dear Telecom co, I would like to share two humble suggestions: Many Indians today already have Wi-Fi connections at home, especially senior citizens who primarily rely on it. It would be great if COs could introduce a low-cost plan offering around 1 GB per day, with flexible…

PS reposted

Gaurav Sen

@gkcs_

Nov 1

This video explains how diffusion models are overtaking Large Language Models for generation tasks like: 1. Code Generation 2. Image Generation 3. Video Generation 00:00 Agenda 00:20 How are they different from LLMs? 05:09 Internal Mechanism 10:09 How are vectors generated?…

PS reposted

λux

@novasarc01

Nov 1

github.com/GeeeekExplorer…

novasarc01's tweet card. Nano vLLM. Contribute to GeeeekExplorer/nano-vllm development by creating an account on GitHub.

GitHub - GeeeekExplorer/nano-vllm: Nano vLLM

Source: github.com

PS reposted

Anuradha Tiwari

@talk2anuradha

Oct 30

Amazon just laid off 14000 people. Not a single minister has offered any support. > Govt collects taxes from IT professionals > Use same taxes to fund freebies > Use those freebies to win elections But when these taxpayers are in trouble, government is nowhere to be found.

PS reposted

Andrew Ng

@AndrewYNg

Oct 29

An exciting new professional certificate: PyTorch for Deep Learning taught by @lmoroney is now available at DeepLearning.AI. This is the definitive program for learning PyTorch, which is one of the main frameworks researchers use to build breakthrough AI systems. If you…

PS reposted

krupa

@krupaad

Oct 28

krupadave.com/articles/every…

krupadave.com

Everything About Transformers

Illustrated guide to how transformers actually work — and why they’re built this way.

Source: krupadave.com

PS reposted

Sebastian Raschka

@rasbt

Oct 20

🔗 Mixture of Experts (MoE): github.com/rasbt/LLMs-fro…

Sebastian Raschka

@rasbt

Oct 13

Sliding Window Attention 🔗 github.com/rasbt/LLMs-fro…

PS reposted

Andrew Ng

@AndrewYNg

Oct 7

Announcing my new course: Agentic AI! Building AI agents is one of the most in-demand skills in the job market. This course, available now at deeplearning.ai, teaches you how. You'll learn to implement four key agentic design patterns: - Reflection, in which an agent…

PS reposted

DeepLearning.AI

@DeepLearningAI

Oct 6

Our course recommendation of the day is “Post-training of LLMs, ” where you’ll learn how to customize pre-trained language models using Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Online Reinforcement Learning (RL). You'll learn when to use each…

PS reposted

Unsloth AI

@UnslothAI

Oct 1

Unsloth now has a Docker image! 🐳 Train LLMs locally with no setup: just run the image and go. Includes every pre-made Unsloth notebook. Solves dependency or environment issues. Guide: docs.unsloth.ai/new/how-to-tra…

UnslothAI's tweet image. Unsloth now has a Docker image! 🐳

Train LLMs locally with no setup: just run the image and go.
Includes every pre-made Unsloth notebook.

Solves dependency or environment issues.

Guide: docs.unsloth.ai/new/how-to-tra…

PS reposted

Ben Burtenshaw

@ben_burtenshaw

Oct 2

still experimenting with LoRA based on the @thinkymachines configuration and just implemented it in colab. In this notebook I set up a fine tune of Qwen/Qwen3-0.6B on the OpenR1-Math dataset with lora rank of 1. with this setup you can get the same reward accuracy as full…

PS reposted

Thinking Machines

@thinkymachines

Oct 1

Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

thinkymachines's tweet image. Introducing Tinker: a flexible API for fine-tuning language models.

Write training loops in Python on your laptop; we'll run them on distributed GPUs.

Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

PS reposted

Devendra Chaplot

@dchaplot

Oct 1

Announcing our first product: Tinker! Tinker is a training API for everyone! It lets you focus on what matters in LLM training - your data and algorithms - while we handle the heavy lifting of distributed training. You can train your own models using Tinker even if you have no…

Tinker

Source: thinkingmachines.ai

Thinking Machines

@thinkymachines

Oct 1

PS reposted

Karthik M Bhat

@ikarthikmb

Sep 20

Red carpet welcome at @Rakuten India office in Bengaluru! Kaadubesanahalli, Prestige Tech Park! Global companies trust you and set up shop here, and this is the kind of civic amenities you offer! @GBAChiefComm and then our DCM @DKShivakumar say companies can’t BLACKMAIL… does…

ikarthikmb's tweet image. Red carpet welcome at @Rakuten India office in Bengaluru! Kaadubesanahalli, Prestige Tech Park!

Global companies trust you and set up shop here, and this is the kind of civic amenities you offer! @GBAChiefComm and then our DCM @DKShivakumar say companies can’t BLACKMAIL… does…

PS reposted

Gauri Tripathi

@Gauri_the_great

Sep 17

Some resources about GPUs I found good as a noob in GPU programming, 1. jax-ml.github.io/scaling-book/g… 2. modal.com/gpu-glossary/p… 3. multimodalai.substack.com/p/the-mlai-eng… 4. bytesofintelligence.substack.com/p/maximizing-g… 5. youtube.com/playlist?list=…

PS reposted

Nishant Kumar

@nishkumar1977

Sep 18

The more charts I see the more I get convinced: A big big correction is due next year #2026. Till then enjoy the rally that's right in front of us for the next few months!! #Nifty #Nifty500 #NiftyMidcap100 #NiftySmallcap100

nishkumar1977's tweet image. The more charts I see the more I get convinced: A big big correction is due next year #2026. Till then enjoy the rally that's right in front of us for the next few months!!
#Nifty #Nifty500 #NiftyMidcap100 #NiftySmallcap100

This post is unavailable.

PS reposted

λux

@novasarc01

Sep 12

solid in-depth explanation of paged attention in this blog.

PS reposted

Nishant Kumar

@nishkumar1977

Sep 7

#Nifty 24741 7th Sep 2024 And today 7th Sep 2025 A year has passed but the story continues!! That time we were about to end the 3rd and rest is history. The 4th took over! Now, we are running that 5th and that may complete in the next 3-4 months or by very early next year.…

nishkumar1977's tweet image. #Nifty 24741

7th Sep 2024

And today

7th Sep 2025

A year has passed but the story continues!!

That time we were about to end the 3rd and rest is history. The 4th took over!

Now, we are running that 5th and that may complete in the next 3-4 months or by very early next year.…

This post is unavailable.

PS reposted

Unsloth AI

@UnslothAI

Aug 5

You can now run gpt-oss-120b & 20b locally with our GGUFs! 🦥 Run OpenAI's 120b model on 66GB RAM & 20b model on 14GB RAM. Both in original precision. Uploads includes our chat template fixes. Guide: docs.unsloth.ai/basics/gpt-oss GGUF: huggingface.co/unsloth/gpt-os…