pranabkumar_'s profile picture.

PS

@pranabkumar_

PS reposted

Stanford just dropped a 1:47 hour masterclass about LLM training on YouTube. this is the 4th lecture in their new Fall 2025 series of transformers and LLM. this is the new go-to course on LLMs imo, and you need to take it seriously. subjects in this video: → Pretraining →…

Hesamation's tweet image. Stanford just dropped a 1:47 hour masterclass about LLM training on YouTube. this is the 4th lecture in their new Fall 2025 series of transformers and LLM.

this is the new go-to course on LLMs imo, and you need to take it seriously. 

subjects in this video:
→ Pretraining 
→…

PS reposted

Dear Telecom co, I would like to share two humble suggestions: Many Indians today already have Wi-Fi connections at home, especially senior citizens who primarily rely on it. It would be great if COs could introduce a low-cost plan offering around 1 GB per day, with flexible…


PS reposted

This video explains how diffusion models are overtaking Large Language Models for generation tasks like: 1. Code Generation 2. Image Generation 3. Video Generation 00:00 Agenda 00:20 How are they different from LLMs? 05:09 Internal Mechanism 10:09 How are vectors generated?…


PS reposted

Amazon just laid off 14000 people. Not a single minister has offered any support. > Govt collects taxes from IT professionals > Use same taxes to fund freebies > Use those freebies to win elections But when these taxpayers are in trouble, government is nowhere to be found.


PS reposted

An exciting new professional certificate: PyTorch for Deep Learning taught by @lmoroney is now available at DeepLearning.AI. This is the definitive program for learning PyTorch, which is one of the main frameworks researchers use to build breakthrough AI systems. If you…


PS reposted

🔗 Mixture of Experts (MoE): github.com/rasbt/LLMs-fro…

rasbt's tweet image. 🔗 Mixture of Experts (MoE): github.com/rasbt/LLMs-fro…

Sliding Window Attention 🔗 github.com/rasbt/LLMs-fro…

rasbt's tweet image. Sliding Window Attention
🔗 github.com/rasbt/LLMs-fro…


PS reposted

Announcing my new course: Agentic AI! Building AI agents is one of the most in-demand skills in the job market. This course, available now at deeplearning.ai, teaches you how. You'll learn to implement four key agentic design patterns: - Reflection, in which an agent…


PS reposted

Our course recommendation of the day is “Post-training of LLMs, ” where you’ll learn how to customize pre-trained language models using Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Online Reinforcement Learning (RL). You'll learn when to use each…


PS reposted

Unsloth now has a Docker image! 🐳 Train LLMs locally with no setup: just run the image and go. Includes every pre-made Unsloth notebook. Solves dependency or environment issues. Guide: docs.unsloth.ai/new/how-to-tra…

UnslothAI's tweet image. Unsloth now has a Docker image! 🐳 

Train LLMs locally with no setup: just run the image and go.
Includes every pre-made Unsloth notebook.

Solves dependency or environment issues.

Guide: docs.unsloth.ai/new/how-to-tra…

PS reposted

still experimenting with LoRA based on the @thinkymachines configuration and just implemented it in colab. In this notebook I set up a fine tune of Qwen/Qwen3-0.6B on the OpenR1-Math dataset with lora rank of 1. with this setup you can get the same reward accuracy as full…


PS reposted

Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

thinkymachines's tweet image. Introducing Tinker: a flexible API for fine-tuning language models.

Write training loops in Python on your laptop; we'll run them on distributed GPUs.

Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

PS reposted

Announcing our first product: Tinker! Tinker is a training API for everyone! It lets you focus on what matters in LLM training - your data and algorithms - while we handle the heavy lifting of distributed training. You can train your own models using Tinker even if you have no…

Introducing Tinker: a flexible API for fine-tuning language models. Write training loops in Python on your laptop; we'll run them on distributed GPUs. Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…

thinkymachines's tweet image. Introducing Tinker: a flexible API for fine-tuning language models.

Write training loops in Python on your laptop; we'll run them on distributed GPUs.

Private beta starts today. We can't wait to see what researchers and developers build with cutting-edge open models!…


PS reposted

Red carpet welcome at @Rakuten India office in Bengaluru! Kaadubesanahalli, Prestige Tech Park! Global companies trust you and set up shop here, and this is the kind of civic amenities you offer! @GBAChiefComm and then our DCM @DKShivakumar say companies can’t BLACKMAIL… does…

ikarthikmb's tweet image. Red carpet welcome at @Rakuten India office in Bengaluru! Kaadubesanahalli, Prestige Tech Park! 

Global companies trust you and set up shop here, and this is the kind of civic amenities you offer! @GBAChiefComm and then our DCM @DKShivakumar say companies can’t BLACKMAIL… does…

PS reposted

The more charts I see the more I get convinced: A big big correction is due next year #2026. Till then enjoy the rally that's right in front of us for the next few months!! #Nifty #Nifty500 #NiftyMidcap100 #NiftySmallcap100

nishkumar1977's tweet image. The more charts I see the more I get convinced: A big big correction is due next year #2026. Till then enjoy the rally that's right in front of us for the next few months!!
#Nifty #Nifty500 #NiftyMidcap100 #NiftySmallcap100
nishkumar1977's tweet image. The more charts I see the more I get convinced: A big big correction is due next year #2026. Till then enjoy the rally that's right in front of us for the next few months!!
#Nifty #Nifty500 #NiftyMidcap100 #NiftySmallcap100
nishkumar1977's tweet image. The more charts I see the more I get convinced: A big big correction is due next year #2026. Till then enjoy the rally that's right in front of us for the next few months!!
#Nifty #Nifty500 #NiftyMidcap100 #NiftySmallcap100
nishkumar1977's tweet image. The more charts I see the more I get convinced: A big big correction is due next year #2026. Till then enjoy the rally that's right in front of us for the next few months!!
#Nifty #Nifty500 #NiftyMidcap100 #NiftySmallcap100
This post is unavailable.

PS reposted

solid in-depth explanation of paged attention in this blog.

novasarc01's tweet image. solid in-depth explanation of paged attention in this blog.
novasarc01's tweet image. solid in-depth explanation of paged attention in this blog.
novasarc01's tweet image. solid in-depth explanation of paged attention in this blog.
novasarc01's tweet image. solid in-depth explanation of paged attention in this blog.

PS reposted

#Nifty 24741 7th Sep 2024 And today 7th Sep 2025 A year has passed but the story continues!! That time we were about to end the 3rd and rest is history. The 4th took over! Now, we are running that 5th and that may complete in the next 3-4 months or by very early next year.…

nishkumar1977's tweet image. #Nifty 24741

7th Sep 2024

And today

7th Sep 2025

A year has passed but the story continues!!

That time we were about to end the 3rd and rest is history. The 4th took over!

Now, we are running that 5th and that may complete in the next 3-4 months or by very early next year.…
This post is unavailable.

PS reposted

You can now run gpt-oss-120b & 20b locally with our GGUFs! 🦥 Run OpenAI's 120b model on 66GB RAM & 20b model on 14GB RAM. Both in original precision. Uploads includes our chat template fixes. Guide: docs.unsloth.ai/basics/gpt-oss GGUF: huggingface.co/unsloth/gpt-os…

UnslothAI's tweet image. You can now run gpt-oss-120b & 20b locally with our GGUFs! 🦥

Run OpenAI's 120b model on 66GB RAM & 20b model on 14GB RAM. Both in original precision.

Uploads includes our chat template fixes.

Guide: docs.unsloth.ai/basics/gpt-oss
GGUF: huggingface.co/unsloth/gpt-os…

Our open models are here. Both of them. openai.com/open-models



United States Trends

Loading...

Something went wrong.


Something went wrong.