TraceOpt

@traceml_ai

Tracing and Optimizing ML workloads

github.com/traceopt-ai/tr…

Joined August 2025

11Posts 3Followers 54Following

TraceOpt

@traceml_ai

Oct 29

Training models feels like flying blind: OOMs, idle GPUs, hidden bottlenecks. Would live observability would actually help optimize training ? Curious what you'd want live: Multi-GPU view Throughput Gradient stability Cost @PyTorch @huggingface @wandb @modal @NVIDIAAIDev

TraceOpt reposted

Unsloth AI

@UnslothAI

Oct 9

OpenAI shows how gpt-oss can autonomously beat 2048 using reinforcement learning (RL). Training was done locally with Unsloth on NVIDIA DGX Spark. You can also do it free on Colab. 🦥 OpenAI DevDay notebook: github.com/openai/gpt-oss…

TraceOpt

@traceml_ai

Oct 9

Tired of “CUDA out of memory” while training? 😩 I built TraceML, a tiny open-source tool that shows GPU & memory usage live while fine-tuning PyTorch models. Now with ⏱️ step timing. github.com/traceopt-ai/tr… @PyTorch #MachineLearning #CUDA

traceml_ai's tweet card. A simple package to automatically trace PyTorch training memory usage. - traceopt-ai/traceml

GitHub - traceopt-ai/traceml: A simple package to automatically trace PyTorch training memory usage.

Source: github.com

TraceOpt

@traceml_ai

Sep 29

TraceML: a lightweight tool for real-time PyTorch training memory visibility. View live in your terminal: ⚡ CPU, RAM, GPU usage ⚡ Layer-level allocations ⚡ Activation & gradient memory ⚡ Total forward/backward estimates github.com/traceopt-ai/tr… #PyTorch #DeepLearning #MLOps

traceml_ai's tweet image. TraceML: a lightweight tool for real-time PyTorch training memory visibility.

View live in your terminal:
⚡ CPU, RAM, GPU usage
⚡ Layer-level allocations
⚡ Activation &amp; gradient memory
⚡ Total forward/backward estimates
github.com/traceopt-ai/tr…
#PyTorch #DeepLearning #MLOps

TraceOpt

@traceml_ai

Sep 23

🔥 My PyTorch training was slower, so I built a tiny CLI profiler to spot bottlenecks. It shows live: CPU, GPU util + mem, RAM, activation mem, gradient mem. github.com/traceopt-ai/tr… Focus: answer “why is my training slow?” Would love feedback: what to improve or add next?

GitHub - traceopt-ai/traceml: A simple package to automatically trace PyTorch training memory usage.

Source: github.com