Sharing knowledge
and insights
Talks, blogs, videos, and workshops where I share insights on deep learning, LLM training, and building production ML systems.
Scaling LLM Training to 1000 GPUs
Deep dive into distributed training strategies, data parallelism, and pipeline parallelism for training large language models at scale.
Understanding Attention Mechanisms: A Visual Guide
An illustrated walkthrough of self-attention, multi-head attention, and cross-attention with interactive visualizations and PyTorch code.
Hands-on LLM Fine-tuning with Hugging Face
A practical workshop on fine-tuning open-source LLMs using LoRA, QLoRA, and the Hugging Face ecosystem for domain-specific tasks.
Building Production ML Pipelines with Kubeflow
End-to-end tutorial on building automated ML pipelines with Kubeflow, from data ingestion to model serving with canary deployments.