Lectures

You can download the lectures here. 🔔 Subscribe to our newsletter for the latest updates on LLMs!

1. Course Introduction
Lecture date: July 24, 2025
tl;dr: An introduction to the course content, logistics, policies and background.
[ slides ] [ recording ]
Suggested Readings:
- A Survey of Deep Learning: From Activations to Transformers
2. Introduction to Language Models
Lecture date: August 04, 2025
tl;dr: Introduction to language modelling, RNNs, backpropagation through time, LSTMs, GRUs.
[ slides ] [ recording ]
Suggested Readings:
3. Sequence to Sequence Model, Attention Mechanism
Lecture date: August 06, 2025
tl;dr: Training of seq2seq models, seq2seq attention, self attention
[ slides ] [ recording ]
Suggested Readings:
4. Transformer: Architecture and Training
Lecture date: August 07, 2025
tl;dr: Transformer Architecture, Pre-training and Fine-tuning of BERT
[ slides ] [ recording ]
Suggested Readings:
5. Pre-training and Instruction Tuning
Lecture date: August 11, 2025
tl;dr: Pre-training strategies of Encoder-only, Encoder-decoder and Decoder-only models; Instruction Tuning, Weighted Instruction Tuning
[ slides ] [ recordingA ] [ recordingB ]
Suggested Readings:
6. Tokenization
Lecture date: August 13, 2025
tl;dr: BPE, WordPiece, Unigram Tokenization
[ slides ] [ recording ]
Suggested Readings:
7. RLHF: Part 01
Lecture date: August 18, 2025
tl;dr: Basiscs of RLHF, Reward models, BT and Non-BT models
[ slides ] [ recording ]
8. RLHF: Part 02
Lecture date: August 20, 2025
tl;dr: Reward Maximization: Best-of-N policy, Objective Formulation, Vanilla Policy Gradient, Actor/Critic Methods
[ slides ] [ recording ]
Suggested Readings:
- Back to Basics: Revisiting REINFORCE Style Optimization for Learning from Human Feedback in LLMs
9. RLHF: Part 03
Lecture date: August 21, 2025
tl;dr: GRPO, PPO, TRPO
[ slides ] [ recording ]
Suggested Readings:
10. RLHF: Part 04
Lecture date: August 25, 2025
tl;dr: PPO/GRPO for LLM alignment, DPO, PPO vs DPO
[ slides ] [ recording ]
Suggested Readings:
- Direct Preference Optimization: Your Language Model is Secretly a Reward Model
- Direct Language Model Alignment from Online AI Feedback
11. Efficient LLMs: Part 01
Lecture date: August 27, 2025
tl;dr: Efficient distributed training - memory profiling, gradient checkpointing, gradient accumulation
[ slides ] [ recording ]
Suggested Readings:
12. Efficient LLMs: Part 02
Lecture date: August 28, 2025
tl;dr: Efficient distributed training - data parallelism, ZeRO (1,2,3), FSDP
[ slides ] [ recordings ]
Suggested Readings:
13. Efficient LLMs: Part 03
Lecture date: September 1, 2025
tl;dr: Efficient distributed training - tensor parallelism, sequence parallelism, context parallelism, ring attention
[ slides ] [ recording ]
Suggested Readings:
14. Efficient LLMs: Part 04
Lecture date: September 3, 2025
tl;dr: Pipeline Parallelism (AFAB, 1F1B), basics of GPU, Flash attention
[ slides ] [ video ]
Suggested Readings:
15. Efficient LLMs: Part 05
Lecture date: September 4, 2025
tl;dr: Training vs. Inference: Forward Pass, Inference, KV Cache usage and Management(vLLMs, KV Blocks, Paged Attention)
[ slides ] [ video ]
Suggested Readings:
16. Efficient LLMs: Part 06
Lecture date: September 8, 2025
tl;dr: Training vs. Inference(in code), Mixture of Experts Architecture, Efficient LLMs Recap
[ slides ] [ video ]
Suggested Readings:
17. Parameter-Efficient Fine-Tuning (PEFT)
Lecture date: September 10, 2025
tl;dr: Additive, Selective and Re-parameterization PEFT Techniques
[ slides ] [ video ]
Suggested Readings:
18. Model Compression
Lecture date: September 11, 2025
tl;dr: Different types of model pruning techniques
[ slides ] [ video ]
Suggested Readings:
19. Knowledge Distillation
Lecture date: September 18, 2025
tl;dr: Different techniques for knowledge distillation in LLMs
[ slides ] [ video ]
Suggested Readings:
20. Retrieval-based LMs: Part 01
Lecture date: September 22, 2025
tl;dr: Motivation behind retrieval-augmented LMs, Retriever pipeline, different retrieval methods (sparse and dense)
[ slides ] [ video ]
Suggested Readings:
- Scoring, term weighting and the vector space model (chapter-6)
- Dense Passage Retrieval for Open-Domain Question Answering
21. Retrieval-based LMs: Part 02
Lecture date: September 24, 2025
tl;dr: Cross-Encoder Reranking, Token level Dense Retrieval (COLBERT), Graph RAG, Hippo RAG
[ slides ] [ video ]
Suggested Readings:
22. LLMs and Tools: Incorporating Tools during Fine-tuning
Lecture date: September 25, 2025
tl;dr: Tool augmentation in LLMs during fine-tuning, TALM, PAL, Toolformer, WebGPT
[ slides ] [ video ]
Suggested Readings:
23. LLMs and Tools: Teaching LLMs to Use External APIs
Lecture date: October 6, 2025
tl;dr: Teaching LLMs to use external APIs - APIBench, ToolAlpaca, ToolBench, APIGen, Granite-Function Calling Model
[ slides ] [ video ]
Suggested Readings:
24. LLMs and Tools: Automating Complex Tasks
Lecture date: October 8, 2025
tl;dr: Teaching LLMs to use external APIs - ToolFlow, MAGNET
[ slides ] [ video ]
Suggested Readings:
- ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis
- Magnet: Multi-turn Tool-use Data Synthesis and Distillation via Graph Translation
25. LLM Reasoning: Part 01
Lecture date: October 9, 2025
tl;dr: How to make LLMs Reason - CoT Prompting, Thought Process Optimization, RL
[ slides ] [ video ]
Suggested Readings:
26. LLM Reasoning: Part 02
Lecture date: October 13, 2025
tl;dr: Deepseek-R1-zero, Reasoning in weaker models-Teaching Restarts, Backtracking (MCTS)
[ slides ] [ video ]
Suggested Readings:
27. LLM Reasoning: Part 03 (Test-Time Scaling)
Lecture date: October 15, 2025
tl;dr: Scaling test-time compute with reasoning models - parallel scaling, sequential scaling, hybrid scaling, internal scaling
[ slides ] [ video ]
Suggested Readings:
28. LLMs and Tools: Agentic Workflow: Part 01
Lecture date: October 16, 2025
tl;dr: Automating Complex Tasks - ReACT, Self-Refine, Reflexion
[ slides ] [ video ]
Suggested Readings:
29. LLMs and Tools: Agentic Workflow: Part 02
Lecture date: October 22, 2025
tl;dr: Memory management in AI agents, enabling a small-sized LLM to approach larger proprietary model performance
[ slides ]
Suggested Readings:
- MemGPT: Towards LLMs as Operating Systems
- SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks
30. LLMs and Tools: Agentic Workflow: Part 03
Lecture date: October 23, 2025
tl;dr: Agentic Protocols - Model Context Protocol (MCP), Agent2Agent (A2A) Protocol, agents.json
[ slides ]
Suggested Readings:
31. Alternative Models: RWKV
Lecture date: October 25, 2025
tl;dr: Receptance Weighted Key Value (RWKV)
[ slides ]
Suggested Readings:
- RWKV: Reinventing RNNs for the Transformer Era
32. Alternative Models: SSMs
Lecture date: October 27, 2025
tl;dr: State Space Machines (SSMs)
[ slides ]
Suggested Readings:
- Blog: Introduction to State Space Models (SSM)
33. Multimodal Encoder Models
Lecture date: October 29, 2025
tl;dr: Multimodal encoder models - ViT, VisualBERT, VilBERT, CLIP, LayoutLMv2, ViT with registers, DinoV3, VideoCLIP, ImageBind
[ slides ]
Suggested Readings:
34. Text Generation with Multimodal Inputs
Lecture date: October 30, 2025
tl;dr: Multimodal text generation architectures - Frozen, Flamingo, BLIP, BLIP2, mPLUG, LLaVa, Video-LLaMA, MiniGPT-4, MiniCPM-V, UI-TARS
[ slides ]
Suggested Readings:
35. Alternative Models: Discretized SSMs & Structured Discrete SSMs
Lecture date: November 3, 2025
tl;dr: Discretized State Space Machines, S4 and Mamba
[ slides-1 ] [ slides-2 ]
Suggested Readings:
36. Physics of LLMs: Knowledge Storage and Extraction
Lecture date: November 6, 2025
tl;dr: Understanding why augmented pre-training data is essential for LLM knowledge extraction
[ slides ]
Suggested Readings:
- Physics of Language Models: Part 3.1, Knowledge Storage and Extraction
37. Physics of LLMs: Knowledge Manipulation, Knowledge Capacity Scaling Laws and Reasoning
Lecture date: November 10, 2025
tl;dr: Understanding LLM reasoning, knowledge manipulation, and capacity scaling laws
[ slides ]
Suggested Readings:
38. Interpretability of LLMs
Lecture date: November 12, 2025
tl;dr: Local and Global Explanation-based Analysis, Sparse Autoencoders, Activation patching and steering
[ slides ]
Suggested Readings: