About 11 results for "UCtAcpQcYerN8xxZJYTfWBMw"
Featured Results

PT2H15M13S
Reinforcement Learning from Human Feedback explained with math derivations and the PyTorch code.
Umar JamilFeb 27, 2024
PT1H26M21S
Mistral / Mixtral Explained: Sliding Window Attention, Sparse Mixture of Experts, Rolling Buffer
Umar JamilDec 27, 2023
PT1H12M53S
Distributed Training with PyTorch: complete tutorial with cloud infrastructure and code
Umar JamilDec 19, 2023
PT49M24S
Retrieval Augmented Generation (RAG) Explained: Embedding, Sentence BERT, Vector Database (HNSW)
Umar JamilNov 27, 2023![BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token](https://i.ytimg.com/vi/90mGPxR2GgY/hqdefault.jpg)
PT54M52S
BERT explained: Training, Inference, BERT vs GPT/LLamA, Fine tuning, [CLS] token
Umar JamilOct 26, 2023
PT1H10M55S
LLaMA explained: KV-Cache, Rotary Positional Embedding, RMS Norm, Grouped Query Attention, SwiGLU
Umar JamilAug 24, 2023
PT26M55S