Let's pretrain a 3B LLM from scratch: on 16+ H100 GPUs, no detail skipped.

william falcon February 13, 2024
Video Thumbnail

You May Also Like

william falcon

View Channel

About

No channel description available.

AI Assistant

Loading...