-
Find the induction heads in GPT-2
I tried to replicating the Decoder-only transformer by following "Attention is all you need" paper and trained it on William Shakespeare's work.
-
Replicating Decoder-only Transformer by using William Shakespeare Corpus
I tried to replicating the Decoder-only transformer by following "Attention is all you need" paper and trained it on William Shakespeare's work.
-
Replicating Diffusion Models on MNIST
I tried to replicating diffusion model (DDPM & DDIM) by following DeepLearningAI course and trained it on MNIST.