ARENA learning experience
I found ARENA quite helpful for self-study AI safety related topics and it can work well in together with Deep Learning Curriculum. It offers colab choice for the exercise part, which is great since I don’t have much GPU support for my own computer.
Here is how I spent my time on various topics in ARENA and hope it can work as a reference for someone also interested in self-studying this material.
Total Hours: 42.7. Split into the following:
- 8.7h Chapter 0: Fundamentals
- skipped exercise 0.1 Ray Tracing since I found it too advanced to be necessary.
- 2.3h exercise 0.2 CNN. I learnt about
torch.as_strided
related stuff. - 2.4h exercise 0.3 Resnet. I learnt a more detailed view about resnet, pytorch_lighting, batchnorm.
- 2.0h exercise 0.4 Optimization. I learnt about details of various optimizer,
Weight & Bias
related usage. - 2.0h exercise 0.5 Backprop (skipped part 3 & 4 & 5). I learnt about details of back propagation & Autograd.
- 8.0h Chapter 1: Transformers & Mech Interp
- 3.0h exercise 1.1 transformer (I skipped part of sampling.). I learnt about details of transformer, sampling, training and inference sampling.
- 5.0h exercise 1.2 mechanistic interpretability. I learnt about induction circuits, transformerLens, induction heads, hooks, reverse-engineering induction circuits. These material opened a new view for me about how to understand LLM. I find some part hard to understand though and skipped some of the exercise as I don’t want to spend too much time on this topic for now.
- 16.0h Chapter 2: Reinforcement Learning
- 2.5h exercise 2.1 Introduction to RL. it works like a memory refresher about some RL concepts. It is nice to check the detail of some RL environments.
- 7.0h exercise 2.2 Deep Q Learning
- 4.0h exercise 2.3 PPO. The Atari Breakout game result is shown here.
- 2.5h exercise 2.4 RLHF.
- 10.0h Chapter 3 Training at Scale
- I spent most of my time on doing the exercise in Data Parallelism part and skipped most of the exercise in other sections.
Enjoy Reading This Article?
Here are some more articles you might like to read next: