I
Board @ Berkeley AI Safety Initiative
Co-Host of IBAR, intercollegiate AI safety retreat
Growth @ Palisade Research
Evals Pedagogy @ AI-Plans
Communications @ AI Safety Founders
Sparse Autoencoders
Activation Patching
Alignment Faking
Emergent Misalignment
Transformer Circuits (coming!)
Superposition & Polysemanticity (coming!)