Ady’s World     Contact




    

            
Experience

Board @  Berkeley AI Safety Initiative

Co-Host of IBAR, intercollegiate AI safety retreat

Growth @  Palisade Research

Evals Pedagogy @ AI-Plans

Communications @ AI Safety Founders


Notes

Sparse Autoencoders 

Activation Patching

Alignment Faking

Emergent Misalignment

Transformer Circuits (coming!)

Superposition & Polysemanticity (coming!)