Papers

For a full list of publications, see my Google Scholar.

Selected papers

Oscar Obeso*, Andy Arditi*, Javier Ferrando, Joshua Freeman, Cameron Holmes, Neel Nanda
arXiv preprint arXiv:2509.03531, 2025
Andy Arditi*, Oscar Obeso*, Aaquib Syed, Daniel Paleka, Nina Panickssery, Wes Gurnee, Neel Nanda
Advances in Neural Information Processing Systems, 2024

Other papers

Runjin Chen, Andy Arditi, Henry Sleight, Owain Evans, Jack Lindsey
arXiv preprint arXiv:2507.21509, 2025
Aryo Pradipta Gema, Alexander Hägele, Runjin Chen, Andy Arditi, Jacob Goldman-Wetzler, Kit Fraser-Taliente, Henry Sleight, Linda Petrini, Julian Michael, Beatrice Alex, Pasquale Minervini, Yanda Chen, Joe Benton, Ethan Perez
arXiv preprint arXiv:2507.14417, 2025
Kureha Yamaguchi, Benjamin Etheridge, Andy Arditi
arXiv preprint arXiv:2507.03167, 2025
Miles Turpin, Andy Arditi, Marvin Li, Joe Benton, Julian Michael
arXiv preprint arXiv:2506.22777, 2025
Min Woo Park, Andy Arditi, Elias Bareinboim, Sanghack Lee
Advances in Neural Information Processing Systems, 2025

Blog posts

AI & interpretability

Andy Arditi, Runjin Chen
LessWrong, 2025
Andy Arditi, Marvin Li, Joe Benton, Miles Turpin
LessWrong, 2025
Daniel Lee, Eric Breck, Andy Arditi
LessWrong, 2025
Andy Arditi
LessWrong, 2024
Andy Arditi, Bilal Chughtai
LessWrong, 2024
Andy Arditi, Oscar Obeso, Aaquib Syed, Wes Gurnee, Neel Nanda
LessWrong, 2024
Andy Arditi, Oscar Obeso
LessWrong, 2023

Cryptography & blockchain

Andy Arditi, Ye Zhang
Scroll blog, 2022
Andy Arditi
Personal blog, 2022
Andy Arditi
Personal blog, 2022
Andy Arditi
Personal blog, 2022