AI & ML interests

Principled evaluation of mechanistic interpretability methods.

Recent Activity

amueller  updated a Space 20 days ago
mib-bench/leaderboard
amueller  published a Space 20 days ago
mib-bench/leaderboard
amueller  updated a Space 20 days ago
mib-bench/leaderboard
View all activity