🐤 BERT pre-training checkpoints used for analyzing early learning dynamics in "The Subspace Chronicles" (Müller-Eberstein et al., 2023).
Max
personads
AI & ML interests
Natural Language Processing | Representation Learning | Learning Dynamics