The Fragility of Chain-of-Thought Monitoring Across Typologically Diverse Languages
Paper • 2605.27901 • Published • 8
None defined yet.
The Fragility of Chain-of-Thought Monitoring Across Typologically Diverse Languages
Towards Understanding the Robustness of Sparse Autoencoders