CoT Red-Handed: Stress Testing Chain-of-Thought Monitoring
Byte by Byte: Optimizing Tokenization for Large Language Models
Do image classifiers deepdream of electric sheep?