Designing deceptively simple math problems to challenge (or troll) students and LLMs, and why it is crucial to include enough atypical variations in training datasets and textbooks.