A classroom debate about Java and Python is also a test of whether AI helps students reason about code, or only helps them produce something that looks right.