On the Worst-Case Analysis of Temporal-Difference Learning Algorithms