Stable Models and Temporal Difference Learning