Evaluating Flexibility Metrics on Simple Temporal Networks with Reinforcement Learning