Memory-Based Modeling And Prioritized Sweeping In Reinforcement Learning