Offline Policy-Search In Bayesian Reinforcement Learning