Market Timing strategy through Reinforcement Learning