Regret Analysis for Discounted Reinforcement Learning