Machine learning methods are traditionally divided into three broad groups, which correspond to learning paradigms, with regards to the mother nature of the "signal" or "feed-back" available to the learning method:In reinforcement learning, the ecosystem is usually represented to be a Markov decision procedure (MDP). Lots of reinforcements learning