Q-learning policies for a single agent foraging tasks

Y. Mohan; Ponnambalam S G

Profiles Research Units Publications

Conferences

Q-learning policies for a single agent foraging tasks

Y. Mohan,

Published in

2010

Abstract

Policies play an important role in balancing the trade-off between exploration and exploitation problem in q-learning. Pure exploration degrades the performance of the q-learning but increases the flexibility to adapt in a dynamic environment. On the other hand pure exploitation drives the learning process to locally optimal solutions. In this paper, a single agent foraging task has been modeled incorporating the available policies reported in the open literature to address the exploration and exploitation issues. Policies namely greedy, e-greedy, Boltzmann distribution, Simulated Annealing(SA)algorithm and random search are used to study their performances in the foraging task and the results are presented.

About the journal

Journal	ISMA'10 - 7th International Symposium on Mechatronics and its Applications

Authors (1)

Ponnambalam S G

ABOUT US

ACADEMICS

INTERNATIONAL RELATIONS

RESEARCH

RANKINGS & PLACEMENT

ABOUT US

ACADEMICS

INTERNATIONAL RELATIONS

RESEARCH

RANKINGS & PLACEMENT