Mostrar el registro sencillo del ítem
dc.contributor.author | Karanik, Marcelo J. | |
dc.contributor.author | Gramajo, Sergio D. | |
dc.date.accessioned | 2009-09-17T19:26:43Z | |
dc.date.available | 2009-09-17T19:26:43Z | |
dc.date.issued | 2009-09-17T19:26:43Z | |
dc.identifier.isbn | 978-987-24967-3-9 | |
dc.identifier.uri | http://hdl.handle.net/10226/473 | |
dc.description.abstract | The software agents are programs that can perceive from their environment and they act to reach their design goals. In most cases the selected agent architecture determines its behaviour in response to different problem states. However, there are some problem domains in which it is desirable that the agent learns a good action execution policy by interacting with its environment. This kind of learning is called Reinforcement Learning (RL) and is useful in the process control area. Given a problem state, the agent selects the adequate action to do and receives an immediate reward. Then it actualizes its estimations about every action and, after a certain period of time, the agent learns which the best action to execute is. Most RL algorithms execute simple actions even if two o more can be executed. This work involves the use of RL algorithms to find an optimal policy in a gridworld problem and proposes a mechanism to combine actions of different types. | en |
dc.language.iso | en | en |
dc.relation.ispartofseries | Karanik, M. y Gramajo, S., (2009, julio). Actions Combination Method for Reinforcement Learning. Trabajo presentado en el Congreso de Inteligencia Computacional Aplicada (CICA), realizado en Buenos Aires del 23 al 24 de julio de 2009. | |
dc.subject | Reinforcement Learning | en |
dc.subject | Actions Combination | en |
dc.subject | SARSA | en |
dc.subject | Optimal Policy | en |
dc.title | Actions Combination Method for Reinforcement Learning | en |
dc.type | Article | en |