What does this research mean for the field?

The HI-MAT algorithm can automatically discover compact MAXQ task hierarchies from successful source trajectories that are comparable to manually-engineered hierarchies and significantly accelerate learning when transferred to new target tasks. Novelty: ClaimNovelty.METHODOLOGICAL. Consensus alignment: ConsensusAlignment.NEUTRAL.

January 1, 2008

Automatic discovery and transfer of MAXQ hierarchies

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

We present an algorithm, HI-MAT (Hierarchy Induction via Models And Trajectories), that discovers MAXQ task hierarchies by applying dynamic Bayesian network models to a successful trajectory from a source reinforcement learning task. HI-MAT discovers subtasks by analyzing the causal and temporal relationships among the actions in the trajectory. Under appropriate assumptions, HI-MAT induces hierarchies that are consistent with the observed trajectory and have compact value-function tables employing safe state abstractions. We demonstrate empirically that HI-MAT constructs compact hierarchies that are comparable to manually-engineered hierarchies and facilitate significant speedup in learning when transferred to a target task.

Preguntar a la IA

Me gusta

Guardar