March 4, 2024Open Access

Tsallis Entropy Regularization for Linearly Solvable MDP and Linear Quadratic Regulator

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

Shannon entropy regularization is widely adopted in optimal control due to its ability to promote exploration and enhance robustness, e.g., maximum entropy reinforcement learning known as Soft Actor-Critic. In this paper, Tsallis entropy, which is a one-parameter extension of Shannon entropy, is used for the regularization of linearly solvable MDP and linear quadratic regulators. We derive the solution for these problems and demonstrate its usefulness in balancing between exploration and sparsity of the obtained control law.

Leer artículo completoexternamente

Me gusta

Guardar

Ver artículo completo

Cite This Study

Hashizume et al. (Mon,) studied this question.

synapsesocial.com/papers/68e75ddfb6db6435876d508d https://doi.org/https://doi.org/10.48550/arxiv.2403.01805

Me gusta

Guardar

Ver artículo completo