Volume-weighted Bellman error method for adaptive meshing in approximate dynamic programming

Optimal control and reinforcement learning have an associate “value function” which must be suitably approximated. Value function approximation problems usually have different precision requirements in different regions of the state space. An uniform gridding wastes resources in regions in which the...

Full description

Saved in:
Bibliographic Details
Main Authors: Leopoldo Armesto, Antonio Sala
Format: Article
Language:Spanish
Published: Universitat Politècnica de València 2021-12-01
Series:Revista Iberoamericana de Automática e Informática Industrial RIAI
Subjects:
Online Access:https://polipapers.upv.es/index.php/RIAI/article/view/15698
Tags: Add Tag
No Tags, Be the first to tag this record!