Volume-weighted Bellman error method for adaptive meshing in approximate dynamic programming
Optimal control and reinforcement learning have an associate “value function” which must be suitably approximated. Value function approximation problems usually have different precision requirements in different regions of the state space. An uniform gridding wastes resources in regions in which the...
Saved in:
Main Authors: | , |
---|---|
Format: | Article |
Language: | Spanish |
Published: |
Universitat Politècnica de València
2021-12-01
|
Series: | Revista Iberoamericana de Automática e Informática Industrial RIAI |
Subjects: | |
Online Access: | https://polipapers.upv.es/index.php/RIAI/article/view/15698 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|