title

abstract

layout

series

publisher

issn

id

month

tex_title

firstpage

lastpage

page

order

cycles

bibtex_editor

editor

bibtex_author

author

date

note

address

container-title

volume

genre

issued

pdf

extras

Hierarchical POMDP controller optimization by likelihood maximization

Planning can often be simplified by decomposing the task into smaller tasks arranged hierarchically. Charlin et al. [4] recently showed that the hierarchy discovery problem can be framed as a non-convex optimization problem. However, the inherent computational difficulty of solving such an optimization problem makes it hard to scale to real-world problems. In another line of research, Toussaint et al. [18] developed a method to solve planning problems by maximum-likelihood estimation. In this paper, we show how the hierarchy discovery problem in partially observable domains can be tackled using a similar maximum likelihood approach. Our technique first transforms the problem into a dynamic Bayesian network through which a hierarchical structure can naturally be discovered while optimizing the policy. Experimental results demonstrate that this approach scales better than previous techniques based on non-convex optimization.

inproceedings

Proceedings of Machine Learning Research

PMLR

2640-3498

toussaint08a

0

Hierarchical POMDP controller optimization by likelihood maximization

562

570

562-570

562

false

McAllester, David A. and Myllym{"a}ki, Petri

given	family
David A.	McAllester

given	family
Petri	Myllymäki

Toussaint, Marc and Charlin, Laurent and Poupart, Pascal

given	family
Marc	Toussaint

given	family
Laurent	Charlin

given	family
Pascal	Poupart

2008-07-09

Reissued by PMLR on 30 October 2024.

Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence

R6

inproceedings

date-parts

2008

7

9

http://proceedings.mlr.press/r6/toussaint08a/toussaint08a.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

2008-07-09-toussaint08a.md

2008-07-09-toussaint08a.md

Files

2008-07-09-toussaint08a.md

Latest commit

History

2008-07-09-toussaint08a.md

File metadata and controls