Nonapproximability Results for Partially Observable Markov Decision Processes

C. Lusena; J. Goldsmith; M. Mundhenk

doi:10.1613/jair.714

PDF PS PS.Z

Published: Mar 1, 2001

DOI: https://doi.org/10.1613/jair.714

C. Lusena

J. Goldsmith

M. Mundhenk

Abstract

We show that for several variations of partially observable Markov decision processes, polynomial-time algorithms for finding control policies are unlikely to or simply don't have guarantees of finding policies within a constant factor or a constant summand of optimal. Here ``unlikely'' means ``unless some complexity classes collapse,'' where the collapses considered are P=NP, P=PSPACE, or P=EXP. Until or unless these collapses are shown to hold, any control-policy designer must choose between such performance guarantees and efficient computation.

Issue

Vol. 14 (2001)

Section

Articles

news

AAAI Contributes to JAIR Sustainability Campaign

Reproducibility Initiative

2025 IJCAI-JAIR Prize Awarded

Special Track on Multi-Agent Path Finding

JAIR Available in ACM Library

JAIR Sustainability Campaign: Help Support Us

submission

JAIR invites submissions in all areas of AI. Articles published in JAIR must meet the highest quality standards as measured by originality and significance of the contribution.

Submit an Article

afiliatedsites

JAIR is published by AI Access Foundation, a nonprofit public charity whose purpose is to facilitate the dissemination of scientific results in artificial intelligence. JAIR, established in 1993, was one of the first open-access scientific journals on the Web, and has been a leading publication venue since its inception.

Learn more

Article Sidebar

Main Article Content

Abstract

Article Details