The Computational Complexity of Understanding Binary Classifier Decisions | Journal of Artificial Intelligence Research

PDF

Published: Jan 21, 2021

DOI: https://doi.org/10.1613/jair.1.12359

Keywords:

neural networks, probabilistic reasoning, satisfiability, machine learning

Stephan Waeldchen

TU Berlin

Jan Macdonald

TU Berlin

Sascha Hauch

TU Berlin

Gitta Kutyniok

TU Berlin

Abstract

For a d-ary Boolean function Φ: {0, 1}^d → {0, 1} and an assignment to its variables x = (x₁, x_2,. . . , x_d) we consider the problem of finding those subsets of the variables that are sufficient to determine the function value with a given probability δ. This is motivated by the task of interpreting predictions of binary classifiers described as Boolean circuits, which can be seen as special cases of neural networks. We show that the problem of deciding whether such subsets of relevant variables of limited size k ≤ d exist is complete for the complexity class NP^PP and thus, generally, unfeasible to solve. We then introduce a variant, in which it suffices to check whether a subset determines the function value with probability at least δ or at most δ − γ for 0 < γ < δ. This promise of a probability gap reduces the complexity to the class NP^BPP. Finally, we show that finding the minimal set of relevant variables cannot be reasonably approximated, i.e. with an approximation factor d^1−α for α > 0, by a polynomial time algorithm unless P = NP. This holds even with the promise of a probability gap.

Issue

Vol. 70 (2021)

Section

Articles

afiliatedsites

JAIR is published by AI Access Foundation, a nonprofit public charity whose purpose is to facilitate the dissemination of scientific results in artificial intelligence. JAIR, established in 1993, was one of the first open-access scientific journals on the Web, and has been a leading publication venue since its inception. We invite you to check out our other initiatives.

Learn more

Article Sidebar

Main Article Content

Abstract

Article Details