Learning Discrete Bayesian Networks from Continuous Data

Yi-Chun Chen; Tim A. Wheeler; Mykel J. Kochenderfer

doi:10.1613/jair.5371

PDF PS

Published: Jun 22, 2017

DOI: https://doi.org/10.1613/jair.5371

Yi-Chun Chen

Tim A. Wheeler

Mykel J. Kochenderfer

Abstract

Learning Bayesian networks from raw data can help provide insights into the relationships between variables. While real data often contains a mixture of discrete and continuous-valued variables, many Bayesian network structure learning algorithms assume all random variables are discrete. Thus, continuous variables are often discretized when learning a Bayesian network. However, the choice of discretization policy has significant impact on the accuracy, speed, and interpretability of the resulting models. This paper introduces a principled Bayesian discretization method for continuous variables in Bayesian networks with quadratic complexity instead of the cubic complexity of other standard techniques. Empirical demonstrations show that the proposed method is superior to the established minimum description length algorithm. In addition, this paper shows how to incorporate existing methods into the structure learning process to discretize all continuous variables and simultaneously learn Bayesian network structures.

Issue

Vol. 59 (2017)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details