Bayesian Regression Tree Ensembles that Adapt to Smoothness and Sparsity

Abstract

Ensembles of decision trees are a useful tool for obtaining flexible estimates of regression functions. Examples of these methods include gradient-boosted decision trees, random forests and Bayesian classification and regression trees. Two potential shortcomings of tree ensembles are their lack of smoothness and their vulnerability to the curse of dimensionality. We show that these issues can be overcome by instead considering sparsity inducing soft decision trees in which the decisions are treated as probabilistic. We implement this in the context of the Bayesian additive regression trees framework and illustrate its promising performance through testing on benchmark data sets. We provide strong theoretical support for our methodology by showing that the posterior distribution concentrates at the minimax rate (up to a logarithmic factor) for sparse functions and functions with additive structures in the high dimensional regime where the dimensionality of the covariate space is allowed to grow nearly exponentially in the sample size. Our method also adapts to the unknown smoothness and sparsity levels, and can be implemented by making minimal modifications to existing Bayesian additive regression tree algorithms.

Document Details

Document Type
Pub Defense Publication
Publication Date
Sep 21, 2018
Source ID
10.1111/rssb.12293

Entities

People

  • Antonio R. Linero
  • Yun Yang

Organizations

  • Florida State University
  • National Science Foundation
  • United States Department of Defense
  • University of Illinois Urbana–Champaign

Tags

Fields of Study

  • Computer science

Readers

  • Neural Network Machine Learning.
  • Statistical inference.

Technology Areas

  • AI & ML
  • AI & ML - Bayesian Inference
  • AI & ML - Machine Learning Algorithms
  • Space