Algorithms for Optimal Reduced Isotonic Regression

Optimal Reduced Isotonic Regression

Janis Hardwick Quentin F. Stout
University of Michigan

Abstract: Isotonic regression is a shape-constrained nonparametric regression in which the ordinate is a nondecreasing function of the abscissa. The regression outcome is an increasing step function. For an initial set of n points, the number of steps in the isotonic regression, m, may be as large as n. As a result, the full isotonic regression has been criticized as overfitting the data or making the representation too complicated. So-called "reduced" isotonic regression constrains the outcome to be a specified number of steps, b. The fastest previous algorithm for determining an optimal reduced isotonic regression takes Θ(n+ b m²) time for the L₂ metric. However, researchers have found this to be too slow and have instead used approximations. Here we reduce the time to Θ(n + b m log m) for the L₁ and L₂ metrics. In contrast, the fastest known algorithm for b-step approximation of arbitrary data takes Θ(b n²) time for L₂ and Θ((b + log n) n²) time for L₁.

Keywords: isotonic regression algorithms, reduced isotonic regression, histogram, segmentation, monotonic, piecewise constant approximation, step function, isotonic data, dynamic programming

Complete paper. Proc. Interface 2012: The Future of Statistical Computing

NOTE: The work in this paper on L₂ was improved upon here, though that paper does not discuss L₁.

Here is an overview of my work on shape constrained regression (isotonic, unimodal, step).