Isotonic Regression for Multiple Independent Variables

Quentin F. Stout
Computer Science and Engineering, University of Michigan

 

Abstract: This paper gives an approach for computing isotonic regressions at points in multidimensional space, with the ordering given by domination (also known as dimensional ordering, majorization, or matrix ordering). That is, given points p=(p1,...,pd) and q=(q1,...,qd) in d-dimensional space, q is said to dominate p if pi ≤ qi for all 1 ≤i ≤ d. Given a set of n d-dimensional points V = {v1...vn} where point vi has data value yi and weight wi, the Lp isotonic regression of the data is a set of values ýi that minimizes

(Σ {wi | yi - ýi | p)1/p : 1 ≤ i ≤ n }   if 1 ≤ p < ∞
max { wi|yi - ýi| : 1 ≤ i ≤ n}   if p = ∞
subject to the isotonic constraint that if vi dominates vj, then ýi ≥ ýj. Typically people are interested in L2 (least squares), L1 (median regression), or L (mini-max error, or the uniform metric).

Note that each dimension need only be a linearly ordered set. For example, one may be height, another shoe size, and a third S < M < L < XL shirt size, where y may correspond to weight. The isotonic requirement is based on the reasonable assumption that if any two of the dimensions are held constant then an increase in the third will increase the expected weight, or, at least, not decrease it.

The paper shows how to efficiently compute the isotonic regression by embedding V into a larger directed acyclic graph (dag) G=(V∪R, E). G is bipartitie, and for all u, v ∈ V, v dominates u iff there is an r ∈ R such that (u,r), (r,v) ∈ E. Further, R and E are only slightly larger than V. Using G, algorithms are given for the L1 and L2 metrics which are n/poly-log n faster than previous algorithms. A yet faster algorithm is given for L1 isotonic regression with unweighted data.

L is not unique, so algorithms are given for several different variants that have been studied. This includes the strict L isotonic regression, which is the limit, as p→∞, of Lp isotonic regression. These too are n/poly-log n faster than previous algorithms.

Some of the algorithms exploit the fact that G is not much larger than V, which others use its structure in a more central way. By using the rendezvous vertices R, operations are performed over the transitive closure more efficiently than they can be performed over the transitive closure itself.

This embedding approach had first been used for L isotonic regression. I've significantly revised that paper, and moved the approach to this paper, extending it to other metrics and to the strict L isotonic regression.

Keywords: weighted isotonic regression, monotonic, multidimensional, nonparametric, DAG, poset, order-embedding, shape-constrained

Complete paper

 

Related work: Here is information about the fastest isotonic regression algorithms known to date, for a variety of orderings and metrics.

Here is a paper on L isotonic regression, also known as minimax regression, one on strict L isotonic regression, one on L1isotonic regression, also known as median regression, and one on unimodal regression.


Quentin's Home Copyright © 2009-2012 Quentin F. Stout.