This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder. |
If you use code found here in a publication or presentation, I would appreciate it if you would acknowledge the source.
Click the link to download.
G. Blanchard, A. Deshmukh, U. Dogan, G. Lee, and C. Scott, ``Domain Generalization by Marginal Transfer Learning."
H. Ramaswamy, C. Scott, and A. Tewari, "Mixture Proportion Estimation via Kernel Embedding of Distributions," avXiv:1603.02501.
The code is in python 2.7 and requires the scipy, numpy, matplotlib, and cvxopt packages.
C. Scott, ``A Rate of Convergence for Mixture Proportion Estimation, with Application to Learning from Noisy Labels," AISTATS 2015.
Under the hood this code contains a scalable implementation (programmed by Daniel LeJeune) of kernel logistic regression using random Fourier features, which should be useful in a number of other contexts.
E. Cruz Cortes and C. Scott, ``Sparse approximation of a kernel mean."
J. Kim and C. Scott, ``Robust kernel density estimation, Journal of Machine Learning Research, vol. 13, pp. 2529-2565, 2012.
C. Scott, "Calibrated Surrogate Losses for Classification with Label-Dependent Costs," Electronic Journal of Statistics, vol. 6, pp. 958-992, 2012.
G. Lee, W. Finn, and C. Scott, "Statistical file matching of flow cytometry data," J. Biomedical Informatics, vol. 44, no. 4., pp. 663-676, 2011.
G. Lee and C. Scott, ``EM algorithms for multivariate Gaussian mixture models with truncated and censored data," Computational Statistics and Data Analysis, vol. 56, no. 9, pp. 2816-2829, 2012.
G. Lee and C. Scott, ``Nested
support vector machines," to be published
in IEEE Trans. Signal Processing.
G. Lee and C. Scott, ``The one
class support vector
machine solution path," Proc. IEEE International
Conference on Acoustics, Speech, and Signal Processing
(ICASSP 2007), vol. 2, II-521--II-524, Honolulu, USA,
April 2007.
The CS-SVM algorithm is different from the one developed by Bach et al.
in
that we capture the cost asymmetry in a single parameter. This algorithm
first finds the path of the regularization parameter when the cost
asymmetry parameter is set to a specific value (the negative sample size
divided by the total sample size). Then, for any fixed value of the
regularization parameter, it finds the solution path as the cost asymmetry
parameter varies. the first of these two path algorithms is detailed in
the following class project report by Gyemin.
G. Lee, ``The Solution Path
for the Balanced 2C-SVM," EECS 559 Class Project Report,
University of Michigan, Fall 2006. The second path algorithm has no documentation, but follows similar
principles to the other algorithms.
J. Kim and C. Scott, ``$L_2$
kernel classification," IEEE Trans. Pattern Analysis and
Machine Intelligence, vol. 32, no. 10, Oct. 2010, 1822 - 1831.
C. Scott and E. Kolaczyk, ``Nonparametric
assessment of
contamination in multivariate data using generalized quantile
sets and FDR," J. Computational and
Graphical Statistics, June 1, 2010, 19(2): 439-456.
D. Rossell, R. Guerra and C. Scott, ``Semi-parametric
differential expression analysis via partial mixture
estimation," Statistical
Applications in Genetics and Molecular Biology, vol. 7, no. 1,
article 15, 2008.
M. Davenport, R. Baraniuk, and C. Scott, ``Tuning
support vector machines for minimax and Neyman-Pearson
classification," IEEE Trans. Pattern Analysis and Machine
Intelligence, vol. 32, no. 10, Oct. 2010, 1888-1898.
C. Scott and R. Nowak, ``Minimax-optimal
classification
with dyadic decision trees," IEEE Transactions on
Information Theory, vol. 52, no. 4, pp. 1335--1353,
April 2006.
C. Scott and R. Nowak, ``Learning minimum volume
sets," Journal of Machine Learning Research,
vol. 7, pp. 665--704, April 2006.
C. Scott and R. Nowak, ``Robust
contour matching via the
order preserving assignment problem," IEEE
Transactions on Image Processing, vol. 15, no. 7, pp.
1831-1838, July 2006.
This work was supported in part by NSF Awards 0830490 and 0953135.