This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder. |
If you use code found here in a publication or presentation, I would appreciate it if you would acknowledge the source.
Click the link to download.
G. Blanchard, A. Deshmukh, U. Dogan, G. Lee, and C. Scott, ``Domain Generalization by Marginal Transfer Learning."
H. Ramaswamy, C. Scott, and A. Tewari, "Mixture Proportion Estimation via Kernel Embedding of Distributions," avXiv:1603.02501.
The code is in python 2.7 and requires the scipy, numpy, matplotlib, and cvxopt packages.
C. Scott, ``A Rate of Convergence for Mixture Proportion Estimation, with Application to Learning from Noisy Labels," AISTATS 2015.
Under the hood this code contains a scalable implementation (programmed by Daniel LeJeune) of kernel logistic regression using random Fourier features, which should be useful in a number of other contexts.
E. Cruz Cortes and C. Scott, ``Sparse approximation of a kernel mean."
J. Kim and C. Scott, ``Robust kernel density estimation, Journal of Machine Learning Research, vol. 13, pp. 2529-2565, 2012.
C. Scott, "Calibrated Surrogate Losses for Classification with Label-Dependent Costs," Electronic Journal of Statistics, vol. 6, pp. 958-992, 2012.
G. Lee, W. Finn, and C. Scott, "Statistical file matching of flow cytometry data," J. Biomedical Informatics, vol. 44, no. 4., pp. 663-676, 2011.
G. Lee and C. Scott, ``EM algorithms for multivariate Gaussian mixture models with truncated and censored data," Computational Statistics and Data Analysis, vol. 56, no. 9, pp. 2816-2829, 2012.
G. Lee and C. Scott, ``Nested support vector machines," to be published in IEEE Trans. Signal Processing.
G. Lee and C. Scott, ``The one class support vector machine solution path," Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2007), vol. 2, II-521--II-524, Honolulu, USA, April 2007.
The CS-SVM algorithm is different from the one developed by Bach et al. in that we capture the cost asymmetry in a single parameter. This algorithm first finds the path of the regularization parameter when the cost asymmetry parameter is set to a specific value (the negative sample size divided by the total sample size). Then, for any fixed value of the regularization parameter, it finds the solution path as the cost asymmetry parameter varies. the first of these two path algorithms is detailed in the following class project report by Gyemin.
G. Lee, ``The Solution Path for the Balanced 2C-SVM," EECS 559 Class Project Report, University of Michigan, Fall 2006.
The second path algorithm has no documentation, but follows similar principles to the other algorithms.
J. Kim and C. Scott, ``$L_2$ kernel classification," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 10, Oct. 2010, 1822 - 1831.
C. Scott and E. Kolaczyk, ``Nonparametric assessment of contamination in multivariate data using generalized quantile sets and FDR," J. Computational and Graphical Statistics, June 1, 2010, 19(2): 439-456.
D. Rossell, R. Guerra and C. Scott, ``Semi-parametric differential expression analysis via partial mixture estimation," Statistical Applications in Genetics and Molecular Biology, vol. 7, no. 1, article 15, 2008.
M. Davenport, R. Baraniuk, and C. Scott, ``Tuning support vector machines for minimax and Neyman-Pearson classification," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 10, Oct. 2010, 1888-1898.
C. Scott and R. Nowak, ``Minimax-optimal classification with dyadic decision trees," IEEE Transactions on Information Theory, vol. 52, no. 4, pp. 1335--1353, April 2006.
C. Scott and R. Nowak, ``Learning minimum volume sets," Journal of Machine Learning Research, vol. 7, pp. 665--704, April 2006.
C. Scott and R. Nowak, ``Robust contour matching via the order preserving assignment problem," IEEE Transactions on Image Processing, vol. 15, no. 7, pp. 1831-1838, July 2006.
This work was supported in part by NSF Awards 0830490 and 0953135.