Conformalization of Sparse Generalized Linear Models
In collaboration with Georgia Institute of Technology
AuthorsEtash Kumar Guha, Eugene Ndiaye, Xiaoming Huo
Conformalization of Sparse Generalized Linear Models
In collaboration with Georgia Institute of Technology
AuthorsEtash Kumar Guha, Eugene Ndiaye, Xiaoming Huo
Given a sequence of observable variables , the conformal prediction method estimates a confidence set for given that is valid for any finite sample size by merely assuming that the joint distribution of the data is permutation invariant. Although attractive, computing such a set is computationally infeasible in most regression problems. Indeed, in these cases, the unknown variable can take an infinite number of possible candidate values, and generating conformal sets requires retraining a predictive model for each candidate. In this paper, we focus on a sparse linear model with only a subset of variables for prediction and use numerical continuation techniques to approximate the solution path efficiently. The critical property we exploit is that the set of selected variables is invariant under a small perturbation of the input data. Therefore, it is sufficient to enumerate and refit the model only at the change points of the set of active features and smoothly interpolate the rest of the solution via a Predictor-Corrector mechanism. We show how our path-following algorithm accurately approximates conformal prediction sets and illustrate its performance using synthetic and real data examples.
Multivariate Conformal Prediction using Optimal Transport
January 12, 2026research area Methods and Algorithms
Conformal prediction (CP) quantifies the uncertainty of machine learning models by constructing sets of plausible outputs. These sets are constructed by leveraging a so-called conformity score, a quantity computed using the input point of interest, a prediction model, and past observations. CP sets are then obtained by evaluating the conformity score of all possible outputs, and selecting them according to the rank of their scores. Due to this…
Conformal Prediction via Regression-as-Classification
May 3, 2024research area Methods and Algorithmsconference ICLR
Conformal prediction (CP) for regression can be challenging, especially when the output distribution is heteroscedastic, multimodal, or skewed. Some of the issues can be addressed by estimating a distribution over the output, but in reality, such approaches can be sensitive to estimation error and yield unstable intervals. Here, we circumvent the challenges by converting regression to a classification problem and then use CP for classification to…