Stability
There is a close connection in ML and statistics between stability under perturbation and generalization to unseen data.
Literature
Statistics
- Meinshausen & Bühlmann, 2010: Stability selection (doi, arxiv)
- Yu, 2013: Stability (doi, arxiv, pdf)
- Start here: Nice overview and review of literature
- Lim & Yu, 2016: Estimation stability with cross-validation (doi, arxiv)
- Sec 2.7: Stability of (unstable in high dim. with collinearity) vs (more stable)
Machine Learning
- Bousquet & Elisseeff, 2002: Stability and generalization (pdf)
- Mukherjee et al, 2006: Learning theory: stability is sufficient for generalization and necessary and sufficient for consistency of empirical risk minimization (doi, preprint )
- Shalev-Shwartz, Shamir, Srebro, Sridharan, 2010: Learnability, stability, and
uniform convergence (pdf)
- Textbook treatment in Shalev-Shwartz & Ben-David, 2014, Understanding Machine Learning, Ch. 13: Regularization and stability
- Important reference for (Dwork et al, 2015) (arxiv)
- Xu et al, 2012: Sparse algorithms are not stable (doi, pdf)
- Thakurta & Smith, 2013: Differentially private feature selection via stability
arguments, and the robustness of the lasso (pdf)
- Textbook treatment in Dwork & Roth, 2014, Algorithmic Foundations of Differential Privacy, Ch. 7: When worst-case sensitivity is atypical
- Motivated by differential privacy, but provides independently interesting sufficient conditions for stability of lasso
- Sec 1.2: Nice survey of literature on stability