Abstract
We provide a unified framework for characterizing pure and approximate differentially private (DP) learnability. The framework uses the language of graph theory: for a concept class H, we define the contradiction graph G of H. Its vertices are realizable datasets and two datasets S, S′ are connected by an edge if they contradict each other (i.e., there is a point x that is labeled differently in S and S′). Our main finding is that the combinatorial structure of G is deeply related to learning H under DP. Learning H under pure DP is captured by the fractional clique number of G. Learning H under approximate DP is captured by the clique number of G. Consequently, we identify graph-theoretic dimensions that characterize DP learnability: the clique dimension and fractional clique dimension. Along the way, we reveal properties of the contradiction graph which may be of independent interest. We also suggest several open questions and directions for future research.
Original language | English |
---|---|
Pages (from-to) | 94-129 |
Number of pages | 36 |
Journal | Proceedings of Machine Learning Research |
Volume | 247 |
State | Published - 2024 |
Externally published | Yes |
Event | 37th Annual Conference on Learning Theory, COLT 2024 - Edmonton, Canada Duration: 30 Jun 2024 → 3 Jul 2024 |
Funding
Funders | Funder number |
---|---|
European Research Council Executive Agency | |
Technion Center for Machine Learning and Intelligent Systems | |
Engineering Research Centers | |
MLIS | |
European Commission | |
National Science Foundation | DMS-2154082 |
National Science Foundation | |
Iowa Science Foundation | 1225/20 |
Iowa Science Foundation | |
Bloom's Syndrome Foundation | 2018385 |
Bloom's Syndrome Foundation | |
GENERALIZATION | 101039692 |
Keywords
- Chromatic number
- Clique number
- Contradiction graph
- Differential privacy
- Fractional chromatic number
- Fractional clique number
- LP duality
- PAC learning