TY - JOUR
T1 - Detrimental network effects in privacy
T2 - A graph-theoretic model for node-based intrusions
AU - Houssiau, Florimond
AU - Sapieżyński, Piotr
AU - Radaelli, Laura
AU - Shmueli, Erez
AU - de Montjoye, Yves Alexandre
N1 - Publisher Copyright:
© 2022 The Author(s)
PY - 2023/1/13
Y1 - 2023/1/13
N2 - Despite proportionality being one of the tenets of data protection laws, we currently lack a robust analytical framework to evaluate the reach of modern data collections and the network effects at play. Here, we propose a graph-theoretic model and notions of node- and edge-observability to quantify the reach of networked data collections. We first prove closed-form expressions for our metrics and quantify the impact of the graph's structure on observability. Second, using our model, we quantify how (1) from 270,000 compromised accounts, Cambridge Analytica collected 68.0M Facebook profiles; (2) from surveilling 0.01% of the nodes in a mobile phone network, a law enforcement agency could observe 18.6% of all communications; and (3) an app installed on 1% of smartphones could monitor the location of half of the London population through close proximity tracing. Better quantifying the reach of data collection mechanisms is essential to evaluate their proportionality.
AB - Despite proportionality being one of the tenets of data protection laws, we currently lack a robust analytical framework to evaluate the reach of modern data collections and the network effects at play. Here, we propose a graph-theoretic model and notions of node- and edge-observability to quantify the reach of networked data collections. We first prove closed-form expressions for our metrics and quantify the impact of the graph's structure on observability. Second, using our model, we quantify how (1) from 270,000 compromised accounts, Cambridge Analytica collected 68.0M Facebook profiles; (2) from surveilling 0.01% of the nodes in a mobile phone network, a law enforcement agency could observe 18.6% of all communications; and (3) an app installed on 1% of smartphones could monitor the location of half of the London population through close proximity tracing. Better quantifying the reach of data collection mechanisms is essential to evaluate their proportionality.
KW - DSML 2: Proof-of-concept: Data science output has been formulated, implemented, and tested for one domain/problem
KW - networks
KW - privacy
KW - social networks
KW - surveillance
UR - http://www.scopus.com/inward/record.url?scp=85146236245&partnerID=8YFLogxK
U2 - 10.1016/j.patter.2022.100662
DO - 10.1016/j.patter.2022.100662
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 36699738
AN - SCOPUS:85146236245
SN - 2666-3899
VL - 4
JO - Patterns
JF - Patterns
IS - 1
M1 - 100662
ER -