TY  - JOUR
AU  - Dulhanty, Chris
AU  - Wong, Alexander
PY  - 2020/01/02
Y2  - 2025/07/02
TI  - Investigating the Impact of Inclusion in Face Recognition Training Data
JF  - Journal of Computational Vision and Imaging Systems
JA  - J. Comp. Vis. Imag. Sys.
VL  - 5
IS  - 1
SE  - Articles
DO  - 
UR  - https://openjournals.uwaterloo.ca/index.php/vsl/article/view/1657
SP  - 1
AB  - &lt;p&gt;Modern face recognition systems leverage datasets containing im-&lt;br&gt;ages of hundreds of thousands of individuals’ faces. Recently, there&lt;br&gt;has been significant public scrutiny into the privacy implications of&lt;br&gt;large-scale training datasets such as MS-Celeb-1M, as many peo-&lt;br&gt;ple are uncomfortable with their face being used to train dual-use&lt;br&gt;technologies that can enable mass surveillance. However, the im-&lt;br&gt;pact of an individual’s inclusion in training data on a derived sys-&lt;br&gt;tem’s ability to recognize them has not previously been studied. In&lt;br&gt;this work, we audit ArcFace, a state-of-the-art, open-source face&lt;br&gt;recognition system, in a large-scale face identification experiment.&lt;br&gt;We find Rank-1 identification accuracy of 79.71% for individuals&lt;br&gt;present in training data and 75.73% for those not present. These re-&lt;br&gt;sults demonstrate that modern face recognition systems work bet-&lt;br&gt;ter for individuals they are trained on, which has serious privacy&lt;br&gt;implications as all large-scale, open-source training datasets do not&lt;br&gt;gather informed consent from individuals during their collection.&lt;/p&gt;
ER  -