Privacy-Preserving Algorithms for Multiple Sensitive Attributes Satisfying t-Closeness

(整期优先)网络出版时间:2018-06-16
/ 1
Althoughk-anonymityisagoodwayofpublishingmicrodataforresearchpurposes,itcannotresistseveralcommonattacks,suchasattributedisclosureandthesimilarityattack.Toresisttheseattacks,manyrefinementsofk-anonymityhavebeenproposedwitht-closenessbeingoneofthestrictestprivacymodels.Whilemostexistingt-closenessmodelsaddressthecaseinwhichtheoriginaldatahaveonlyonesinglesensitiveattribute,datawithmultiplesensitiveattributesaremorecommoninpractice.Inthispaper,wecoverthisgapwithtwoproposedalgorithmsformultiplesensitiveattributesandmakethepublisheddatasatisfyt-closeness.Basedontheobservationthatthevaluesofthesensitiveattributesinanyequivalenceclassmustbeasspreadaspossibleovertheentiredatatomakethepublisheddatasatisfyt-closeness,bothofthealgorithmsusedifferentmethodstopartitionrecordsintogroupsintermsofsensitiveattributes.Oneusesaclusteringmethod,whiletheotherleveragestheprincipalcomponentanalysis.Then,accordingtothesimilarityofquasi-identifierattributes,recordsareselectedfromdifferentgroupstoconstructanequivalenceclass,whichwillreducethelossofinformationasmuchaspossibleduringanonymization.Ourproposedalgorithmsareevaluatedusingarealdataset.Theresultsshowthattheaveragespeedofthefirstproposedalgorithmisslowerthanthatofthesecondproposedalgorithmbuttheformercanpreservemoreoriginalinformation.Inaddition,comparedwithrelatedapproaches,bothproposedalgorithmscanachievestrongerprotectionofprivacyandreduceless.