Guide to Best Practices for Research Data Anonymization

Authors

Caterina Groposo Pavão, PPGCIN UFRGS; Letícia Guarany Bonetti, Ibict; Marcel Garcia de Souza, Ibict; Rene Faustino Gabriel Junior , Universidade Federal do Rio Grande do Sul; Samile Andrea de Souza Vanz, Universidade Federal do Rio Grande do Sul; Tatyane Guedes Martins da Silva, Ibict; Washington Luís Ribeiro de Carvalho Segundo, Ibict

Keywords:

Data anonymization, Research data, Open Science, Personal data protection

Synopsis

This Guide was developed to provide guidance on how to carry out the anonymization of research data, with the aim of enabling researchers to understand the meaning and importance of anonymization, present techniques for protecting personal information, and contribute to Open Science in compliance with the General Data Protection Law (Law No. 13,709/2018). Federal Law No. 13,709, the General Personal Data Protection Law (LGPD), was published on August 14, 2018, and came into force on September 18, 2020, from which date compliance became mandatory throughout the national territory (Brasil, 2018). The General Data Protection Law (LGPD) is related to the topic of open research data because of the identification of personal data.

Author Biographies

Caterina Groposo Pavão, PPGCIN UFRGS

Bachelor’s degree in Library and Information Science from the Federal University of Rio Grande do Sul, with a master’s degree and a PhD in Communication and Information from the Graduate Program in Communication and Information (PPGCOM UFRGS), including a doctoral sandwich period at the Complutense University of Madrid. Professor in the Department of Information Science at the School of Library and Information Science and Communication of the Federal University of Rio Grande do Sul, and in the Graduate Program in Information Science (PPGCIN UFRGS). Member of the UFRGS Research Group on Scholarly Communication and of the Center for Studies in Science, Innovation and Technology (NECIT).

Letícia Guarany Bonetti, Ibict

Librarian graduated from the University of Brasília (2019) and holds a Master’s degree in Information Science from UFSCar (2023). PhD candidate in the Graduate Program in Information Science at the Brazilian Institute of Information in Science and Technology (Ibict). Works as a Technologist at the Brazilian Institute of Information in Science and Technology (Ibict), developing services in the area of Open Science and data repositories. Deputy Coordinator of the Brazilian Network of Digital Repositories (RBRD). Conducts research in Information Science, with a focus on research data management, repositories, metadata, and FAIR principles. She was a scholarship holder funded by the São Paulo Research Foundation (FAPESP) during her master’s studies.

Marcel Garcia de Souza, Ibict

PhD candidate in Information Science at the Brazilian Institute of Information in Science and Technology (Ibict). Holds a Master’s degree in Science Education from the Federal University of Rio Grande do Sul (2016). Graduated in Psychology from the Catholic University of Brasília (2005). Federal public servant; Science and Technology Analyst at the Brazilian Institute of Information in Science and Technology, where he serves as Coordinator of Scientific Information Processing, Analysis, and Dissemination, in addition to coordinating applied research focused on Information Science, Open Science, Information for Sustainability, and Technological Information.

Rene Faustino Gabriel Junior , Universidade Federal do Rio Grande do Sul

Graduação em Biblioteconomia e Documentação pela Pontifícia Universidade Católica do Paraná (2008), mestrado em Ciência, Gestão e Tecnologia da Informação pela Universidade Federal do Paraná (2011) e doutorado em Ciência da Informação pela Universidade Estadual Paulista Júlio de Mesquita Filho (2014). Atualmente é professor adjunto da Universidade Federal do Rio Grande do Sul. Tem experiência na área de Ciência da Informação, com ênfase em Biblioteconomia, atuando principalmente nos seguintes temas: bibliometria, BRAPCI, ciência da informação, comunicação científica e produção científica. Implantou e coordena a Base de Dados de Periódicos em Ciência da Informação (BRAPCI). Membro do Grupo de Pesquisa de Comunicação Científica da UFRGS e do Núcleo de Estudos em Ciência, Inovação e Tecnologia (NECIT).

Samile Andrea de Souza Vanz, Universidade Federal do Rio Grande do Sul

Full Professor in the Department of Information Sciences, in the Graduate Program in Communication (PPGCOM UFRGS), and in the Graduate Program in Information Science at the Federal University of Rio Grande do Sul (PPGCIN UFRGS). Holds a degree in Library and Information Science from the Federal University of Rio Grande do Sul (1999), and a Master’s and PhD in Communication and Information from PPGCOM UFRGS (2004 and 2009), including a doctoral sandwich period at Dalian University of Technology (China, 2007-2008). Completed postdoctoral studies at Universidad Carlos III de Madrid (Spain, 2016). Conducts research in the field of Scholarly Communication, with emphasis on the production of scientific indicators, bibliometrics, scientific collaboration, citation analysis, co-citation analysis, and university rankings. Has academic and professional experience in planning, management, and library architecture.

Tatyane Guedes Martins da Silva, Ibict

Bachelor’s degree in Library and Information Science from the University of Brasília (2019). Has experience in the field of Information Science, with an emphasis on Library and Information Science. Was a scholarship holder in the Institutional Capacity Building Program at the Brazilian Institute of Information in Science and Technology (Ibict), developing services in the area of Open Science and data repositories. Main areas of interest include free software for libraries, Open Science, open data, research data management, repositories, metadata, and FAIR principles.

Washington Luís Ribeiro de Carvalho Segundo, Ibict

PhD in Computer Science from the University of Brasília (UnB), including a sandwich period at King’s College London, and a Master’s degree in the same field from UnB. He also holds degrees in Mathematics, both a bachelor’s degree and a teaching degree, from the same institution. He currently serves as General Coordinator of Scientific and Technological Information at the Brazilian Institute of Information in Science and Technology (Ibict), where he leads projects focused on Open Science, digital repositories, systems interoperability, and scientific data management. He is a Permanent Faculty Member of the Graduate Program in Information Science at Ibict. Among his contributions at the Institute, notable initiatives include coordinating Oasisbr and the Brazilian Digital Library of Theses and Dissertations (BDTD). He also leads efforts related to Rede dARK. His trajectory also includes the development of BrCris and the Laguna project.

References

A GUIDE to Confidentiality in Health and Social Care. 2013. London: NHS England. Disponível em: https://digital.nhs.uk/data-and-information/looking-after-information/data-security-and-information-governance/codes-of-practice-for-handling-information-in-health-and-care/a-guide-to-confidentiality-in-health-and-social-care/a-guide-to-confidentiality. Acesso em: 8 mar. 2026.

ALBAGLI, S.; MACIEL, M. L.; ADBO, A. H. (org.). Ciência aberta, questões abertas. Brasília: Ibict; Rio de Janeiro: Unirio, 2015. Disponível em: https://livroaberto.Ibict.br/bitstream/1/1060/1/Ciencia%20aberta_questoes%20abertas_PORTUGUES_DIGITAL%20(5).pdf. Acesso em: 10 jul. 2025.

and on the free movement of such data, and repealing Directive 95/46/EC. Uniao Europeia, 2016.

BRASIL. Lei nº 10.406, de 10 de janeiro de 2002. Institui o Código Civil. Diário Oficial da União: seção 1, Brasília, DF, 11 jan. 2002.

BRASIL. Lei nº 13.709, de 14 de agosto de 2018. Lei Geral de Proteção de Dados Pessoais (LGPD). Brasília: Presidência da República, 2018. Disponível em: https://www.planalto.gov.br/ccivil_03/_ato2015-2018/2018/lei/l13709.htm. Acesso em: 17 dez. 2024.

CURTY, R. Abordagens de reuso e a questão da reusabilidade dos dados científicos. Liinc Em Revista, Rio de Janeiro, v. 15, n. 2, 2019. Disponível em: https://doi.org/10.18617/liinc.v15i2.4777. Acesso em: 5 jul. 2025.

Disponível em: http://data.europa.eu/eli/reg/2016/679/oj. Acesso em: 8 mar. 2026.

GABRIEL JÚNIOR, R. F. et al. Acesso aberto a dados de pesquisa no Brasil: mapeamento de repositórios, práticas e percepções dos pesquisadores e tecnologias. Ciência da Informação, Brasília, DF, v. 48, n. 3, p. 87-101, set./dez. 2019. Suplemento. Disponível em: https://lume.ufrgs.br/handle/10183/212266. Acesso em: 7 jul. 2025.

GIOUROUKOU, K. et al. Rethinking privacy in medical imaging AI: from metadata and pixel-level identification risks to federated learning and synthetic data challenges. Radiology Artificial Intelligence, v. 8, n. 1, 2025. DOI: 10.1148/ryai.250273. Disponível em: https://pubmed.ncbi.nlm.nih.gov/41295085/. Acesso em: 1 mar. 2026.

GUEDES, M. S.; MACHADO, D. C.; COSTA, A. F. J. Estudo técnico sobre anonimização de dados na LGPD: uma visão de processo baseado em risco e técnicas computacionais versão 1.0. Brasília: ANPD, 2023. Disponível em: https://www.gov.br/anpd/pt-br/centrais-de-conteudo/documentos-tecnicos-orientativos/estudo_tecnico_sobre_anonimizacao_de_dados_na_lgpd_uma_visao_de_processo_baseado_em_risco_e_tecnicas_computacionais.pdf. Acesso em: 7 jul. 2025.

HURD, J. M. The transformation of scientific communication: A model for 2020. Journal of the American Society for Information Science, v. 51, n. 14, p. 1279-1283, 2000. Disponível em: http://www.ou.edu/ap/lis5703/sessions/hurd.pdf. Acesso em: 20 ago. 2025.

INTERNATIONAL HOUSEHOLD SURVEY NETWORK. Anonymization Principles. Disponível em: https://ihsn.org/. [2025]. Acesso em: 7 ago. 2025.

LAROBINA, M.; MURINO, L. Medical image file formats. Journal of Digital Imaging, v. 27, p. 200-206, 2014. DOI: 10.1007/s10278-013-9657-9. Disponível em: https://pubmed.ncbi.nlm.nih.gov/24338090/. Acesso em: 3 mar 2026.

LI, N.; LI, T.; VENKATASUBRAMANIAN, S. t-Closeness: privacy beyond k-anonymity and ℓ-diversity. In: IEEE 23rd International Conference on Data Engineering. Anais… Istanbul: IEEE, 2007. p. 106 115. DOI: 10.1109/ICDE.2007.367856. Disponível em: https://ieeexplore.ieee.org/document/4221659. Acesso em: 3 mar. 2026.

MADELEINE, S. Ferramentas gratuitas para anonimização de imagens médicas. Blog IMAIOS. 2022. Disponível em: https://www.imaios.com/br/recursos/blog/5-melhores-ferramentas-de-desidentificacao-dicom. Acesso em: 3 mar. 2026.

MEADOWS, Arthur Jack. Acomunicação científica. Brasília: Briquet de Lemos, 1999.

MEDEN, B. et al. K-Same-Net: K-Anonymity with generative deep neural networks for face deidentification. Entropy, v. 20, n. 1, 2018. DOI: 10.3390/e20010060. Disponível em: https://www.mdpi.com/1099-4300/20/1/60. Acesso em: 5 mar. 2026.

MENG, L.; SHENOY, A. Retaining expression on De-identified faces. In: Speech and Computer: Lecture Notes in Computer Science book series. LNCS, v. 10458, p. 651-661, 2017. DOI: 10.1007/978-3-319-66429-3_65. Disponível em: https://uhra.herts.ac.uk/id/eprint/14040/. Acesso em: 3 mar. 2026.

NEWTON E.; SWEENEY L.; MALIN B. Preserving Privacy by De-identifying Facial Images. IEEE Transactions on Knowledge and Data Engineering, 2005. Disponível em: https://www.researchgate.net/publication/3297373_Preserving_privacy_by_de-identifying_face_images. Acesso em: 7 abr. 2025.

PERSONAL DATA PROTECTION COMMISSION SINGAPORE. Guide to basic data anonymisation techniques. Singapore: PDPCS, 2018. Disponível em: https://www.pdpc.gov.sg/-/media/Files/PDPC/PDF-Files/Other-Guides/Guide-to-Anonymisation_v1-(250118).pdf. Acesso em: 7 abr. 2025.

RACHEL, B. et al. Medical imaging privacy: a systematic scoping review of key parameters in dataset construction and data protection. Journal of Medical Imaging and Radiation Sciences, v. 56, n. 5, 2025. DOI: 10.1016/j.jmir.2025.101914. Disponível em: https://pubmed-ncbi-nlm-nih-gov.ez45.periodicos.capes.gov.br/40288182/. Acesso em: 5 mar. 2026.

REMPE, M. et al. De-identification of medical imaging data: a comprehensive tool for ensuring patient privacy. European Radiology, v. 5, n. 12, 2025. DOI: 10.1007/s00330-025-11695-x. Disponível em: https://arxiv.org/pdf/2410.12402. Acesso em: 5 mar 2026.

SANCHEZ, F. A.; VIDOTTI, S. A. B. G.; VECHIATO, F. L. A contribuição da curadoria digital em repositórios digitais. Revista Informação na Sociedade Contemporânea, Natal, p. 11-17, 2017. Número especial. Disponível em: https://periodicos.ufrn.br/informacao/article/download/12280/8508. Acesso em: 26 set. 2023.

SKLOOT, R. A vida imortal de Henrietta Lacks. São Paulo: Companhia das Letras. 2011. 454 p.

SWEENEY, L. k-anonymity: a model for protecting privacy. International Journal on Uncertainty, v. 10, n. 5, p. 557-570, 2002. Disponível em: https://epic.org/wp-content/uploads/privacy/reidentification/Sweeney_Article.pdf. Acesso em: 3 mar. 2026.

TARGINO, M. G. Comunicação científica: uma revisão de seus elementos básicos. Informação e Sociedade Estudos, João Pessoa, v. 10, n. 2, p. 37-85, 2000. Disponível em: https://periodicos.ufpb.br/ojs/index.php/ies/article/view/326. Acesso em: 7 jul. 2025.

UNIAO EUROPEIA. Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016 on the protection of natural persons with regard to the processing of personal data

VARGAS, A. G. et al. Tratamento de dados pessoais para fins acadêmicos e para a realização de estudos e pesquisas: guia orientativo. Brasília: ANPD, 2023. 58 p. Disponível em: https://www.gov.br/anpd/pt-br/centrais-de-conteudo/materiais-educativos-e-publicacoes/web-guia-anpd-tratamento-de-dados-para-fins-academicos.pdf. Acesso em: 3 mar 2026.

YANMING, Z. et al. Privacy-Preserving in Medical Image Analysis: A Review of Methods and Applications. In: Li, Y., Zhang, Y., Xu, J. (ed.) Parallel and distributed computing, applications and technologies. PDCAT 2024. Lecture Notes in Computer Science, v. 15502. Springer, Singapore. DOI: 10.1007/978-981-96-4207-6_15. Disponível em: https://link.springer.com/chapter/10.1007/978-981-96-4207-6_15. Acesso em: 2 mar 2026.

capa-Pavão-et-al.Guia-de-boas-práticas-para-anonimização-de-dados-de-pesquisa.2026

Downloads

Published

April 15, 2026

Categories

License

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.