TransWikia.com

How to identify a field as holding personal identifiable information from the name of the field itself using ML model in python?

Data Science Asked on January 15, 2021

Is it possible to automatically detect fields holding personal information (name, phone, address, SSN, passport, gov ID…) from its names, using python in order to upload datasets into the cloud after encrypting or anonymizing the PII fields?

I am open to do my own model by training it on a dataset that holds thousands of fields and each one is classified whether personal or not. But apparently I can’t find any related datasets.

One Answer

In such cases, where the data is not availible, it will be better to make the dataset on our own. Just create a google form and sent to a few friends and family and hence you will end up with a dataset. Apart from this it you can check this article on anonymizing information on kaggle

Answered by user108025 on January 15, 2021

Add your own answers!

Ask a Question

Get help from others!

© 2024 TransWikia.com. All rights reserved. Sites we Love: PCI Database, UKBizDB, Menu Kuliner, Sharing RPP