Skip to main content

PII Detection and Reporting

While doing a document integration or synchronization it may be prudent to check the document, or it's metadata for PII. You can also crawl you content simply for PII by using our reporting output connector with any repository connector that we support.


Configuring a Job for PII Detection

  1. First add the PII Detection Job Task to your Job. Help in setting up this Job Task can be found here.
  2. Next, add the Field Mapping for the output PII field and then auditing. You can find information on Auditing Field Mappings here.

Now that you have your Job setup, you can now run your Job and all PII detected will be recorded in the auditFields.pii.pii field of tsRecordProcessed collection in MongoDB.


Reporting

Using mongoexport, we can generate reports and export to a csv file

Mongo Export of ALL records with any PII

mongoexport --db simflofy --collection tsRecordProcessed--fields
doc_id,doc_name,auditFields.pii.pii --query '{"auditFields.pii.pii" :
{$exists:true},$where:"this.auditFields.pii.pii.length > 0"}' --type=csv --out
pii.csv

Mongo Export of only those records with Social Security + Name PII

mongoexport --db simflofy --collection tsRecordProcessed--fields
doc_id,doc_name,auditFields.pii.pii --query '{$and:[{"auditFields.pii.pii" :
"SocialSecurity"},{"auditFields.pii.pii" : "Name"}]}' --type=csv --out
ssn_name_pii.csv

Related Articles:
Discovery Analytics and Reporting
Simflofy Integration Jobs
Integration Job Tasks
Reporting with Apache Zeppelin and MongoDB Compass