cancel
Showing results for 
Search instead for 
Did you mean: 

Classification - UK PII

TTree2007
Level 4

Hi Rod

As briefly mentioned, having a lot of msg's being returned running classification for 'UK PII'.  The customer is treating them as "false positives" as it does appear to be practically capturing all email msgs!  

Firstly, can you clear up how the UK PII Policy should trigger the tag please?  I'm reading it as it needs to have...

1) 'Any of' - 2 or more - DOB, email address, UK postal address.

2) 'Any of' - CC, UK Driving License, Electoral Reg, NHS no, Nation Insurance no, passport no, Tax ref, etc...

My understanding is that BOTH these conditions need to be met for it to tag it as "PII" but it appears by reviewing a few tagged files and testing an msg file it does seem to trigger PII with just 1) - e.g. my test msg have 3 email addresses and 2 UK postal addresses.

There was NO elements from 2).

If so, this seems a bit simplistic as practically every email that's been forwarded or replied to has these elements!  I'm hoping you have a simple fix or I'm missing something. 

I haven't put the latest CHF on these environments (it's using base 6.1).

Thanks

1 ACCEPTED SOLUTION

Accepted Solutions

Rod_p1
Level 6
Employee Accredited Certified

We have released the 6.1RP1 / 6.1.1 Rolling Patch which includes the previous hotfixes for the PII information templates and other Veritas Information Classifer (VIC)  enhancements. I suggest you do your modifications and changes on that release for the newest available version.

 

Rod

View solution in original post

4 REPLIES 4

andrew_mcmullan
Level 3

It's 1 OR 2 (not 1 AND 2).  Note the root 'Any of' that joins 1 and 2.

But also note that the email address condition requires 10 or more email addresses.

So a document with 3 email addresses and 2 postal addresses should not get a match.

 

andrew_mcmullan
Level 3

I suggest changing the confidence on the email condition to (very high to very high) instead of the default (high to very high) as this will then require extra keywords to be present intended to identify email addresses used in documents (non email) vs just email addresses in email (too common).

and/or increase the number of email addresses required to 1000. 

Thanks.  I have been playing with confidence level.  We decided to add some further conditions and a reg ex and have it working quite nicely now!

Regards

Rod_p1
Level 6
Employee Accredited Certified

We have released the 6.1RP1 / 6.1.1 Rolling Patch which includes the previous hotfixes for the PII information templates and other Veritas Information Classifer (VIC)  enhancements. I suggest you do your modifications and changes on that release for the newest available version.

 

Rod