You don't need to use the 'language is' condition as well as the text for matching content - just the text condition is sufficient.You would only use 'language is' if you want to restrict results to documents whose primary language is Japanese. It's...
I suggest changing the confidence on the email condition to (very high to very high) instead of the default (high to very high) as this will then require extra keywords to be present intended to identify email addresses used in documents (non email) ...
It's 1 OR 2 (not 1 AND 2). Note the root 'Any of' that joins 1 and 2.But also note that the email address condition requires 10 or more email addresses.So a document with 3 email addresses and 2 postal addresses should not get a match.
I don't think you can get the confidence detected for files classified by DI, but you can see it if you test a sample file through the VIC UI.The PCI/DSS policy is slightly unusual in that it requires only medium confidence for the Credit/Debit card ...