Challenge
- Urgent need to isolate data sets containing Personally Identifying Information (PII) and Protected Health Information (PHI)
- Unforgiving deadlines for legally-required notification of individuals with PII/PHI potentially exposed in the attack
- Project required identifying and eliminating redundant material concerning PII/PHI of individuals and entities
- Requirement to limit costs, from initial data analysis through completion of a ‘clean’ spreadsheet without redundant material (e.g., duplicative entries for an individual because multiple datasets contained their PII/PHI)
Solution
- Extensive data mining process to identify PII and PHI using Canopy’s artificial intelligence and machine learning capabilities, its built-in, data-driven regular expression searches, and Elevate’s searches tailored to project-specific requirements
- Creation of custom coding layout that increased the efficiency of Canopy’s automation tools (e.g., its mapping feature to automatically capture dense PII and PHI in spreadsheets, charts, and tables within minutes)
- Comprehensive, optimised processes for consolidating relevant information about each individual across all sources into a single notification list entry that preserved information on the sources of any duplicative material