sidebar | permalink | keywords | summary |
---|---|---|---|
sidebar |
reference_private_data_categories.html |
personal files, personal data, sensitive personal files, sensitive personal data, categories, health data, ethnic origin, political opinions, gdpr, email address, credit card, ip address, iban, national ID, identifier, sex, criminal, compliance, privacy, |
There are many types of private data that Cloud Compliance can identify in your volumes, Amazon S3 buckets, and databases. See the categories below. |
There are many types of private data that Cloud Compliance can identify in your volumes, Amazon S3 buckets, databases, and OneDrive folders. See the categories below.
Tip
|
If you need Cloud Compliance to identify other private data types, such as additional national ID numbers or healthcare identifiers, email [email protected] with your request. |
The personal data found in files can be general personal data or national identifiers. The third column identifies whether Cloud Compliance uses proximity validation to validate its findings for the identifier.
Note that you can add to the list of personal data that is found in your files if you are scanning a database server. The Data Fusion feature allows you to choose the additional identifiers that Cloud Compliance will look for in its' scans by selecting columns in a database table. See Adding personal data identifiers using Data Fusion for details.
Type | Identifier | Proximity validation? |
---|---|---|
General |
Email address |
No |
Credit card number |
No |
|
IBAN number (International Bank Account Number) |
No |
|
IP address |
No |
|
National Identifiers |
Belgian ID (Numero National) |
Yes |
Brazilian ID (CPF) |
Yes |
|
Bulgarian ID (UCN) |
Yes |
|
California Driver’s License |
Yes |
|
Croatian ID (OIB) |
Yes |
|
Cyprus Tax Identification Number (TIC) |
Yes |
|
Czech/Slovak ID |
Yes |
|
Danish ID (CPR) |
Yes |
|
Dutch ID (BSN) |
Yes |
|
Estonian ID |
Yes |
|
Finnish ID (HETU) |
Yes |
|
French Tax Identification Number (SPI) |
Yes |
|
German Tax Identification Number (Steuerliche Identifikationsnummer) |
Yes |
|
Greek ID |
Yes |
|
Hungarian Tax Identification Number |
Yes |
|
Irish ID (PPS) |
Yes |
|
Israeli ID |
Yes |
|
Italian Tax Identification Number |
Yes |
|
Latvian ID |
Yes |
|
Lithuanian ID |
Yes |
|
Luxembourg ID |
Yes |
|
Maltese ID |
Yes |
|
Polish ID (PESEL) |
Yes |
|
Portuguese Tax Identification Number (NIF) |
Yes |
|
Romanian ID (CNP) |
Yes |
|
Slovenian ID (EMSO) |
Yes |
|
South African ID |
Yes |
|
Spanish Tax Identification Number |
Yes |
|
Swedish ID |
Yes |
|
U.K. ID (NINO) |
Yes |
|
USA Social Security Number (SSN) |
Yes |
The sensitive personal data that Cloud Compliance can find in files includes the following:
- Criminal Procedures Reference
-
Data concerning a natural person’s criminal convictions and offenses.
- Ethnicity Reference
-
Data concerning a natural person’s racial or ethnic origin.
- Health Reference
-
Data concerning a natural person’s health.
- ICD-9-CM Medical Codes
-
Codes used in the medical and health industry.
- ICD-10-CM Medical Codes
-
Codes used in the medical and health industry.
- Philosophical Beliefs Reference
-
Data concerning a natural person’s philosophical beliefs.
- Political Opinions Reference
-
Data concerning a natural person’s political opinions.
- Religious Beliefs Reference
-
Data concerning a natural person’s religious beliefs.
- Sex Life or Orientation Reference
-
Data concerning a natural person’s sex life or sexual orientation.
Cloud Compliance categorizes your data as follows:
- Finance
-
-
Balance Sheets
-
Purchase Orders
-
Invoices
-
Quarterly Reports
-
- HR
-
-
Background Checks
-
Compensation Plans
-
Employee Contracts
-
Employee Reviews
-
Health
-
Resumes
-
- Legal
-
-
NDAs
-
Vendor-Customer contracts
-
- Marketing
-
-
Campaigns
-
Conferences
-
- Operations
-
-
Audit Reports
-
- Sales
-
-
Sales Orders
-
- Services
-
-
RFI
-
RFP
-
SOW
-
Training
-
- Support
-
-
Complaints and Tickets
-
- Metadata categories
-
-
Application Data
-
Archive Files
-
Audio
-
Business Application Data
-
CAD Files
-
Code
-
Database and index files
-
Design Files
-
Email Application Data
-
Executables
-
Financial Application Data
-
Health Application Data
-
Images
-
Logs
-
Miscellaneous Documents
-
Miscellaneous Presentations
-
Miscellaneous Spreadsheets
-
Videos
-
Cloud Compliance scans all files for category and metadata insights and displays all file types in the file types section of the dashboard.
But when Cloud Compliance detects Personal Identifiable Information (PII), or when it performs a DSAR search, only the following file formats are supported:
.PDF, .DOCX, .DOC, .PPTX, .XLS, .XLSX, .CSV, .TXT, .RTF, and .JSON.
NetApp can’t guarantee 100% accuracy of the personal data and sensitive personal data that Cloud Compliance identifies. You should always validate the information by reviewing the data.
Based on our testing, the table below shows the accuracy of the information that Cloud Compliance finds. We break it down by precision and recall:
- Precision
-
The probability that what Cloud Compliance finds has been identified correctly. For example, a precision rate of 90% for personal data means that 9 out of 10 files identified as containing personal information, actually contain personal information. 1 out of 10 files would be a false positive.
- Recall
-
The probability for Cloud Compliance to find what it should. For example, a recall rate of 70% for personal data means that Cloud Compliance can identify 7 out of 10 files that actually contain personal information in your organization. Cloud Compliance would miss 30% of the data and it won’t appear in the dashboard.
Cloud Compliance is in a Controlled Availability release and we are constantly improving the accuracy of our results. Those improvements will be automatically available in future Cloud Compliance releases.
Type | Precision | Recall |
---|---|---|
Personal data - General |
90%-95% |
60%-80% |
Personal data - Country identifiers |
30%-60% |
40%-60% |
Sensitive personal data |
80%-95% |
20%-30% |
Categories |
90%-97% |
60%-80% |