数据质量常见问题

Data Quality Issue Sample Data Problem
Standard Are data elements consistently defined and understood? Gender code =M,F,U in one system and Gender code = 0, 1, 2 in another system
Complete Is all necessary data present? 20% of customers’ last name is blank, 50% of zip-codes are 99999
Accurate Does the data accurately represent reality or a verifiable source? A supplier is listed as ‘Active’ but went out of business six years ago
Valid Do data values fall within acceptable ranges Salary values should be between 60,000 – 12,000
Unique Data appears several times Both John Ryan and Jack Ryan appear in the system – are they the same person?

DQS…

原文地址:https://www.cnblogs.com/aot/p/2260426.html