Data Analytics

The term "data analytics" generally relates to processes used to inspect data — usually large quantities of data — and to transform that data into useful information. In the case of fraud, data analytics may be used to identify and isolate patterns of fraudulent activity. Data analytics includes a number of techniques, approaches and practices, such as data mining, predictive analytics and business intelligence.

In the fight against fraud, certain types of digital analysis, particularly the application of Benford's Law, described below, can be helpful in identifying fraudulent activities. While a number of commercial tools exist specifically to support data analytics, for smaller data sets, widely available database and spreadsheet packages can be employed to support data analytics. For example, data analytics may be used to identify fraudulent prescription and money laundering schemes.

Data mining is an extraordinarily powerful and useful tool in the fight against fraud that has been augmented by advances in information technology. It is, in essence, the process of identifying patterns in large data sets. It incorporates aspects of statistics, database management and artificial intelligence. Given the power of technology, entire data sets, not merely samples, can be evaluated. Data mining can be said to involve five types of activities: gathering data and establishing relationships between variables in the data; amalgamating data sets by common characteristics; isolating patterns within the data sets; deriving functions that govern the patterns and that can be used predictively; and, validating the derived functions. In the fraud arena, data can be analyzed to see if it fits into one or more known patterns of fraudulent activity. An example of this is the application of Benford’s Law to large data sets. Benford’s Law is a form of digital analysis that, among other things, predicts that the initial digits of a naturally occurring set of numbers will be distributed in a non-random way and that otherwise unexplained variances for the predicted distribution of initial digits may be an indication of fraudulent activity. Spatial data mining, as another example, has been used to identify patterns of fraud geographically, such as the pattern of physicians, pharmacies and patients relating to the specious prescriptions of drugs. While data analytics have existed as long as data, it is only recent advances in technology —particularly the development of expansive data warehouses — that allow vast amounts of data to be efficiently and effectively mined. A number of tools — an inventory that changes almost daily — exist to help management and auditors mine their data in the effort to combat fraud.