1. Census Data
2. National Drug Codes
3. Providers excluded by the OIG
4. Bureau of Labor Statistics
5. Made Up Data
Census Data
Certain census data has been obtained from the North Carolina Child Advocacy
Institute. The amount of data and the
number of columns of data is fairly extensive, making it ideal for simulating analyses
using PROC MEANS, PROC UNIVARIATE, PROC OUTLIER, PROC SORT, PROC SQL,
PROC DOMAIN, etc. This is somewhat similar to the type of data that the
IT Auditor can expect to encounter frequently. Data files are available from the
North Carolina Child Advocacy Institute .
National Drug Codes
Manufacturing data for prescription drugs has been included here from the FDA website.
National Drug Code Directory - Center for Drug Evaluation and
Research . The data is available for a variety of drugs and their measure and classification.
Each of the data fields can be linked together by sequence number, hence this provides
a good illustration for loading data, doing database queries which can employ, etc. Data is available from
This data is ideal for the use of PROC SQL to load the data and to run various queries
which are typical of those situations that could be encountered during either a fraud
investigation, IT Audit or marketing analysis.
Providers Excluded by the OIG
The Office of Inspector General (OIG) performs investigations for the Federal Government.
The OIG reviews for both errors and fraud. In certain cases where fraud is
encountered or suspected, a provider of Medicare or Medicaid, if convicted of
criminal activity, may be barred (excluded) from billing for services for a period of time.
This information is available at the website of the OIG - HHS .
For the typical analytical tasks performed by many users, SQLite is more than adequate.
Further details are available at the SQLite site.
Bureau of Labor Statistics
The Bureau of Labor Statistics publishes monthly data which
has been summarized from the data submitted to them by the various States.
This data is also an ideal example of the type data which the IT auditor may
encounter during financial audits, investigations or reviews for fraud. Although
there is an extensive amount of data, this data can be loaded into an
SQLite (or other database) for further investigation. Data is available
from Bureau of Labor Statistics .
Made Up Data
Some of the information used for the testing scripts has simply
been made up. This is especially true for employee expense reports,
gaps in check numbers, etc. Any resemblance with any other data (living or dead)
is purely coincidence.
Comments, suggestions and user experiences are welcomed and can be sent to support AT ezrstats.com.
