Data

1. Census Data

2. National Drug Codes

3. Providers excluded by the OIG

4. Bureau of Labor Statistics

5. Made Up Data

Census Data

Certain census data has been obtained from the North Carolina Child Advocacy Institute. The amount of data and the number of columns of data is fairly extensive, making it ideal for simulating analyses using PROC MEANS, PROC UNIVARIATE, PROC OUTLIER, PROC SORT, PROC SQL, PROC DOMAIN, etc. This is somewhat similar to the type of data that the IT Auditor can expect to encounter frequently. Data files are available from the North Carolina Child Advocacy Institute .

National Drug Codes

Manufacturing data for prescription drugs has been included here from the FDA website. National Drug Code Directory - Center for Drug Evaluation and Research . The data is available for a variety of drugs and their measure and classification. Each of the data fields can be linked together by sequence number, hence this provides a good illustration for loading data, doing database queries which can employ, etc. Data is available from This data is ideal for the use of PROC SQL to load the data and to run various queries which are typical of those situations that could be encountered during either a fraud investigation, IT Audit or marketing analysis.

Providers Excluded by the OIG

The Office of Inspector General (OIG) performs investigations for the Federal Government. The OIG reviews for both errors and fraud. In certain cases where fraud is encountered or suspected, a provider of Medicare or Medicaid, if convicted of criminal activity, may be barred (excluded) from billing for services for a period of time. This information is available at the website of the OIG - HHS . For the typical analytical tasks performed by many users, SQLite is more than adequate. Further details are available at the SQLite site.

Bureau of Labor Statistics

The Bureau of Labor Statistics publishes monthly data which has been summarized from the data submitted to them by the various States. This data is also an ideal example of the type data which the IT auditor may encounter during financial audits, investigations or reviews for fraud. Although there is an extensive amount of data, this data can be loaded into an SQLite (or other database) for further investigation. Data is available from Bureau of Labor Statistics .

Made Up Data

Some of the information used for the testing scripts has simply been made up. This is especially true for employee expense reports, gaps in check numbers, etc. Any resemblance with any other data (living or dead) is purely coincidence.

Comments, suggestions and user experiences are welcomed and can be sent to support AT ezrstats.com.


Web Page last updated on 03-15-2007
© EZ-R Stats, LLC 2005-2007

© EZ_R Stats

Visit EZ_R Stats on the web at:

www.ezrstats.com