|
Overview/Narrative When surveys are taken, the situation can arise where one or more of the persons taking the survey do not actually interview a person, but instead, make up the data. There are a number of reasons why this may happen - e.g. time pressure to complete the results, inconvenient to visit, too far to drive, too late in the day, etc. The industry term for this is "curb stoning", inspired by an image of the surveyor completing the survey while sitting on a curb stone. It is essential to detect when instances of this may be happening, as the data is made up, it can invalidate the survey study results. One means of detecting potential instances of curb stoning is through the application of Benford's Law, which provides an expected distribution for the frequency of first digits. This law has been found to apply to many numerical results, such as lengths of rivers, number of people residing in a household, etc. Audit Objective The objective is to identify potential instances of curb stoning based upon a comparison of the survey data completed compared with that which would be expected, if the data distribution conformed with Benford's Law. Note that some elements of a survey will NOT comply with Benford's law where they are of a predetermined range. However, in this case, all the data elements should conform - i.e. street address number, number of people in the household, and the distance to the nearest church. Audit Procedures / Audit Program 1. Obtain survey data in electronic format, note data elements and state whether they would be expected to conform with Benford's law. Example Data (Excel and Flat File) Excel WorkBook Sheets BenSurvey100 - BenSurvey104 Scripts to do the Data Extraction Description of Analytical Procedures Performed First sort the file provided in Surveyor Sequence Using Ez_R Stats for Excel, choose the menu option Tools | Benford. Screen Shots of Analysis
Screen Shots of Output
Discussion of Results A visual review of the charts below indicates that the survey results obtained by the first three surveyors very closely approximate what would be expected. However, the chart of the results for the fourth surveyor, number 103, are strikingly different. Thus, it is desirable to further review and verify the results obtained by surveyor 103. Alternative Approach - Batch Commands / Scripts The analysis can also be done using a command line: sort /ezs/temp/sort.ezs ben.tab bensort.tab Surveyor then run the benford analysis (lines wrap around) CtrlBrk.exe -e/ezs/temp/ben.ezs -iben.tab -o/ezs/temp/benresultspeople.tab -vNumpeople -bSurveyor -mMAX type=F1
The analysis can also be performed using scripts: options obs=max proc benford data=sortben type=F1; proc benford data=sortben type=F1;
|
![]() |
|
Web Page last updated on 07-18-2007
© EZ-R Stats, LLC 2005-2007 |
© EZ_R Stats
Visit EZ_R Stats on the web at:
www.ezrstats.com