Proc summary can be used to obtain basic statistics about variables in a file. These statistics include the sum, minimum value, maximum value and record count for each variable. Output is to tab separated file which can then be processed further, if desired.
Use Excel for auditing and analysis ? Why not try our Analytic software for the the most common functions - Benford's Law, outliers, univariate statistics, CMA sampling, interval sampling, sample size calculation, stratification, classification, holiday date checking, duplicates, gaps, analytic review, trend analysis, credit card validation (luhn-10) etc. This desktop version operates from within Excel (an add-in). Want to learn more? Visit the Analytics home page (or view the link Operation Guide for a full description and screen shots).
Only a single variable is summarized. However, up to five sort columns may be specified, in order to obtain statistics at the control break level, i.e. “BY” values. Note that the file must be sorted in the BY variable sequence before proc summary can be run.
The many uses for PROC SUMMARY include verifying file totals, obtaining basic population statistics, identifying sampling parameters, establishing control totals, etc.
Sample screenshot

Syntax
proc summary data=INPUTFILE ;
out = OUTPUTFILE;
var VARNAME;
by BYLIST;
Parameters Used
There are four parameters for the SUMMARY procedure:
INPUTFILE - the name of the file to be analyzed
OUTPUTFILE - filename where the results are to be stored
VARNAME - name of the variable to be summarized
by BYLIST;
Example Script
*;
* summary1.ezs;
* summarize data with proc summary;
* analyze both invoice counts and claim amounts;
libname test '{%libin}';
libname vend '{%libout}';
options obs=5000;
proc summary data=test.vndspk1 ;
var total;
by vendor;
out = vend.sum1;
run;
The Proc Summary procedure is part of EZ-R Stats for Windows. View screen shots of the system.
Analysts interested in Proc Summary may also be interested in Proc Means, Proc Outlier, Proc Benford and Proc Univariate.
There is also a web analytics version which can be run directly from the Internet using Excel or other data source (in tab separated value format). View Web Analytics details. All software is provided at no cost.

