KDnuggets : News : 2003 : n03 : item11 < PREVIOUS | NEXT >

Courses

From: Connie Lester
Date: 20 Jan 2003
Subject: New Data Cleaning Techniques course coming to San Francisco, Mar 3-4

Register now for the March 3-4 course in San Francisco, CA.

SAS has teamed with Ron Cody, author of Cody's Data Cleaning Techniques Using SAS Software to offer a new course based on his book entitled Data Cleaning Techniques. www.sas.com/service/edu/courses/bdct.html

This two-day course provides participants with techniques to find data errors in raw data or SAS data sets. These techniques involve using DATA step programming and SAS procedures such as FREQ, MEANS, and UNIVARIATE to detect invalid values for categorical variables or invalid numeric values, based on user-defined criteria or automatic outlier detection. Programs that look for subject Ids in multiple files and that detect duplicate records are also described. Content covered in the course includes:

  • Making sure that the raw data were accurately entered into a computer readable file.
  • Checking that character variables contain only valid values.
  • Checking that numeric values are within predetermined ranges.
  • Checking if there are missing values for variables where complete data is necessary.
  • Checking for and eliminating duplicate data entries.
  • Checking for uniqueness of certain values, such as patient ID's.
  • Checking for invalid date values.
  • Checking that an ID number is present in each of "n" files.
  • Verifying that more complex multi-file rules have been followed.
For a complete listing of all Business Knowledge Series Program courses visit http://www.sas.com/bks or contact us via e-mail at training@sas.com.


KDnuggets : News : 2003 : n03 : item11 < PREVIOUS | NEXT >

Copyright © 2003 KDnuggets.   Subscribe to KDnuggets News!