KDnuggets : Newsletter : 1999 Issues : 99:07 Contents :

KDnuggets 99:07, item 4, Publications:

Previous | Contents |  Next

Date: Fri, 12 Mar 1999 14:01:34 -0800
From: Brent Emerson, bemerson@mkp.com
Subject: New Book: Data Preparation For Data Mining, by Dorian Pyle

DATA PREPARATION FOR DATA MINING
by Dorian Pyle
Morgan Kaufmann Publishers, March 1999, ISBN 1-55860-529-0

Data Preparation for Data Mining addresses an issue unfortunately
ignored by most authorities on data mining: data preparation. Thanks
largely to its perceived difficulty, data preparation has
traditionally taken a backseat to the more alluring question of how
best to extract meaningful knowledge. But without adequate preparation
of your data, the return on the resources invested in mining is
certain to be disappointing.

Dorian Pyle corrects this imbalance. A twenty-five-year veteran of
what has become the data mining industry, Pyle shares his own
successful data preparation methodology, offering both a conceptual
overview for managers and complete technical details for IT
professionals. Apply his techniques and watch your mining efforts pay
off-in the form of improved performance, reduced distortion, and more
valuable results.

 -- Offers in-depth coverage of an essential but largely ignored subject. 
 -- Goes far beyond theory, leading you-step by step-through the author's own data preparation techniques. 
 -- Provides practical illustrations of the author's methodology using realistic sample data sets. 
 -- Includes algorithms you can apply directly to your own project, along with instructions for understanding when automation is possible and when greater intervention is required. 
 -- Explains how to identify and correct data problems that may be present in your application. 
 -- Prepares miners, helping them head into preparation with a better understanding of data sets and their limitations.

CONTENTS

Preface -- Introduction -- Data Exploration As a Process -- The Nature
of the World and Its Impact on Data Preparation -- Data Preparation as
a Process -- Getting the Data: Basic Preparation -- Sampling,
Variability and Confidence -- Handling Non-Numerical Variables --
Normalizing and Redistributing Variables -- Replacing Missing and
Empty Values -- Series Variables -- Preparing the Data Set -- The Data
Survey -- Using Prepared Data -- Using the Demonstration Code on the
CD -- Further Reading -- Index

MORE INFORMATION
http://www.mkp.com/books_catalog/1-55860-529-0.asp

Morgan Kaufmann Publishers
San Francisco, California
http://www.mkp.com
orders@mkp.com

Previous | Contents |  Next


KDnuggets : Newsletter : 1999 Issues : 99:07 Contents :

Copyright © 1999 KDnuggets