Dr. Gregory Piatetsky-Shapiro KDnuggets |
Prof. Gary Parker Connecticut College |
Introductions | Course materials | Data Mining Course Modules | Assignments & Datasets | Extra Publications | Additional Lectures | Acknowledgments
Introductions
Course introduction | For prospective students | For facultyCourse materials
- Syllabus
- Detailed Course Outline
- Course Notes (PDF, 59 pages, 492KB)
KDnuggets.com/data_mining_course/course_notes.pdf
Please be patient and wait for the entire file to load ! - References
Data Mining Course Modules
To get the presentations, add www.kdnuggets.com/data_mining_course/ in front of ppt files below- DM1: Introduction: Machine Learning and Data Mining, updated May 31, 2006.
dm1-introduction-ml-data-mining.ppt - DM2: Machine Learning and Classification, updated June 7, 2006.
dm2-intro-machine-learning-classification.ppt -
DM3: Input: Concepts, Instances, Attributes.
dm3-input-concepts.ppt - DM4: Output: Knowledge Representation, updated June 7, 2006.
dm4-output-representation.ppt - DM5: Classification - Basic Methods.
dm5-classification-basic.ppt - DM6: DM6: Classification: Decision Trees.
dm6-decision-tree-intro.ppt - DM7: Classification: C4.5.
dm7-decision-tree-c45.ppt - DM8: Classification: CART.
dm8-decision-tree-cart.ppt - DM9: Classification: Rules, Regression, K-Nearest Neighbour.
dm9-rules-regression-knn.ppt - DM10: Evaluation and Credibility, updated May 31, 2006.
dm10-evaluation.ppt - DM11: Evaluation - Lift and Costs, updated May 31, 2006.
dm11-evaluation-lift-cost.ppt - DM12: Data Preparation for Knowledge Discovery, updated June 7, 2006.
dm12-data-preparation.ppt - DM13: Clustering, updated May 31, 2006.
dm13-clustering.ppt - DM14: Associations Rules, updated May 31, 2006.
dm14-association-rules.ppt - DM15: Data Mining and Visualization, (3.2MB), updated May 31, 2006.
dm15-visualization-data-mining.ppt - DM16: Summarization and Deviation Detection.
dm16-summarization-deviation-detection.ppt - DM17: Applications: Targeted Marketing, KDD Cup, and Customer Modeling, updated Oct 18, 2004.
dm17-targeted-marketing-kdd-cup.ppt - DM18: Applications: Genomic Microarray Data Mining.
dm18-microarray-data-mining.ppt - DM19: Data Mining and Society; Future Directions
dm19-data-mining-and-society.ppt
Assignments, Mid-term Quiz, and Final Exam
Datasets
Additional Publications
- KEFIR Summarization system for health-care data:
sample HTML report and
KEFIR book chapter (PDF, 357KB, 19 pages),
kefir/kefir-chapter.pdf
Used in module 16. - Capturing Best Practice for Microarray Gene Expression Data Analysis (PDF, 850KB, 9 pages), G. Piatetsky-Shapiro, T. Khabaza, S. Ramaswamy, KDD-03 Conference. Used in Module 18 and in final project.
microarray-best-practice.pdf
Additional Lectures
- Introductory Data Mining Tutorial, (90 slides).
data-mining-tutorial.ppt
-
Introduction to Data Mining (notes) a 30-minute unit, appropriate for a "Introduction to Computer Science" or a similar course.
x1-intro-to-data-mining.ppt
- Data Mining Module for a course on Artificial Intelligence: Decision Trees,
appropriate for one or two classes.
(See Data Mining course notes for Decision Tree modules.)
x2-data-mining-for-ai.ppt
- Data Mining Module for a course on Algorithms: Decision Trees,
appropriate for one or two classes.
See also
data mining algorithms introduction and
Data Mining Course notes (Decision Tree modules).
x3-algorithms-decision-trees.ppt
- From Data Mining to Knowledge Discovery: an Introduction, Connecticut College, Oct 2003.
x4-data-mining-to-knowledge-discovery.ppt
- Data Mining in Genomics: The Dawn of Personalized Medicine, Connecticut College, Oct 2003.
x5-data-mining-in-genomics.ppt