- weather.arff, for Assignment 1: Using the Weka Workbench. This dataset should also be available under WEKAHOME/data.
- genes-leukemia.csv
(description), for
Assignment 2: Preparing the data and mining it (beginner version)
- Compressed file ALL_AML_original_data.zip, for Assignment 3: Data Cleaning and Preparing for Modeling (intermediate version)
- Compressed file ALL_AML_train_processed.zip, ALL_AML train data, thresholded, for
Assignment 4: Feature Reduction.
It should be generated as part of Assignment 3.
- genes-leukemia.csv (description), also used for Assignment 5: Predicting treatment outcome (1 week)
- Compressed file final_project_data.zip for Final Project: Predict disease classes using genetic microarray data.
Data Mining Course Datasets |
This directory contains the following datasets.
To get the datasets, add www.kdnuggets.com/data_mining_course/data/ in front of data files below: