KDnuggets
:
Data Mining Course
: Data
Data Mining Course Datasets
This directory contains the following datasets: (Use mouse right click and "Save as" to download them)
weather.arff
, for
Assignment 1: Using the Weka Workbench
. This dataset should also be available under WEKAHOME/data.
genes-leukemia.csv
(
description
), for
Assignment 2: Preparing the data and mining it (beginner version)
Compressed file
ALL_AML_original_data.zip
, for
Assignment 3: Data Cleaning and Preparing for Modeling (intermediate version)
Compressed file
ALL_AML_train_processed.zip
, ALL_AML train data, thresholded, for
Assignment 4: Feature Reduction
. It should be generated as part of
Assignment 3
.
genes-leukemia.csv
(
description
), also used for
Assignment 5: Predicting treatment outcome
(1 week)
Compressed file
final_project_data.zip
for
Final Project: Predict disease classes using genetic microarray data
.