KDnuggets : News : 2002 : n04 : item3    (previous | next)

Features


From: Gregory Piatetsky-Shapiro

Date: 18 Feb, 2002

Subject: Poll Results: favorite dataset format

The previous KDnuggets poll asked

What dataset format you use the most when data mining?

Based on 238 votes, about half of the responders chose either comma-separated of tab-separated formats. Another quarter of the responders indicated that they kept the data in a database. Only 11% kept the data in a commercial data mining tool format such as SAS.

Full results:

Comma-separated (.csv) file		24%
Tab or Space-separated (.txt) file	26%
Commercial data mining tool format	11%
Weka format (.ARFF)			 5%
in a database				27%
in a spreadsheet			 7%
Other					 1%
 
See www.kdnuggets.com/polls/2002/dataset_format.htm

KDnuggets : News : 2002 : n04 : item3    (previous | next)

Copyright © 2002 KDnuggets.   Subscribe to KDnuggets News!