KDnuggets » Forums
Latest News



 FAQFAQ    SearchSearch    MemberlistMemberlist     RegisterRegister   ProfileProfile    Log inLog in 

Question about data processing

 
Post new topic   Reply to topic    www.kdnuggets.com Forum Index -> Data Mining Beginners
View previous topic :: View next topic  
Author Message
Perter_Be



Joined: 02 Jun 2013
Posts: 2

PostPosted: Sun Jun 02, 2013 10:47 am    Post subject: Question about data processing Reply with quote

Hi guys,
I am writing a term paper where data mining is used to improve the security of a power grid and I have a question about data processing.

So various kinds of data is collected from a power grid like: Outages, transformer data, power quality, system loads etc.
This data is then processed for an machine learning application. And it is said that the following things are done with the collected data:
- Parse ans scale features
- Aggregation and binning
- Convert categorical to boolean features

I don't have the foggiest clue about what is menat "Convert categorical to boolean features". Could anyone of you imagine what this could mean.
Back to top
View user's profile Send private message
editor
Site Admin


Joined: 04 Oct 2005
Posts: 120
Location: Boston, MA

PostPosted: Mon Jun 03, 2013 6:24 am    Post subject: Converting categorical (nominal) features to boolean Reply with quote

This conversion is standard when applying many types of machine learning algorithms like neural nets which cannot deal easily with nominal values.

For example, say you have a feature COLOR, and it can have values,
RED, GREEN, and BLUE. Then you would replace it with 3 new attributes:

COLOR_RED (if COLOR="RED" then 1 else 0),
COLOR_GREEN (if COLOR="GREEN" then 1 else 0),
COLOR_BLUE (if COLOR="BLUE" then 1 else 0)
Back to top
View user's profile Send private message Send e-mail Visit poster's website
Perter_Be



Joined: 02 Jun 2013
Posts: 2

PostPosted: Wed Jun 05, 2013 3:13 pm    Post subject: Further questions Reply with quote

Thank you for your answer. But now I have 2 question with regard to you answer:
1) Why can many machine learning algorithms not deal with nominal values
2) If we use boolean valuse why do we have to Parse ans scale features
Back to top
View user's profile Send private message
Display posts from previous:   
Post new topic   Reply to topic    www.kdnuggets.com Forum Index -> Data Mining Beginners All times are GMT - 5 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group

KDnuggets » Forums

Copyright © 2012 KDnuggets.   Subscribe to KDnuggets News! Tweet Twitter | facebook Facebook | RSS RSS | About KDnuggets