KDnuggets » Forums
Latest News



 FAQFAQ    SearchSearch    MemberlistMemberlist     RegisterRegister   ProfileProfile    Log inLog in 

computing brief..
Goto page 1, 2  Next
 
Post new topic   Reply to topic    www.kdnuggets.com Forum Index -> Data Mining Beginners
View previous topic :: View next topic  
Author Message
madnormski
Contributor


Joined: 06 Dec 2006
Posts: 6

PostPosted: Fri Dec 08, 2006 9:05 am    Post subject: computing brief.. Reply with quote

build and compare 2 different predictive models using a decision tree..with the winconsin breast cancer dataset...

i am seriously lost on this?...
Back to top
View user's profile Send private message
editor
Site Admin


Joined: 04 Oct 2005
Posts: 120
Location: Boston, MA

PostPosted: Fri Dec 08, 2006 2:19 pm    Post subject: Wisconsin Breast Cancer Data Reply with quote

You can get this data from
UCI repository - see
http://www.ics.uci.edu/~mlearn/MLSummary.html

Look at
http://www.ailab.si/orange/doc/datasets/breast-cancer-wisconsin.htm
for notes and analysis
Back to top
View user's profile Send private message Send e-mail Visit poster's website
madnormski
Contributor


Joined: 06 Dec 2006
Posts: 6

PostPosted: Sat Dec 09, 2006 6:27 am    Post subject: Reply with quote

i have the dataset...how do i create 2 predictive models using decision trees using clementine..im more into multimedia projects and i am lost on this...?????????
Back to top
View user's profile Send private message
TimManns
Data Mining Guru


Joined: 25 Sep 2006
Posts: 37
Location: Sydney

PostPosted: Sun Dec 10, 2006 6:04 pm    Post subject: Reply with quote

Hi,

When installing Clementine you have the choice to include small demo files and demo Clementine stream files that provide examples of a simple medical drup prescription scenario. This may be similar to your breast cancer project. Following a simple example from the appendix of the User guide (installed as PDF when you install Clementine) will show how to build simple predictive models. It will cover the basics such as balancing the data, sampling, and simple data trasnformations prior to building models.

Clementine is easy to use, but you should expect some sort of hands-on training from a teacher or colleague though - at least for an hour or so.

Cheers

Tim
Back to top
View user's profile Send private message
TimManns
Data Mining Guru


Joined: 25 Sep 2006
Posts: 37
Location: Sydney

PostPosted: Sun Dec 10, 2006 6:07 pm    Post subject: Reply with quote

I should have mentioned;
- it could take you only 30 mins if you know the data and are familar with Clementine to create a basic predictive model.
Back to top
View user's profile Send private message
madnormski
Contributor


Joined: 06 Dec 2006
Posts: 6

PostPosted: Tue Dec 12, 2006 9:41 am    Post subject: Reply with quote

do u happen to know wot theoutput attribute is for the wisconsin dataset?..
Back to top
View user's profile Send private message
editor
Site Admin


Joined: 04 Oct 2005
Posts: 120
Location: Boston, MA

PostPosted: Wed Dec 13, 2006 6:37 am    Post subject: Wisconsin Breast Cancer Data - Output attribute Reply with quote

[quote="madnormski"]do u happen to know wot theoutput attribute is for the wisconsin dataset?..[/quote]

It is the class - the last attribute, (benign or malignant)
Back to top
View user's profile Send private message Send e-mail Visit poster's website
madnormski
Contributor


Joined: 06 Dec 2006
Posts: 6

PostPosted: Wed Dec 13, 2006 6:57 am    Post subject: Reply with quote

im 30 yaers old from derry in northern ireland...if i sent you the breast cancer dataset could you design 2 predictive models for me?...plzzzzz...im lost completely,,,
Back to top
View user's profile Send private message
TimManns
Data Mining Guru


Joined: 25 Sep 2006
Posts: 37
Location: Sydney

PostPosted: Wed Dec 13, 2006 5:21 pm    Post subject: Reply with quote

sure- i can spare half an hour to hack something togther.


It would help if you could supply information about the data, the business case or any background information. I'd definately need a description of the variables/fields involved. Simply having field names isn't going to be enough, I'll need to know what the data actually is.

Cheers

Tim
timmanns at bigpond dot net dot au
Back to top
View user's profile Send private message
madnormski
Contributor


Joined: 06 Dec 2006
Posts: 6

PostPosted: Thu Dec 14, 2006 4:35 am    Post subject: Reply with quote

tim could i possibly email you as i could send u the small dataset..

i have to build and compare 2 different predictive models using a decision tree, a neural network or cohonen network node in clementine...

i have to pre process and clean the data and then properly define input and output variables to achieve a satisfactory solution to problem...

finaallly..

ihave to write up a report..
-including problem domain
-input, output and excluded variables (and reasons to designate them as such)
- explain choice of algorithms and of data pre processing activity..
-give an analysis of results including an assessment of effectiveness of the mining...

my email is ...normanshongo2003@yahoo.co.uk

tom i will never be using this program again as i am a multimedia student and have no idea on wot to do for this small assignment..

can i send u the dataset files plz?...

could u include the aspects of the project brief...maybe screenshots from clementine with small notes included....anything at all would be greatly appreciated as i have to hand this in tomorrow evening....as long as the topics in brief covered it does not have to be a very indept investigation....

tom u would be really helping me out as i dont know who to turn too now and i want to get the brief completed in time...

thank you so much...if u could do a half houts piece i would be so happy...[b][/b]
Back to top
View user's profile Send private message
Display posts from previous:   
Post new topic   Reply to topic    www.kdnuggets.com Forum Index -> Data Mining Beginners All times are GMT - 5 Hours
Goto page 1, 2  Next
Page 1 of 2

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group

KDnuggets » Forums

Copyright © 2012 KDnuggets.   Subscribe to KDnuggets News! Tweet Twitter | facebook Facebook | RSS RSS | About KDnuggets