KDnuggets » Forums
Latest News



 FAQFAQ    SearchSearch    MemberlistMemberlist     RegisterRegister   ProfileProfile    Log inLog in 

How to deal with choice of population of study?

 
Post new topic   Reply to topic    www.kdnuggets.com Forum Index -> Data Mining Beginners
View previous topic :: View next topic  
Author Message
KaliHD



Joined: 02 Feb 2013
Posts: 1

PostPosted: Sat Feb 02, 2013 1:16 am    Post subject: How to deal with choice of population of study? Reply with quote

Hi everyone,

I am glad to find a forum like that.

I am currently looking for doing classification or clustering of my big data set on web application.
I would like to do cluster on websites visitors who are identified by their cookie ... which they can drop whenever they want.

The problem I am to deal with is the life time cookie! To get an idea I have about 3 millions "unique" (if they drop their cookie a customer can be identified as new another time) visitor and 85% where the information related to the "unique" visitor is just for one visit and 15% where the information can be collected over more than 1 day (typically about 12 days).

I don't know if it is a good idea to take into account in my clustering (or classification) the 85% of visitors and to do my model on "just" the other 15% which seems more relevant?
Or is this dummy? Because yes they can drop the cookie and yes if I take into account the 85% maybe I will see the same visitor 20 times but with 20 different "unique" key because he has drop the cookie each time.
But an another side the visitor can also come only one time and never come back again?

Have you maybe an idea also on which articles I can find information and ideas about that?

All the best,

KaliHD
Back to top
View user's profile Send private message
editor
Site Admin


Joined: 04 Oct 2005
Posts: 120
Location: Boston, MA

PostPosted: Wed Feb 27, 2013 8:26 am    Post subject: Web cookies Reply with quote

It is a tricky question, but I would start by building 2 separate models - one for multi-time visitors and one for 1-time visitors.
If 1-time visitors visit similar types of pages, then perhaps you can apply multi-visit model to one-time visitors
Back to top
View user's profile Send private message Send e-mail Visit poster's website
Display posts from previous:   
Post new topic   Reply to topic    www.kdnuggets.com Forum Index -> Data Mining Beginners All times are GMT - 5 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group

KDnuggets » Forums

Copyright © 2012 KDnuggets.   Subscribe to KDnuggets News! Tweet Twitter | facebook Facebook | RSS RSS | About KDnuggets