KDnuggets » Forums
Latest News



 FAQFAQ    SearchSearch    MemberlistMemberlist     RegisterRegister   ProfileProfile    Log inLog in 

Information retrieval

 
Post new topic   Reply to topic    www.kdnuggets.com Forum Index -> Text Mining, Web Mining, Association Rules, and Other Algorithms
View previous topic :: View next topic  
Author Message
ArmMiner



Joined: 23 Jan 2013
Posts: 1

PostPosted: Wed Jan 23, 2013 11:01 am    Post subject: Information retrieval Reply with quote

Hi all,

I am writing my Master thesis and would like to know some opinions about my task (some hints maybe). So, my project description:

The product information is in excel file. There are thousands of products in the list (columns: name, productId, description, imageUrl...). The first part of the problem is to compare all products to each other (i.e. compare the descriptions). After first comparison of first two products, a mutual string has to be created, like fingerprint, so it would be compared to the third product. So, after each comparison the mutual string is updated. Also, by comparing the products the most similar ones have to be grouped. So, in result we will have grouped products. Last process is attribute extraction. This means from each group the attributes have to be extracted (color, size...).
So, I'm thinking about the ways to solve this problem. Shall I use some tools or write programme in java? Any ideas or hints?

Thanks!

Best regards
Armen
Back to top
View user's profile Send private message
editor
Site Admin


Joined: 04 Oct 2005
Posts: 120
Location: Boston, MA

PostPosted: Fri Feb 01, 2013 10:53 am    Post subject: Product comparison Reply with quote

You may want to check some tools for recommendations and associations.
Also Java is not a good language for such tasks - try to use python.

There are lots of python packages for machine learning, for example
see

http://www.kdnuggets.com/2012/11/best-python-modules-for-data-mining.html
Back to top
View user's profile Send private message Send e-mail Visit poster's website
Display posts from previous:   
Post new topic   Reply to topic    www.kdnuggets.com Forum Index -> Text Mining, Web Mining, Association Rules, and Other Algorithms All times are GMT - 5 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group

KDnuggets » Forums

Copyright © 2012 KDnuggets.   Subscribe to KDnuggets News! Tweet Twitter | facebook Facebook | RSS RSS | About KDnuggets