KDnuggets Data Mining Community's Top Resource since 1997
for Data Mining and Analytics Software, Jobs, Consulting, Courses, and more
 
advanced search              help


You are here: KDnuggets Home » Web Mining Course » Assignment 2

Web Mining Course: Assignment 2 - Global log analysis

For the "toy" log file d100.log (or another log file), compute
  1. the total number of hits
  2. number of different IP addresses
  3. What methods were used (GET/HEAD, etc), and how many hits used each method?
  4. number of different files requested
  5. number of HTML pages requested.
    Count files ending in .html, .htm, and /, (directories)
What interesting observations can you make ?

For extra credit, compute the same values for the large log file kdlog.zip.

Current KDnuggets News

Follow KDnuggets on Twitter

SUBSCRIBE
Subscribe to KDnuggets News, the leading data mining and analytics newsletter.

Get the latest news, software, jobs, courses, and more (free).



You are here: KDnuggets Home » Web Mining Course » Assignment 2

Copyright © 2009 KDnuggets.  | SUBSCRIBE to KDnuggets News (free)  | About KDnuggets | Contact us