KDnuggets :
Web Mining Course :
Assignment 2
Web Mining Course: Assignment 2 - Global log analysis
For the "toy" log file d100.log (or another log file), compute
- the total number of hits
- number of different IP addresses
- What methods were used (GET/HEAD, etc), and how many hits used each method?
- number of different files requested
- number of HTML pages requested.
Count files ending in
.html,
.htm,
and
/, (directories)
What interesting observations can you make ?
For extra credit, compute the same values for the large log file kdlog.zip.
KDnuggets :
Web Mining Course :
Assignment 2