KDnuggets : News : 2001 : n16 : item14    (previous | next)

Software


From: Mohammed Zaki
Date: Thu, 26 Jul 2001 12:09:58 -0400
Subject: LOGML: Log Markup Language Schema and DTD Available
LOGML (Log Markup Language) Draft Specification and Schema
-----------------------------------------------------------
We are glad to announce the draft specification of LOGML (Log Markup
Language) and Schema at: http://www.cs.rpi.edu/~puninj/LOGML/

Log Markup Language (LOGML) is an XML 1.0 application designed to
describe log reports of web servers. Web-data mining is one of the
current hot topics in computer science. Mining data that has been
collected from web server logfiles, is not only useful for studying
customer choices, but also helps in organizing web pages. This is
accomplished by knowing which web pages are most frequently accessed
by the web surfers. The structure of a web site is represented as a
web graph using XGMML (eXtensible Graph Markup and Modeling Language;
see below).

In mining the data from the log statistics, we use the web graph in
annotating the log information. Further we give summary reports,
comprising of information such as client sites, types of browsers and
the usage time statistics. We also gather the client activity in a web
site as a subgraph of the web site graph. This subgraph can be used to
get better understanding of general user activity in the web site.  In
LOGML, we create a new XML vocabulary to structurally express the
contents of the logfile information.  We provide with a LOGML dtd and
LOGML Schema (based on XML Schema W3C Recommendation 2 May 2001).

Application of LOGML for Web Usage Mining is described in detail in
the following paper (to appear in WEBKDD01):
http://www.cs.rpi.edu/~zaki/papers.html#WEBKDD01

XGMML (eXtensible Graph Markup and Modeling Language) Schema Update
-------------------------------------------------------------------
We are also glad to announce the update of XGMML schema, based on XML
Schema W3C recommendation, at: http://www.cs.rpi.edu/~puninj/XGMML/

XGMML (eXtensible Graph Markup and Modeling Language) is an XML
application based on GML which is used for graph description. XGMML
uses tags to describe nodes and edges of a graph. The purpose of XGMML
is to make possible the exchange of graphs between differents
authoring and browsing tools for graphs. The conversion of graphs
written in GML to XGMML is trivial.  Using XSL with XGMML allows the
translation of graphs to different formats. XGMML was created to be
used for the WWWPAL System that visualizes web sites as a graph. Web
Robots can navigate through a web site and save the graph information
as an XGMML file. XGMML, as any other XML application, can be mixed
with other markup languages to describe additional graph, node and/or
edge information.

Questions and comments are welcome.
John Punin, puninj@cs.rpi.edu
Mukkai Krishnamoorthy, moorthy@cs.rpi.edu
Mohammed J. Zaki, zaki@cs.rpi.edu

KDnuggets : News : 2001 : n16 : item14    (previous | next)

Copyright © 2001 KDnuggets.   Subscribe to KDnuggets News!