KDnuggets : Newsletter : 1999 Issues : 99:02 Contents :

KDnuggets 99:02, item 5, Tools and Services:

Previous | Contents |  Next

Date: Wed, 6 Jan 1999 15:31:44 +0100 (MET)
From: Jakub.Zavrel@kub.nl
Subject: Software release: Timbl 2.0
Web: http://ilk.kub.nl/software.html
----------------------------------------------------------------------
 Software release:      TiMBL 2.0
                        Tilburg Memory Based Learner
ILK Research Group, http://ilk.kub.nl/
----------------------------------------------------------------------
The ILK (Induction of Linguistic Knowledge) Research Group at Tilburg
University, The Netherlands, announces the release of a new version of
TiMBL, Tilburg Memory Based Learner (version 2.0).

TiMBL is a machine learning program implementing a family of
Memory-Based Learning techniques. TiMBL stores a representation of the
training set explicitly in memory (hence `Memory Based'), and
classifies new cases by extrapolating from the most similar stored
cases. 

TiMBL features the following (optional) metrics and speed-up
optimalizations that enhance the underlying k-nearest neighbor 
classifier engine:

- Information Gain weighting for dealing with features of differing
  importance (the IB1-IG learning algorithm).
- Stanfill & Waltz's / Cost & Salzberg's (Modified) Value Difference
  metric for making graded guesses of the match between two
  different symbolic values.
- Conversion of the flat instance memory into a decision tree, 
  and inverted indexing of the instance memory, both yielding
  faster classification.
- Further compression and pruning of the decision tree, guided 
  by feature information gain differences, for an even larger 
  speed-up (the IGTREE learning algorithm).

The current version is a complete rewrite of the software, and
offers a number of new features:

- Support for numeric features.
- The TRIBL algorithm, a hybrid between decision tree and nearest
  neighbor search.
- An API to access the functionality of TiMBL from your own C++ 
  programs.
- Increased ability to monitor the process of extrapolation from 
  nearest neighbors.
- Many bug-fixes and small improvements.

TiMBL accepts commandline arguments by which these metrics and
optimalizations can be selected and combined. TiMBL can read the C4.5
and WEKA's ARFF data file formats as well as column files and compact
(fixed-width delimiter-less) data.

To download, go to http://ilk.kub.nl/software.html
   (registration may be required)

For information on the ILK Research Group, visit our site at
http://ilk.kub.nl/
(edited GPS)

Previous | Contents |  Next


KDnuggets : Newsletter : 1999 Issues : 99:02 Contents :

Copyright © 1999 KDnuggets