KDnuggets Home » News » 2011 » Jul » Software » Mulan: an open-source Java library for learning from multi-label data  ( < Prev | 11:n17 | Next > )

Mulan: an open-source Java library for learning from multi-label data


 
  
includes algorithms for Classification, Ranking, Feature Selection, Evaluation, and more. Each item of a multi-label dataset can be a member of multiple categories.


Date: Jul 8, 2011

Mulan Mulan is an open-source Java library for learning from multi-label data. Multi-label data consist of training examples of a target function that has multiple binary target variables. This means that each item of a multi-label dataset can be a member of multiple categories or annotated by many labels (classes). This is actually the nature of many real world problems such as semantic annotation of images and video, web page categorization, direct marketing, functional genomics and music categorization into genres and emotions.

Currently, the library includes a variety of state-of-the-art algorithms for performing the following major multi-label learning tasks:

  • Classification. This task is concerned with outputting a bipartition of the labels into relevant and irrelevant ones for a given input instance.
  • Ranking. This task is concerned with outputting an ordering of the labels, according to their relevance for a given data item
  • Classification and ranking. A combination of the two tasks mentioned-above.
In addition, the library offers the following features:
  • Feature selection. Simple baseline methods are currently supported.
  • Evaluation. Classes that calculate a large variety of evaluation measures through hold-out evaluation and cross-validation.
This is the website of Mulan: mulan.sourceforge.net/


 
Related
Data Mining Software

KDnuggets Home » News » 2011 » Jul » Software » Mulan: an open-source Java library for learning from multi-label data  ( < Prev | 11:n17 | Next > )