KDnuggets : Software : Web Content Mining, Screen scraping
See also Web Usage Mining and Web Server Log analysis software and Text Mining software.

Web Content Mining Software

commercial | free and open source
  • Metafy Anthracite Web Mining Software, visually construct spiders and scrapers without scripts (requires MacOS X 10.4 or newer).
  • Megaputer WebAnalyst, integrates the data and text mining capabilities of Megaputer's analytical software directly into your website.
  • Screen Scraper, allows users to scrape structured and unstructured data from websites and format it (free download).
  • WebQL, for creating turnkey web extraction applications, such as price collector, patent information aggregator, etc.
  • XML Miner, XML Miner is a system and class library for mining data and text expressed in XML, extracting knowledge and re-using that knowledge in products and applications in the form of fuzzy logic expert system rules.
free and open source
  • GNU Wget, command line tool for retrieving files using HTTP, HTTPS and FTP.

KDnuggets : Software : Web Content Mining

Copyright © 2008 KDnuggets.   Subscribe to KDnuggets News!