KDnuggets : News : 2003 : n12 : item9 < PREVIOUS | NEXT >

Software

From: Normand Peladeau
Date: June 6, 2003
Subject: New improved version of WordStat and pre-released version of CodeMiner

RELEASE OF WORDSTAT 4.0

Provalis Research is pleased to announce the release of WordStat version 4.0.

New features and improvements include:

  • Dictionary moderated lemmatization and stemming (English only for now, other languages will be added later).
  • Heatmap plot and dual-clustering to examining relationship between keywords and categorical or numerical variables.
  • Clustering and multidimensional scaling of cases (records or documents) based on content similarity.
  • Proximity plot to examine keywords co-occurrence or documents similarity in descending order of similarity (limit of 5000 documents).
  • Phrases finder to easily identify recurring phrases and idioms.
  • New drag'n drop dictionary editor for easy and quick assignment of words or phrases to dictionaries (may also be used to merge two dictionaries)
  • Support for RTF documents (up to 10Mb per document) and larger memo fields (beyond the previous 64Kb limit)
  • Direct importation of Word, WordPerfect and HTML documents from the editor.
  • Customization of 2D and 3D multidimensional scaling charts, correspondence plots and bar charts (title, color, data point, data labels, etc.)
  • Option to animate 3D multidimensional scaling charts and 3D correspondence plots (allows better visualization of all 3 dimensions).
  • Descriptions may now be assigned to dictionary categories.
  • 3D barcharts and line charts.
  • Weighting of words or phrases in content categories may now be performed using floating-point numbers (positive values).
  • Improved spell-checking with support for more than 10 different languages such as English, French, Spanish, etc.
  • All tables may now be saved in Excel, text or html format.
  • All changes made to the categorization dictionary or exclusion list may be canceled during a session.
  • Automatic selection of inflected forms by suffix or WordNet relationship (dictionary builder tool).
  • Improve text editing (with undo).
  • New categorization dictionaries based on WordNet and Roget ontology systems.
  • Miscellaneous changes, speed improvements, etc.

About WordStat:

WordStat is a content analysis and text mining module specifically designed to analyze textual information such as responses to open-ended questions, interviews, titles, journal articles, public speeches, electronic communications, etc. It provides numerous statistical and graphical tools to visualize text and examine the relationship between words or categories of words and the values of categorical and numerical variables (cluster analysis, multidimensional scaling, proximity plot, correspondence plot, heatmap, bar chart, line chart, crosstabulation, statistical association measures, etc.). Such tools allow users to compare how different groups differ in vocabulary usage, discussed topic, or content category. They also may be used to uncover existing relationships between specific categories of words and numeric variables such as age of the respondent, year of publication, etc.WordStat can analyze text directly or apply categorization dictionaries to group words with similar meanings . The program also provides numerous tools to facilitate the development and validation of new categorization dictionaries.

For more information on WordStat and to obtain evaluation copy please visit our Web Site under:

www.simstat.com/wordstat.htm

PRE-RELEASED VERSION OF CODEMINER 0.9 (BETA VERSION)

We are also working on a new text management and qualitative analysis software. This new software will be offered as an optional base module for WordStat. It will provide an easy way to manage documents and perform common qualitative analysis tasks such as:

  • Storing of documents in Rich Text Format with support for graphics, tables, etc..
  • Easy creation and edition of qualitative analysis coding systems.
  • Drag and drop assignment of codes to text segments.
  • Code retrieval, code comparison statistics, and text searching tools.
  • Numerous visualization tools such as bar charts, line charts and correspondence analysis plots, clustering, concept mapping (using multidimensional scaling), and heatmaps.
  • Code sequences analysis.
  • Computation of 4 levels of inter-raters agreement (occurrence, frequency, code importance and coded segments).
  • Ability to perform content analysis / text mining with the optional WordStat addon module on entire documents or on selected coded segments.
  • Ability to perform statistical analysis on project files using Simstat statistical software.
  • Saving of segments and comparison tables to disk (text, comma separated value, Excel, etc.)

While CodeMinder is still under beta testing, some people have expressed the wish to purchase the product immediately. We have thus decided to offer a pre-released version of CodeMiner at a special price (almost 50% off the planned retail price). Customers who will purchase this pre-released version will receive by email an electronic license file to unlock the beta version. When CodeMiner will be officially released, they will receive the final version for free (installation CD and manual). They will also be entitled to the same benefits as users who will purchase the official version 1.0.

For more information on CodeMiner and to download a trial version, please visit our web site at: www.simstat.com/CodeMiner.htm


KDnuggets : News : 2003 : n12 : item9 < PREVIOUS | NEXT >

Copyright © 2003 KDnuggets.   Subscribe to KDnuggets News!