PublicationsFrom: Kirk.Borne@gsfc.nasa.govDate: Thu, 17 May 2001 12:59:21 -0400 (EDT) Subject: NASA Astronomical Data Center's Data Mining and XML Resources NASA's Astronomical Data Center (ADC) has established a compilation of "Data Mining Resources for Space Science" : http://adc.gsfc.nasa.gov/adc/adc_datamining.html The list includes numerous items of general data mining interest. It is not intended to be a comprehensive data mining resource, but is primarily focussed on the new Virtual Observatory initiative for astronomical data. This Virtual Observatory will require significant use of techniques from the data mining and knowledge discovery communities. Links to the Virtual Observatory are provided in the above resource listing. A short description is also available here : http://adc.gsfc.nasa.gov/adc/adc_enews/jul00.enews.html#2 In addition to these efforts, the ADC is involved in research into the application of eXtensible Markup Language (XML) for the repository of published astronomical data. Automated pipelines have been developed for the flow of data from scientists and journal publication presses into XML documents. The goal of these pipelines is to greatly reduce the human effort required to transform electronic human-readable tables into machine-readable and database-ready documents. An important benefit is that the resulting XML documents can be searched and mined via detailed or complex queries using the latest XML-based search tools. ADC's XML web site : http://tarantella.gsfc.nasa.gov/xml The ADC has also developed a new general data format designed to take full advantage of the XML hierarchical structure. The mathematical core of the format is XDF, the eXtensible Data Format. ADC's XDF web site : http://tarantella.gsfc.nasa.gov/xml/XDF_home.html Kirk Borne Raytheon Information Technology and Scientific Services NASA Goddard Space Flight Center Astrophysics Data Facility, Code 631 Greenbelt, MD 20771 ADC web page: http://adc.gsfc.nasa.gov/ |
Copyright © 2001 KDnuggets. Subscribe to KDnuggets News!