mineXML

As industrial strength XML databases emerging, the amount of available XML information becomes tremendous. An interactive system that can automatically analyze XML data, and find meaningful and useful information within particular application domains, is needed.

Goal

The project mineXML is designed to extract high-level information from the Web Service description (XML documents) repositories for purpose of establishing associations, building classifications, and extracting meaningful information of various types (i.e., business related, configuration related).

Following are some techniques and issues we are investigating:

Collaboration

This project is a joint effort by Cindy Chen, Judah Diament, George Mihaila, Haixun Wang and Isabelle Rouvellou, at IBM T. J. Watson Research Center, Hawthorne, New York, USA.

For further information on mineXML, please contact Cindy Chen.