ACM SIGIR 2000 Workshop On
July 28, 2000
Call for Papers and Participation
Ricardo Baeza-Yates, Depto. de Ciencias de la Computacion, Universidad de Chile, Chile
Norbert Fuhr, Dortmund University, Germany
Ron Sacks-Davis, RMIT Multimedia Database Systems (MDS), Australia
Ross Wilkinson, CSIRO, Division of Mathematical and Information Science, Australia
XML - the eXtensible Markup Language has recently emerged as a new standard for data representation and exchange on the Internet. It is widely believed that it will become a universal format for data exchange on the Web and that in the near future we will find vast amounts of documents in XML format on the Web. As a result, it has become crucial to address the question of how can we efficiently query and search large corpora of XML documents.
To date, most work on storing, indexing, querying, and searching documents in XML has stemmed from the database community's work on semi-structured data. An alternative approach, which has received less attention to date, is to view XML documents as a collection of text documents with additional tags and relations between these tags. IR techniques have traditionally been applied to search large sets of textual data and should thus be extended to encode the structure and semantics inherent in XML documents. Integrating IR and XML search techniques will enable more sophisticated search on the structure as well as the content of these documents, while leveraging the success of IR techniques in document similarity ranking and keyword search.
In this workshop, we will explore both approaches and investigate the relationship between IR and XML. The question that we will address is twofold: what IR can do for XML and vice versa. Topics may include:
Send by email to David Carmel-
A short vita and a position paper along with the intended track (tutorial, research, demo, standards). Length should be no more than 2000 words (HTML, Postscript, or PDF format). Final versions should be submitted in HTML for pre-conference publication, and additionally in PDF or Postscript for the printed version of the workshop materials.
For participation only:
A statement of interest, not to exceed 500 words (HTML, Postscript, or PDF format).
Working notes of the workshop which contain all research and position
papers will be distributed to participants at the workshop and, with the
permission of the authors, made available on the Web. If there is interest
among the attendees, we may publish an edited volume after the
|Submission:||June 5, 2000|
|Notification:||June 19, 2000|
|Final version:||July 3, 2000|
|SIGIR technical conference:||July 24-27, 2000|
|SIGIR workshop:||July 28, 2000|
Additional information is available at XML-IR Workshop Web page and questions can be sent directly to any of the organizers above. Information about the workshop venue and local arrangements (hotel reservation etc.) as well as the sponsoring conference can be found at SIGIR Conference main Web page.