Who’s Who? Identifying Concepts and Entities across Multiple Documents Zunaid Kazi and Yael Ravin T. J. Watson Research Center, IBM POB 704, Yorktown Heights, NY 10598 zhkazi, ravin @ us.ibm.com Abstract A number of research and software development groups have developed technology for identifying terms and names in documents and associating them with concepts and named entities, but few have addressed coreference of concepts and entities across multiple documents in a collection. Cross-document coreference is challenging, since a collection of documents consists of multiple discourse contexts, with a many-to-many correspondence between terms and names on one hand and the concepts and entities they refer to on the other. In this paper we describe an algorithm we devised for handling terms and names across documents and for automatically mapping them to the intended concepts and entities.