Making ontologies work for resolving redundancies across documents
This paper discusses issues in the design of ontologies for supporting the assessment of similarity across documents. Knowledge management efforts have produced many document collections focused on particular domains. As such systems scale up, they become unwieldy and even unusable if obsolete and redundant content is not continually identified and removed. We worked with the Eureka knowledge sharing system at Xerox, which contains about 40,000 technician-authored tips that do not appear in official manuals. The goal was to assess the similarity of various documents and enable better maintenance of the collection by identifying redundancies and inconsistencies.
Everett, J. O. ; Bobrow, D. G. ; Stolle, R. ; Crouch, R. S. ; de Paiva, V. ; Condoravdi, C. ; van den Berg, M.; Polanyi, L. Making ontologies work for resolving redundancies across documents. Communications of the ACM. 2002 February; 45 (2): 55-60.