Inventors:
David A. Brooks - Providence RI, US
Niklas Heidloff - Salzkotten, DE
Hong Dai - Westford MA, US
Craig R. Wolpert - Holliston MA, US
Igor L. Belakovskiy - Cambridge MA, US
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06F 7/00
G06F 17/30
G06F 15/16
US Classification:
707 3, 707 4, 707 5, 707 6
Abstract:
A method and system for sharing full text index entries across application boundaries in which documents are obtained by a shared, platform level indexing service, and a determination is made as to whether the received documents are duplicates with regard to previously indexed documents. If a document is determined to be a duplicate, the index representation of the previously indexed copy of the document is modified to indicate that the document is also associated with another application or context. If a document is not a duplicate of a previously indexed document, the document is indexed to support future searches and/or other processing. The index representation of a document includes application category identifiers associating one or more applications or contexts with the document. When a document is indexed, one or more category identifiers are generated and stored in association with that document. The category identifiers for an indexed document may, for example, represent an application that received, stored, or otherwise processed that document.