Inventors:
Alexandru Marius Pasca - Sunnyvale CA, US
Peter Szabolcs Dienes - Mountain View CA, US
Assignee:
Google Inc. - Mountain View CA
International Classification:
G06F 17/00
US Classification:
707673, 704 9, 704 1, 704 7, 704 10, 704257
Abstract:
Methods and apparatus, including systems and computer program products, to acquire potential paraphrases from textual input. In one aspect, textual input is received, a first map is generated, where the key of the first map is an ngram identified in the textual input and the value associated with the key of the first map is a unique identifier, a second map is generated, where the key of the second map is an anchor identified from the ngram and the value associated with the key of the second map is one or more middle portions associated with the anchor, and a third map is generated, where the key of the third map is a potential paraphrase pair identified from the middle portions and the value associated with the key of the third map is the one or more unique anchors associated with the potential paraphrase pair.