US Patent:
20210256115, Aug 19, 2021
Inventors:
- SAN JOSE CA, US
Bonnie Arogyam Varghese - Milpitas CA, US
Shankar Subramaniam - Cupertino CA, US
Karthik Krishnan - San Jose CA, US
Rency Joseph - Santa Clara CA, US
International Classification:
G06F 21/55
G06F 40/35
Abstract:
A method and an electronic device () are disclosed for generating semantic representation of a document to determine data security risk associated with the document. The method includes receiving, by a document semantics controller () of the electronic device (), a document in an electronic form and determining, by the document semantics controller (), raw text. Further, the method includes generating, by the document semantics controller (), a plurality of sentence blocks using the raw text and determining, by the document semantics controller (), embeddings for the plurality of sentence blocks. Further, the method includes determining, by the document semantics controller (), the semantic representation of the document based on the embeddings for each of the sentence blocks; and generating, by the document semantics controller (), the semantic representation of the document to determine the data security risk associated with the document.