Search

Taufik Abidin Phones & Addresses

  • New Brighton, MN
  • Piscataway, NJ
  • Fargo, ND

Publications

Us Patents

Method And System For Data Mining Of Very Large Spatial Datasets Using Vertical Set Inner Products

View page
US Patent:
20080109437, May 8, 2008
Filed:
Nov 17, 2005
Appl. No.:
11/791004
Inventors:
William K. Perrizo - Fargo ND, US
Taufik Fuadi Abidin - Piscataway NJ, US
Amal Shehan Perera - Knoxville TN, US
Masum Serazi - Edison NJ, US
International Classification:
G06F 7/08
G06F 17/30
US Classification:
707 7, 7071041, 707E17009, 707E17046
Abstract:
A system and method for performing and accelerating cluster analysis of large data sets is presented. The data set is formatted into binary bit Sequential (bSQ) format and then structured into a Peano Count tree (P-tree) format which represents a lossless tree representation of the original data. A P-tree algebra is defined and used to formulate a vertical set inner product (VSIP) technique that can be used to efficiently and scalably measure the mean value and total variation of a set about a fixed point in the large dataset. The set can be any projected subspace of any vector space, including oblique sub spaces. The VSIPs are used to determine the closeness of a point to a set of points in the large dataset making the VSIPs very useful in classification, clustering and outlier detection. One advantage is that the number of centroids (k) need not be pre-specified but are effectively determined. The high quality of the centroids makes them useful in partitioning clustering methods such as the k-means and the k-medoids clustering. The present invention also identifies the outliers.
Taufik F Abidin from New Brighton, MN, age ~54 Get Report