US Patent:
20220207391, Jun 30, 2022
Inventors:
- Irvine CA, US
James Max Kanter - Boston MA, US
Kalyan Kumar Veeramachaneni - Watertown MA, US
International Classification:
G06N 5/04
G06N 20/00
Abstract:
A feature engineering application receives a plurality of data sets from different data sources for training a model for making a prediction based on new data. The feature engineering application generates primitives based on the data sets. A primitive is to be applied to a variable in the data sets to synthesize a feature. The feature engineering application also receives a temporal parameter that specifies a temporal value for generating time-based features. After the primitives are generated and the temporal parameter is received, the feature engineering application aggregates the plurality of data entities based on primary variables in the plurality of data entities and generate an entity set based on the aggregation. The feature engineering application then synthesize features, including the time-based features, based on the entity set, at least some of the primitives, and the temporal parameter.