Inventors:
- San Jose CA, US
Kushal Kafle - Boston MA, US
Zhe Lin - Fremont CA, US
Zhihong Ding - Fremont CA, US
Scott Cohen - Sunnyvale CA, US
Quan Tran - San Jose CA, US
International Classification:
G06K 9/62
G06N 3/08
Abstract:
This disclosure describes one or more implementations of systems, non-transitory computer-readable media, and methods that extract multiple attributes from an object portrayed in a digital image utilizing a multi-attribute contrastive classification neural network. For example, the disclosed systems utilize a multi-attribute contrastive classification neural network that includes an embedding neural network, a localizer neural network, a multi-attention neural network, and a classifier neural network. In some cases, the disclosed systems train the multi-attribute contrastive classification neural network utilizing a multi-attribute, supervised-contrastive loss. In some embodiments, the disclosed systems generate negative attribute training labels for labeled digital images utilizing positive attribute labels that correspond to the labeled digital images.