Inventors:
- Suwon-si, KR
Joseph H. HASSOUN - Los Gatos CA, US
Ali SHAFIEE ARDESTANI - San Jose CA, US
Hamzah Ahmed Ali ABDELAZIZ - San Jose CA, US
Georgios GEORGIADIS - Porter Ranch CA, US
Hui CHEN - Irvine CA, US
David Philip Lloyd THORSLEY - Morgan Hill CA, US
International Classification:
G06F 17/18
G06N 3/08
G06N 3/04
Abstract:
A method of quantizing an artificial neural network may include dividing a quantization range for a tensor of the artificial neural network into a first region and a second region, and quantizing values of the tensor in the first region separately from values of the tensor in the second region. Linear or nonlinear quantization may be applied to values of the tensor in the first region and the second region. The method may include locating a breakpoint between the first region and the second region by substantially minimizing an expected quantization error over at least a portion of the quantization range. The expected quantization error may be minimized by solving analytically and/or searching numerically.