US Patent:
20210109761, Apr 15, 2021
Inventors:
- Cupertino CA, US
Robert D. Kenney - Austin TX, US
Terence M. Potter - Austin TX, US
Vinod Reddy Nalamalapu - Austin TX, US
Sivayya V. Ayinala - Austin TX, US
International Classification:
G06F 9/38
G06F 9/30
G06F 17/16
Abstract:
Techniques are disclosed relating to sharing operands among SIMD threads for a larger arithmetic operation. In some embodiments, a set of multiple hardware pipelines is configured to execute single-instruction multiple-data (SIMD) instructions for multiple threads in parallel, where ones of the hardware pipelines include execution circuitry configured to perform floating-point operations using one or more pipeline stages of the pipeline and first routing circuitry configured to select, from among thread-specific operands stored for the hardware pipeline and from one or more other pipelines in the set, a first input operand for an operation by the execution circuitry. In some embodiments, a device is configured to perform a mathematical operation on source input data structures stored across thread-specific storage for the set of hardware pipelines, by executing multiple SIMD floating-point operations using the execution circuitry and the first routing circuitry. This may improve performance and reduce power consumption for matrix multiply and reduction operations, for example.