- Mountain View CA, US Norman Paul Jouppi - Palo Alto CA, US Andrew Everett Phelps - Middleton WI, US Reginald Clifford Young - Palo Alto CA, US Thomas Norrie - San Jose CA, US Gregory Michael Thorson - Waunakee WI, US Dan Luu - Madison WI, US
International Classification:
G06N 3/08 G06F 15/80 G06N 3/063 G06N 5/04
Abstract:
A circuit for performing neural network computations for a neural network comprising a plurality of neural network layers, the circuit comprising: a matrix computation unit configured to, for each of the plurality of neural network layers: receive a plurality of weight inputs and a plurality of activation inputs for the neural network layer, and generate a plurality of accumulated values based on the plurality of weight inputs and the plurality of activation inputs; and a vector computation unit communicatively coupled to the matrix computation unit and configured to, for each of the plurality of neural network layers: apply an activation function to each accumulated value generated by the matrix computation unit to generate a plurality of activated values for the neural network layer.
- Mountain View CA, US Reginald Clifford Young - Palo Alto CA, US
International Classification:
G06N 3/08 G06N 3/063 G06N 5/04
Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a respective neural network output for each of a plurality of inputs, the method comprising, for each of the neural network layers: receiving a plurality of inputs to be processed at the neural network layer; forming one or more batches of inputs from the plurality of inputs, each batch having a number of inputs up to the respective batch size for the neural network layer; selecting a number of the one or more batches of inputs to process, where a count of the inputs in the number of the one or more batches is greater than or equal to the respective associated batch size of a subsequent layer in the sequence; and processing the number of the one or more batches of inputs to generate the respective neural network layer output.
Depth Concatenation Using A Matrix Computation Unit
- Mountain View CA, US Reginald Clifford Young - Palo Alto CA, US
International Classification:
G06N 3/04 G06F 17/16 G06N 3/063
Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for depth concatenation using a matrix computation unit. One of the methods includes: receiving a request to process network inputs to a neural network using an integrated circuit, the neural network comprising a depth concatenation neural network layer; and generating instructions that, when executed by the integrated circuit, cause the integrated circuit to perform operations comprising: for each spatial location in a first input tensor to the depth concatenation layer and a second input tensor to the depth concatenation layer: multiplying, using the matrix computation unit, a second depth vector for the spatial location by a shift weight matrix for the depth concatenation layer to generate a shifted second depth vector; and adding the shifted second depth vector and a first input depth vector for the spatial location to generate a concatenated depth vector.
- Mountain View CA, US Norman Paul Jouppi - Palo Alto CA, US Andrew Everett Phelps - Middleton WI, US Reginald Clifford Young - Palo Alto CA, US Thomas Norrie - Mountain View CA, US Gregory Michael Thorson - Waunakee WI, US Dan Luu - Madison WI, US
International Classification:
G06N 3/08 G06F 15/80 G06N 3/063 G06N 5/04
Abstract:
A circuit for performing neural network computations for a neural network comprising a plurality of neural network layers, the circuit comprising: a matrix computation unit configured to, for each of the plurality of neural network layers: receive a plurality of weight inputs and a plurality of activation inputs for the neural network layer, and generate a plurality of accumulated values based on the plurality of weight inputs and the plurality of activation inputs; and a vector computation unit communicatively coupled to the matrix computation unit and configured to, for each of the plurality of neural network layers: apply an activation function to each accumulated value generated by the matrix computation unit to generate a plurality of activated values for the neural network layer.
- Mountain View CA, US Norman Paul Jouppi - Palo Alto CA, US Andrew Everett Phelps - Middleton WI, US Reginald Clifford Young - Palo Alto CA, US Thomas Norrie - Mountain View CA, US Gregory Michael Thorson - Waunakee WI, US Dan Luu - Madison WI, US
International Classification:
G06N 3/08 G06N 5/04 G06N 3/063 G06F 15/80
Abstract:
A circuit for performing neural network computations for a neural network comprising a plurality of neural network layers, the circuit comprising: a matrix computation unit configured to, for each of the plurality of neural network layers: receive a plurality of weight inputs and a plurality of activation inputs for the neural network layer, and generate a plurality of accumulated values based on the plurality of weight inputs and the plurality of activation inputs; and a vector computation unit communicatively coupled to the matrix computation unit and configured to, for each of the plurality of neural network layers: apply an activation function to each accumulated value generated by the matrix computation unit to generate a plurality of activated values for the neural network layer.
- Mountain View CA, US Norman Paul Jouppi - Palo Alto CA, US Andrew Everett Phelps - Middleton WI, US Reginald Clifford Young - Palo Alto CA, US Thomas Norrie - Mountain View CA, US Gregory Michael Thorson - Waunakee WI, US Dan Luu - Madison WI, US
International Classification:
G06N 3/08 G06N 5/04 G06N 3/063 G06F 15/80
Abstract:
A circuit for performing neural network computations for a neural network comprising a plurality of neural network layers, the circuit comprising: a matrix computation unit configured to, for each of the plurality of neural network layers: receive a plurality of weight inputs and a plurality of activation inputs for the neural network layer, and generate a plurality of accumulated values based on the plurality of weight inputs and the plurality of activation inputs; and a vector computation unit communicatively coupled to the matrix computation unit and configured to, for each of the plurality of neural network layers: apply an activation function to each accumulated value generated by the matrix computation unit to generate a plurality of activated values for the neural network layer.
Depth Concatenation Using A Matrix Computation Unit
- Mountain View CA, US Reginald Clifford Young - Palo Alto CA, US
International Classification:
G06N 3/04 G06N 3/063 G06F 17/16
Abstract:
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for depth concatenation using a matrix computation unit. One of the methods includes: receiving a request to process network inputs to a neural network using an integrated circuit, the neural network comprising a depth concatenation neural network layer; and generating instructions that, when executed by the integrated circuit, cause the integrated circuit to performing operations comprising: for each spatial location in a first input tensor to the depth concatenation layer and a second input tensor to the depth concatenation layer: multiplying, using the matrix computation unit, a second depth vector for the spatial location by a shift weight matrix for the depth concatenation layer to generate a shifted second depth vector; and adding the shifted second depth vector and a first input depth vector for the spatial location to generate a concatenated depth vector.
- Mountain View CA, US Norman Paul Jouppi - Palo Alto CA, US Andrew Everett Phelps - Middleton WI, US Reginald Clifford Young - Palo Alto CA, US Thomas Norrie - Mountain View CA, US Gregory Michael Thorson - Waunakee WI, US Dan Luu - Madison WI, US
International Classification:
G06N 3/08 G06N 5/04 G06F 15/80 G06N 3/063
Abstract:
A circuit for performing neural network computations for a neural network comprising a plurality of neural network layers, the circuit comprising: a matrix computation unit configured to, for each of the plurality of neural network layers: receive a plurality of weight inputs and a plurality of activation inputs for the neural network layer, and generate a plurality of accumulated values based on the plurality of weight inputs and the plurality of activation inputs; and a vector computation unit communicatively coupled to the matrix computation unit and configured to, for each of the plurality of neural network layers: apply an activation function to each accumulated value generated by the matrix computation unit to generate a plurality of activated values for the neural network layer.
Alt. Dispute Resolution Appellate Practice Civil Litigation Criminal Law Domestic and Family Law Juvenile Law Nonprofit Workers' Compensation Guardianships/Conservatorships Government/Administrative Law Alt. Dispute Resolution Criminal Law Nonprofit