MPSCNNFullyConnected(3)
NAME
MPSCNNFullyConnected
SYNOPSIS
#import <MPSCNNConvolution.h> Inherits MPSCNNConvolution. Instance Methods (nonnull instancetype) - initWithDevice:convolutionDescriptor:kernelWeights:biasTerms:flags: (nonnull instancetype) - initWithDevice:weights: (nullable instancetype) - initWithCoder:device: (nonnull instancetype) - initWithDevice: Additional Inherited Members
Detailed Description
This depends on Metal.framework The MPSCNNFullyConnected specifies a fully connected convolution layer a.k.a. Inner product layer. A fully connected CNN layer is one where every input channel is connected to every output channel. The kernel width is equal to width of source image and the kernel height is equal to the height of source image. Width and height of the output is 1x1. Thus, it takes a srcW x srcH x Ni MPSCNNImage, convolves it with Weights[No][SrcW][srcH][Ni] and produces a 1 x 1 x No output. The following must be true: kernelWidth == source.width kernelHeight == source.height clipRect.size.width == 1 clipRect.size.height == 1 One can think of a fully connected layer as a matrix multiplication that flattens an image into a vector of length srcW*srcH*Ni. The weights are arragned in a matrix of dimension No x (srcW*srcH*Ni) for product output vectors of length No. The strideInPixelsX, strideInPixelsY, and group must be 1. Offset is not applicable and is ignored. Since clipRect is clamped to the destination image bounds, if the destination is 1x1, one doesn't need to set the clipRect. Note that one can implement an inner product using MPSCNNConvolution by setting offset = (kernelWidth/2,kernelHeight/2) clipRect.origin = (ox,oy), clipRect.size = (1,1) strideX = strideY = group = 1 However, using the MPSCNNFullyConnected for this is better for performance as it lets us choose the most performant method which may not be possible when using a general convolution. For example, we may internally use matrix multiplication or special reduction kernels for a specific platform.
Method Documentation
- (nullable instancetype) initWithCoder: (NSCoder *__nonnull) aDecoder(nonnull id< MTLDevice >) device NSSecureCoding compatability While the standard NSSecureCoding/NSCoding method -initWithCoder: should work, since the file can't know which device your data is allocated on, we have to guess and may guess incorrectly. To avoid that problem, use initWithCoder:device instead. Parameters: aDecoder The NSCoder subclass with your serialized MPSKernel device The MTLDevice on which to make the MPSKernel Returns: A new MPSKernel object, or nil if failure. Reimplemented from MPSCNNConvolution. - (nonnull instancetype) initWithDevice: (nonnull id< MTLDevice >) device Standard init with default properties per filter type Parameters: device The device that the filter will be used on. May not be NULL. Returns: A pointer to the newly initialized object. This will fail, returning nil if the device is not supported. Devices must be MTLFeatureSet_iOS_GPUFamily2_v1 or later. Reimplemented from MPSCNNConvolution. - (nonnull instancetype) initWithDevice: (nonnull id< MTLDevice >) device(const MPSCNNConvolutionDescriptor *__nonnull) fullyConnectedDescriptor(const float *__nonnull) kernelWeights(const float *__nullable) biasTerms(MPSCNNConvolutionFlags) flags Initializes a fully connected kernel. Parameters: device The MTLDevice on which this MPSCNNFullyConnected filter will be used fullyConnectedDescriptor A pointer to a MPSCNNConvolutionDescriptor. strideInPixelsX, strideInPixelsY and group properties of fullyConnectedDescriptor must be set to 1 (default). kernelWeights A pointer to a weights array. Each entry is a float value. The number of entries is = inputFeatureChannels * outputFeatureChannels * kernelHeight * kernelWidth The layout of filter weight is so that it can be reinterpreted as 4D tensor (array) weight[ outputChannels ][ kernelHeight ][ kernelWidth ][ inputChannels / groups ] Weights are converted to half float (fp16) internally for best performance. biasTerms A pointer to bias terms to be applied to the convolution output. Each entry is a float value. The number of entries is = numberOfOutputFeatureMaps flags Currently unused. Pass MPSCNNConvolutionFlagsNone Returns: A valid MPSCNNConvolution object or nil, if failure. Reimplemented from MPSCNNConvolution. - (nonnull instancetype) initWithDevice: (nonnull id< MTLDevice >) device(nonnull id< MPSCNNConvolutionDataSource >) weights Initializes a fully connected kernel Parameters: device The MTLDevice on which this MPSCNNFullyConnected filter will be used weights A pointer to a object that conforms to the MPSCNNConvolutionDataSource protocol. The MPSCNNConvolutionDataSource protocol declares the methods that an instance of MPSCNNFullyConnected uses to obtain the weights and bias terms for the CNN fully connected filter. Returns: A valid MPSCNNFullyConnected object or nil, if failure. Reimplemented from MPSCNNConvolution.
Author
Generated automatically by Doxygen for MetalPerformanceShaders.framework from the source code. Version MetalPerformanceShaders-Thu2Jul 13 2017 MPSCNNFullyConnected(3)
Mac OS X 10.13.1 - Generated Mon Nov 6 16:26:24 CST 2017