manpagez: man pages & more
man MPSCNNBinaryConvolution(3)
Home | html | info | man
MPSCNNBinaryConvolution(3)




NAME

       MPSCNNBinaryConvolution


SYNOPSIS

       #import <MPSCNNConvolution.h>

       Inherits MPSCNNKernel.

       Inherited by MPSCNNBinaryFullyConnected.

   Instance Methods
       (nonnull instancetype) -
           initWithDevice:convolutionData:scaleValue:type:flags:
       (nonnull instancetype) -
           initWithDevice:convolutionData:outputBiasTerms:outputScaleTerms:inputBiasTerms:inputScaleTerms:type:flags:
       (nullable instancetype) - initWithCoder:device:
       (nonnull instancetype) - initWithDevice:

   Properties
       NSUInteger inputFeatureChannels
       NSUInteger outputFeatureChannels

   Additional Inherited Members

Detailed Description

       This depends on Metal.framework  The MPSCNNBinaryConvolution specifies
       a convolution with binary weights and an input image using binary
       approximations. The MPSCNNBinaryConvolution optionally first binarizes
       the input image and then convolves the result with a set of binary-
       valued filters, each producing one feature map in the output image
       (which is a normal image)

       The output is computed as follows:

       out[i, x, y, c] = ( sum_{dx,dy,f} in[i,x+dx, y+dy, f] x B[c,dx,dy,f] )
                           * scale[c] * beta[i,x,y] + bias[c], where



       the sum over dx,dy is over the spatial filter kernel window defined by
       'kernelWidth' and 'KernelHeight', sum over 'f' is over the input
       feature channel indices within group, 'B' contains the binary weights,
       interpreted as {-1,1} or { 0, 1 } and scale[c] is the
       'outputScaleTerms' array and bias is the 'outputBiasTerms' array. Above
       'i' is the image index in batch the sum over input channels 'f' runs
       through the group indices.

       The convolution operator 'x' is defined by MPSCNNBinaryConvolutionType
       passed in at initialization time of the filter (

       See also:
           initWithDevice). In case 'type' =
           MPSCNNBinaryConvolutionTypeBinaryWeights, the input image is not
           binarized at all and the convolution is computed interpreting the
           weights as [ 0, 1 ] -> { -1, 1 } with the given scaling terms. In
           case 'type' = MPSCNNBinaryConvolutionTypeXNOR the convolution is
           computed by first binarizing the input image using the sign
           function 'bin(x) = x < 0 ? -1 : 1' and the convolution
           multiplication is done with the XNOR-operator !(x ^ y) = delta_xy =
           { (x==y) ? 1 : 0 }, and scaled according to the optional scaling
           operations. Note that we output the values of the bitwise
           convolutions to interval { -1, 1 }, which means that the output of
           the XNOR-operator is scaled implicitly as follows: r = 2 * ( !(x ^
           y) ) - 1 = { -1, 1 }. This means that for a dot-product of two
           32-bit words the result is: r = 2 * popcount(!(x ^ y) ) - 32 = 32 -
           2 * popcount( x ^ y ) = { -32, -30, ..., 30, 32 }. In case 'type' =
           MPSCNNBinaryConvolutionTypeAND the convolution is computed by first
           binarizing the input image using the sign function 'bin(x) = x < 0
           ? -1 : 1' and the convolution multiplication is done with the AND-
           operator (x & y) = delta_xy * delta_x1 = { (x==y==1) ? 1 : 0 }. and
           scaled according to the optional scaling operations. Note that we
           output the values of the AND-operation is assumed to lie in { 0, 1
           } interval and hence no more implicit scaling takes place. This
           means that for a dot-product of two 32-bit words the result is: r =
           popcount(x & y) = { 0, ..., 31, 32 }.

       The input data can be pre-offset and scaled by providing the
       'inputBiasTerms' and 'inputScaleTerms' parameters for the
       initialization functions and this can be used for example to accomplish
       batch normalization of the data. The scaling of input values happens
       before possible beta-image computation.

       The parameter 'beta' above is an optional image which is used to
       compute scaling factors for each spatial position and image index. For
       the XNOR-Net based networks this is computed as follows: beta[i,x,y] =
       sum_{dx,dy} A[i, x+dx, y+dy] / (kx * ky), where (dx,dy) are summed over
       the convolution filter window [ -kx/2, (kx-1)/2], [ -ky/2, (ky-1)/2 ]
       and A[i,x,y] = sum_{c} abs( in[i,x,y,c] ) / Nc, where 'in' is the
       original input image (in full precision) and Nc is the number of input
       channels in the input image. Parameter 'beta' is not passed as input
       and to enable beta-scaling the user can provide
       'MPSCNNBinaryConvolutionFlagsUseBetaScaling' in the flags parameter in
       the initialization functions.

       Finally the normal activation neuron is applied and the result is
       written to the output image.

       NOTE: MPSCNNBinaryConvolution does not currently support groups > 1.


Method Documentation

   - (nullable instancetype) initWithCoder: (NSCoder *__nonnull)
       aDecoder(nonnull id< MTLDevice >) device
       NSSecureCoding compatability  While the standard
       NSSecureCoding/NSCoding method -initWithCoder: should work, since the
       file can't know which device your data is allocated on, we have to
       guess and may guess incorrectly. To avoid that problem, use
       initWithCoder:device instead.

       Parameters:
           aDecoder The NSCoder subclass with your serialized MPSKernel
           device The MTLDevice on which to make the MPSKernel

       Returns:
           A new MPSKernel object, or nil if failure.



       Reimplemented from MPSCNNKernel.

       Reimplemented in MPSCNNBinaryFullyConnected.

   - (nonnull instancetype) initWithDevice: (nonnull id< MTLDevice >) device
       Standard init with default properties per filter type

       Parameters:
           device The device that the filter will be used on. May not be NULL.

       Returns:
           A pointer to the newly initialized object. This will fail,
           returning nil if the device is not supported. Devices must be
           MTLFeatureSet_iOS_GPUFamily2_v1 or later.



       Reimplemented from MPSCNNKernel.

       Reimplemented in MPSCNNBinaryFullyConnected.

   - (nonnull instancetype) initWithDevice: (nonnull id< MTLDevice >)
       device(nonnull id< MPSCNNConvolutionDataSource >) convolutionData(const
       float *__nullable) outputBiasTerms(const float *__nullable)
       outputScaleTerms(const float *__nullable) inputBiasTerms(const float
       *__nullable) inputScaleTerms(MPSCNNBinaryConvolutionType)
       type(MPSCNNBinaryConvolutionFlags) flags
       Initializes a binary convolution kernel with binary weights as well as
       both pre and post scaling terms.

       Parameters:
           device The MTLDevice on which this MPSCNNBinaryConvolution filter
           will be used
           convolutionData A pointer to a object that conforms to the
           MPSCNNConvolutionDataSource protocol. The
           MPSCNNConvolutionDataSource protocol declares the methods that an
           instance of MPSCNNBinaryConvolution uses to obtain the weights and
           the convolution descriptor. Each entry in the
           convolutionData:weights array is a 32-bit unsigned integer value
           and each bit represents one filter weight (given in machine byte
           order). The featurechannel indices increase from the least
           significant bit within the 32-bits. The number of entries is =
           ceil( inputFeatureChannels/32.0 ) * outputFeatureChannels *
           kernelHeight * kernelWidth The layout of filter weight is so that
           it can be reinterpreted as a 4D tensor (array) weight[
           outputChannels ][ kernelHeight ][ kernelWidth ][ ceil(
           inputChannels / 32.0 ) ] (The ordering of the reduction from 4D
           tensor to 1D is per C convention. The index based on inputchannels
           varies most rapidly, followed by kernelWidth, then kernelHeight and
           finally outputChannels varies least rapidly.)
           outputBiasTerms A pointer to bias terms to be applied to the
           convolution output. Each entry is a float value. The number of
           entries is = numberOfOutputFeatureMaps. If nil then 0.0 is used for
           bias. The values stored in the pointer are copied in and the array
           can be freed after this function returns.
           outputScaleTerms A pointer to scale terms to be applied to binary
           convolution results per output feature channel. Each entry is a
           float value. The number of entries is = numberOfOutputFeatureMaps.
           If nil then 1.0 is used. The values stored in the pointer are
           copied in and the array can be freed after this function returns.
           inputBiasTerms A pointer to offset terms to be applied to the input
           before convolution and before input scaling. Each entry is a float
           value. The number of entries is 'inputFeatureChannels'. If NULL
           then 0.0 is used for bias. The values stored in the pointer are
           copied in and the array can be freed after this function returns.
           inputScaleTerms A pointer to scale terms to be applied to the input
           before convolution, but after input biasing. Each entry is a float
           value. The number of entries is 'inputFeatureChannels'. If nil then
           1.0 is used. The values stored in the pointer are copied in and the
           array can be freed after this function returns.
           type What kind of binarization strategy is to be used.
           flags See documentation above and documentation of
           MPSCNNBinaryConvolutionFlags.

       Returns:
           A valid MPSCNNBinaryConvolution object or nil, if failure.



       Reimplemented in MPSCNNBinaryFullyConnected.

   - (nonnull instancetype) initWithDevice: (nonnull id< MTLDevice >)
       device(nonnull id< MPSCNNConvolutionDataSource >)
       convolutionData(float) scaleValue(MPSCNNBinaryConvolutionType)
       type(MPSCNNBinaryConvolutionFlags) flags
       Initializes a binary convolution kernel with binary weights and a
       single scaling term.

       Parameters:
           device The MTLDevice on which this MPSCNNBinaryConvolution filter
           will be used
           convolutionData A pointer to a object that conforms to the
           MPSCNNConvolutionDataSource protocol. The
           MPSCNNConvolutionDataSource protocol declares the methods that an
           instance of MPSCNNBinaryConvolution uses to obtain the weights and
           bias terms as well as the convolution descriptor. Each entry in the
           convolutionData:weights array is a 32-bit unsigned integer value
           and each bit represents one filter weight (given in machine byte
           order). The featurechannel indices increase from the least
           significant bit within the 32-bits. The number of entries is =
           ceil( inputFeatureChannels/32.0 ) * outputFeatureChannels *
           kernelHeight * kernelWidth The layout of filter weight is so that
           it can be reinterpreted as a 4D tensor (array) weight[
           outputChannels ][ kernelHeight ][ kernelWidth ][ ceil(
           inputChannels / 32.0 ) ] (The ordering of the reduction from 4D
           tensor to 1D is per C convention. The index based on inputchannels
           varies most rapidly, followed by kernelWidth, then kernelHeight and
           finally outputChannels varies least rapidly.)
           scaleValue A floating point value used to scale the entire
           convolution.
           type What kind of binarization strategy is to be used.
           flags See documentation above and documentation of
           MPSCNNBinaryConvolutionFlags.

       Returns:
           A valid MPSCNNBinaryConvolution object or nil, if failure.



       Reimplemented in MPSCNNBinaryFullyConnected.


Property Documentation

   - (NSUInteger) inputFeatureChannels [read],  [nonatomic],  [assign]
   - outputFeatureChannels [read],  [nonatomic],  [assign]
       The number of feature channels per pixel in the output image.



Author

       Generated automatically by Doxygen for
       MetalPerformanceShaders.framework from the source code.





Version MetalPerformanceShaders-Thu2Jul 13 2017     MPSCNNBinaryConvolution(3)


Mac OS X 10.12.6 - Generated Sun Oct 29 14:49:38 CDT 2017
© manpagez.com 2000-2025
Individual documents may contain additional copyright information.