A descriptor of a Layer Normalization operation.
Source and destination memory descriptor.
Source and destination gradient memory descriptor.
Scale and shift data and gradient memory descriptors.
Scaleshift memory descriptor uses 2D dnnl_ab
format[2, normalized_dim] where 1-st dimension contains gamma parameter, 2-nd dimension contains beta parameter. Normalized_dim is equal to the last logical dimension of the data tensor across which normalization is performed.
Mean and variance data memory descriptors.
Statistics (mean and variance) memory descriptor is the k-dimensional tensor where k is equal to data_tensor_ndims - 1 and may have any plain (stride[last_dim] == 1) user-provided format.
Layer normalization epsilon parameter.