Implicit CPU Vectorization
The implicit CPU vectorization aims to merge together the execution
of several work-items using the Intel® vector instruction set and extends
the use of the vector unit when moving from one generation to another.