Visible to Intel only — GUID: GUID-8DB25031-C4EC-4E60-BF28-8A6AF01E05E7
Visible to Intel only — GUID: GUID-8DB25031-C4EC-4E60-BF28-8A6AF01E05E7
Intrinsics for Integer Expand and Load Operations
The prototypes for Intel® Advanced Vector Extensions 512 (Intel® AVX-512) intrinsics are located in the zmmintrin.h header file.
To use these intrinsics, include the immintrin.h file as follows:
#include <immintrin.h>
Intrinsic Name |
Operation |
Corresponding |
---|---|---|
_mm512_mask_expandloadu_epi32, _mm512_maskz_expandloadu_epi32 _mm512_mask_expand_epi32, _mm512_maskz_expand_epi32 |
Load packed int32 values from dense memory or register. |
VPEXPANDD |
_mm512_mask_expandloadu_epi64, _mm512_maskz_expandloadu_epi64 _mm512_mask_expand_epi64, _mm512_maskz_expand_epi64 |
Load packed int64 values from dense memory or register. |
VPEXPANDQ |
variable | definition |
---|---|
k | writemask used as a selector |
a | first source vector element |
src | source element to use based on writemask result |
mem_addr | pointer to base address in memory |
_mm512_mask_expand_epi32
extern __m512i __cdecl _mm512_mask_expand_epi32(__m512i src, __mmask16 k, __m512i a);
Loads contiguous active int32 elements from a (those with their respective bit set in mask k), and stores the result using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_maskz_expand_epi32
extern __m512i __cdecl _mm512_maskz_expand_epi32(__mmask16 k, __m512i a);
Loads contiguous active int32 elements from a (those with their respective bit set in mask k), and stores the result using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_expandloadu_epi32
extern __m512i __cdecl _mm512_mask_expandloadu_epi32(__m512i src, __mmask16 k, void * mem_addr);
Loads contiguous active int32 elements from unaligned memory at mem_addr (those with their respective bit set in mask k), and stores the result using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_maskz_expandloadu_epi32
extern __m512i __cdecl _mm512_maskz_expandloadu_epi32( __mmask16 k, void * mem_addr);
Loads contiguous active int32 elements from unaligned memory at mem_addr (those with their respective bit set in mask k), and stores the result using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_expandloadu_epi64
extern __m512i __cdecl _mm512_mask_expandloadu_epi64(__m512i src, __mmask8 k, void * mem_addr);
Loads contiguous active int64 elements from unaligned memory at mem_addr (those with their respective bit set in mask k), and stores the result using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_maskz_expandloadu_epi64
extern __m512i __cdecl _mm512_maskz_expandloadu_epi64(__mmask8 k, void * mem_addr);
Loads contiguous active int64 elements from unaligned memory at mem_addr (those with their respective bit set in mask k), and stores the result using zeromask k (elements are zeroed out when the corresponding mask bit is not set).
_mm512_mask_expand_epi64
extern __m512i __cdecl _mm512_mask_expand_epi64(__m512i src, __mmask8 k, __m512i a);
Loads contiguous active int64 elements from a (those with their respective bit set in mask k), and stores the result using writemask k (elements are copied from src when the corresponding mask bit is not set).
_mm512_maskz_expand_epi64
extern __m512i __cdecl _mm512_maskz_expand_epi64(__mmask8 k, __m512i a);
Loads contiguous active int64 elements from a (those with their respective bit set in mask k), and stores the result using zeromask k (elements are zeroed out when the corresponding mask bit is not set).