Intel® C++ Compiler Classic Developer Guide and Reference

ID 767249
Date 12/16/2022
Public

A newer version of this document is available. Customers should click here to go to the newest version.

Document Table of Contents

Intrinsics for Conversion Operations

The prototypes for Intel® Advanced Vector Extensions 512 (Intel® AVX-512) intrinsics are located in the zmmintrin.h header file.

To use these intrinsics, include the immintrin.h file as follows:

#include <immintrin.h>


variable definition
src

source element to use based on writemask result

k

writemask used as a selector

a

first source vector element

rounding

Rounding control values; these can be one of the following (along with the sae suppress all exceptions flag):

  • _MM_FROUND_TO_NEAREST_INT - rounds to nearest even
  • _MM_FROUND_TO_NEG_INF - rounds to negative infinity
  • _MM_FROUND_TO_POS_INF - rounds to positive infinity
  • _MM_FROUND_TO_ZERO - rounds to zero
  • _MM_FROUND_CUR_DIRECTION - rounds using default from MXCSR register


_mm_mask_cvtpd_ps

__m128 _mm_mask_cvtpd_ps(__m128 src, __mmask8 k, __m128d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2ps

Convert packed double-precision (64-bit) floating-point elements in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtpd_ps

__m128 _mm_maskz_cvtpd_ps(__mmask8 k, __m128d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2ps

Convert packed double-precision (64-bit) floating-point elements in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtpd_ps

__m128 _mm256_mask_cvtpd_ps(__m128 src, __mmask8 k, __m256d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2ps

Convert packed double-precision (64-bit) floating-point elements in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtpd_ps

__m128 _mm256_maskz_cvtpd_ps(__mmask8 k, __m256d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2ps

Convert packed double-precision (64-bit) floating-point elements in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtph_ps

__m128 _mm_mask_cvtph_ps(__m128 src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtph2ps

Convert packed half-precision (16-bit) floating-point elements in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtph_ps

__m128 _mm_maskz_cvtph_ps(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtph2ps

Convert packed half-precision (16-bit) floating-point elements in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtph_ps

__m256 _mm256_mask_cvtph_ps(__m256 src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtph2ps

Convert packed half-precision (16-bit) floating-point elements in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtph_ps

__m256 _mm256_maskz_cvtph_ps(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtph2ps

Convert packed half-precision (16-bit) floating-point elements in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvt_roundps_ph

__m128i _mm_mask_cvt_roundps_ph(__m128i src, __mmask8 k, __m128 a, int rounding)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2ph

Convert packed single-precision (32-bit) floating-point elements in a to packed half-precision (16-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_mask_cvtps_ph

__m128i _mm_mask_cvtps_ph(__m128i src, __mmask8 k, __m128 a, int rounding)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2ph

Convert packed single-precision (32-bit) floating-point elements in a to packed half-precision (16-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvt_roundps_ph

__m128i _mm_maskz_cvt_roundps_ph(__mmask8 k, __m128 a, int rounding)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2ph

Convert packed single-precision (32-bit) floating-point elements in a to packed half-precision (16-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_maskz_cvtps_ph

__m128i _mm_maskz_cvtps_ph(__mmask8 k, __m128 a, int rounding)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2ph

Convert packed single-precision (32-bit) floating-point elements in a to packed half-precision (16-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvt_roundps_ph

__m128i _mm256_mask_cvt_roundps_ph(__m128i src, __mmask8 k, __m256 a, int rounding)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2ph

Convert packed single-precision (32-bit) floating-point elements in a to packed half-precision (16-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_mask_cvtps_ph

__m128i _mm256_mask_cvtps_ph(__m128i src, __mmask8 k, __m256 a, int rounding)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2ph

Convert packed single-precision (32-bit) floating-point elements in a to packed half-precision (16-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvt_roundps_ph

__m128i _mm256_maskz_cvt_roundps_ph(__mmask8 k, __m256 a, int rounding)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2ph

Convert packed single-precision (32-bit) floating-point elements in a to packed half-precision (16-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_maskz_cvtps_ph

__m128i _mm256_maskz_cvtps_ph(__mmask8 k, __m256 a, int rounding)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2ph

Convert packed single-precision (32-bit) floating-point elements in a to packed half-precision (16-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepi32_pd

__m128d _mm_mask_cvtepi32_pd(__m128d src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtdq2pd

Convert packed 32-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi32_pd

__m128d _mm_maskz_cvtepi32_pd(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtdq2pd

Convert packed 32-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepi32_pd

__m256d _mm256_mask_cvtepi32_pd(__m256d src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtdq2pd

Convert packed 32-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi32_pd

__m256d _mm256_maskz_cvtepi32_pd(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtdq2pd

Convert packed 32-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepi32_ps

__m128 _mm_mask_cvtepi32_ps(__m128 src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtdq2ps

Convert packed 32-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi32_ps

__m128 _mm_maskz_cvtepi32_ps(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtdq2ps

Convert packed 32-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepi32_ps

__m256 _mm256_mask_cvtepi32_ps(__m256 src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtdq2ps

Convert packed 32-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi32_ps

__m256 _mm256_maskz_cvtepi32_ps(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtdq2ps

Convert packed 32-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtpd_epi32

__m128i _mm_mask_cvtpd_epi32(__m128i src, __mmask8 k, __m128d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2dq

Convert packed double-precision (64-bit) floating-point elements in a to packed 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtpd_epi32

__m128i _mm_maskz_cvtpd_epi32(__mmask8 k, __m128d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2dq

Convert packed double-precision (64-bit) floating-point elements in a to packed 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtpd_epi32

__m128i _mm256_mask_cvtpd_epi32(__m128i src, __mmask8 k, __m256d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2dq

Convert packed double-precision (64-bit) floating-point elements in a to packed 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtpd_epi32

__m128i _mm256_maskz_cvtpd_epi32(__mmask8 k, __m256d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2dq

Convert packed double-precision (64-bit) floating-point elements in a to packed 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtpd_epi64

__m128i _mm_cvtpd_epi64(__m128d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers, and return the results.



_mm_mask_cvtpd_epi64

__m128i _mm_mask_cvtpd_epi64(__m128i src, __mmask8 k, __m128d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtpd_epi64

__m128i _mm_maskz_cvtpd_epi64(__mmask8 k, __m128d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtpd_epi64

__m256i _mm256_cvtpd_epi64(__m256d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers, and return the results.



_mm256_mask_cvtpd_epi64

__m256i _mm256_mask_cvtpd_epi64(__m256i src, __mmask8 k, __m256d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtpd_epi64

__m256i _mm256_maskz_cvtpd_epi64(__mmask8 k, __m256d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvt_roundpd_epi64

__m512i _mm512_cvt_roundpd_epi64(__m512d a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers, and return the results.



_mm512_cvtpd_epi64

__m512i _mm512_cvtpd_epi64(__m512d a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers, and return the results.



_mm512_mask_cvt_roundpd_epi64

__m512i _mm512_mask_cvt_roundpd_epi64(__m512i src, __mmask8 k, __m512d a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_mask_cvtpd_epi64

__m512i _mm512_mask_cvtpd_epi64(__m512i src, __mmask8 k, __m512d a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvt_roundpd_epi64

__m512i _mm512_maskz_cvt_roundpd_epi64(__mmask8 k, __m512d a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_maskz_cvtpd_epi64

__m512i _mm512_maskz_cvtpd_epi64(__mmask8 k, __m512d a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtpd_epu32

__m128i _mm_cvtpd_epu32(__m128d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2udq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 32-bit integers, and return the results.



_mm_mask_cvtpd_epu32

__m128i _mm_mask_cvtpd_epu32(__m128i src, __mmask8 k, __m128d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2udq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtpd_epu32

__m128i _mm_maskz_cvtpd_epu32(__mmask8 k, __m128d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2udq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtpd_epu32

__m128i _mm256_cvtpd_epu32(__m256d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2udq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 32-bit integers, and return the results.



_mm256_mask_cvtpd_epu32

__m128i _mm256_mask_cvtpd_epu32(__m128i src, __mmask8 k, __m256d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2udq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtpd_epu32

__m128i _mm256_maskz_cvtpd_epu32(__mmask8 k, __m256d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtpd2udq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtpd_epu64

__m128i _mm_cvtpd_epu64(__m128d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results.



_mm_mask_cvtpd_epu64

__m128i _mm_mask_cvtpd_epu64(__m128i src, __mmask8 k, __m128d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtpd_epu64

__m128i _mm_maskz_cvtpd_epu64(__mmask8 k, __m128d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtpd_epu64

__m256i _mm256_cvtpd_epu64(__m256d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results.



_mm256_mask_cvtpd_epu64

__m256i _mm256_mask_cvtpd_epu64(__m256i src, __mmask8 k, __m256d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtpd_epu64

__m256i _mm256_maskz_cvtpd_epu64(__mmask8 k, __m256d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvt_roundpd_epu64

__m512i _mm512_cvt_roundpd_epu64(__m512d a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results.



_mm512_cvtpd_epu64

__m512i _mm512_cvtpd_epu64(__m512d a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results.



_mm512_mask_cvt_roundpd_epu64

__m512i _mm512_mask_cvt_roundpd_epu64(__m512i src, __mmask8 k, __m512d a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_mask_cvtpd_epu64

__m512i _mm512_mask_cvtpd_epu64(__m512i src, __mmask8 k, __m512d a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvt_roundpd_epu64

__m512i _mm512_maskz_cvt_roundpd_epu64(__mmask8 k, __m512d a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_maskz_cvtpd_epu64

__m512i _mm512_maskz_cvtpd_epu64(__mmask8 k, __m512d a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtps_epi32

__m128i _mm_mask_cvtps_epi32(__m128i src, __mmask8 k, __m128 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2dq

Convert packed single-precision (32-bit) floating-point elements in a to packed 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtps_epi32

__m128i _mm_maskz_cvtps_epi32(__mmask8 k, __m128 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2dq

Convert packed single-precision (32-bit) floating-point elements in a to packed 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtps_epi32

__m256i _mm256_mask_cvtps_epi32(__m256i src, __mmask8 k, __m256 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2dq

Convert packed single-precision (32-bit) floating-point elements in a to packed 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtps_epi32

__m256i _mm256_maskz_cvtps_epi32(__mmask8 k, __m256 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2dq

Convert packed single-precision (32-bit) floating-point elements in a to packed 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtps_epi64

__m128i _mm_cvtps_epi64(__m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers, and return the results.



_mm_mask_cvtps_epi64

__m128i _mm_mask_cvtps_epi64(__m128i src, __mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtps_epi64

__m128i _mm_maskz_cvtps_epi64(__mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtps_epi64

__m256i _mm256_cvtps_epi64(__m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers, and return the results.



_mm256_mask_cvtps_epi64

__m256i _mm256_mask_cvtps_epi64(__m256i src, __mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtps_epi64

__m256i _mm256_maskz_cvtps_epi64(__mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvt_roundps_epi64

__m512i _mm512_cvt_roundps_epi64(__m256 a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers, and return the results.



_mm512_cvtps_epi64

__m512i _mm512_cvtps_epi64(__m256 a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers, and return the results.



_mm512_mask_cvt_roundps_epi64

__m512i _mm512_mask_cvt_roundps_epi64(__m512i src, __mmask8 k, __m256 a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_mask_cvtps_epi64

__m512i _mm512_mask_cvtps_epi64(__m512i src, __mmask8 k, __m256 a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvt_roundps_epi64

__m512i _mm512_maskz_cvt_roundps_epi64(__mmask8 k, __m256 a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_maskz_cvtps_epi64

__m512i _mm512_maskz_cvtps_epi64(__mmask8 k, __m256 a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtps_epu32

__m128i _mm_cvtps_epu32(__m128 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2udq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 32-bit integers, and return the results.



_mm_mask_cvtps_epu32

__m128i _mm_mask_cvtps_epu32(__m128i src, __mmask8 k, __m128 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2udq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtps_epu32

__m128i _mm_maskz_cvtps_epu32(__mmask8 k, __m128 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2udq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtps_epu32

__m256i _mm256_cvtps_epu32(__m256 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2udq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 32-bit integers, and return the results.



_mm256_mask_cvtps_epu32

__m256i _mm256_mask_cvtps_epu32(__m256i src, __mmask8 k, __m256 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2udq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtps_epu32

__m256i _mm256_maskz_cvtps_epu32(__mmask8 k, __m256 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtps2udq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtps_epu64

__m128i _mm_cvtps_epu64(__m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results.



_mm_mask_cvtps_epu64

__m128i _mm_mask_cvtps_epu64(__m128i src, __mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtps_epu64

__m128i _mm_maskz_cvtps_epu64(__mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtps_epu64

__m256i _mm256_cvtps_epu64(__m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results.



_mm256_mask_cvtps_epu64

__m256i _mm256_mask_cvtps_epu64(__m256i src, __mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtps_epu64

__m256i _mm256_maskz_cvtps_epu64(__mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvt_roundps_epu64

__m512i _mm512_cvt_roundps_epu64(__m256 a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results.



_mm512_cvtps_epu64

__m512i _mm512_cvtps_epu64(__m256 a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results.



_mm512_mask_cvt_roundps_epu64

__m512i _mm512_mask_cvt_roundps_epu64(__m512i src, __mmask8 k, __m256 a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_mask_cvtps_epu64

__m512i _mm512_mask_cvtps_epu64(__m512i src, __mmask8 k, __m256 a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvt_roundps_epu64

__m512i _mm512_maskz_cvt_roundps_epu64(__mmask8 k, __m256 a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_maskz_cvtps_epu64

__m512i _mm512_maskz_cvtps_epu64(__mmask8 k, __m256 a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtepi64_pd

__m128d _mm_cvtepi64_pd(__m128i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtqq2pd

Convert packed 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results.



_mm_mask_cvtepi64_pd

__m128d _mm_mask_cvtepi64_pd(__m128d src, __mmask8 k, __m128i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtqq2pd

Convert packed 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi64_pd

__m128d _mm_maskz_cvtepi64_pd(__mmask8 k, __m128i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtqq2pd

Convert packed 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtepi64_pd

__m256d _mm256_cvtepi64_pd(__m256i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtqq2pd

Convert packed 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results.



_mm256_mask_cvtepi64_pd

__m256d _mm256_mask_cvtepi64_pd(__m256d src, __mmask8 k, __m256i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtqq2pd

Convert packed 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi64_pd

__m256d _mm256_maskz_cvtepi64_pd(__mmask8 k, __m256i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtqq2pd

Convert packed 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvt_roundepi64_pd

__m512d _mm512_cvt_roundepi64_pd(__m512i a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtqq2pd

Convert packed 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results.



_mm512_cvtepi64_pd

__m512d _mm512_cvtepi64_pd(__m512i a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtqq2pd

Convert packed 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results.



_mm512_mask_cvt_roundepi64_pd

__m512d _mm512_mask_cvt_roundepi64_pd(__m512d src, __mmask8 k, __m512i a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtqq2pd

Convert packed 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_mask_cvtepi64_pd

__m512d _mm512_mask_cvtepi64_pd(__m512d src, __mmask8 k, __m512i a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtqq2pd

Convert packed 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvt_roundepi64_pd

__m512d _mm512_maskz_cvt_roundepi64_pd(__mmask8 k, __m512i a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtqq2pd

Convert packed 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_maskz_cvtepi64_pd

__m512d _mm512_maskz_cvtepi64_pd(__mmask8 k, __m512i a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtqq2pd

Convert packed 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtepi64_ps

__m128 _mm_cvtepi64_ps(__m128i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtqq2ps

Convert packed 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results.



_mm_mask_cvtepi64_ps

__m128 _mm_mask_cvtepi64_ps(__m128 src, __mmask8 k, __m128i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtqq2ps

Convert packed 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi64_ps

__m128 _mm_maskz_cvtepi64_ps(__mmask8 k, __m128i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtqq2ps

Convert packed 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtepi64_ps

__m128 _mm256_cvtepi64_ps(__m256i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtqq2ps

Convert packed 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results.



_mm256_mask_cvtepi64_ps

__m128 _mm256_mask_cvtepi64_ps(__m128 src, __mmask8 k, __m256i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtqq2ps

Convert packed 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi64_ps

__m128 _mm256_maskz_cvtepi64_ps(__mmask8 k, __m256i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtqq2ps

Convert packed 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvt_roundepi64_ps

__m256 _mm512_cvt_roundepi64_ps(__m512i a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtqq2ps

Convert packed 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results.



_mm512_cvtepi64_ps

__m256 _mm512_cvtepi64_ps(__m512i a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtqq2ps

Convert packed 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results.



_mm512_mask_cvt_roundepi64_ps

__m256 _mm512_mask_cvt_roundepi64_ps(__m256 src, __mmask8 k, __m512i a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtqq2ps

Convert packed 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_mask_cvtepi64_ps

__m256 _mm512_mask_cvtepi64_ps(__m256 src, __mmask8 k, __m512i a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtqq2ps

Convert packed 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvt_roundepi64_ps

__m256 _mm512_maskz_cvt_roundepi64_ps(__mmask8 k, __m512i a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtqq2ps

Convert packed 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_maskz_cvtepi64_ps

__m256 _mm512_maskz_cvtepi64_ps(__mmask8 k, __m512i a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtqq2ps

Convert packed 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvttpd_epi32

__m128i _mm_mask_cvttpd_epi32(__m128i src, __mmask8 k, __m128d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttpd2dq

Convert packed double-precision (64-bit) floating-point elements in a to packed 32-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvttpd_epi32

__m128i _mm_maskz_cvttpd_epi32(__mmask8 k, __m128d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttpd2dq

Convert packed double-precision (64-bit) floating-point elements in a to packed 32-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvttpd_epi32

__m128i _mm256_mask_cvttpd_epi32(__m128i src, __mmask8 k, __m256d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttpd2dq

Convert packed double-precision (64-bit) floating-point elements in a to packed 32-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvttpd_epi32

__m128i _mm256_maskz_cvttpd_epi32(__mmask8 k, __m256d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttpd2dq

Convert packed double-precision (64-bit) floating-point elements in a to packed 32-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvttpd_epi64

__m128i _mm_cvttpd_epi64(__m128d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results.



_mm_mask_cvttpd_epi64

__m128i _mm_mask_cvttpd_epi64(__m128i src, __mmask8 k, __m128d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvttpd_epi64

__m128i _mm_maskz_cvttpd_epi64(__mmask8 k, __m128d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvttpd_epi64

__m256i _mm256_cvttpd_epi64(__m256d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results.



_mm256_mask_cvttpd_epi64

__m256i _mm256_mask_cvttpd_epi64(__m256i src, __mmask8 k, __m256d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvttpd_epi64

__m256i _mm256_maskz_cvttpd_epi64(__mmask8 k, __m256d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvtt_roundpd_epi64

__m512i _mm512_cvtt_roundpd_epi64(__m512d a, int sae)

CPUID Flags: AVX512DQ

Instruction(s): vcvttpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results. Pass __MM_FROUND_NO_EXC to sae to suppress all exceptions.



_mm512_cvttpd_epi64

__m512i _mm512_cvttpd_epi64(__m512d a)

CPUID Flags: AVX512DQ

Instruction(s): vcvttpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results.



_mm512_mask_cvtt_roundpd_epi64

__m512i _mm512_mask_cvtt_roundpd_epi64(__m512i src, __mmask8 k, __m512d a, int sae)

CPUID Flags: AVX512DQ

Instruction(s): vcvttpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_mask_cvttpd_epi64

__m512i _mm512_mask_cvttpd_epi64(__m512i src, __mmask8 k, __m512d a)

CPUID Flags: AVX512DQ

Instruction(s): vcvttpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvtt_roundpd_epi64

__m512i _mm512_maskz_cvtt_roundpd_epi64(__mmask8 k, __m512d a, int sae)

CPUID Flags: AVX512DQ

Instruction(s): vcvttpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set). Pass __MM_FROUND_NO_EXC to sae to suppress all exceptions.



_mm512_maskz_cvttpd_epi64

__m512i _mm512_maskz_cvttpd_epi64(__mmask8 k, __m512d a)

CPUID Flags: AVX512DQ

Instruction(s): vcvttpd2qq

Convert packed double-precision (64-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvttpd_epu32

__m128i _mm_cvttpd_epu32(__m128d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttpd2udq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 32-bit integers with truncation, and return the results.



_mm_mask_cvttpd_epu32

__m128i _mm_mask_cvttpd_epu32(__m128i src, __mmask8 k, __m128d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttpd2udq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 32-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvttpd_epu32

__m128i _mm_maskz_cvttpd_epu32(__mmask8 k, __m128d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttpd2udq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 32-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvttpd_epu32

__m128i _mm256_cvttpd_epu32(__m256d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttpd2udq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 32-bit integers with truncation, and return the results.



_mm256_mask_cvttpd_epu32

__m128i _mm256_mask_cvttpd_epu32(__m128i src, __mmask8 k, __m256d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttpd2udq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 32-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvttpd_epu32

__m128i _mm256_maskz_cvttpd_epu32(__mmask8 k, __m256d a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttpd2udq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 32-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvttpd_epu64

__m128i _mm_cvttpd_epu64(__m128d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results.



_mm_mask_cvttpd_epu64

__m128i _mm_mask_cvttpd_epu64(__m128i src, __mmask8 k, __m128d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvttpd_epu64

__m128i _mm_maskz_cvttpd_epu64(__mmask8 k, __m128d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvttpd_epu64

__m256i _mm256_cvttpd_epu64(__m256d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results.



_mm256_mask_cvttpd_epu64

__m256i _mm256_mask_cvttpd_epu64(__m256i src, __mmask8 k, __m256d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvttpd_epu64

__m256i _mm256_maskz_cvttpd_epu64(__mmask8 k, __m256d a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvtt_roundpd_epu64

__m512i _mm512_cvtt_roundpd_epu64(__m512d a, int sae)

CPUID Flags: AVX512DQ

Instruction(s): vcvttpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results. Pass __MM_FROUND_NO_EXC to sae to suppress all exceptions.



_mm512_cvttpd_epu64

__m512i _mm512_cvttpd_epu64(__m512d a)

CPUID Flags: AVX512DQ

Instruction(s): vcvttpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results.



_mm512_mask_cvtt_roundpd_epu64

__m512i _mm512_mask_cvtt_roundpd_epu64(__m512i src, __mmask8 k, __m512d a, int sae)

CPUID Flags: AVX512DQ

Instruction(s): vcvttpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_mask_cvttpd_epu64

__m512i _mm512_mask_cvttpd_epu64(__m512i src, __mmask8 k, __m512d a)

CPUID Flags: AVX512DQ

Instruction(s): vcvttpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvtt_roundpd_epu64

__m512i _mm512_maskz_cvtt_roundpd_epu64(__mmask8 k, __m512d a, int sae)

CPUID Flags: AVX512DQ

Instruction(s): vcvttpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set). Pass __MM_FROUND_NO_EXC to sae to suppress all exceptions.



_mm512_maskz_cvttpd_epu64

__m512i _mm512_maskz_cvttpd_epu64(__mmask8 k, __m512d a)

CPUID Flags: AVX512DQ

Instruction(s): vcvttpd2uqq

Convert packed double-precision (64-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvttps_epi32

__m128i _mm_mask_cvttps_epi32(__m128i src, __mmask8 k, __m128 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttps2dq

Convert packed single-precision (32-bit) floating-point elements in a to packed 32-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvttps_epi32

__m128i _mm_maskz_cvttps_epi32(__mmask8 k, __m128 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttps2dq

Convert packed single-precision (32-bit) floating-point elements in a to packed 32-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvttps_epi32

__m256i _mm256_mask_cvttps_epi32(__m256i src, __mmask8 k, __m256 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttps2dq

Convert packed single-precision (32-bit) floating-point elements in a to packed 32-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvttps_epi32

__m256i _mm256_maskz_cvttps_epi32(__mmask8 k, __m256 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttps2dq

Convert packed single-precision (32-bit) floating-point elements in a to packed 32-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvttps_epi64

__m128i _mm_cvttps_epi64(__m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results.



_mm_mask_cvttps_epi64

__m128i _mm_mask_cvttps_epi64(__m128i src, __mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvttps_epi64

__m128i _mm_maskz_cvttps_epi64(__mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvttps_epi64

__m256i _mm256_cvttps_epi64(__m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results.



_mm256_mask_cvttps_epi64

__m256i _mm256_mask_cvttps_epi64(__m256i src, __mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvttps_epi64

__m256i _mm256_maskz_cvttps_epi64(__mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvtt_roundps_epi64

__m512i _mm512_cvtt_roundps_epi64(__m256 a, int sae)

CPUID Flags: AVX512DQ

Instruction(s): vcvttps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results. Pass __MM_FROUND_NO_EXC to sae to suppress all exceptions.



_mm512_cvttps_epi64

__m512i _mm512_cvttps_epi64(__m256 a)

CPUID Flags: AVX512DQ

Instruction(s): vcvttps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results.



_mm512_mask_cvtt_roundps_epi64

__m512i _mm512_mask_cvtt_roundps_epi64(__m512i src, __mmask8 k, __m256 a, int sae)

CPUID Flags: AVX512DQ

Instruction(s): vcvttps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_mask_cvttps_epi64

__m512i _mm512_mask_cvttps_epi64(__m512i src, __mmask8 k, __m256 a)

CPUID Flags: AVX512DQ

Instruction(s): vcvttps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvtt_roundps_epi64

__m512i _mm512_maskz_cvtt_roundps_epi64(__mmask8 k, __m256 a, int sae)

CPUID Flags: AVX512DQ

Instruction(s): vcvttps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set). Pass __MM_FROUND_NO_EXC to sae to suppress all exceptions.



_mm512_maskz_cvttps_epi64

__m512i _mm512_maskz_cvttps_epi64(__mmask8 k, __m256 a)

CPUID Flags: AVX512DQ

Instruction(s): vcvttps2qq

Convert packed single-precision (32-bit) floating-point elements in a to packed 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvttps_epu32

__m128i _mm_cvttps_epu32(__m128 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttps2udq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 32-bit integers with truncation, and return the results.



_mm_mask_cvttps_epu32

__m128i _mm_mask_cvttps_epu32(__m128i src, __mmask8 k, __m128 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttps2udq

Convert packed double-precision (32-bit) floating-point elements in a to packed unsigned 32-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvttps_epu32

__m128i _mm_maskz_cvttps_epu32(__mmask8 k, __m128 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttps2udq

Convert packed double-precision (32-bit) floating-point elements in a to packed unsigned 32-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvttps_epu32

__m256i _mm256_cvttps_epu32(__m256 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttps2udq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 32-bit integers with truncation, and return the results.



_mm256_mask_cvttps_epu32

__m256i _mm256_mask_cvttps_epu32(__m256i src, __mmask8 k, __m256 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttps2udq

Convert packed double-precision (32-bit) floating-point elements in a to packed unsigned 32-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvttps_epu32

__m256i _mm256_maskz_cvttps_epu32(__mmask8 k, __m256 a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvttps2udq

Convert packed double-precision (32-bit) floating-point elements in a to packed unsigned 32-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvttps_epu64

__m128i _mm_cvttps_epu64(__m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results.



_mm_mask_cvttps_epu64

__m128i _mm_mask_cvttps_epu64(__m128i src, __mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvttps_epu64

__m128i _mm_maskz_cvttps_epu64(__mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvttps_epu64

__m256i _mm256_cvttps_epu64(__m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results.



_mm256_mask_cvttps_epu64

__m256i _mm256_mask_cvttps_epu64(__m256i src, __mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvttps_epu64

__m256i _mm256_maskz_cvttps_epu64(__mmask8 k, __m128 a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvttps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvtt_roundps_epu64

__m512i _mm512_cvtt_roundps_epu64(__m256 a, int sae)

CPUID Flags: AVX512DQ

Instruction(s): vcvttps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results. Pass __MM_FROUND_NO_EXC to sae to suppress all exceptions.



_mm512_cvttps_epu64

__m512i _mm512_cvttps_epu64(__m256 a)

CPUID Flags: AVX512DQ

Instruction(s): vcvttps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results.



_mm512_mask_cvtt_roundps_epu64

__m512i _mm512_mask_cvtt_roundps_epu64(__m512i src, __mmask8 k, __m256 a, int sae)

CPUID Flags: AVX512DQ

Instruction(s): vcvttps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_mask_cvttps_epu64

__m512i _mm512_mask_cvttps_epu64(__m512i src, __mmask8 k, __m256 a)

CPUID Flags: AVX512DQ

Instruction(s): vcvttps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvtt_roundps_epu64

__m512i _mm512_maskz_cvtt_roundps_epu64(__mmask8 k, __m256 a, int sae)

CPUID Flags: AVX512DQ

Instruction(s): vcvttps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set). Pass __MM_FROUND_NO_EXC to sae to suppress all exceptions.



_mm512_maskz_cvttps_epu64

__m512i _mm512_maskz_cvttps_epu64(__mmask8 k, __m256 a)

CPUID Flags: AVX512DQ

Instruction(s): vcvttps2uqq

Convert packed single-precision (32-bit) floating-point elements in a to packed unsigned 64-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtepu32_pd

__m128d _mm_cvtepu32_pd(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtudq2pd

Convert packed unsigned 32-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results.



_mm_mask_cvtepu32_pd

__m128d _mm_mask_cvtepu32_pd(__m128d src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtudq2pd

Convert packed unsigned 32-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepu32_pd

__m128d _mm_maskz_cvtepu32_pd(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtudq2pd

Convert packed unsigned 32-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtepu32_pd

__m256d _mm256_cvtepu32_pd(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtudq2pd

Convert packed unsigned 32-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results.



_mm256_mask_cvtepu32_pd

__m256d _mm256_mask_cvtepu32_pd(__m256d src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtudq2pd

Convert packed unsigned 32-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepu32_pd

__m256d _mm256_maskz_cvtepu32_pd(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vcvtudq2pd

Convert packed unsigned 32-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtepu64_pd

__m128d _mm_cvtepu64_pd(__m128i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtuqq2pd

Convert packed unsigned 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results.



_mm_mask_cvtepu64_pd

__m128d _mm_mask_cvtepu64_pd(__m128d src, __mmask8 k, __m128i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtuqq2pd

Convert packed unsigned 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepu64_pd

__m128d _mm_maskz_cvtepu64_pd(__mmask8 k, __m128i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtuqq2pd

Convert packed unsigned 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtepu64_pd

__m256d _mm256_cvtepu64_pd(__m256i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtuqq2pd

Convert packed unsigned 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results.



_mm256_mask_cvtepu64_pd

__m256d _mm256_mask_cvtepu64_pd(__m256d src, __mmask8 k, __m256i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtuqq2pd

Convert packed unsigned 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepu64_pd

__m256d _mm256_maskz_cvtepu64_pd(__mmask8 k, __m256i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtuqq2pd

Convert packed unsigned 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvt_roundepu64_pd

__m512d _mm512_cvt_roundepu64_pd(__m512i a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtuqq2pd

Convert packed unsigned 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results.



_mm512_cvtepu64_pd

__m512d _mm512_cvtepu64_pd(__m512i a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtuqq2pd

Convert packed unsigned 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results.



_mm512_mask_cvt_roundepu64_pd

__m512d _mm512_mask_cvt_roundepu64_pd(__m512d src, __mmask8 k, __m512i a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtuqq2pd

Convert packed unsigned 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_mask_cvtepu64_pd

__m512d _mm512_mask_cvtepu64_pd(__m512d src, __mmask8 k, __m512i a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtuqq2pd

Convert packed unsigned 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvt_roundepu64_pd

__m512d _mm512_maskz_cvt_roundepu64_pd(__mmask8 k, __m512i a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtuqq2pd

Convert packed unsigned 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_maskz_cvtepu64_pd

__m512d _mm512_maskz_cvtepu64_pd(__mmask8 k, __m512i a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtuqq2pd

Convert packed unsigned 64-bit integers in a to packed double-precision (64-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtepu64_ps

__m128 _mm_cvtepu64_ps(__m128i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtuqq2ps

Convert packed unsigned 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results.



_mm_mask_cvtepu64_ps

__m128 _mm_mask_cvtepu64_ps(__m128 src, __mmask8 k, __m128i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtuqq2ps

Convert packed unsigned 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepu64_ps

__m128 _mm_maskz_cvtepu64_ps(__mmask8 k, __m128i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtuqq2ps

Convert packed unsigned 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtepu64_ps

__m128 _mm256_cvtepu64_ps(__m256i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtuqq2ps

Convert packed unsigned 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results.



_mm256_mask_cvtepu64_ps

__m128 _mm256_mask_cvtepu64_ps(__m128 src, __mmask8 k, __m256i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtuqq2ps

Convert packed unsigned 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepu64_ps

__m128 _mm256_maskz_cvtepu64_ps(__mmask8 k, __m256i a)

CPUID Flags: AVX512DQ, AVX512VL

Instruction(s): vcvtuqq2ps

Convert packed unsigned 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvt_roundepu64_ps

__m256 _mm512_cvt_roundepu64_ps(__m512i a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtuqq2ps

Convert packed unsigned 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results.



_mm512_cvtepu64_ps

__m256 _mm512_cvtepu64_ps(__m512i a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtuqq2ps

Convert packed unsigned 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results.



_mm512_mask_cvt_roundepu64_ps

__m256 _mm512_mask_cvt_roundepu64_ps(__m256 src, __mmask8 k, __m512i a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtuqq2ps

Convert packed unsigned 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_mask_cvtepu64_ps

__m256 _mm512_mask_cvtepu64_ps(__m256 src, __mmask8 k, __m512i a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtuqq2ps

Convert packed unsigned 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvt_roundepu64_ps

__m256 _mm512_maskz_cvt_roundepu64_ps(__mmask8 k, __m512i a, int rounding)

CPUID Flags: AVX512DQ

Instruction(s): vcvtuqq2ps

Convert packed unsigned 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_maskz_cvtepu64_ps

__m256 _mm512_maskz_cvtepu64_ps(__mmask8 k, __m512i a)

CPUID Flags: AVX512DQ

Instruction(s): vcvtuqq2ps

Convert packed unsigned 64-bit integers in a to packed single-precision (32-bit) floating-point elements, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtepi32_epi8

__m128i _mm_cvtepi32_epi8(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdb

Convert packed 32-bit integers in a to packed 8-bit integers with truncation, and return the results.



_mm_mask_cvtepi32_epi8

__m128i _mm_mask_cvtepi32_epi8(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdb

Convert packed 32-bit integers in a to packed 8-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi32_epi8

__m128i _mm_maskz_cvtepi32_epi8(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdb

Convert packed 32-bit integers in a to packed 8-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtepi32_epi8

__m128i _mm256_cvtepi32_epi8(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdb

Convert packed 32-bit integers in a to packed 8-bit integers with truncation, and return the results.



_mm256_mask_cvtepi32_epi8

__m128i _mm256_mask_cvtepi32_epi8(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdb

Convert packed 32-bit integers in a to packed 8-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi32_epi8

__m128i _mm256_maskz_cvtepi32_epi8(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdb

Convert packed 32-bit integers in a to packed 8-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtepi32_epi16

__m128i _mm_cvtepi32_epi16(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdw

Convert packed 32-bit integers in a to packed 16-bit integers with truncation, and return the results.



_mm_mask_cvtepi32_epi16

__m128i _mm_mask_cvtepi32_epi16(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdw

Convert packed 32-bit integers in a to packed 16-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi32_epi16

__m128i _mm_maskz_cvtepi32_epi16(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdw

Convert packed 32-bit integers in a to packed 16-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtepi32_epi16

__m128i _mm256_cvtepi32_epi16(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdw

Convert packed 32-bit integers in a to packed 16-bit integers with truncation, and return the results.



_mm256_mask_cvtepi32_epi16

__m128i _mm256_mask_cvtepi32_epi16(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdw

Convert packed 32-bit integers in a to packed 16-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi32_epi16

__m128i _mm256_maskz_cvtepi32_epi16(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdw

Convert packed 32-bit integers in a to packed 16-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtepi64_epi8

__m128i _mm_cvtepi64_epi8(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqb

Convert packed 64-bit integers in a to packed 8-bit integers with truncation, and return the results.



_mm_mask_cvtepi64_epi8

__m128i _mm_mask_cvtepi64_epi8(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqb

Convert packed 64-bit integers in a to packed 8-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi64_epi8

__m128i _mm_maskz_cvtepi64_epi8(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqb

Convert packed 64-bit integers in a to packed 8-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtepi64_epi8

__m128i _mm256_cvtepi64_epi8(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqb

Convert packed 64-bit integers in a to packed 8-bit integers with truncation, and return the results.



_mm256_mask_cvtepi64_epi8

__m128i _mm256_mask_cvtepi64_epi8(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqb

Convert packed 64-bit integers in a to packed 8-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi64_epi8

__m128i _mm256_maskz_cvtepi64_epi8(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqb

Convert packed 64-bit integers in a to packed 8-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtepi64_epi32

__m128i _mm_cvtepi64_epi32(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqd

Convert packed 64-bit integers in a to packed 32-bit integers with truncation, and return the results.



_mm_mask_cvtepi64_epi32

__m128i _mm_mask_cvtepi64_epi32(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqd

Convert packed 64-bit integers in a to packed 32-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi64_epi32

__m128i _mm_maskz_cvtepi64_epi32(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqd

Convert packed 64-bit integers in a to packed 32-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtepi64_epi32

__m128i _mm256_cvtepi64_epi32(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqd

Convert packed 64-bit integers in a to packed 32-bit integers with truncation, and return the results.



_mm256_mask_cvtepi64_epi32

__m128i _mm256_mask_cvtepi64_epi32(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqd

Convert packed 64-bit integers in a to packed 32-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi64_epi32

__m128i _mm256_maskz_cvtepi64_epi32(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqd

Convert packed 64-bit integers in a to packed 32-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtepi64_epi16

__m128i _mm_cvtepi64_epi16(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqw

Convert packed 64-bit integers in a to packed 16-bit integers with truncation, and return the results.



_mm_mask_cvtepi64_epi16

__m128i _mm_mask_cvtepi64_epi16(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqw

Convert packed 64-bit integers in a to packed 16-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi64_epi16

__m128i _mm_maskz_cvtepi64_epi16(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqw

Convert packed 64-bit integers in a to packed 16-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtepi64_epi16

__m128i _mm256_cvtepi64_epi16(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqw

Convert packed 64-bit integers in a to packed 16-bit integers with truncation, and return the results.



_mm256_mask_cvtepi64_epi16

__m128i _mm256_mask_cvtepi64_epi16(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqw

Convert packed 64-bit integers in a to packed 16-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi64_epi16

__m128i _mm256_maskz_cvtepi64_epi16(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqw

Convert packed 64-bit integers in a to packed 16-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtsepi32_epi8

__m128i _mm_cvtsepi32_epi8(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdb

Convert packed 32-bit integers in a to packed 8-bit integers with signed saturation, and return the results.



_mm_mask_cvtsepi32_epi8

__m128i _mm_mask_cvtsepi32_epi8(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdb

Convert packed 32-bit integers in a to packed 8-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtsepi32_epi8

__m128i _mm_maskz_cvtsepi32_epi8(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdb

Convert packed 32-bit integers in a to packed 8-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtsepi32_epi8

__m128i _mm256_cvtsepi32_epi8(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdb

Convert packed 32-bit integers in a to packed 8-bit integers with signed saturation, and return the results.



_mm256_mask_cvtsepi32_epi8

__m128i _mm256_mask_cvtsepi32_epi8(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdb

Convert packed 32-bit integers in a to packed 8-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtsepi32_epi8

__m128i _mm256_maskz_cvtsepi32_epi8(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdb

Convert packed 32-bit integers in a to packed 8-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtsepi32_epi16

__m128i _mm_cvtsepi32_epi16(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdw

Convert packed 32-bit integers in a to packed 16-bit integers with signed saturation, and return the results.



_mm_mask_cvtsepi32_epi16

__m128i _mm_mask_cvtsepi32_epi16(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdw

Convert packed 32-bit integers in a to packed 16-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtsepi32_epi16

__m128i _mm_maskz_cvtsepi32_epi16(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdw

Convert packed 32-bit integers in a to packed 16-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtsepi32_epi16

__m128i _mm256_cvtsepi32_epi16(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdw

Convert packed 32-bit integers in a to packed 16-bit integers with signed saturation, and return the results.



_mm256_mask_cvtsepi32_epi16

__m128i _mm256_mask_cvtsepi32_epi16(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdw

Convert packed 32-bit integers in a to packed 16-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtsepi32_epi16

__m128i _mm256_maskz_cvtsepi32_epi16(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdw

Convert packed 32-bit integers in a to packed 16-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtsepi64_epi8

__m128i _mm_cvtsepi64_epi8(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqb

Convert packed 64-bit integers in a to packed 8-bit integers with signed saturation, and return the results.



_mm_mask_cvtsepi64_epi8

__m128i _mm_mask_cvtsepi64_epi8(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqb

Convert packed 64-bit integers in a to packed 8-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtsepi64_epi8

__m128i _mm_maskz_cvtsepi64_epi8(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqb

Convert packed 64-bit integers in a to packed 8-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtsepi64_epi8

__m128i _mm256_cvtsepi64_epi8(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqb

Convert packed 64-bit integers in a to packed 8-bit integers with signed saturation, and return the results.



_mm256_mask_cvtsepi64_epi8

__m128i _mm256_mask_cvtsepi64_epi8(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqb

Convert packed 64-bit integers in a to packed 8-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtsepi64_epi8

__m128i _mm256_maskz_cvtsepi64_epi8(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqb

Convert packed 64-bit integers in a to packed 8-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtsepi64_epi32

__m128i _mm_cvtsepi64_epi32(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqd

Convert packed 64-bit integers in a to packed 32-bit integers with signed saturation, and return the results.



_mm_mask_cvtsepi64_epi32

__m128i _mm_mask_cvtsepi64_epi32(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqd

Convert packed 64-bit integers in a to packed 32-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtsepi64_epi32

__m128i _mm_maskz_cvtsepi64_epi32(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqd

Convert packed 64-bit integers in a to packed 32-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtsepi64_epi32

__m128i _mm256_cvtsepi64_epi32(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqd

Convert packed 64-bit integers in a to packed 32-bit integers with signed saturation, and return the results.



_mm256_mask_cvtsepi64_epi32

__m128i _mm256_mask_cvtsepi64_epi32(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqd

Convert packed 64-bit integers in a to packed 32-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtsepi64_epi32

__m128i _mm256_maskz_cvtsepi64_epi32(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqd

Convert packed 64-bit integers in a to packed 32-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtsepi64_epi16

__m128i _mm_cvtsepi64_epi16(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqw

Convert packed 64-bit integers in a to packed 16-bit integers with signed saturation, and return the results.



_mm_mask_cvtsepi64_epi16

__m128i _mm_mask_cvtsepi64_epi16(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqw

Convert packed 64-bit integers in a to packed 16-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtsepi64_epi16

__m128i _mm_maskz_cvtsepi64_epi16(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqw

Convert packed 64-bit integers in a to packed 16-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtsepi64_epi16

__m128i _mm256_cvtsepi64_epi16(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqw

Convert packed 64-bit integers in a to packed 16-bit integers with signed saturation, and return the results.



_mm256_mask_cvtsepi64_epi16

__m128i _mm256_mask_cvtsepi64_epi16(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqw

Convert packed 64-bit integers in a to packed 16-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtsepi64_epi16

__m128i _mm256_maskz_cvtsepi64_epi16(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqw

Convert packed 64-bit integers in a to packed 16-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtsepi16_epi8

__m128i _mm_cvtsepi16_epi8(__m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovswb

Convert packed 16-bit integers in a to packed 8-bit integers with signed saturation, and return the results.



_mm_mask_cvtsepi16_epi8

__m128i _mm_mask_cvtsepi16_epi8(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovswb

Convert packed 16-bit integers in a to packed 8-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtsepi16_epi8

__m128i _mm_maskz_cvtsepi16_epi8(__mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovswb

Convert packed 16-bit integers in a to packed 8-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtsepi16_epi8

__m128i _mm256_cvtsepi16_epi8(__m256i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovswb

Convert packed 16-bit integers in a to packed 8-bit integers with signed saturation, and return the results.



_mm256_mask_cvtsepi16_epi8

__m128i _mm256_mask_cvtsepi16_epi8(__m128i src, __mmask16 k, __m256i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovswb

Convert packed 16-bit integers in a to packed 8-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtsepi16_epi8

__m128i _mm256_maskz_cvtsepi16_epi8(__mmask16 k, __m256i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovswb

Convert packed 16-bit integers in a to packed 8-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvtsepi16_epi8

__m256i _mm512_cvtsepi16_epi8(__m512i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovswb

Convert packed 16-bit integers in a to packed 8-bit integers with signed saturation, and return the results.



_mm512_mask_cvtsepi16_epi8

__m256i _mm512_mask_cvtsepi16_epi8(__m256i src, __mmask32 k, __m512i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovswb

Convert packed 16-bit integers in a to packed 8-bit integers with signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvtsepi16_epi8

__m256i _mm512_maskz_cvtsepi16_epi8(__mmask32 k, __m512i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovswb

Convert packed 16-bit integers in a to packed 8-bit integers with signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepi8_epi32

__m128i _mm_mask_cvtepi8_epi32(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxbd

Sign extend packed 8-bit integers in the low 4 bytes of a to packed 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi8_epi32

__m128i _mm_maskz_cvtepi8_epi32(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxbd

Sign extend packed 8-bit integers in the low 4 bytes of a to packed 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepi8_epi32

__m256i _mm256_mask_cvtepi8_epi32(__m256i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxbd

Sign extend packed 8-bit integers in the low 8 bytes of a to packed 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi8_epi32

__m256i _mm256_maskz_cvtepi8_epi32(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxbd

Sign extend packed 8-bit integers in the low 8 bytes of a to packed 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepi8_epi64

__m128i _mm_mask_cvtepi8_epi64(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxbq

Sign extend packed 8-bit integers in the low 2 bytes of a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi8_epi64

__m128i _mm_maskz_cvtepi8_epi64(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxbq

Sign extend packed 8-bit integers in the low 2 bytes of a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepi8_epi64

__m256i _mm256_mask_cvtepi8_epi64(__m256i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxbq

Sign extend packed 8-bit integers in the low 4 bytes of a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi8_epi64

__m256i _mm256_maskz_cvtepi8_epi64(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxbq

Sign extend packed 8-bit integers in the low 4 bytes of a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepi8_epi16

__m128i _mm_mask_cvtepi8_epi16(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovsxbw

Sign extend packed 8-bit integers in a to packed 16-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi8_epi16

__m128i _mm_maskz_cvtepi8_epi16(__mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovsxbw

Sign extend packed 8-bit integers in a to packed 16-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepi8_epi16

__m256i _mm256_mask_cvtepi8_epi16(__m256i src, __mmask16 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovsxbw

Sign extend packed 8-bit integers in a to packed 16-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi8_epi16

__m256i _mm256_maskz_cvtepi8_epi16(__mmask16 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovsxbw

Sign extend packed 8-bit integers in a to packed 16-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvtepi8_epi16

__m512i _mm512_cvtepi8_epi16(__m256i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovsxbw

Sign extend packed 8-bit integers in a to packed 16-bit integers, and return the results.



_mm512_mask_cvtepi8_epi16

__m512i _mm512_mask_cvtepi8_epi16(__m512i src, __mmask32 k, __m256i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovsxbw

Sign extend packed 8-bit integers in a to packed 16-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvtepi8_epi16

__m512i _mm512_maskz_cvtepi8_epi16(__mmask32 k, __m256i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovsxbw

Sign extend packed 8-bit integers in a to packed 16-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepi32_epi64

__m128i _mm_mask_cvtepi32_epi64(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxdq

Sign extend packed 32-bit integers in a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi32_epi64

__m128i _mm_maskz_cvtepi32_epi64(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxdq

Sign extend packed 32-bit integers in a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepi32_epi64

__m256i _mm256_mask_cvtepi32_epi64(__m256i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxdq

Sign extend packed 32-bit integers in a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi32_epi64

__m256i _mm256_maskz_cvtepi32_epi64(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxdq

Sign extend packed 32-bit integers in a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepi16_epi32

__m128i _mm_mask_cvtepi16_epi32(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxwd

Sign extend packed 16-bit integers in a to packed 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi16_epi32

__m128i _mm_maskz_cvtepi16_epi32(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxwd

Sign extend packed 16-bit integers in a to packed 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepi16_epi32

__m256i _mm256_mask_cvtepi16_epi32(__m256i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxwd

Sign extend packed 16-bit integers in a to packed 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi16_epi32

__m256i _mm256_maskz_cvtepi16_epi32(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxwd

Sign extend packed 16-bit integers in a to packed 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepi16_epi64

__m128i _mm_mask_cvtepi16_epi64(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxwq

Sign extend packed 16-bit integers in the low 4 bytes of a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi16_epi64

__m128i _mm_maskz_cvtepi16_epi64(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxwq

Sign extend packed 16-bit integers in the low 4 bytes of a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepi16_epi64

__m256i _mm256_mask_cvtepi16_epi64(__m256i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxwq

Sign extend packed 16-bit integers in the low 8 bytes of a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi16_epi64

__m256i _mm256_maskz_cvtepi16_epi64(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsxwq

Sign extend packed 16-bit integers in the low 8 bytes of a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtusepi32_epi8

__m128i _mm_cvtusepi32_epi8(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdb

Convert packed unsigned 32-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results.



_mm_mask_cvtusepi32_epi8

__m128i _mm_mask_cvtusepi32_epi8(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdb

Convert packed unsigned 32-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtusepi32_epi8

__m128i _mm_maskz_cvtusepi32_epi8(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdb

Convert packed unsigned 32-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtusepi32_epi8

__m128i _mm256_cvtusepi32_epi8(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdb

Convert packed unsigned 32-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results.



_mm256_mask_cvtusepi32_epi8

__m128i _mm256_mask_cvtusepi32_epi8(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdb

Convert packed unsigned 32-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtusepi32_epi8

__m128i _mm256_maskz_cvtusepi32_epi8(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdb

Convert packed unsigned 32-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtusepi32_epi16

__m128i _mm_cvtusepi32_epi16(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdw

Convert packed unsigned 32-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and return the results.



_mm_mask_cvtusepi32_epi16

__m128i _mm_mask_cvtusepi32_epi16(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdw

Convert packed unsigned 32-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtusepi32_epi16

__m128i _mm_maskz_cvtusepi32_epi16(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdw

Convert packed unsigned 32-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtusepi32_epi16

__m128i _mm256_cvtusepi32_epi16(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdw

Convert packed unsigned 32-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and return the results.



_mm256_mask_cvtusepi32_epi16

__m128i _mm256_mask_cvtusepi32_epi16(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdw

Convert packed unsigned 32-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtusepi32_epi16

__m128i _mm256_maskz_cvtusepi32_epi16(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdw

Convert packed unsigned 32-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtusepi64_epi8

__m128i _mm_cvtusepi64_epi8(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqb

Convert packed unsigned 64-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results.



_mm_mask_cvtusepi64_epi8

__m128i _mm_mask_cvtusepi64_epi8(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqb

Convert packed unsigned 64-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtusepi64_epi8

__m128i _mm_maskz_cvtusepi64_epi8(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqb

Convert packed unsigned 64-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtusepi64_epi8

__m128i _mm256_cvtusepi64_epi8(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqb

Convert packed unsigned 64-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results.



_mm256_mask_cvtusepi64_epi8

__m128i _mm256_mask_cvtusepi64_epi8(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqb

Convert packed unsigned 64-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtusepi64_epi8

__m128i _mm256_maskz_cvtusepi64_epi8(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqb

Convert packed unsigned 64-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtusepi64_epi32

__m128i _mm_cvtusepi64_epi32(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqd

Convert packed unsigned 64-bit integers in a to packed unsigned 32-bit integers with unsigned saturation, and return the results.



_mm_mask_cvtusepi64_epi32

__m128i _mm_mask_cvtusepi64_epi32(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqd

Convert packed unsigned 64-bit integers in a to packed unsigned 32-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtusepi64_epi32

__m128i _mm_maskz_cvtusepi64_epi32(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqd

Convert packed unsigned 64-bit integers in a to packed unsigned 32-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtusepi64_epi32

__m128i _mm256_cvtusepi64_epi32(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqd

Convert packed unsigned 64-bit integers in a to packed unsigned 32-bit integers with unsigned saturation, and return the results.



_mm256_mask_cvtusepi64_epi32

__m128i _mm256_mask_cvtusepi64_epi32(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqd

Convert packed unsigned 64-bit integers in a to packed unsigned 32-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtusepi64_epi32

__m128i _mm256_maskz_cvtusepi64_epi32(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqd

Convert packed unsigned 64-bit integers in a to packed unsigned 32-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtusepi64_epi16

__m128i _mm_cvtusepi64_epi16(__m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqw

Convert packed unsigned 64-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and return the results.



_mm_mask_cvtusepi64_epi16

__m128i _mm_mask_cvtusepi64_epi16(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqw

Convert packed unsigned 64-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtusepi64_epi16

__m128i _mm_maskz_cvtusepi64_epi16(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqw

Convert packed unsigned 64-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtusepi64_epi16

__m128i _mm256_cvtusepi64_epi16(__m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqw

Convert packed unsigned 64-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and return the results.



_mm256_mask_cvtusepi64_epi16

__m128i _mm256_mask_cvtusepi64_epi16(__m128i src, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqw

Convert packed unsigned 64-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtusepi64_epi16

__m128i _mm256_maskz_cvtusepi64_epi16(__mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqw

Convert packed unsigned 64-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtusepi16_epi8

__m128i _mm_cvtusepi16_epi8(__m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovuswb

Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results.



_mm_mask_cvtusepi16_epi8

__m128i _mm_mask_cvtusepi16_epi8(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovuswb

Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtusepi16_epi8

__m128i _mm_maskz_cvtusepi16_epi8(__mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovuswb

Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtusepi16_epi8

__m128i _mm256_cvtusepi16_epi8(__m256i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovuswb

Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results.



_mm256_mask_cvtusepi16_epi8

__m128i _mm256_mask_cvtusepi16_epi8(__m128i src, __mmask16 k, __m256i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovuswb

Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtusepi16_epi8

__m128i _mm256_maskz_cvtusepi16_epi8(__mmask16 k, __m256i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovuswb

Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvtusepi16_epi8

__m256i _mm512_cvtusepi16_epi8(__m512i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovuswb

Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results.



_mm512_mask_cvtusepi16_epi8

__m256i _mm512_mask_cvtusepi16_epi8(__m256i src, __mmask32 k, __m512i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovuswb

Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvtusepi16_epi8

__m256i _mm512_maskz_cvtusepi16_epi8(__mmask32 k, __m512i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovuswb

Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_cvtepi16_epi8

__m128i _mm_cvtepi16_epi8(__m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovwb

Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and return the results.



_mm_mask_cvtepi16_epi8

__m128i _mm_mask_cvtepi16_epi8(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovwb

Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepi16_epi8

__m128i _mm_maskz_cvtepi16_epi8(__mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovwb

Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_cvtepi16_epi8

__m128i _mm256_cvtepi16_epi8(__m256i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovwb

Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and return the results.



_mm256_mask_cvtepi16_epi8

__m128i _mm256_mask_cvtepi16_epi8(__m128i src, __mmask16 k, __m256i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovwb

Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepi16_epi8

__m128i _mm256_maskz_cvtepi16_epi8(__mmask16 k, __m256i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovwb

Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvtepi16_epi8

__m256i _mm512_cvtepi16_epi8(__m512i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovwb

Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and return the results.



_mm512_mask_cvtepi16_epi8

__m256i _mm512_mask_cvtepi16_epi8(__m256i src, __mmask32 k, __m512i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovwb

Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvtepi16_epi8

__m256i _mm512_maskz_cvtepi16_epi8(__mmask32 k, __m512i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovwb

Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepu8_epi32

__m128i _mm_mask_cvtepu8_epi32(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxbd

Zero extend packed unsigned 8-bit integers in the low 4 bytes of a to packed 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepu8_epi32

__m128i _mm_maskz_cvtepu8_epi32(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxbd

Zero extend packed unsigned 8-bit integers in th elow 4 bytes of a to packed 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepu8_epi32

__m256i _mm256_mask_cvtepu8_epi32(__m256i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxbd

Zero extend packed unsigned 8-bit integers in the low 8 bytes of a to packed 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepu8_epi32

__m256i _mm256_maskz_cvtepu8_epi32(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxbd

Zero extend packed unsigned 8-bit integers in the low 8 bytes of a to packed 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepu8_epi64

__m128i _mm_mask_cvtepu8_epi64(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxbq

Zero extend packed unsigned 8-bit integers in the low 2 bytes of a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepu8_epi64

__m128i _mm_maskz_cvtepu8_epi64(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxbq

Zero extend packed unsigned 8-bit integers in the low 2 bytes of a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepu8_epi64

__m256i _mm256_mask_cvtepu8_epi64(__m256i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxbq

Zero extend packed unsigned 8-bit integers in the low 4 bytes of a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepu8_epi64

__m256i _mm256_maskz_cvtepu8_epi64(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxbq

Zero extend packed unsigned 8-bit integers in the low 4 bytes of a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepu8_epi16

__m128i _mm_mask_cvtepu8_epi16(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovzxbw

Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepu8_epi16

__m128i _mm_maskz_cvtepu8_epi16(__mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovzxbw

Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepu8_epi16

__m256i _mm256_mask_cvtepu8_epi16(__m256i src, __mmask16 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovzxbw

Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepu8_epi16

__m256i _mm256_maskz_cvtepu8_epi16(__mmask16 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovzxbw

Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_cvtepu8_epi16

__m512i _mm512_cvtepu8_epi16(__m256i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovzxbw

Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and return the results.



_mm512_mask_cvtepu8_epi16

__m512i _mm512_mask_cvtepu8_epi16(__m512i src, __mmask32 k, __m256i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovzxbw

Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_cvtepu8_epi16

__m512i _mm512_maskz_cvtepu8_epi16(__mmask32 k, __m256i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovzxbw

Zero extend packed unsigned 8-bit integers in a to packed 16-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepu32_epi64

__m128i _mm_mask_cvtepu32_epi64(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxdq

Zero extend packed unsigned 32-bit integers in a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepu32_epi64

__m128i _mm_maskz_cvtepu32_epi64(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxdq

Zero extend packed unsigned 32-bit integers in a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepu32_epi64

__m256i _mm256_mask_cvtepu32_epi64(__m256i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxdq

Zero extend packed unsigned 32-bit integers in a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepu32_epi64

__m256i _mm256_maskz_cvtepu32_epi64(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxdq

Zero extend packed unsigned 32-bit integers in a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepu16_epi32

__m128i _mm_mask_cvtepu16_epi32(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxwd

Zero extend packed unsigned 16-bit integers in a to packed 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepu16_epi32

__m128i _mm_maskz_cvtepu16_epi32(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxwd

Zero extend packed unsigned 16-bit integers in a to packed 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepu16_epi32

__m256i _mm256_mask_cvtepu16_epi32(__m256i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxwd

Zero extend packed unsigned 16-bit integers in a to packed 32-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepu16_epi32

__m256i _mm256_maskz_cvtepu16_epi32(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxwd

Zero extend packed unsigned 16-bit integers in a to packed 32-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_cvtepu16_epi64

__m128i _mm_mask_cvtepu16_epi64(__m128i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxwq

Zero extend packed unsigned 16-bit integers in the low 4 bytes of a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_cvtepu16_epi64

__m128i _mm_maskz_cvtepu16_epi64(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxwq

Zero extend packed unsigned 16-bit integers in the low 4 bytes of a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_cvtepu16_epi64

__m256i _mm256_mask_cvtepu16_epi64(__m256i src, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxwq

Zero extend packed unsigned 16-bit integers in the low 8 bytes of a to packed 64-bit integers, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_cvtepu16_epi64

__m256i _mm256_maskz_cvtepu16_epi64(__mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovzxwq

Zero extend packed unsigned 16-bit integers in the low 8 bytes of a to packed 64-bit integers, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm_mask_packs_epi32

__m128i _mm_mask_packs_epi32(__m128i src, __mmask8 k, __m128i a, __m128i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpackssdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_packs_epi32

__m128i _mm_maskz_packs_epi32(__mmask8 k, __m128i a, __m128i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpackssdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_packs_epi32

__m256i _mm256_mask_packs_epi32(__m256i src, __mmask16 k, __m256i a, __m256i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpackssdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_packs_epi32

__m256i _mm256_maskz_packs_epi32(__mmask16 k, __m256i a, __m256i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpackssdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_mask_packs_epi32

__m512i _mm512_mask_packs_epi32(__m512i src, __mmask32 k, __m512i a, __m512i b)

CPUID Flags: AVX512BW

Instruction(s): vpackssdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_packs_epi32

__m512i _mm512_maskz_packs_epi32(__mmask32 k, __m512i a, __m512i b)

CPUID Flags: AVX512BW

Instruction(s): vpackssdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_packs_epi32

__m512i _mm512_packs_epi32(__m512i a, __m512i b)

CPUID Flags: AVX512BW

Instruction(s): vpackssdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using signed saturation, and return the results.



_mm_mask_packs_epi16

__m128i _mm_mask_packs_epi16(__m128i src, __mmask16 k, __m128i a, __m128i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpacksswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_packs_epi16

__m128i _mm_maskz_packs_epi16(__mmask16 k, __m128i a, __m128i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpacksswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_packs_epi16

__m256i _mm256_mask_packs_epi16(__m256i src, __mmask32 k, __m256i a, __m256i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpacksswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_packs_epi16

__m256i _mm256_maskz_packs_epi16(__mmask32 k, __m256i a, __m256i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpacksswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_mask_packs_epi16

__m512i _mm512_mask_packs_epi16(__m512i src, __mmask64 k, __m512i a, __m512i b)

CPUID Flags: AVX512BW

Instruction(s): vpacksswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_packs_epi16

__m512i _mm512_maskz_packs_epi16(__mmask64 k, __m512i a, __m512i b)

CPUID Flags: AVX512BW

Instruction(s): vpacksswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_packs_epi16

__m512i _mm512_packs_epi16(__m512i a, __m512i b)

CPUID Flags: AVX512BW

Instruction(s): vpacksswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using signed saturation, and return the results.



_mm_mask_packus_epi32

__m128i _mm_mask_packus_epi32(__m128i src, __mmask8 k, __m128i a, __m128i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpackusdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_packus_epi32

__m128i _mm_maskz_packus_epi32(__mmask8 k, __m128i a, __m128i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpackusdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_packus_epi32

__m256i _mm256_mask_packus_epi32(__m256i src, __mmask16 k, __m256i a, __m256i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpackusdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_packus_epi32

__m256i _mm256_maskz_packus_epi32(__mmask16 k, __m256i a, __m256i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpackusdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_mask_packus_epi32

__m512i _mm512_mask_packus_epi32(__m512i src, __mmask32 k, __m512i a, __m512i b)

CPUID Flags: AVX512BW

Instruction(s): vpackusdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_packus_epi32

__m512i _mm512_maskz_packus_epi32(__mmask32 k, __m512i a, __m512i b)

CPUID Flags: AVX512BW

Instruction(s): vpackusdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_packus_epi32

__m512i _mm512_packus_epi32(__m512i a, __m512i b)

CPUID Flags: AVX512BW

Instruction(s): vpackusdw

Convert packed 32-bit integers from a and b to packed 16-bit integers using unsigned saturation, and return the results.



_mm_mask_packus_epi16

__m128i _mm_mask_packus_epi16(__m128i src, __mmask16 k, __m128i a, __m128i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpackuswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm_maskz_packus_epi16

__m128i _mm_maskz_packus_epi16(__mmask16 k, __m128i a, __m128i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpackuswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm256_mask_packus_epi16

__m256i _mm256_mask_packus_epi16(__m256i src, __mmask32 k, __m256i a, __m256i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpackuswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm256_maskz_packus_epi16

__m256i _mm256_maskz_packus_epi16(__mmask32 k, __m256i a, __m256i b)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpackuswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_mask_packus_epi16

__m512i _mm512_mask_packus_epi16(__m512i src, __mmask64 k, __m512i a, __m512i b)

CPUID Flags: AVX512BW

Instruction(s): vpackuswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and return the results using writemask k (elements are copied from src when the corresponding mask bit is not set).



_mm512_maskz_packus_epi16

__m512i _mm512_maskz_packus_epi16(__mmask64 k, __m512i a, __m512i b)

CPUID Flags: AVX512BW

Instruction(s): vpackuswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and return the results using zeromask k (elements are zeroed out when the corresponding mask bit is not set).



_mm512_packus_epi16

__m512i _mm512_packus_epi16(__m512i a, __m512i b)

CPUID Flags: AVX512BW

Instruction(s): vpackuswb

Convert packed 16-bit integers from a and b to packed 8-bit integers using unsigned saturation, and return the results.



_mm_mask_cvtepi32_storeu_epi8

void _mm_mask_cvtepi32_storeu_epi8(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdb

Convert packed 32-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtepi32_storeu_epi8

void _mm256_mask_cvtepi32_storeu_epi8(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdb

Convert packed 32-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtepi32_storeu_epi16

void _mm_mask_cvtepi32_storeu_epi16(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdw

Convert packed 32-bit integers in a to packed 16-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtepi32_storeu_epi16

void _mm256_mask_cvtepi32_storeu_epi16(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovdw

Convert packed 32-bit integers in a to packed 16-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtepi64_storeu_epi8

void _mm_mask_cvtepi64_storeu_epi8(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqb

Convert packed 64-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtepi64_storeu_epi8

void _mm256_mask_cvtepi64_storeu_epi8(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqb

Convert packed 64-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtepi64_storeu_epi32

void _mm_mask_cvtepi64_storeu_epi32(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqd

Convert packed 64-bit integers in a to packed 32-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtepi64_storeu_epi32

void _mm256_mask_cvtepi64_storeu_epi32(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqd

Convert packed 64-bit integers in a to packed 32-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtepi64_storeu_epi16

void _mm_mask_cvtepi64_storeu_epi16(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqw

Convert packed 64-bit integers in a to packed 16-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtepi64_storeu_epi16

void _mm256_mask_cvtepi64_storeu_epi16(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovqw

Convert packed 64-bit integers in a to packed 16-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtsepi32_storeu_epi8

void _mm_mask_cvtsepi32_storeu_epi8(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdb

Convert packed 32-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtsepi32_storeu_epi8

void _mm256_mask_cvtsepi32_storeu_epi8(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdb

Convert packed 32-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtsepi32_storeu_epi16

void _mm_mask_cvtsepi32_storeu_epi16(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdw

Convert packed 32-bit integers in a to packed 16-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtsepi32_storeu_epi16

void _mm256_mask_cvtsepi32_storeu_epi16(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsdw

Convert packed 32-bit integers in a to packed 16-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtsepi64_storeu_epi8

void _mm_mask_cvtsepi64_storeu_epi8(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqb

Convert packed 64-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtsepi64_storeu_epi8

void _mm256_mask_cvtsepi64_storeu_epi8(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqb

Convert packed 64-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtsepi64_storeu_epi32

void _mm_mask_cvtsepi64_storeu_epi32(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqd

Convert packed 64-bit integers in a to packed 32-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtsepi64_storeu_epi32

void _mm256_mask_cvtsepi64_storeu_epi32(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqd

Convert packed 64-bit integers in a to packed 32-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtsepi64_storeu_epi16

void _mm_mask_cvtsepi64_storeu_epi16(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqw

Convert packed 64-bit integers in a to packed 16-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtsepi64_storeu_epi16

void _mm256_mask_cvtsepi64_storeu_epi16(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovsqw

Convert packed 64-bit integers in a to packed 16-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtsepi16_storeu_epi8

void _mm_mask_cvtsepi16_storeu_epi8(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovswb

Convert packed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtsepi16_storeu_epi8

void _mm256_mask_cvtsepi16_storeu_epi8(void* base_addr, __mmask16 k, __m256i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovswb

Convert packed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm512_mask_cvtsepi16_storeu_epi8

void _mm512_mask_cvtsepi16_storeu_epi8(void* base_addr, __mmask32 k, __m512i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovswb

Convert packed 16-bit integers in a to packed 8-bit integers with signed saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtusepi32_storeu_epi8

void _mm_mask_cvtusepi32_storeu_epi8(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdb

Convert packed unsigned 32-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtusepi32_storeu_epi8

void _mm256_mask_cvtusepi32_storeu_epi8(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdb

Convert packed unsigned 32-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtusepi32_storeu_epi16

void _mm_mask_cvtusepi32_storeu_epi16(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdw

Convert packed unsigned 32-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtusepi32_storeu_epi16

void _mm256_mask_cvtusepi32_storeu_epi16(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusdw

Convert packed unsigned 32-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtusepi64_storeu_epi8

void _mm_mask_cvtusepi64_storeu_epi8(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqb

Convert packed unsigned 64-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtusepi64_storeu_epi8

void _mm256_mask_cvtusepi64_storeu_epi8(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqb

Convert packed unsigned 64-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtusepi64_storeu_epi32

void _mm_mask_cvtusepi64_storeu_epi32(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqd

Convert packed unsigned 64-bit integers in a to packed unsigned 32-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtusepi64_storeu_epi32

void _mm256_mask_cvtusepi64_storeu_epi32(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqd

Convert packed unsigned 64-bit integers in a to packed unsigned 32-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtusepi64_storeu_epi16

void _mm_mask_cvtusepi64_storeu_epi16(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqw

Convert packed unsigned 64-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtusepi64_storeu_epi16

void _mm256_mask_cvtusepi64_storeu_epi16(void* base_addr, __mmask8 k, __m256i a)

CPUID Flags: AVX512F, AVX512VL

Instruction(s): vpmovusqw

Convert packed unsigned 64-bit integers in a to packed unsigned 16-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtusepi16_storeu_epi8

void _mm_mask_cvtusepi16_storeu_epi8(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovuswb

Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtusepi16_storeu_epi8

void _mm256_mask_cvtusepi16_storeu_epi8(void* base_addr, __mmask16 k, __m256i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovuswb

Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm512_mask_cvtusepi16_storeu_epi8

void _mm512_mask_cvtusepi16_storeu_epi8(void* base_addr, __mmask32 k, __m512i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovuswb

Convert packed unsigned 16-bit integers in a to packed unsigned 8-bit integers with unsigned saturation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm_mask_cvtepi16_storeu_epi8

void _mm_mask_cvtepi16_storeu_epi8(void* base_addr, __mmask8 k, __m128i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovwb

Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm256_mask_cvtepi16_storeu_epi8

void _mm256_mask_cvtepi16_storeu_epi8(void* base_addr, __mmask16 k, __m256i a)

CPUID Flags: AVX512BW, AVX512VL

Instruction(s): vpmovwb

Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.



_mm512_mask_cvtepi16_storeu_epi8

void _mm512_mask_cvtepi16_storeu_epi8(void* base_addr, __mmask32 k, __m512i a)

CPUID Flags: AVX512BW

Instruction(s): vpmovwb

Convert packed 16-bit integers in a to packed 8-bit integers with truncation, and store the active results (those with their respective bit set in writemask k) to unaligned memory at base_addr.