Accelerate TensorFlow Model Inference on CPUs

‹ Back to Video Series: 2022 oneAPI DevSummit China

Accelerate TensorFlow* Model Inference on CPUs with Intel® AI Technology

Overview
Resources

This training session focuses on:

Intel® Optimization of TensorFlow* on an Intel® Xeon® platform
AI model optimization quantification tool: Intel® Neural Compressor

A demo shows the following process:

Train and get an FP32 TensorFlow model.
Use the Intel Neural Compressor to quantize and optimize the FP32 model to get an int8 model.
Test and compare the performance improvement and accuracy loss of FP32 and int8 models on an Intel Xeon platform with Intel® Deep Learning Boost technology in the Intel® Tiber™ AI Cloud.

Speaker

Zhang (Neo) Jianyu is a senior software engineer of Intel® AI software solutions. He focuses on AI solutions and performance optimization on Intel® platforms (CPUs and GPUs). He has a master's degree in pattern recognition and AI from Northwestern Polytechnical University. Zhang has experience in AI, virtualization, communication, and embedded software development.

Learn More about oneAPI

Try oneAPI in a Preconfigured Environment with Intel® Tiber™ AI Cloud

Download Intel® Toolkits and Experiment on Your Own

Find Upcoming Developer Summit Events and Watch Prior Event Videos

Product and Performance Information

Performance varies by use, configuration and other factors. Learn more at www.Intel.com/PerformanceIndex.

Select Your Language

Using Intel.com Search

Quick Links

Recent Searches

Advanced Search

Only search in

Accelerate TensorFlow* Model Inference on CPUs with Intel® AI Technology

Product and Performance Information