Optimizing and Running LLaMA2 on Intel® CPU

Download

ID 791610

Date 2023-10-24

Public

Description

In this whitepaper, we demonstrate how you can perform hardware platform-specific optimization to improve the inference speed of your LLaMA2 LLM model on the llama.cpp (an open-source LLaMA model inference software) running on the Intel® CPU Platform.

Usage instructions

Select Your Language

Using Intel.com Search

Quick Links

Recent Searches

Advanced Search

Only search in

Optimizing and Running LLaMA2 on Intel® CPU

Description

Usage instructions