Skip To Main Content
Intel logo - Return to the home page
My Tools

Select Your Language

  • Bahasa Indonesia
  • Deutsch
  • English
  • Español
  • Français
  • Português
  • Tiếng Việt
  • ไทย
  • 한국어
  • 日本語
  • 简体中文
  • 繁體中文
Sign In to access restricted content

Using Intel.com Search

You can easily search the entire Intel.com site in several ways.

  • Brand Name: Core i9
  • Document Number: 123456
  • Code Name: Emerald Rapids
  • Special Operators: “Ice Lake”, Ice AND Lake, Ice OR Lake, Ice*

Quick Links

You can also try the quick links below to see results for most popular searches.

  • Product Information
  • Support
  • Drivers & Software

Recent Searches

Sign In to access restricted content

Advanced Search

Only search in

Sign in to access restricted content.

The browser version you are using is not recommended for this site.
Please consider upgrading to the latest version of your browser by clicking one of the following links.

  • Safari
  • Chrome
  • Edge
  • Firefox

Vectorize Data for RAG and Enhance Inference with IBM watsonx*

@IntelDevTools

Subscribe Now

Stay in the know on all things CODE. Updates are delivered to your inbox.

Sign Up

Overview

The combined scalability and efficiency of Intel® hardware and the flexible power of the IBM watsonx* AI portfolio make an ideal platform for gaining the most from retrieval augmented generation (RAG). This session guides developers to create and launch models that use the advanced capabilities of these hardware and software platforms. It also highlights the advanced optimizations that make Intel® Xeon® processors an ideal choice for hosting embedding models and efficiently handling embedding queries within a RAG pipeline at scale.

Other highlights of the session include a demonstration of the RAG pipeline with an embedding server, a Milvus* vector data store, and an LLM inference with examples illustrating the capabilities of the 4th gen Intel® Xeon® Scalable processor when used with popular Llama models.

The webinar is designed for intermediate-level AI researchers, data scientists, and machine learning engineers who work with RAG pipelines, optimized context-aware LLMs, and vector searches. Technology architects and enterprise developers may also gain new techniques for optimizing AI workloads on Intel hardware.

The following topics are covered in this session:

  • Optimizing AI workloads on 4th gen Intel Xeon Scalable processors to capitalize on Intel® Advanced Matrix Extensions (Intel® AMX) and Intel® Extension for PyTorch* to optimize and improve the embedding model inference with AI.
  • Designing and deploying a RAG pipeline, which includes embedding generation, vector searches using Milvus, and LLM inference.
  • Integrating watsonx for AI applications. Explore techniques for building and scaling context-aware AI solutions with optimized embedding models.
  • Maximizing performance of Llama models, which covers enterprise-level techniques for optimizing Llama models on a RAG-based pipeline.
  • Gaining hands-on experience with the RAG pipeline and 4th gen Intel Xeon Scalable processors. 
  • Company Overview
  • Contact Intel
  • Newsroom
  • Investors
  • Careers
  • Corporate Responsibility
  • Inclusion
  • Public Policy
  • © Intel Corporation
  • Terms of Use
  • *Trademarks
  • Cookies
  • Privacy
  • Supply Chain Transparency
  • Site Map
  • Recycling
  • Your Privacy Choices California Consumer Privacy Act (CCPA) Opt-Out Icon
  • Notice at Collection

Intel technologies may require enabled hardware, software or service activation. // No product or component can be absolutely secure. // Your costs and results may vary. // Performance varies by use, configuration, and other factors. Learn more at intel.com/performanceindex. // See our complete legal Notices and Disclaimers. // Intel is committed to respecting human rights and avoiding causing or contributing to adverse impacts on human rights. See Intel’s Global Human Rights Principles. Intel’s products and software are intended only to be used in applications that do not cause or contribute to adverse impacts on human rights.

Intel Footer Logo