How Prediction Guard Delivers Trustworthy AI

How Prediction Guard Delivers Trustworthy AI on Intel® Gaudi® 2 AI Accelerators

Overview

Large language models (LLM) promise to revolutionize how enterprises operate, but making them production-ready means solving privacy risks, security vulnerabilities, and performance bottlenecks.

Not so easy.

This session focuses on how AI startup Prediction Guard found a solution to these challenges by using the processing power of Intel® Gaudi® 2 AI accelerators in the Intel® Tiber™ AI Cloud.¹The topics include:

Prediction Guard’s pioneering work with hosting open source LLMs like Llama 2 and neural-chat-7B in a secure, privacy-preserving environment with filters for PII, prompt-injection attacks, toxic outputs, and factual inconsistencies.
How Prediction Guard optimized batching, model replication, tensor shaping, and hyperparameters for 2x throughput gains and industry-leading time to first token for streaming.
Architectural insights and best practices for capitalizing on LLMs.

Skill level: Expert

Featured Software

This session showcases the Intel Tiber AI Cloud: Learn More | Sign Up

Download Code Samples

Intel and Hugging Face* Neural-Chat-7B

See All Code Samples

Other Resources

Ecosystem Developer Hub

Intel® Liftoff for Startups

Jump to:

You May Also Like

You May Also Like

Related Articles

Prediction Guard Reduces Risks in LLM Applications

Trusted AI in the Intel Tiber AI Cloud

Seekr*: Build Trustworthy LLMs for Evaluating and Generating Content at Scale

Accelerate Meta* Llama 3 with Intel AI Solutions

Related Webinar

How to Use Intel-Optimized AI Software in the Cloud

<link rel="stylesheet" href="/etc.clientlibs/settings/wcm/designs/ver/251008/intel/clientlibs/pages/commons-page.min.css" type="text/css"><script src="/etc.clientlibs/settings/wcm/designs/ver/251008/intel/clientlibs/pages/commons-page.min.js" defer></script>

<link rel="preload" href="/etc.clientlibs/settings/wcm/designs/ver/251008/intel/clientlibs/pages/atomVideo.min.css" as="style"><link rel="stylesheet" href="/etc.clientlibs/settings/wcm/designs/ver/251008/intel/clientlibs/pages/atomVideo.min.css" type="text/css"><script src="/etc.clientlibs/settings/wcm/designs/ver/251008/intel/clientlibs/pages/atomVideo.min.js"></script>

<link rel="preload" href="/etc.clientlibs/settings/wcm/designs/ver/251008/intel/clientlibs/pages/colorBlock.min.css" as="style"><link rel="stylesheet" href="/etc.clientlibs/settings/wcm/designs/ver/251008/intel/clientlibs/pages/colorBlock.min.css" type="text/css">

<script>!function(){var e=setInterval(function(){"undefined"!=typeof $CQ&&($CQ(function(){CQ_Analytics.SegmentMgr.loadSegments("/etc/segmentation"),CQ_Analytics.ClientContextUtils.init("/etc/clientcontext/intel",window.location.pathname.substr(0,window.location.pathname.indexOf(".")))}),clearInterval(e))},100)}();</script>