Document Automation for Efficient Claims Processing Using NLP

Business Results

Up to 93% faster inference time in batch

Up to 94% faster inference in real time

For workloads and configurations, see GitHub*. Results may vary.

Background

This reference kit uses the healthcare payers industry as an example of how this kit can be applied. Payers in this industry employ hundreds of clinicians to review millions of claim-related documents for relevant data points to then draw a conclusion. Clinicians remain in high demand, which will continue to increase because of an aging population and reduced revenue from the recent pandemic. Organizations are pressured to save costs without impacting the quality of patient care. At the same time, information is exponentially increasing, often in nonstandard formats, making it difficult for payers to manage.

According to Gartner^†, each claim requiring human intervention costs about $4 to process, and auto adjudicated claims cost about $1 to process. Manual rework costs about $25 per claim to process. With more than 3 billion claims processed in the US per year, automation assisted claims processing can be a valuable cost savings measure.

To read and classify document text, Named Entity Recognition (NER), a subtask of information extraction, is used to locate and classify named entities in texts into predefined categories. These categories could include the names of persons, organizations, locations, expressions of times, quantities, monetary values, and percentages.

Solution

In collaboration with Accenture*, Intel developed an AI reference kit that can be used in an application to help payers automate the review of claims. Each reference kit includes:

Training data
An open source, trained model
Libraries
User guides
oneAPI components

At a Glance

Industry: Healthcare, cross-industry back-office document automation
Task: Information extraction to locate and classify named entities in text into nine predefined categories or tags.
Dataset: Real data consisting of 48,000 sentences with corresponding parts of speech and labeled tags in .csv format
- 3 features
- 90:10 split (training:validation)
Type of Learning: Natural language processing (NLP) supervised deep learning
Models: BERT for NER (Named Entity Recognition) model
Output: Predicts the named entity tagging for each word in the sentence into one of the nine predefined categories.
Intel® AI Portfolio:
- Intel® AI Analytics Toolkit
- Intel® Optimization for TensorFlow*

The experiment performed in this kit expedites the classification of key information from the text in the document into predefined categories.

While GPUs are the natural choice for deep learning and AI processing to achieve a higher frame per second (FPS) rate, they are also expensive and memory consuming. This experiment applies model quantization using Intel® technology (Intel® QuickAssist Technology and Intel® Neural Compressor), which compresses the models using the CPU for processing while maintaining accuracy and speeding up the inference time of named-entity tagging.

Technology

Optimized with Intel oneAPI for Better Performance

Performance was tested on Microsoft Azure* Standard_D8_V5 using 3rd generation Intel® Xeon® processors to optimize the kit.

Benefits

To build a named entity recognition solution using the BERT transfer learning approach at scale, data scientists need to train models using substantial datasets and run inference more frequently to accommodate the variety of text in the documents that could change with every claim. The data scientist needs to evaluate data classifications to tag and categorize data so that it can be better understood and analyzed. This task requires repetitive training and retraining, making the job tedious.

With over 90% faster inferencing on Intel®-optimized software, data scientists can accommodate new text constructs and contexts in documents, accelerate the machine learning pipeline, and achieve better accuracy.

For healthcare payers, the Documentation Automation reference kit can minimize a labor-intensive claims process, provide potentially significant cost savings, and deliver greater customer satisfaction from more accurate and timely responses. It also allows healthcare payers a way to scale their business without the constraint of hiring more clinicians to review.

Download Kit

References

† Gartner: "Strategic Automation Decision Framework: From RPA to AI on the Journey to Hyperautomation in Healthcare." Published December 14, 2020

Stay Up to Date on AI Workload Optimizations

Sign up to receive hand-curated technical articles, tutorials, developer tools, training opportunities, and more to help you accelerate and optimize your end-to-end AI and data science workflows.

Take a chance and subscribe. You can change your mind at any time.

Select Your Language

Using Intel.com Search

Quick Links

Recent Searches

Advanced Search

Only search in

Document Automation for Efficient Claims Processing Using NLP

Business Results

Background

Solution

At a Glance

Technology

Benefits

References

Stay Up to Date on AI Workload Optimizations

You’re In!

Failed to submit your form.

Form Submission Failed

Product and Performance Information

Using Intel.com Search

Quick Links

Recent Searches

Advanced Search

Only search in

Document Automation for Efficient Claims Processing Using NLP

Business Results

Background

Solution

At a Glance

Technology

Benefits

References

Stay Up to Date on AI Workload Optimizations

You’re In!

Failed to submit your form.

Form Submission Failed

Your registration cannot proceed. The materials on this site are subject to U.S. and other applicable export control laws and are not accessible from all locations.

Product and Performance Information