Article ID: 000095012 Content Type: Troubleshooting Last Reviewed: 05/22/2023

Troubleshooting Threshold Sensors Dropping below Lower Threshold on Intel® Server

BUILT IN - ARTICLE INTRO SECOND COMPONENT
Summary

Steps to isolate or resolve threshold sensors dropping below lower threshold on Intel® Server

Description

One or more threshold sensors has dropped below its warning or critical low threshold.

Resolution

Follow these steps:

Sensor NameDescriptionResolution
  • P12V AUX
  • P12V PSU
  • P3V3
  • P5V

System voltage has dropped below its normal operating range. Typically, this issue is caused by a failure in one of the power supplies but may also be caused by a short circuit on baseboard, SATA drives, fans, or PCIe cards.

Try these steps to isolate the source of the voltage excursion event:

  • Ensure all cables are connected correctly.
  • Check connections on the fans and SATA drives.
  • Remove and inspect all potentially impacted components. Place components back in one at a time to attempt to isolate failure.
  • Inspect for contamination in connectors (DIMMs, PCIe*).
  • If the issue remains, attempt to re-seat power supplies. Replace power supplies if re-seating does not solve problem.
  • P1V8 PCH
  • P105 PCH AUX
  • PVNN PCH AUX

Typically a baseboard failure, one or more of the voltages on the baseboard is outside of its normal operating range.

In order, try these steps to isolate the source of the Voltage Excursion Event:

  • Remove all but minimum components for operation and check sensor.
  • If error remains, replace baseboard.
  • P3VBAT

System has experienced a Baseboard CMOS Battery (BB +3.3V Vbat) voltage excursion, typically caused by contamination on the surface of the CMOS battery.

In order, try these steps:

  • Remove battery, wipe with alcohol to remove potential contamination, reinstall battery, and re-check batter voltage is 2.95 V or higher.
  • Replace the CMOS battery. Any battery of type CR2032 can be used.
  • If error remains, replace the baseboard.
  • PVCCD HV CPU
  • PVCCFA EHV CPU
  • PVCCINFAON CPU
  • PVCCIN CPU

Processor voltage has dropped below its normal operating range. Typically, this issue is caused by a failure in one of the power supplies but may also be caused by a short circuit on baseboard, SATA drives, fans, or PCIe cards.

If the fault is asserted then de-asserted immediately after take no action but continue to monitor.

Otherwise:

  • Ensure the processor is seated properly and secured with the correct torque value.
  • Cross test the processors. If the issue remains with the processor socket, replace the motherboard, else replace the processor.
  • CPU VR Temp
  • Riser Temp
  • Exit Air Temp
  • Front Panel Temp
  • BB M2 Temp
  • BB OCP Temp
  • BB P0/P1 VR Temp
  • BB Riser2 Temp
  • HSBP Temp

The temperature sensor has dropped below its normal operating range.

  • Ensure the air/liquid used to cool the system is within the thermal specification for the system.
  • For front panel, ensure that the cable is connected properly to the Board.
  • Ensure the latest FRUSDR update has been run for the system and the correct chassis is detected or selected.
  • Ensure that the server room is maintained at the optimal ambient temperature.
  • Fan
  • PSU Fan
Fans spinning at a lower RPM and may be preceded by several Non-Critical Fan errors.
  • Ensure the latest FRUSDR update has been run for the system and the correct chassis is detected or selected.
  • Hot-Swap the fan with another working fan.
  • Shut down the system, clean the fan and plug it back in. Restart the system.
  • For 2U systems, if the fan LED is glowing Amber, the fan has malfunctioned. Remove and plug it back in. If the issue still persists, replace the fan.