GPU: Disable Hangcheck
This section applies only to applications with long-running GPU compute
workloads in native environments. It is not recommended for
virtualizations or other standard usages of GPU, such as gaming.
A workload that takes more than four seconds for GPU hardware to execute
is a long-running workload. By default, individual threads that qualify
as long-running workloads are considered hung and are terminated. By
disabling the hangcheck timeout period, you can avoid this problem.
If the system is rebooted, hangcheck is automatically enabled. You
must disable hangcheck again after every reboot or follow the
directions below to disable hangcheck persistently (across multiple
reboots). Please re-run this GPU Hangcheck disable with reboot fix if
you update (or auto-update) the kernel.
To disable hangcheck until the next reboot:
sudo sh -c "echo N> /sys/module/i915/parameters/enable_hangcheck"
To disable hangcheck across multiple reboots:
If the kernel is updated, hangcheck is automatically enabled. Run the
procedure below after every kernel update to ensure hangcheck is
disabled.
- Open a terminal.
- Open the grub file in/etc/default.
- In the grub file, find the lineGRUB_CMDLINE_LINUX_DEFAULT="".
- Enter the following text between the quotes (“”):i915.enable_hangcheck=0
- Run the following command:sudo update-grub
- Reboot the system. Hangcheck remains disabled.