‘udev’ rules continuously being reloaded resulted in ASM nvme disks going offline
July 12, 2020 Leave a comment
Environment:
Linux Server release 7.2
kernel-3.10.0-514.26.2.el7.x86_64
Problem:
When Oracle processes are opening the device for writing and then closing it, this synthesizes a change
event. And udev
rules having ACTION=="add|change"
gets reloaded. This behavior causes ASM nvme disks to go offline:
Thu Jul 09 16:33:16 2020
WARNING: Disk 18 (rac1$disk1) in group 2 mode 0x7f is now being offlined
Fri Jul 10 10:04:34 2020
WARNING: Disk 19 (rac1$disk5) in group 2 mode 0x7f is now being offlined
Fri Jul 10 13:45:45 2020
WARNING: Disk 15 (rac1$disk8) in group 2 mode 0x7f is now being offlined
Solution:
To suppress the false positive change
events disable the inotify
watch
for devices used for Oracle ASM using following steps:
- Create
/etc/udev/rules.d/96-nvme-nowatch.rules
file with just one line in it:
ACTION=="add|change", KERNEL=="nvme*", OPTIONS:="nowatch"
2. After creating the file please run the following to activate the rule:
# udevadm control --reload-rules
# udevadm trigger --type=devices --action=change
The above command will reload the complete udev configuration and will trigger all the udev
rules. On a busy production system this could disrupt ongoing operations, applications running on the server. Please use the above command during a scheduled maintenance window only.
Source: https://access.redhat.com/solutions/1465913 + our experience with customers.