在這裡編輯新頁面「首頁/2021-07-08」
我看到HA日志中有些SAP资源健康状态监控一直存在报错,
1. ha.txt
Failed Actions:
* rsc_SAPHanaTopology_HWP_HDB00_monitor_10000 on HQSAPBOBW01 'not configured' (6): call=60, status=complete, exitreason='none',
last-rc-change='Tue Apr 6 21:46:20 2021', queued=0ms, exec=0ms
* st-HQSAPBOBW02-ipmilan_monitor_3600000 on HQSAPBOBW01 'unknown error' (1): call=59, status=Timed Out, exitreason='none',
last-rc-change='Tue Nov 17 21:32:24 2020', queued=0ms, exec=21629ms2. pacemaker.log
haHQSAPBOBW02 stonith-ng: info: stonith_device_remove: Device 'rsc_SAPHanaTopology_HWP_HDB00' not found (1 active devices) Apr 08 14:16:58 [5237] HQSAPBOBW02 stonith-ng: info: stonith_device_remove: Device 'rsc_SAPHanaTopology_HWP_HDB00' not found (1 active devices) Apr 08 14:16:58 [5237] HQSAPBOBW02 stonith-ng: info: stonith_device_remove: Device 'rsc_SAPHanaTopology_HWP_HDB00' not found (1 active devices) Apr 08 14:16:58 [5237] HQSAPBOBW02 stonith-ng: info: stonith_device_remove: Device 'rsc_SAPHanaTopology_HWP_HDB00' not found (1 active devices) Apr 08 14:16:58 [5237] HQSAPBOBW02 stonith-ng: info: update_cib_stonith_devices_v2: Updating device list from the cib: create operations[@id='rsc_sap2_HWP_HDB00-operations'] Apr 08 14:16:58 [5237] HQSAPBOBW02 stonith-ng: info: cib_devices_update: Updating devices to version 0.1330360.0 Apr 08 14:16:58 [5237] HQSAPBOBW02 stonith-ng: info: cib_device_update: Device st-HQSAPBOBW02-ipmilan has been disabled on HQSAPBOBW02: score=-INFINITY
经过初步排查,我看到近期HA及系统日志(HA日志只有04月07日开始的日志记录)中注意到一些SAP资源健康状态监控一直存在报错,主要是资源/设备“rsc_SAPHanaTopology_HWP_HDB00”显示"not configured"或"no found", 持续出现,这可能导致HA异常。建议确认此资源/设备是否在使用。 PS:但是我比较了2020-09-15的HA日志,对应的服务是"active"状态。
