在這裡編輯新頁面「首頁/2021-07-08」

我看到HA日志中有些SAP资源健康状态监控一直存在报错,

1. ha.txt

Failed Actions:
* rsc_SAPHanaTopology_HWP_HDB00_monitor_10000 on HQSAPBOBW01 'not configured' (6): call=60, status=complete, exitreason='none',
    last-rc-change='Tue Apr  6 21:46:20 2021', queued=0ms, exec=0ms
* st-HQSAPBOBW02-ipmilan_monitor_3600000 on HQSAPBOBW01 'unknown error' (1): call=59, status=Timed Out, exitreason='none',
    last-rc-change='Tue Nov 17 21:32:24 2020', queued=0ms, exec=21629ms

2. pacemaker.log

haHQSAPBOBW02 stonith-ng:     info: stonith_device_remove: Device 'rsc_SAPHanaTopology_HWP_HDB00' not found (1 active devices)
Apr 08 14:16:58 [5237] HQSAPBOBW02 stonith-ng:     info: stonith_device_remove: Device 'rsc_SAPHanaTopology_HWP_HDB00' not found (1 active devices)
Apr 08 14:16:58 [5237] HQSAPBOBW02 stonith-ng:     info: stonith_device_remove: Device 'rsc_SAPHanaTopology_HWP_HDB00' not found (1 active devices)
Apr 08 14:16:58 [5237] HQSAPBOBW02 stonith-ng:     info: stonith_device_remove: Device 'rsc_SAPHanaTopology_HWP_HDB00' not found (1 active devices)
Apr 08 14:16:58 [5237] HQSAPBOBW02 stonith-ng:     info: update_cib_stonith_devices_v2: Updating device list from the cib: create operations[@id='rsc_sap2_HWP_HDB00-operations']
Apr 08 14:16:58 [5237] HQSAPBOBW02 stonith-ng:     info: cib_devices_update:    Updating devices to version 0.1330360.0
Apr 08 14:16:58 [5237] HQSAPBOBW02 stonith-ng:     info: cib_device_update:     Device st-HQSAPBOBW02-ipmilan has been disabled on HQSAPBOBW02: score=-INFINITY

经过初步排查,我看到近期HA及系统日志(HA日志只有04月07日开始的日志记录)中注意到一些SAP资源健康状态监控一直存在报错,主要是资源/设备“rsc_SAPHanaTopology_HWP_HDB00”显示"not configured"或"no found", 持续出现,这可能导致HA异常。建议确认此资源/设备是否在使用。 PS:但是我比较了2020-09-15的HA日志,对应的服务是"active"状态。

首頁/2021-07-08 (last edited 2021-07-08 10:21:19 by merlyn)