测试环境存储磁盘问题,导致RAC一个节点CRS启动失败

测试环境存储磁盘问题,导致RAC一个节点CRS启动失败

linux 5.6 *64, 2节点RAC 11.2.0.4

如下,按照日志,说明信息。

主机重启,发现CRS进程并未自动启动完成。
a1:/picclife/app/oracle$ crsctl check crs CRS-4638: Oracle High Availability Services is online CRS-4535: Cannot communicate with Cluster Ready Services CRS-4530: Communications failure contacting Cluster Synchronization Services daemon CRS-4534: Cannot communicate with Event Manager 查询当前节点信息,发现ASM实例 OFFLINE a1:/picclife/app/grid$ crsctl stat res -t -init -------------------------------------------------------------------------------- NAME TARGET STATE SERVER STATE_DETAILS -------------------------------------------------------------------------------- Cluster Resources -------------------------------------------------------------------------------- ora.asm 1 ONLINE OFFLINE Instance Shutdown ora.cluster_interconnect.haip 1 ONLINE OFFLINE ora.crf 1 ONLINE ONLINE a1 ora.crsd 1 ONLINE OFFLINE ora.cssd 1 ONLINE OFFLINE STARTING ora.cssdmonitor 1 ONLINE ONLINE a1 ora.ctssd 1 ONLINE OFFLINE ora.diskmon 1 OFFLINE OFFLINE ora.evmd 1 ONLINE OFFLINE ora.gipcd 1 ONLINE ONLINE a1 ora.gpnpd 1 ONLINE ONLINE a1 ora.mdnsd 1 ONLINE ONLINE a1
手工启动ASM实例,报错 grid$sqlplus
/ as sysasm SQL> startup ORA-01078: failure in processing system parameters ORA-29701: unable to connect to Cluster Synchronization Service
查询集群Alert,发现启动资源失败报错,提升表决磁盘信息存在问题,并且指向日志 grid$cd $ORACLE_HOME
/log/node_name/ grid$tail -200f a*.log 2020-01-18 10:46:30.622: [ohasd(2344)]CRS-2765:Resource 'ora.cssdmonitor' has failed on server 'a1'. 2020-01-18 10:46:32.278: [cssd(4941)]CRS-1713:CSSD daemon is started in clustered mode 2020-01-18 10:46:32.329: [cssd(4941)]CRS-1714:Unable to discover any voting files, retrying discovery in 15 seconds; Details at (:CSSNM00070:) in /picclife/app/11.2.0/grid/log/a1/cssd/ocssd.log 2020-01-18 10:46:33.920: [ohasd(2344)]CRS-2767:Resource state recovery not attempted for 'ora.diskmon' as its target state is OFFLINE 2020-01-18 10:46:33.920: [ohasd(2344)]CRS-2769:Unable to failover resource 'ora.diskmon'. 2020-01-18 10:46:47.335: [cssd(4941)]CRS-1714:Unable to discover any voting files, retrying discovery in 15 seconds; Details at (:CSSNM00070:) in /picclife/app/11.2.0/grid/log/a1/cssd/ocssd.log 2020-01-18 10:47:02.340: [cssd(4941)]CRS-1714:Unable to discover any voting files, retrying discovery in 15 seconds; Details at (:CSSNM00070:) in /picclife/app/11.2.0/grid/log/a1/cssd/ocssd.log 根据集群alert日志,指向,查询ocssd.log日志,
clssnmvDiskVerify: Successful discovery of 0 disks 未发现存在一块磁盘!!! $ tail
-200f /picclife/app/11.2.0/grid/log/a1/cssd/ocssd.log 2020-01-18 10:47:47.358: [ CSSD][1087162688]clssnmReadDiscoveryProfile: voting file discovery string(/dev/asm*) 2020-01-18 10:47:47.358: [ CSSD][1087162688]clssnmvDDiscThread: using discovery string /dev/asm* for initial discovery 2020-01-18 10:47:47.358: [ SKGFD][1087162688]Discovery with str:/dev/asm*: 2020-01-18 10:47:47.358: [ SKGFD][1087162688]UFS discovery with :/dev/asm*: 2020-01-18 10:47:47.358: [ SKGFD][1087162688]Execute glob on the string /dev/asm* 2020-01-18 10:47:47.358: [ SKGFD][1087162688]OSS discovery with :/dev/asm*: 2020-01-18 10:47:47.358: [ CSSD][1087162688]clssnmvDiskVerify: Successful discovery of 0 disks 2020-01-18 10:47:47.358: [ CSSD][1087162688]clssnmCompleteInitVFDiscovery: Completing initial voting file discovery 2020-01-18 10:47:47.358: [ CSSD][1087162688]clssnmvFindInitialConfigs: No voting files found 2020-01-18 10:47:47.358: [ CSSD][1087162688](:CSSNM00070:)clssnmCompleteInitVFDiscovery: Voting file not found. Retrying discovery in 15 seconds 2020-01-18 10:47:47.486: [ CSSD][1100994880]clssscSelect: cookie accept request 0xdee1a0 2020-01-18 10:47:47.486: [ CSSD][1100994880]clssgmAllocProc: (0x122c840) allocated 2020-01-18 10:47:47.486: [ CSSD][1100994880]clssgmClientConnectMsg: properties of cmProc 0x122c840 - 1,2,3,4,5 2020-01-18 10:47:47.486: [ CSSD][1100994880]clssgmClientConnectMsg: Connect from con(0x1416) proc(0x122c840) pid(2686) version 11:2:1:4, properties: 1,2,3,4,5 2020-01-18 10:47:47.486: [ CSSD][1100994880]clssgmClientConnectMsg: msg flags 0x0000 2020-01-18 10:47:47.725: [ CSSD][1100994880]clssscSelect: cookie accept request 0x1222de0 2020-01-18 10:47:47.725: [ CSSD][1100994880]clssscevtypSHRCON: getting client with cmproc 0x1222de0 2020-01-18 10:47:47.725: [ CSSD][1100994880]clssgmRegisterClient: proc(4/0x1222de0), client(73/0x122d7d0) 2020-01-18 10:47:47.725: [ CSSD][1100994880]clssgmExecuteClientRequest(): type(6) size(684) only connect and exit messages are allowed before lease acquisition proc(0x1222de0) client(0x122d7d0) 查询测试环境,默认存储是/dev/asm*磁盘,未发现asm磁盘 [root@a1 dev]# ls -lrt asm* ls: asm*: No such file or directory [root@a1 ~]# fdisk -l 存储问题,未加载ASM磁盘,系统磁盘问题,丢盘了,重新虚拟机加载磁盘,重启主机后,问题解决。
原文地址:https://www.cnblogs.com/lvcha001/p/12218529.html