現象如下:
node1的crs,evm服務不能啟動,node2的crs服務是正常的。
inode1:oracle:orcl1:/home/Oracle> exit
inode1:root::/> /etc/init.crs start
Startup will be queued to init within 30 seconds.
inode1:oracle:orcl1:/home/Oracle> crsctl check crs
CSS appears healthy
Cannot communicate with CRS
Cannot communicate with EVM
inode1:oracle:orcl1:/home/Oracle> crs_stat -t
CRS-0184: Cannot communicate with the CRS daemon.
在crsd.log中發現一直報下面的錯誤,基本上是每1s報一次,
inode1:oracle:orcl1:/db/Oracle/product/10.2.0/crs/log/inode1/crsd> tail -f crsd.log
2011-01-18 09:43:45.471: [ COMMCRS][8751]clsc_connect: (1156cfc70) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))
2011-01-18 09:43:46.660: [ COMMCRS][8752]clsc_connect: (1156cfc70) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))
2011-01-18 09:43:47.843: [ COMMCRS][8753]clsc_connect: (1156cfc70) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))
2011-01-18 09:43:49.278: [ COMMCRS][8754]clsc_connect: (1156cfc70) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))
分析:
由於之前有一個磁盤出現問題,對datavg卷組進行了exportvg,importvg操作,
inode1:oracle:orcl1:/home/Oracle> lsvg -p datavg
datavg:
PV_NAME PV STATE TOTAL PPs FREE PPS FREE DISTRIBUTION
hdisk2 active 799 759 160..120..159..160..160
hdisk3 missing 799 799 160..160..159..160..160
inode1:root:>exportvg datavg
inode1:root:>importvg -V60 -y datavg hdisk2
這樣操作後再啟動crs就出現上面的現象
解決:
刪除node1機器上的/etc/oracle/scls_scr/rac1/Oracle/CSSfatal 文件
如果不刪這個文件,運行root.sh 腳本時會報錯。
然後重新運行下$ORA_CRS_HOME/root.sh腳本
inode1:root::/db> cd Oracle/product/10.2.0/crs
inode1:root::/db/Oracle/product/10.2.0/crs> ./root.sh
WARNING: directory '/db/Oracle/product/10.2.0' is not owned by root
WARNING: directory '/db/Oracle/product' is not owned by root
WARNING: directory '/db/Oracle' is not owned by root
WARNING: directory '/db' is not owned by root
Checking to see if Oracle CRS stack is already configured
Checking to see if any 9i GSD is up
Setting the permissions on OCR backup directory
Setting up NS directorIEs
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/db/Oracle/product/10.2.0' is not owned by root
WARNING: directory '/db/Oracle/product' is not owned by root
WARNING: directory '/db/Oracle' is not owned by root
WARNING: directory '/db' is not owned by root
clscfg: EXISTING configuration version 3 detected.
clscfg: version 3 is 10G Release 2.
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: inode1 inode1-priv inode1
node 2: inode2 inode2-priv inode2
clscfg: Arguments check out successfully.
NO KEYS WERE WRITTEN. Supply -force parameter to override.
-force is destructive and will destroy any previous cluster
configuration.
Oracle Cluster Registry for cluster has already been initialized
Startup will be queued to init within 30 seconds.
Adding daemons to inittab
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
CSS is active on these nodes.
inode1
inode2
CSS is active on all nodes.
Waiting for the Oracle CRSD and EVMD to start
Oracle CRS stack installed and running under init(1M)
Running vipca(silent) for configuring nodeaPPS
IP address "inode1-vip" has already been used. Enter an unused IP address.
inode1:root::/db/Oracle/product/10.2.0/crs> cd bin
inode1:root::/db/Oracle/product/10.2.0/crs/bin> ./crsctl check crs
CSS appears healthy
CRS appears healthy
EVM appears healthy