Oracle 11g rac what about the production case that cannot be started by another node 02/14 Update SLTechnology News&Howtos

Oracle 11g rac what about the production case that cannot be started by another node

2026-02-14 Update From: SLTechnology News&Howtos shulou NAV: SLTechnology News&Howtos > Database >

Shulou(Shulou.com)05/31 Report--

This article mainly introduces the oracle 11g rac another node can not start the production case how to do, has a certain reference value, interested friends can refer to, I hope you can learn a lot after reading this article, the following let Xiaobian with you to understand.

I. description of the environment

11g rac dual node, AIX minicomputer

II. Phenomenon

Node 2 failed to start

Crsctl start crs executes an error.

III. Analysis and handling of problems

1. View database log

Archived Log entry 399348 added for thread 2 sequence 205493 ID 0xffffffff8452e669 dest 1:Sat Dec 09 11:13:47 2017Thread 2 advanced to log sequence 205495 (LGWR switch) Current log# 3 seq# 205495 mem# 0: + DATA/orcl2/onlinelog/group_3.257.890091875Sat Dec 09 11:13:51 2017Archived Log entry 399349 added for thread 2 sequence 205494 ID 0xffffffff8452e669 dest 1:Sat Dec 09 11:24:07 2017NOTE: ASMB terminatingErrors in file / u01/app/oracle/diag/rdbms/orcl2/PTS22/trace/PTS22_asmb_8847608.trc:ORA-15064:? ASM? ORA-03113:? Errors in file / u01/app/oracle/diag/rdbms/orcl2/PTS22/trace/PTS22_asmb_8847608.trc:ORA-15064:? ASM? ORA-03113:? ASMB (ospid: 8847608): terminating the instance due to error 15064Sat Dec 09 11:24:07 2017 Murray-identify possible communication problems orcldb2:/u01/app/oracle/diag/rdbms/orcl2/orcl22/trace$oerr ora 1506415064, 00000, "communication failure with ASM instance" / / * Cause: There was a failure to communicate with the ASM instance Most// likely because the connection went down.// * Action: Check the accompanying error messages for more information on the// reason for the failure. Note that database instances will always// return this error when the ASM instance is terminated abnormally.

two。 View cluster logs

2017-12-09 11 missing for 23 15 of timeout interval. [cssd (7667900)] CRS-1612:Network communication with node orcldb1 (1) 50% of timeout interval. Removal of this node from cluster in 14.523 seconds2017-12-09 11 cssd 23 cssd [cssd (7667900)] CRS-1611:Network communication with node orcldb1 (1) missing for 75% of timeout interval. Removal of this node from cluster in 6.509 seconds2017-12-09 11 cssd 24 cssd 03.052 [cssd (7667900)] CRS-1610:Network communication with node orcldb1 (1) 90 of timeout interval. Removal of this node from cluster in 2.497 seconds2017-12-09 11Frey 24purl 05.552 [cssd (7667900)] CRS-1609:This node is unable to communicate with other nodes in the cluster and is going down to preserve cluster integrity; details at (: CSSNM00008:) in / u01/app/11.2.0/grid/log/orcldb2/cssd/ocssd.log.2017-12-0911Rich 24purl 05.552 [cssd (7667900)] CRS-1656:The CSS daemon is terminating due to a fatal error Details at (: CSSSC00012:) in / u01/app/11.2.0/grid/log/orcldb2/cssd/ocssd.log2017-12-09 11 24 in 05.614 [cssd (7667900)] CRS-1652:Starting clean up of CRSD resources.

3. View Syslog

IDENTIFIER TIMESTAMP T C RESOURCE_NAME DESCRIPTIONFE2DEE00 1209123617 P S SYSXAIXIF DUPLICATE IP ADDRESS DETECTED IN THE NETFE2DEE00 1209122517 P S SYSXAIXIF DUPLICATE IP ADDRESS DETECTED IN THE NETFE2DEE00 1209114417 P S SYSXAIXIF DUPLICATE IP ADDRESS DETECTED IN THE NETFE2DEE00 1209114317 P S SYSXAIXIF DUPLICATE IP ADDRESS DETECTED IN THE NETA924A5FC 1209112417 P S SYSPROC SOFTWARE PROGRAM ABNORMALLY TERMINATED

To sum up, all the logs point to the database communication may be problematic.

Check the heartbeat network, on the node one, the ping node two is connected, and of course ping itself is also connected.

It feels so strange here. It seems that there is no problem with heartbeat. All kinds of greetings? Sort out the train of thought, ping Node 1 on Node 2, OK, really ping doesn't work. After finding this problem, I communicated with the customer and found that the network had just made adjustments. Handled by the network engineer. Heartbeat network restored. It's my turn to pull up the cluster.

-- root user executes crsctl stop crs-- error crsctl stop crs-f forcibly shuts down crsctl start crscrsctl stat res-t Thank you for reading this article carefully. I hope the article "what to do with production case that cannot be started by another node of oracle 11g rac" shared by the editor will be helpful to you. At the same time, I also hope you will support us, pay attention to the industry information channel, and more related knowledge is waiting for you to learn!

Welcome to subscribe "Shulou Technology Information " to get latest news, interesting things and hot topics in the IT industry, and controls the hottest and latest Internet news, technology news and IT industry trends.

*The comments in the above article only represent the author's personal views and do not represent the views and positions of this website. If you have more insights, please feel free to contribute and share.