RAC: When 1st node started, the 2nd node failed to start
I got a problem in Oracle 10gR2 RAC on Windows 2003R2 Domain member environment. I have a 2 nodes RAC using ASM in 2 MS Windows 2003 Standard Server, it is a clean environment, only have Oracle and Norton Antivirus software installed.
When the 1st node started successfully from booting up the machine, the 2nd node is failed to startup. It stays in the Windows startup screen (Applying Computer Setting ...) for more then 1 hour. Eventually, the window login screen come out, but I cannot login to the system after input username and password. This situation is reversable (the 1st node failed to start if I startup the 2nd node first).
In case I set the Oracle Services (OracleCRService and OracleEVMService) into Manual startup at 2nd Nodes, the 2nd node can startup smoothly. After login to the 2nd node, I can start these 2 oracle services without problem.
P.S. This problem is just happened after applied I applied all MS Security Update on 10 Apr, 2008.
Any suggestion how to shoot this problem? Thanks.
Message was edited by:
ckhlam
A couple of things you could try :
a) Disable the Norton AntiVirus Software and check whether rebooting the
Server allows the CRS stack to come up. Recall reading about an issue
where-in NAV waits for the Network Stack to come up and blocks
CRS's startup sequence. This is just a guess at this time but worth a try.
b) You might also want to check if configuring Oracle Process Manager as detailed
in Note:358156.1 allows the CRS stack to be delayed long enough to fully
initialize the OS stack beneath it.
c) If none of the above helps , you might want to uninstall the MS Security Update
to check if this was a problem introduced by this Patch. You might then have
to work with MS / Oracle to dig further into this.
Do update this thread with your observations on this ..
Vishwa
Similar Messages
-
Dbms_schduler job is not running on a 2 node rac when 1st node fails
Hi,
I want to create a dbms_scheduler job in a 2 node RAC and the job should always run on the node1 and if node1 is down then it should run on node2. This is Oracle 10gR2 (10.2.0.3 in WINDOWS) .In order to do the same I did following
-- First Step
Using DBCA- Service Managment - Created a service (BATCH_SERVICE) and given node1 as preferred and node2 as available. This created following entry in tnsnames.ora in both nodes.
BATCH_SERVICE =
(DESCRIPTION =
(ADDRESS = (PROTOCOL = TCP)(HOST = node1-vip)(PORT = 1521))
(ADDRESS = (PROTOCOL = TCP)(HOST = node2-vip)(PORT = 1521))
(LOAD_BALANCE = yes)
(CONNECT_DATA =
(SERVER = DEDICATED)
(SERVICE_NAME = BATCH_SERVICE)
(FAILOVER_MODE =
(TYPE = SELECT)
(METHOD = BASIC)
(RETRIES = 180)
(DELAY = 5)
--- Step 2
-- Created BATCH job classes.
BEGIN
DBMS_SCHEDULER.create_job_class(
job_class_name => 'BATCH_JOB_CLASS',
service => 'BATCH_SERVICE');
END;
-- Step 3 -- created a job using job_class as BATCH_JOB_CLASS
begin
dbms_scheduler.create_job(
job_name => 'oltp_job_test'
,job_type => 'STORED_PROCEDURE'
,job_action => 'schema1.P1'
,start_date => systimestamp at time zone 'US/Central'
,repeat_interval => 'FREQ=DAILY;BYHOUR=11;BYMINUTE=30;'
,job_class => 'BATCH_JOB_CLASS'
,enabled => TRUE
,comments => 'New Job.');
end;
Now when I monitor this job it runs on node1. Now I started testing for failover. I manually shutdown 1st instance. Then as per my understanding job should run on 2nd node. But job is not picking up.
when I run the followign command
srvctl status service -d db -s BATCH_SERVICE
service BATCH_SERVICE is running on instance node2.
Any help is really appreciated.It does not show that whether job is running or broken.
-
hi,
this is malli,
i got below error while installing grid software in rac.
os versin redhat 11inux 5.5
oracle 11gr2 cluster rac
thanks in advance....please do needful
Failed to start the Clusterware. Last 20 lines of the alert log follow:
2014-12-24 01:05:40.100
[client(3850)]CRS-2101:The OLR was formatted using version 3.
2014-12-24 01:05:41.923
[client(3852)]CRS-2317:Fatal error: cannot get local GPnP security keys (wallet).
2014-12-24 01:05:41.923
[client(3852)]CRS-2316:Fatal error: cannot initialize GPnP, CLSGPNP_ERR (Generic GPnP error).
2014-12-24 01:05:41.954
[client(3852)]CRS-1013:The OCR location in an ASM disk group is inaccessible. Details in /u01/app/11.2.0/grid_home/log/lnx01/client/clscfg.log.[client(3852)]CRS-1013:The OCR location in an ASM disk group is inaccessible. Details in /u01/app/11.2.0/grid_home/log/lnx01/client/clscfg.log.
Have you looked in this log file to see why the OCR is inaccessible? Have you looked in the ASM instance's alert log for other clues?
Cheers,
Brian -
When printing webpages from a website on Firefox the top two lines of pages 2, 3, 4, etc are cut off. I printed the same website article on Safari and have no problem. I also checked out Word documents of more than one page and found no problems with pages 2, 3, etc. I tried to print other articles from Firefox from other websites and found the same problem. I spent an hour with HP tech support for my printer and could not find any problems with the printer. Something is up with Firefox. Please help???
Thank you very much.
SamuelHi LouisaKaymen-
I would recommend reading this Support article:
[[Firefox prints incorrectly]]
There are tips on how to troubleshoot printing issues. -
Why RAC when there is virtualization?
I struggle with the question why to use RAC when virtualization gives you the same benefits for less money. Or am I totally wrong.
I mean with virtualization you get, resource management, fail over and all that to a lower cost. "Consolidation of servers".
A couple of things that I can think of that RAC supplies such as load balancing, FAN etc are nice features but not vital to our needs.
Hoping for input from other forum member so that I can get this question cleared, or maby there is no such answer.Quote:
Once you've allocate all the resources from one box, where do you go next with virtualization? Ditch the server and move the VM onto a bigger box? That's a bit wasteful.
Yes this is true depending on what HW the host is running on & the type of database. In our case we do not have that big databases, we have many but none of them are high resource consuming.
But that is exactly the input i am looking for.
The reason is that i am trying to consolidate our current platform that consist of many different flavor & types (HW, OS & Cluster) into one single standard platform for our Oracle databases. And our *NIX team are really into the virtualization tech.
For me to propose and stick to a RAC solution I need facts, at the moment i am kind of stuck in the middle.
Quote 2:
With Oracle VM you can run two 2core VMs, one on each server, and only pay Oracle for 4 cores. Leaving all the other cores free to run other stuff, like app servers.
Well I do not know if Oracle has changed its licensing model while presenting VM, but the last time i checked with Oracle, I have to pay for all 32CPU's and not only for those "4" that i am actually using. "The reason they gave was: There are 32 CPU's that you can use within the machine, and therefor you have to pay for 32."
This is a really awkward way of licensing, but that may have changed now when the VM has been released. -
Is the 2nd HDD in used if there is no data on it?
I just installed a 2nd HDD on my macbook pro. I have not added any data on it yet. I just want to know if the 2nd HDD is spinning if I didnt add any data or even bother to access it?
excellent. That is good news. I dont want any access to it yet till my 1st hdd is full. I just want to to be a dead un-usable drive.
I just installed it in the laptop and just formate it. I can go to my computer and check out the storage space. It shows 2 storage space. 1 for the 1st hdd and the 2nd for the other hdd. -
RAC 11gR2 cluster installation: root.sh failed on the 1st node
Hi,
Does anybody know why is possible when I run the root.sh on the 1st node, during the Oracle 11gR2 RAC installation (cluster installation) to get the following error?
The following environment variables are set as:
ORACLE_OWNER= oracle
ORACLE_HOME= /oracle/grid
Enter the full pathname of the local bin directory: [usr/local/bin]:
The file "dbhome" already exists in /usr/local/bin. Overwrite it? (y/n) [n]: y
Copying dbhome to /usr/local/bin ...
The file "oraenv" already exists in /usr/local/bin. Overwrite it? (y/n) [n]: y
Copying oraenv to /usr/local/bin ...
The file "coraenv" already exists in /usr/local/bin. Overwrite it? (y/n) [n]: y
Copying coraenv to /usr/local/bin ...
Entries will be added to the /etc/oratab file as needed by
Database Configuration Assistant when a database is created
Finished running generic part of root.sh script.
Now product-specific root actions will be performed.
2010-06-29 14:17:43: Parsing the host name
2010-06-29 14:17:43: Checking for super user privileges
2010-06-29 14:17:43: User has super user privileges
Using configuration parameter file: /oracle/grid/crs/install/crsconfig_params
Creating trace directory
User oracle has the required capabilities to run CSSD in realtime mode
LOCAL ADD MODE
Creating OCR keys for user 'root', privgrp 'system'..
Operation successful.
root wallet
root wallet cert
root cert export
peer wallet
profile reader wallet
pa wallet
peer wallet keys
pa wallet keys
peer cert request
pa cert request
peer cert
pa cert
peer root cert TP
profile reader root cert TP
pa root cert TP
peer pa cert TP
pa peer cert TP
profile reader pa cert TP
profile reader peer cert TP
peer user cert
pa user cert
Adding daemon to inittab
CRS-4123: Oracle High Availability Services has been started.
ohasd is starting
CRS-2672: Attempting to start 'ora.gipcd' on 'trz1test_rac'
CRS-2672: Attempting to start 'ora.mdnsd' on 'trz1test_rac'
CRS-2676: Start of 'ora.gipcd' on 'trz1test_rac' succeeded
CRS-2676: Start of 'ora.mdnsd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'trz1test_rac'
CRS-2676: Start of 'ora.gpnpd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'trz1test_rac'
CRS-2676: Start of 'ora.cssdmonitor' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'trz1test_rac'
CRS-2672: Attempting to start 'ora.diskmon' on 'trz1test_rac'
CRS-2676: Start of 'ora.diskmon' on 'trz1test_rac' succeeded
CRS-2676: Start of 'ora.cssd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.ctssd' on 'trz1test_rac'
CRS-2676: Start of 'ora.ctssd' on 'trz1test_rac' succeeded
clscfg: -install mode specified
Successfully accumulated necessary OCR keys.
Creating OCR keys for user 'root', privgrp 'system'..
Operation successful.
CRS-2672: Attempting to start 'ora.crsd' on 'trz1test_rac'
CRS-2676: Start of 'ora.crsd' on 'trz1test_rac' succeeded
Now formatting voting disk: /data_gpfs/oracle/crs/vdsk.
CRS-4603: Successful addition of voting disk /data_gpfs/oracle/crs/vdsk.
## STATE File Universal Id File Name Disk group
1. ONLINE 653624f2aa1f4f83bf774e8052889a32 (/data_gpfs/oracle/crs/vdsk) []
Located 1 voting disk(s).
CRS-2673: Attempting to stop 'ora.crsd' on 'trz1test_rac'
CRS-2677: Stop of 'ora.crsd' on 'trz1test_rac' succeeded
CRS-2673: Attempting to stop 'ora.ctssd' on 'trz1test_rac'
CRS-2677: Stop of 'ora.ctssd' on 'trz1test_rac' succeeded
CRS-2673: Attempting to stop 'ora.cssdmonitor' on 'trz1test_rac'
CRS-2677: Stop of 'ora.cssdmonitor' on 'trz1test_rac' succeeded
CRS-2673: Attempting to stop 'ora.cssd' on 'trz1test_rac'
CRS-2677: Stop of 'ora.cssd' on 'trz1test_rac' succeeded
CRS-2673: Attempting to stop 'ora.gpnpd' on 'trz1test_rac'
CRS-2677: Stop of 'ora.gpnpd' on 'trz1test_rac' succeeded
CRS-2673: Attempting to stop 'ora.gipcd' on 'trz1test_rac'
CRS-2677: Stop of 'ora.gipcd' on 'trz1test_rac' succeeded
CRS-2673: Attempting to stop 'ora.mdnsd' on 'trz1test_rac'
CRS-2677: Stop of 'ora.mdnsd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.mdnsd' on 'trz1test_rac'
CRS-2676: Start of 'ora.mdnsd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.gipcd' on 'trz1test_rac'
CRS-2676: Start of 'ora.gipcd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.gpnpd' on 'trz1test_rac'
CRS-2676: Start of 'ora.gpnpd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.cssdmonitor' on 'trz1test_rac'
CRS-2676: Start of 'ora.cssdmonitor' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.cssd' on 'trz1test_rac'
CRS-2672: Attempting to start 'ora.diskmon' on 'trz1test_rac'
CRS-2676: Start of 'ora.diskmon' on 'trz1test_rac' succeeded
CRS-2676: Start of 'ora.cssd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.ctssd' on 'trz1test_rac'
CRS-2676: Start of 'ora.ctssd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.crsd' on 'trz1test_rac'
CRS-2676: Start of 'ora.crsd' on 'trz1test_rac' succeeded
CRS-2672: Attempting to start 'ora.evmd' on 'trz1test_rac'
CRS-2676: Start of 'ora.evmd' on 'trz1test_rac' succeeded
*/oracle/grid/bin/srvctl start nodeapps -n trz1test_rac ... failed*
Configure Oracle Grid Infrastructure for a Cluster ... failed
This is because ora.eONS daemon is not starting. There is a Metalink note that we MIGHT start this daemon manually ... but this is not working.
*./srvctl status nodeapps -n trz1test_rac*
-n <node_name> option has been deprecated.
VIP trz1test_rac_vip is enabled
VIP trz1test_rac_vip is running on node: trz1test_rac
Network is enabled
Network is running on node: trz1test_rac
GSD is disabled
GSD is not running on node: trz1test_rac
ONS is enabled
ONS daemon is running on node: trz1test_rac
eONS is enabled
eONS daemon is not running on node: trz1test_racI run my clusterware/DB on AIX 5.3
When I run runcluvfy.sh here are the things which are not passing:
Check: Node connectivity of subnet "192.168.1.0"
Source Destination Connected?
trz2test_rac:en5 trz2test_rac:en5 yes
trz2test_rac:en5 trz1test_rac:en5 yes
trz2test_rac:en5 trz1test_rac:en5 yes
trz2test_rac:en5 trz1test_rac:en5 yes
trz2test_rac:en5 trz1test_rac:en5 yes
trz1test_rac:en5 trz1test_rac:en5 yes
Result: Node connectivity passed for subnet "192.168.1.0" with node(s) trz2test_rac,trz1test_rac
Check: TCP connectivity of subnet "192.168.1.0"
Source Destination Connected?
trz1test_rac:192.168.1.140 trz2test_rac:192.168.1.142 failed
trz1test_rac:192.168.1.140 trz2test_rac:192.168.1.142 failed
Result: TCP connectivity check failed for subnet "192.168.1.0"
NTP daemon slewing option check failed on some nodes
PRVF-5436 : The NTP daemon running on one or more nodes lacks the slewing option "-x"
Result: Clock synchronization check using Network Time Protocol(NTP) failed
NTP mustn't be a problem I guess as the date are identical on the 2 nodes.
I have no idea how to fix the TCP connectivity issue with the subnet "192.168.1.0". Some posts wrote that could be a firewall issue. Are there any other causes ?
Thanks to all,
Paul -
Can I have RAC 1st node in RHEL 5 and 2nd node in RHEL 4?
Can I have my RAC 1st node in RHEL5 and 2nd node in RHEL 4?
I am just checking if there is any possibility like that.
Thanks,
MahiEven if it works by accident, it wouldn't be supported.
-
Installing 11gR1 CRS and receive error when executing root.sh on 2nd node of 2 node cluster
This is the error from the execution of root.sh on 2nd node of 2 node RAC cluster:
dhzusbx98: /u01/app/crs # ./root.sh
WARNING: directory '/u01/app' is not owned by root
Checking to see if Oracle CRS stack is already configured
/etc/oracle does not exist. Creating it now.
Setting the permissions on OCR backup directory
Setting up Network socket directories
PROT-1: Failed to initialize ocrconfig
Failed to upgrade Oracle Cluster Registry configuration
dhzusbx98: /u01/app/crs #
This is what I find in the following log file: /u01/app/crs/log/dhzusbx98/client\ocrconfig_6422692.log
Oracle Database 11g CRS Release 11.1.0.6.0 - Production Copyright 1996, 2007 Ora cle. All rights reserved.
2015-02-11 11:21:07.384: [ OCRCONF][1]ocrconfig starts...
2015-02-11 11:21:07.388: [ OCRCONF][1]Upgrading OCR data
2015-02-11 11:21:07.455: [ OCROSD][1]utopen:7:failed to open OCR file/disk /dev /lnkocrd01CC3 , errno=16, os err string=Device busy
2015-02-11 11:21:07.465: [ OCRRAW][1]proprinit: Could not open raw device
2015-02-11 11:21:07.465: [ default][1]a_init:7!: Backend init unsuccessful : [26 ]
2015-02-11 11:21:07.467: [ OCRCONF][1]Exporting OCR data to [OCRUPGRADEFILE]
2015-02-11 11:21:07.467: [ OCRAPI][1]a_init:7!: Backend init unsuccessful : [33 ]
2015-02-11 11:21:07.467: [ OCRCONF][1]There was no previous version of OCR. erro r:[PROC-33: Oracle Cluster Registry is not configured]
2015-02-11 11:21:07.524: [ OCROSD][1]utopen:7:failed to open OCR file/disk /dev /lnkocrd01CC3 , errno=16, os err string=Device busy
2015-02-11 11:21:07.524: [ OCRRAW][1]proprinit: Could not open raw device
2015-02-11 11:21:07.524: [ default][1]a_init:7!: Backend init unsuccessful : [26 ]
2015-02-11 11:21:07.560: [ OCROSD][1]utopen:7:failed to open OCR file/disk /dev /lnkocrd01CC3 , errno=16, os err string=Device busy
2015-02-11 11:21:07.560: [ OCRRAW][1]proprinit: Could not open raw device
2015-02-11 11:21:07.560: [ OCRAPI][1]a_init:6b!: Backend init unsuccessful : [2 6]
2015-02-11 11:21:07.560: [ OCRCONF][1]Failed to initialized OCR context. error:[ PROC-26: Error while accessing the physical storage Operating System error [Devi ce busy] [16]]
2015-02-11 11:21:07.560: [ OCRCONF][1]Exiting [status=failed]...
dhzusbx98: /u01/app/crs/log/dhzusbx98/client #
dhzusbx98: /u01/app/crs/log/dhzusbx98/client #
dhzusbx98: /u01/app/crs/log/dhzusbx98/client #
dhzusbx98: /u01/app/crs/log/dhzusbx98/client # set -o vi
dhzusbx98: /u01/app/crs/log/dhzusbx98/client # cat ocrconfig_6422692.log
Oracle Database 11g CRS Release 11.1.0.6.0 - Production Copyright 1996, 2007 Oracle. All rights reserved.
2015-02-11 11:21:07.384: [ OCRCONF][1]ocrconfig starts...
2015-02-11 11:21:07.388: [ OCRCONF][1]Upgrading OCR data
2015-02-11 11:21:07.455: [ OCROSD][1]utopen:7:failed to open OCR file/disk /dev/lnkocrd01CC3 , errno=16, os err string=Device busy
2015-02-11 11:21:07.465: [ OCRRAW][1]proprinit: Could not open raw device
2015-02-11 11:21:07.465: [ default][1]a_init:7!: Backend init unsuccessful : [26]
2015-02-11 11:21:07.467: [ OCRCONF][1]Exporting OCR data to [OCRUPGRADEFILE]
2015-02-11 11:21:07.467: [ OCRAPI][1]a_init:7!: Backend init unsuccessful : [33]
2015-02-11 11:21:07.467: [ OCRCONF][1]There was no previous version of OCR. error:[PROC-33: Oracle Cluster Registry is not configured]
2015-02-11 11:21:07.524: [ OCROSD][1]utopen:7:failed to open OCR file/disk /dev/lnkocrd01CC3 , errno=16, os err string=Device busy
2015-02-11 11:21:07.524: [ OCRRAW][1]proprinit: Could not open raw device
2015-02-11 11:21:07.524: [ default][1]a_init:7!: Backend init unsuccessful : [26]
2015-02-11 11:21:07.560: [ OCROSD][1]utopen:7:failed to open OCR file/disk /dev/lnkocrd01CC3 , errno=16, os err string=Device busy
2015-02-11 11:21:07.560: [ OCRRAW][1]proprinit: Could not open raw device
2015-02-11 11:21:07.560: [ OCRAPI][1]a_init:6b!: Backend init unsuccessful : [26]
2015-02-11 11:21:07.560: [ OCRCONF][1]Failed to initialized OCR context. error:[PROC-26: Error while accessing the physical storage Operating System error [Device busy] [16]]
2015-02-11 11:21:07.560: [ OCRCONF][1]Exiting [status=failed]...
dhzusbx98: /u01/app/crs/log/dhzusbx98/client # cat /etc/oracle/ocr.loc
ocrconfig_loc=/dev/lnkocrd01CC3
local_only=FALSE
Can I just retry the root.sh on the 2nd node? or, do I have to do some sort of cleanup and then retry? Please advise.... Thanks, AdolfoAre you using raw disk devices for Oracle 11g RAC? You might want to move to ASM. For starters, your version, 11.1 is no longer fully supported by Oracle Corp. You'll need to be on 11.2.0.3 or higher to be supported. The 11.2.0.3 version loses Extended Support in August I think. The 11.2.0.4 version loses Extended Support in January. All of this means your production Oracle RAC database should be on the 12c version by January. And guess what? Oracle 12c RAC does not support raw disk devices.
Since you're starting with this cluster fresh, why not aim for a configuration that will allow you to go forth into the foreseeable future without major configuration changes?
Cheers,
Brian -
SC 3.2 Solaris 10 x86. When one node reboot, the other one does also
Configured a two node cluster with a EMC clariion san (Raid 6) for holding a zpool and use as quorum device.
When one node goes down, the other one does also.
There seems a problem with the quorum.
I can not understand or figure out what actually goes wrong.
When starting up:
Booting as part of a cluster
NOTICE: CMM: Node cnode01 (nodeid = 1) with votecount = 1 added.
NOTICE: CMM: Node cnode02 (nodeid = 2) with votecount = 1 added.
NOTICE: CMM: Quorum device 1 (/dev/did/rdsk/d1s2) added; votecount = 1, bitmask of nodes with configured paths = 0x3.
NOTICE: clcomm: Adapter nge3 constructed
NOTICE: clcomm: Adapter nge2 constructed
NOTICE: CMM: Node cnode01: attempting to join cluster.
NOTICE: nge3: link down
NOTICE: nge2: link down
NOTICE: nge3: link up 1000Mbps Full-Duplex
NOTICE: nge2: link up 1000Mbps Full-Duplex
NOTICE: nge3: link down
NOTICE: nge2: link down
NOTICE: nge3: link up 1000Mbps Full-Duplex
NOTICE: nge2: link up 1000Mbps Full-Duplex
NOTICE: CMM: Node cnode02 (nodeid: 2, incarnation #: 1248284052) has become reachable.
NOTICE: clcomm: Path cnode01:nge2 - cnode02:nge2 online
NOTICE: clcomm: Path cnode01:nge3 - cnode02:nge3 online
NOTICE: CMM: Cluster has reached quorum.
NOTICE: CMM: Node cnode01 (nodeid = 1) is up; new incarnation number = 1248284001.
NOTICE: CMM: Node cnode02 (nodeid = 2) is up; new incarnation number = 1248284052.
NOTICE: CMM: Cluster members: cnode01 cnode02.
NOTICE: CMM: node econfiguration #1 completed.
NOTICE: CMM: Node cnode01: joined cluster.
ip: joining multicasts failed (18) on clprivnet0 - will use link layer broadcasts for multicast
/dev/rdsk/c2t0d0s5 is clean
Reading ZFS config: done.
obtaining access to all attached disks
cnode01 console login:
Then this on the second node:
Booting as part of a cluster
NOTICE: CMM: Node cnode01 (nodeid = 1) with votecount = 1
NOTICE: CMM: Node cnode02 (nodeid = 2) with votecount = 1
NOTICE: CMM: Quorum device 1 (/dev/did/rdsk/d1s2) added; votecount = 1, bitmask of nodes with configured paths = 0x3.
NOTICE: clcomm: Adapter nge3 constructed
NOTICE: clcomm: Adapter nge2 constructed
NOTICE: CMM: Node cnode02: attempting to join cluster.
NOTICE: CMM: Node cnode01 (nodeid: 1, incarnation #: 1248284001) has become reachable.
NOTICE: clcomm: Path cnode02:nge2 - cnode01:nge2 online
NOTICE: clcomm: Path cnode02:nge3 - cnode01:nge3 online
WARNING: CMM: Issuing a NULL Preempt failed on quorum device /dev/did/rdsk/d1s2 with error 2.
NOTICE: CMM: Cluster has reached quorum.ion ratio 4.77, dump succeeded
NOTICE: CMM: Node cnode01 (nodeid = 1) is up; new incarnation number = 1248284001.
NOTICE: CMM: Node cnode02 (nodeid = 2) is up; new incarnation number = 1248284052.
NOTICE: CMM: Cluster members: cnode01 cnode02.
NOTICE: CMM: node reconfiguration #1 completed.
NOTICE: CMM: Node cnode02: joined cluster.
NOTICE: CCR: Waiting for repository synchronization to finish.
*{color:#ff0000}WARNING: CMM: Issuing a NULL Preempt failed on quorum device /dev/did/rdsk/d1s2 with error 2.{color}*
ip: joining multicasts failed (18) on clprivnet0 - will use link layer broadcasts for multicast
/dev/rdsk/c2t0d0s5 is clean
Reading ZFS config: done.
obtaining access to all attached disks
cnode02 console login:
But when the first node reboot, on the second node this message:
Jul 22 19:24:48 cnode02 genunix: [ID 936769 kern.info] devinfo0 is /pseudo/devinfo@0
Jul 22 19:30:57 cnode02 nge: [ID 812601 kern.notice] NOTICE: nge3: link down
Jul 22 19:30:57 cnode02 nge: [ID 812601 kern.notice] NOTICE: nge2: link down
Jul 22 19:30:59 cnode02 nge: [ID 812601 kern.notice] NOTICE: nge3: link up 1000Mbps Full-Duplex
Jul 22 19:31:00 cnode02 nge: [ID 812601 kern.notice] NOTICE: nge2: link up 1000Mbps Full-Duplex
Jul 22 19:31:06 cnode02 genunix: [ID 489438 kern.notice] NOTICE: clcomm: Path cnode02:nge2 - cnode01:nge2 being drained
{color:#ff0000}Jul 22 19:31:06 cnode02 scsi_vhci: [ID 734749 kern.warning] WARNING: vhci_scsi_reset 0x0{color}
Jul 22 19:31:06 cnode02 genunix: [ID 489438 kern.notice] NOTICE: clcomm: Path cnode02:nge3 - cnode01:nge3 being drained
Jul 22 19:31:11 cnode02 nge: [ID 812601 kern.notice] NOTICE: nge3: link down
{color:#ff0000}Jul 22 19:31:12 cnode02 genunix: [ID 414208 kern.warning] WARNING: QUORUM_GENERIC: quorum preempt error in CMM: Error 5 --- QUORUM_GENERIC Tkown ioctl failed on quorum device /dev/did/rdsk/d1s2.{color}
{color:#ff0000}Jul 22 19:31:12 cnode02 cl_dlpitrans: [ID 624622 kern.notice] Notifying cluster that this node is panicking
Jul 22 19:31:12 cnode02 unix: [ID 836849 kern.notice]
Jul 22 19:31:12 cnode02 ^Mpanic[cpu3]/thread=ffffffff8b5c06e0:
Jul 22 19:31:12 cnode02 genunix: [ID 265925 kern.notice] CMM: Cluster lost operational quorum; aborting.{color}
Jul 22 19:31:12 cnode02 unix: [ID 100000 kern.notice]
Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651b40 genunix:vcmn_err+13 ()
Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651b50 cl_runtime:__1cZsc_syslog_msg_log_no_args6FpviipkcpnR__va_list_element__nZsc_syslog_msg_status_enum__+24 ()
Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651c30 cl_runtime:__1cCosNsc_syslog_msgDlog6MiipkcE_nZsc_syslog_msg_status_enum__+9d ()
Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651e20 cl_haci:__1cOautomaton_implbAstate_machine_qcheck_state6M_nVcmm_automaton_event_t__+3bc ()
Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651e60 cl_haci:__1cIcmm_implStransitions_thread6M_v_+de ()
Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651e70 cl_haci:__1cIcmm_implYtransitions_thread_start6Fpv_v_+b ()
Jul 22 19:31:12 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651ed0 cl_orb:cllwpwrapper+106 ()
Jul 22 19:31:13 cnode02 genunix: [ID 655072 kern.notice] fffffe8002651ee0 unix:thread_start+8 ()
Jul 22 19:31:13 cnode02 unix: [ID 100000 kern.notice]
Jul 22 19:31:13 cnode02 genunix: [ID 672855 kern.notice] syncing file systems...
Jul 22 19:31:13 cnode02 genunix: [ID 733762 kern.notice] 1
Jul 22 19:31:34 cnode02 last message repeated 20 times
Jul 22 19:31:35 cnode02 genunix: [ID 622722 kern.notice] done (not all i/o completed)
Jul 22 19:31:36 cnode02 genunix: [ID 111219 kern.notice] dumping to /dev/dsk/c2t0d0s1, offset 3436511232, content: kernel
Jul 22 19:31:45 cnode02 genunix: [ID 409368 kern.notice] ^M100% done: 136950 pages dumped, compression ratio 4.77,
Jul 22 19:31:45 cnode02 genunix: [ID 851671 kern.notice] dump succeeded
Jul 22 19:33:18 cnode02 genunix: [ID 540533 kern.notice] ^MHi,
the problem lies in the error message around the quorum device. The SC documentation, specifically the Sun Cluster Error Messages Guide at http://docs.sun.com/app/docs/doc/820-4681 explains this as follows:
414208 QUORUM_GENERIC: quorum preempt error in CMM: Error %d --- QUORUM_GENERIC Tkown ioctl failed on quorum device %s.
Description:
This node encountered an error when issuing a QUORUM_GENERIC Take Ownership operation on a quorum device. This error indicates that the node was unsuccessful in preempting keys from the quorum device, and the partition to which it belongs was preempted. If a cluster is divided into two or more disjoint subclusters, one of these must survive as the operational cluster. The surviving cluster forces the other subclusters to abort by gathering enough votes to grant it majority quorum. This action is called "preemption of the losing subclusters".
Solution:
Other related messages identify the quorum device where the error occurred. If an EACCES error occurs, the QUORUM_GENERIC command might have failed because of the SCSI3 keys on the quorum device. Scrub the SCSI3 keys off the quorum device and reboot the preempted nodes."
You should try to follow this advice. I would propose to chose a different QD before trying to do this, if you have one available. Is it possible that this LUN has been in use by a different cluster?
To scrub SCSI3 keys you should use the scsi command in /usr/cluster/lib/sc: ./scsi -c inkeys -d <device> to check for the existence of keys, and ...-c scrub.. to remove any SCSI3 keys.
Regards
Hartmut -
Q: Detecting when a Node is Detatched from the Scene Graph, How?
Hi All,
My nodes are regiserted observers with an Async service. I need to know when a nodes is detached/removed from the Scene graph so that I can remove it from the Async service.
The only thing I can think of at the moment is to bind to Node.scene but I would like to think there is a more "standardized" way of doing this (I don't want to re-invent the wheel).
Example (improved from 1st post):
class MyCustomNode extends CustomNode {
var sceneMonitor:Scene = bind scene on replace {onSceneChange()};
public var onSceneChange:function():Void = function():Void{
if (scene == null){
println("we have no scene");
} else {
println("we have a scene");
}Edited by: AndrewHughes on Oct 12, 2009 6:13 AMYou can add an "initialized" flag into your CustomNode and set it true in the postinit block
var initialized = false;
postinit { initialized = true }The parent and scene of the node won't be set until after the CustomNode is past the postinit phase. You can use this flag to determine whether or not the change in parent or scene is because the node is being added, or because the node is being removed.
var fubar = bind scene on replace {
if (initialized and scene == null) {
// cleanup
}You could also use the old value and the new value in the on replace:
var fubar = bind scene on replace oldscene = newscene {
if (oldscene != null and newscene == null) {
// cleanup
} -
When opening Firefox 6 I receive 2 JavaScript Application errors; the 1st is "TypeError: Component.classes['''TvtPwmComponent_CID'''] is undefined". The 2nd error is; "TypeError:Components.classes['''componentid'''] is undefined." and upon closing Firefox 6 I receive the following JavaScript error; "ReferenceError:'''M_caFxTBobj''' is not defined."
This issue can be caused by an extension that isn't working properly.
Start Firefox in <u>[[Safe Mode]]</u> to check if one of the extensions is causing the problem (switch to the DEFAULT theme: Firefox (Tools) > Add-ons > Appearance/Themes).
*Don't make any changes on the Safe mode start window.
*https://support.mozilla.com/kb/Safe+Mode
*https://support.mozilla.com/kb/Troubleshooting+extensions+and+themes -
Received iPad from relative as Christmas gift as they upgraded to mini. I have registered and verified my account. When I try to download apps I
And I put in my password it say I have not verified my account. Apple wants to charge me 19.00 to help me. This is my 1st apple product. So far I am not impressed with the customer service. Can someone please help. I have even done the 2nd step verification and still the same thing. Please help this was a gift for my child and he cannot even use it! Please help!!!Leigh...
but cannot do this because I cannot access Mac App Store
If you can't access the App Store from the Apple menu, Dock, or Applications folder installing the Mac OS X 10.6.8 Update Combo will reinstall the App Store for you.
It's ok to do this even you are already running v10.6.8.
Plants vs. Zombies, and that is all for additional software on here
FYI: Some third party software can cause issues with the App Store > Mac App Store: Sign in sheet does not appear, or does not accept typed text
Apple is not responsible for incompatibility isssues with third party software. -
My macbook pro (August 2010) keeps freezing when apple mail starts up. This happens every 2nd time I turn on my Macbook Pro. This forces me to hold down the power button to restart my Macbook Pro. Does anyone else experience this, and is there a fix? Thx.
since upgrading to OS lion, my mac book pro (2010 August) keeps freezing (non responsive, requires restart) when i turn on my mac and the system is trying to open apple mail. while apple mail is attempting to collect new mail, the macbook pro freezes. this happens every 2nd time i try to start up my computer. this only started happening after i upgraded to OS lion. please let me know what I need to do to fix this. being forced to restart every 2nd time i turn on my macbook pro has been very frustrating. not sure if apple mail is also the cause of my macbook pro for being very slow during start-up (on the occasion the computer doesn't freeze).I am having similar 'freezing' problems with many programs (Mail, Safari, iTunes, etc.). I have done a clean install of Lion instead of an upgrade when it came out. Perhaps something went wrong with my install? I'm thinking about wiping and reinstalling.
-
I am developing a 2 page signup form, where the person enters their username on the 1st page and password on the 2nd.
When I test the form, FF asks to remember my password for the site - however it thinks the zipcode (field above the password) is the username on the account.
I have tried to put their username/email in a hidden field on the 2nd page, just above the password field - and that did not work.
How can i force FF to use the username from previous page when it asks to remember acct info?This website is using autocomplete=off to prevent Firefox from saving and filling the name and password.
You can remove autocomplete=off with a bookmarklet to make Firefox save names and passwords and other form data.
*http://kb.mozillazine.org/User_name_and_password_not_remembered
Maybe you are looking for
-
How do I keep multiple tabs from loading all at once when I open Firefox?
How do I prevent multiple tabs from loading every time I access Firefox? These are open tabs, but I do not want them to open simultaneously every time I want to open a new tab. Then I have to minimize each one in order to access other applications on
-
Changes are not reflecting in other users
HI all it's urgent One user abcd who had made some changes is not reflecting to other users after transport, we have 2 clients 100 and 200 ,changes are reflecting in own user id ,we transported with scc1 transaction with regards vinod
-
i tried to jailbreak my ipod 4g 6.0.1(523) but was not possible. now i cant restore it becouse say i cant restore my ipod error 3194. i try to edit hosts on win32 but was unsuccesfully. anybody help me please :(((((
-
Photoshop CS6 pixel ruler scale is illegible at alternating magnifications
Running PS CS6, Win 7 x64, 30" NEC display, nVidia Quadro FX 3800 display card. Photoshop preference "UI Font Size" set to medium. The attached image shows how Photoshop CS6 has an illegible vertical scale with medium or large UI fonts selected. I ha
-
What are the business drivers for doing SAP BW Modeling
What are the business drivers for doing SAP BW Modeling a. Simplicity b. Flexibility c. Performance d. Cost e. Administrative Overhead