Ora.reco.acfsvol.acfs only on one node on RAC on ODA
We have an ODA (old model) and by a power failure in the data center both boot disks in one node are we gone faulty.
After replacing the chassis, RAID controllers and disks (Oracle Filed Engenieer) reports crsctl stat res -t following:
[grid @ XXXXXXXXA ~] $ crsctl stat res -t
TARGET NAME SERVER STATE STATE_DETAILS
Local Resources
ora.reco.acfsvol.acfs
ONLINE ONLINE XXXXXXXXXA mounted on / cloudfs
OFFLINE OFFLINE XXXXXXXXXBvolume / cloudfs off
is that correct?
Oracle support referred me to MOS 1319263.1, but that's for Exadata ....
Thx
Christoph
(i masked the hostname)
No, this is not correct. Your resource should be online on both nodes.
What happens if you try and start the resource manually using srvctl start filesystem?
Have you checked to see if your volume is online?
Similar Messages
-
Why RAW partitions are visible only on one node in RAC?
I am having 2 node RAC on Windows 2003 Server.
Can anybody tell me why the RAW partitions are visible on RAC-2 but not on RAC-1.
I Shutted down RAC-2, and I thought all the RAW devices will appear in RAC-1, but it didn't work, it is displaying only LOCAL DRIVE.
Why???I am using Microsoft Virtual Server and I have configured SCSI type shared disks.
I executed the given command
C:\oracle\product\10.2.0\crs\bin>cluvfy comp ssa -n rac-1,rac-2
Result : Shared storage check was successful on both the nodes..
Both the nodes are working properly but I want to know the reason 'why the shared disks are not visible on node1'?
Thanks
Sushil -
Why only gets one node when select many nodes of tree in DWCS4 on Mac OS
I use tag <mm:treecontrol> to create tree in DWCS4 on Mac OS.
When I select many nodes in tree, but I only get one node by method: selectedNodes.
codes of created tree as following:
<mm:treecontrol name='tree' size='20' multiple noheaders>
<mm:treecolumn state='hidden'>
<mm:treenode value='A' state='expanded'></mm:treenode>
<mm:treenode value='B' state='expanded'></mm:treenode>
<mm:treenode value='C' state='expanded'></mm:treenode>
</mm:treecontrol>
Who can tell me reasons?
Thanks!
comments: if don't use tag <mm:treecolumn>, tree will not show on Mac OS.Hi macbig,
I finally got to look at my sister's computer. The HDD "Repair Disk" found missing threads, missing directory records, etc. and ended with:
Error: Disk Utility can't repair this disk. Back up as many of your files as possible, reformat the disk, and restore your backed-
up files.
Then, I tried "Verify Disk" and it found invalid volume file count and ended with:
The volume Macintosh HD was found corrupted and needs to be repaired.
Error: This disk needs to be repaired. Click Repair Disk
I guess running Apple Hardware Test is not going to happen. :/
I've ordered online a new 2.5 disk, make a Maverick boot USB, and start from scratch. Do you have any other suggestions?
As for the corrupted old hard drive, do you have any suggestions of how to get out the data somehow?
Thank you so much! -
EM DB console works only from one node?
In my 2 node RAC the enterprise manager database console works only from one node i.e the url shows login page only from one node.
http://NODE2:5500/em------>WORKS, SHOWS LOGIN PAGE AND CAN MANAGE ALL INSTANCES FROM HERE
http://NODE1:5500/em------>DOES NOT WORK, DOES NOT SHOW THE LOGIN PAGE.
Please clarify.
KadhimI have to guess ... (because you tell neither OS nor database version). Assuming it's 10gR2 or higher, that's expected behaviour, dbconsole runs on one node only, the so-called master node. You can change that, see the documentation or this metalink note:
How to manage DB Control 10.2 for RAC Database with emca
Doc ID: NOTE:395162.1
Werner -
Hi Xperts
We have this environment:
2 Rac Nodes 11.2.0.3 Enterprise on Oracle Linux 5.9 .
We have one production Database on this Rac and the users ask to create two single instance on each node, something like this:
Node1 -> Rac Prod1, Single Test
Node2-> Rac Prod2, Single Dev
I want to create Rac One node for those Database (Dev, Test) and create New Diskgroups for ech database.
Can I install a Rac One node on those Server with DBCA?
Do I nedd to Install new Database Software ?
Does the installed Rac have some affectation ?
I just want to be sure about this procedure, before to do anything.
Thank you
J.A.Hi J.A.
Yes you can! However you need to install Grid Infrastructure (GI) in cluster on both nodes, then install database software. Either during software installation or after that, DBCA would allow you to create 1) Single instance, 2) RAC database, 3) RAC One Node database. Keep in mind that RAC One Node is an option (additional license) to the Enterprise Edition of Oracle Database.
I've talked on that topic at the Bulgarian Oracle Users Group at 2011, here is the link to the presentation, you may find it useful. I might upload the videos as well if you need to have something like a proof of concept of just for your own:
http://sve.to/download/1112/
Also I would go with one disk group for both databases, as long as they share the same physical disks I don't see the point of doing that ? Having one diskgroup would allow you to utilize better you disk/space resources.
The procedure would be:
1. Install GI in cluster.
2. Install software libraries.
3. Patch up to 11.2.0.4.
4. Create RAC One Node database using either command line or using Custom Template of DBCA.
At the end of the day, if you have standard edition license you can still install GI in cluster and create single instance databases on each server. The downside of doing that is that you need to manually failover the database to remaining node in case of disaster.
Regards,
Sve -
Oracle Binary Currepted in One node 11g RAC
Hi Team,
/oracle(Oracle Home ) folder currepted in one node.
IBM AIX-11.2.0.1
How to reolve the same.
Thanks
Manohar.1. take the backup of current oracle_home corrupted.
2. Tar the other node oracle_home as below, and copy it to the corrupted node.
tar cvf location <tarname>.tar oraInventory product
3. Extracted the tar on the corrupted node by delting the existing corrupted oracle_home
tar xvf /location/<name>.tar
Now, complete the Oracle RDBMS Cloning Process of the above mentioned untar done in earlier steps:
cd $ORACLE_HOME/clone/bin
Perl clone.pl ORACLE_HOME="<LOCATION>" ORACLE_HOME_NAME="OraDb10g_home1"
Run manual command of “relink all” -
"ORA-00928: missing SELECT keyword" only on one instance in RAC
I am using oracle 9i RAC.When I run a query in one instance it is working fine but in another instance of RAC it is giving "ORA-00928: missing SELECT keyword".
I am wondering why it happening in the same RAC.Please suggest me where is the problem.
ThanksSTAR_TRANSFORMATION_ENABLED=true in both instances
My query lools like
SELECT * FROM
(SELECT coulmn1,col2
FROM tab1 wh full OUTER JOIN tab2 stg on wh.row_id = stg.row_id
WHERE wh.MODIFICATION_NUM IS NULL
OR stg.MODIFICATION_NUM IS NULL
OR wh.MODIFICATION_NUM <> stg.MODIFICATION_NUM) WHERE REC_STA <>'D';
Please suggest
Thanks -
Ons gsd of one node offline,RAC
crs_start ora.whdb02.ons
Attempting to start `ora.whdb02.ons` on member `whdb02`
Start of `ora.whdb02.ons` on member `whdb02` failed.
whdb01 : CRS-1019: Resource ora.whdb02.ons (application) cannot run on whdb01
CRS-0215: Could not start resource 'ora.whdb02.ons'
oracle@whdb02:/oracle/app/oracle/product/10.2.0/db1/network/admin>$CRS_HOME/bin/onsctl start
ksh: /bin/onsctl: not found.
oracle@whdb02:/oracle/app/oracle/product/10.2.0/db1/network/admin>$ORA_CRS_HOME/bin/onsctl start
clsrons_init failed, stat = 504, ocrerr = 32
clsrons_init failed, stat = 504, ocrerr = 32
onsctl: ons failed to start
oracle@whdb02:/oracle/app/oracle/product/10.2.0/db1/network/admin>
oracle@whdb02:/oracle/app/oracle/product/10.2.0/db1/network/admin>
oracle@whdb02:/oracle/app/oracle/product/10.2.0/db1/network/admin>A_CRS_HOME/bin/onsctl STOP
ksh: A_CRS_HOME/bin/onsctl: not found.
oracle@whdb02:/oracle/app/oracle/product/10.2.0/db1/network/admin>$ORA_CRS_HOME/bin/onsctl stop
onsctl: shutting down ons daemon ...
clsrons_init failed, stat = 504, ocrerr = 32
onsctl: shutdown of ons failed!
oracle@whdb02:/oracle/app/oracle/product/10.2.0/db1/network/admin>$ORA_CRS_HOME/bin/onsctl start
clsrons_init failed, stat = 504, ocrerr = 32
clsrons_init failed, stat = 504, ocrerr = 32
onsctl: ons failed to startoracle@whdb02:/oracle/app/oracle/product/10.2.0/db1/network/adgsd log
2010-11-10 13:16:45.515: [ RACG][1] [274628][1][ora.whdb02.gsd]: clsrcexecut: cmd = /oracle/app/oracle/product/10.2.0/crs/bin/racgeut -e USRORA_DEBUG=0 540 /oracle/app/oracle/product/10.2.0/crs/bin/gsdctl stat
2010-11-10 13:16:45.515: [ RACG][1] [274628][1][ora.whdb02.gsd]: clsrcexecut: rc = 1, time = 2.703s
2010-11-10 13:16:45.515: [ RACG][1] [274628][1][ora.whdb02.gsd]: end for resource = ora.whdb02.gsd, action = start, status = 1, time = 54.109s
2010-11-10 13:16:45.989: [ RACG][1] [274634][1][ora.whdb02.gsd]: clsrcgetprsrctx: prsr_init_ext returned rc = 3
2010-11-10 13:16:48.694: [ RACG][1] [274634][1][ora.whdb02.gsd]: GSD is not running on the local node
2010-11-10 14:09:27.008: [ RACG][1] [573578][1][ora.whdb02.gsd]: clsrcgetprsrctx: prsr_init_ext returned rc = 3
2010-11-10 14:10:18.016: [ RACG][1] [573578][1][ora.whdb02.gsd]: Failed to start GSD on local node
2010-11-10 14:10:18.016: [ RACG][1] [573578][1][ora.whdb02.gsd]: clsrcexecut: cmd = /oracle/app/oracle/product/10.2.0/crs/bin/racgeut -e USRORA_DEBUG=0 540 /oracle/app/oracle/product/10.2.0/crs/bin/gsdctl start
2010-11-10 14:10:18.016: [ RACG][1] [573578][1][ora.whdb02.gsd]: clsrcexecut: rc = 1, time = 51.008s
2010-11-10 14:10:20.720: [ RACG][1] [573578][1][ora.whdb02.gsd]: GSD is not running on the local node
2010-11-10 14:10:20.720: [ RACG][1] [573578][1][ora.whdb02.gsd]: clsrcexecut: cmd = /oracle/app/oracle/product/10.2.0/crs/bin/racgeut -e USRORA_DEBUG=0 540 /oracle/app/oracle/product/10.2.0/crs/bin/gsdctl stat
2010-11-10 14:10:20.720: [ RACG][1] [573578][1][ora.whdb02.gsd]: clsrcexecut: rc = 1, time = 2.704s
2010-11-10 14:10:20.720: [ RACG][1] [573578][1][ora.whdb02.gsd]: end for resource = ora.whdb02.gsd, action = start, status = 1, time = 54.109s
2010-11-10 14:10:21.195: [ RACG][1] [573584][1][ora.whdb02.gsd]: clsrcgetprsrctx: prsr_init_ext returned rc = 3
2010-11-10 14:10:23.900: [ RACG][1] [573584][1][ora.whdb02.gsd]: GSD is not running on the local node
2010-11-10 15:33:02.434: [ RACG][1] [618716][1][ora.whdb02.gsd]: clsrcgetprsrctx: prsr_init_ext returned rc = 3
2010-11-10 15:33:56.467: [ RACG][1] [618716][1][ora.whdb02.gsd]: Failed to start GSD on local node
2010-11-10 15:33:56.467: [ RACG][1] [618716][1][ora.whdb02.gsd]: clsrcexecut: cmd = /oracle/app/oracle/product/10.2.0/crs/bin/racgeut -e USRORA_DEBUG=0 540 /oracle/app/oracle/product/10.2.0/crs/bin/gsdctl start
2010-11-10 15:33:56.467: [ RACG][1] [618716][1][ora.whdb02.gsd]: clsrcexecut: rc = 1, time = 54.024s
2010-11-10 15:33:59.171: [ RACG][1] [618716][1][ora.whdb02.gsd]: GSD is not running on the local node
2010-11-10 15:33:59.171: [ RACG][1] [618716][1][ora.whdb02.gsd]: clsrcexecut: cmd = /oracle/app/oracle/product/10.2.0/crs/bin/racgeut -e USRORA_DEBUG=0 540 /oracle/app/oracle/product/10.2.0/crs/bin/gsdctl stat
2010-11-10 15:33:59.171: [ RACG][1] [618716][1][ora.whdb02.gsd]: clsrcexecut: rc = 1, time = 2.704s
2010-11-10 15:33:59.171: [ RACG][1] [618716][1][ora.whdb02.gsd]: end for resource = ora.whdb02.gsd, action = start, status = 1, time = 57.130s
2010-11-10 15:33:59.646: [ RACG][1] [618722][1][ora.whdb02.gsd]: clsrcgetprsrctx: prsr_init_ext returned rc = 3
2010-11-10 15:34:02.351: [ RACG][1] [618722][1][ora.whdb02.gsd]: GSD is not running on the local node
2010-11-10 15:40:29.176: [ RACG][1] [503954][1][ora.whdb02.gsd]: clsrcgetprsrctx: prsr_init_ext returned rc = 3
2010-11-10 15:41:20.184: [ RACG][1] [503954][1][ora.whdb02.gsd]: Failed to start GSD on local node
2010-11-10 15:41:20.184: [ RACG][1] [503954][1][ora.whdb02.gsd]: clsrcexecut: cmd = /oracle/app/oracle/product/10.2.0/crs/bin/racgeut -e USRORA_DEBUG=0 540 /oracle/app/oracle/product/10.2.0/crs/bin/gsdctl start
2010-11-10 15:41:20.184: [ RACG][1] [503954][1][ora.whdb02.gsd]: clsrcexecut: rc = 1, time = 51.007s
2010-11-10 15:41:22.888: [ RACG][1] [503954][1][ora.whdb02.gsd]: GSD is not running on the local node
2010-11-10 15:41:22.889: [ RACG][1] [503954][1][ora.whdb02.gsd]: clsrcexecut: cmd = /oracle/app/oracle/product/10.2.0/crs/bin/racgeut -e USRORA_DEBUG=0 540 /oracle/app/oracle/product/10.2.0/crs/bin/gsdctl stat
2010-11-10 15:41:22.889: [ RACG][1] [503954][1][ora.whdb02.gsd]: clsrcexecut: rc = 1, time = 2.703s
2010-11-10 15:41:22.889: [ RACG][1] [503954][1][ora.whdb02.gsd]: end for resource = ora.whdb02.gsd, action = start, status = 1, time = 54.106s
2010-11-10 15:41:23.373: [ RACG][1] [290992][1][ora.whdb02.gsd]: clsrcgetprsrctx: prsr_init_ext returned rc = 3
2010-11-10 15:41:26.078: [ RACG][1] [290992][1][ora.whdb02.gsd]: GSD is not running on the local node
2010-11-10 15:50:06.328: [ RACG][1] [442492][1][ora.whdb02.gsd]: clsrcgetprsrctx: prsr_init_ext returned rc = 3
2010-11-10 15:50:57.336: [ RACG][1] [442492][1][ora.whdb02.gsd]: Failed to start GSD on local node
2010-11-10 15:50:57.336: [ RACG][1] [442492][1][ora.whdb02.gsd]: clsrcexecut: cmd = /oracle/app/oracle/product/10.2.0/crs/bin/racgeut -e USRORA_DEBUG=0 540 /oracle/app/oracle/product/10.2.0/crs/bin/gsdctl start
2010-11-10 15:50:57.336: [ RACG][1] [442492][1][ora.whdb02.gsd]: clsrcexecut: rc = 1, time = 51.008s
2010-11-10 15:51:00.043: [ RACG][1] [442492][1][ora.whdb02.gsd]: GSD is not running on the local node
2010-11-10 15:51:00.043: [ RACG][1] [442492][1][ora.whdb02.gsd]: clsrcexecut: cmd = /oracle/app/oracle/product/10.2.0/crs/bin/racgeut -e USRORA_DEBUG=0 540 /oracle/app/oracle/product/10.2.0/crs/bin/gsdctl stat
2010-11-10 15:51:00.043: [ RACG][1] [442492][1][ora.whdb02.gsd]: clsrcexecut: rc = 1, time = 2.706s
2010-11-10 15:51:00.043: [ RACG][1] [442492][1][ora.whdb02.gsd]: end for resource = ora.whdb02.gsd, action = start, status = 1, time = 54.114s
2010-11-10 15:51:01.361: [ RACG][1] [618710][1][ora.whdb02.gsd]: clsrcgetprsrctx: prsr_init_ext returned rc = 3
2010-11-10 15:51:04.066: [ RACG][1] [618710][1][ora.whdb02.gsd]: GSD is not running on the local node -
LSNRCTL> start LISTENER_CORPNG04
Starting /ora00/app/oracle/product/11/db1/bin/tnslsnr: please wait...
TNSLSNR for HPUX: Version 11.1.0.7.0 - Production
System parameter file is /ora00/app/oracle/product/11/db1/network/admin/listener.ora
Error listening on: (DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=vir_corpng04)(PORT=1521)(IP=FIRST)))
TNS-12545: Connect failed because target host or object does not exist
TNS-12560: TNS:protocol adapter error
TNS-00515: Connect failed because target host or object does not exist
HPUX Error: 227: Can't assign requested address
Listener failed to start. See the error message(s) above...
LSNRCTL>
Plz help its production system
Thanks in Advance
GaganSee link
Rgds -
Scan-vip running only on one RAC node
Hi ,
While setting up RAC11.2 on Centos 5.7 , I was getting this error during the grid installation:
PRCR-1079 : Failed to start resource ora.scan1.vip
CRS-5005: IP Address: 192.168.100.208 is already in use in the network
CRS-2674: Start of 'ora.scan1.vip' on 'falcen6b' failed
CRS-2632: There are no more servers to try to place resource 'ora.scan1.vip' on that would satisfy its placement policy
PRCR-1079 : Failed to start resource ora.scan2.vip
CRS-5005: IP Address: 192.168.100.209 is already in use in the network
CRS-2674: Start of 'ora.scan2.vip' on 'falcen6b' failed
CRS-2632: There are no more servers to try to place resource 'ora.scan2.vip' on that would satisfy its placement policy
PRCR-1079 : Failed to start resource ora.scan3.vip
CRS-5005: IP Address: 192.168.100.210 is already in use in the network
CRS-2674: Start of 'ora.scan3.vip' on 'falcen6b' failed
CRS-2632: There are no more servers to try to place resource 'ora.scan3.vip' on that would satisfy its placement policy
I figured that the scan service is able to run only on one node at a time. When I stopped the service on rac1 and started it on rac2 the service is starting.
But I think for the grid installation the scan service has to simultaneously run on both the nodes.
How do I resolve it?
Any suggestions please.
PS - I am planning to try with the patch 11.0.2.3 but it will be a while till i get access to it.
Till then can someone suggest a workaround?Hi Balazs Papp and onedbguru,
I was able to resolve that error by running the following command on rac2, now that part of the installer passed.
crsctl start res ora.scan1.vip
However the cluster verification utility is failing at the end of installer.
When I executed the below command, this is my output:
[oracle@falcen6a grid]$ ./runcluvfy.sh stage -post crsinst -n falcen6a,falcen6b -verbose
Performing post-checks for cluster services setup
Checking node reachability...
Check: Node reachability from node "falcen6a"
Destination Node Reachable?
falcen6a yes
falcen6b yes
Result: Node reachability check passed from node "falcen6a"
Checking user equivalence...
Check: User equivalence for user "oracle"
Node Name Comment
falcen6b passed
falcen6a passed
Result: User equivalence check passed for user "oracle"
Checking time zone consistency...
Time zone consistency check passed.
Checking Cluster manager integrity...
Checking CSS daemon...
Node Name Status
falcen6b running
falcen6a running
Oracle Cluster Synchronization Services appear to be online.
Cluster manager integrity check passed
UDev attributes check for OCR locations started...
Result: UDev attributes check passed for OCR locations
UDev attributes check for Voting Disk locations started...
Result: UDev attributes check passed for Voting Disk locations
Check default user file creation mask
Node Name Available Required Comment
falcen6b 0022 0022 passed
falcen6a 0022 0022 passed
Result: Default user file creation mask check passed
Checking cluster integrity...
Cluster is divided into 2 partitions
Partition 1 consists of the following members:
Node Name
falcen6b
Partition 2 consists of the following members:
Node Name
falcen6a
Cluster integrity check failed. Cluster is divided into 2 partition(s).
Checking OCR integrity...
Checking the absence of a non-clustered configuration...
All nodes free of non-clustered, local-only configurations
ERROR:
PRVF-4193 : Asm is not running on the following nodes. Proceeding with the remaining nodes.
Checking OCR config file "/etc/oracle/ocr.loc"...
OCR config file "/etc/oracle/ocr.loc" check successful
ERROR:
PRVF-4195 : Disk group for ocr location "+DATA" not available on the following nodes:
Checking size of the OCR location "+DATA" ...
Size check for OCR location "+DATA" successful...
OCR integrity check failed
Checking CRS integrity...
ERROR:
PRVF-5316 : Failed to retrieve version of CRS installed on node "falcen6b"
The Oracle clusterware is healthy on node "falcen6b"
The Oracle clusterware is healthy on node "falcen6a"
CRS integrity check failed
Checking node application existence...
Checking existence of VIP node application
Node Name Required Status Comment
falcen6b yes unknown failed
falcen6a yes unknown failed
Result: Check failed.
Checking existence of ONS node application
Node Name Required Status Comment
falcen6b no unknown ignored
falcen6a no online passed
Result: Check ignored.
Checking existence of GSD node application
Node Name Required Status Comment
falcen6b no unknown ignored
falcen6a no does not exist ignored
Result: Check ignored.
Checking existence of EONS node application
Node Name Required Status Comment
falcen6b no unknown ignored
falcen6a no online passed
Result: Check ignored.
Checking existence of NETWORK node application
Node Name Required Status Comment
falcen6b no unknown ignored
falcen6a no online passed
Result: Check ignored.
Checking Single Client Access Name (SCAN)...
SCAN VIP name Node Running? ListenerName Port Running?
falcen6-scan unknown false LISTENER 1521 false
WARNING:
PRVF-5056 : Scan Listener "LISTENER" not running
Checking name resolution setup for "falcen6-scan"...
SCAN Name IP Address Status Comment
falcen6-scan 192.168.100.210 passed
falcen6-scan 192.168.100.208 passed
falcen6-scan 192.168.100.209 passed
Verification of SCAN VIP and Listener setup failed
OCR detected on ASM. Running ACFS Integrity checks...
Starting check to see if ASM is running on all cluster nodes...
PRVF-5137 : Failure while checking ASM status on node "falcen6b"
Starting Disk Groups check to see if at least one Disk Group configured...
Disk Group Check passed. At least one Disk Group configured
Task ACFS Integrity check failed
Checking Oracle Cluster Voting Disk configuration...
Oracle Cluster Voting Disk configuration check passed
Checking to make sure user "oracle" is not in "root" group
Node Name Status Comment
falcen6b does not exist passed
falcen6a does not exist passed
Result: User "oracle" is not part of "root" group. Check passed
Checking if Clusterware is installed on all nodes...
Check of Clusterware install passed
Checking if CTSS Resource is running on all nodes...
Check: CTSS Resource running on all nodes
Node Name Status
falcen6b passed
falcen6a passed
Result: CTSS resource check passed
Querying CTSS for time offset on all nodes...
Result: Query of CTSS for time offset passed
Check CTSS state started...
Check: CTSS state
Node Name State
falcen6b Observer
falcen6a Observer
CTSS is in Observer state. Switching over to clock synchronization checks using NTP
Starting Clock synchronization checks using Network Time Protocol(NTP)...
NTP Configuration file check started...
The NTP configuration file "/etc/ntp.conf" is available on all nodes
NTP Configuration file check passed
Checking daemon liveness...
Check: Liveness for "ntpd"
Node Name Running?
falcen6b yes
falcen6a yes
Result: Liveness check passed for "ntpd"
Checking NTP daemon command line for slewing option "-x"
Check: NTP daemon command line
Node Name Slewing Option Set?
falcen6b yes
falcen6a yes
Result:
NTP daemon slewing option check passed
Checking NTP daemon's boot time configuration, in file "/etc/sysconfig/ntpd", for slewing option "-x"
Check: NTP daemon's boot time configuration
Node Name Slewing Option Set?
falcen6b yes
falcen6a yes
Result:
NTP daemon's boot time configuration check for slewing option passed
NTP common Time Server Check started...
NTP Time Server "133.243.236.19" is common to all nodes on which the NTP daemon is running
NTP Time Server "133.243.236.18" is common to all nodes on which the NTP daemon is running
NTP Time Server "210.173.160.86" is common to all nodes on which the NTP daemon is running
NTP Time Server ".LOCL." is common to all nodes on which the NTP daemon is running
Check of common NTP Time Server passed
Clock time offset check from NTP Time Server started...
Checking on nodes "[falcen6b, falcen6a]"...
Check: Clock time offset from NTP Time Server
Time Server: 133.243.236.19
Time Offset Limit: 1000.0 msecs
Node Name Time Offset Status
falcen6b 15.332 passed
falcen6a -1.503 passed
Time Server "133.243.236.19" has time offsets that are within permissible limits for nodes "[falcen6b, falcen6a]".
Time Server: 133.243.236.18
Time Offset Limit: 1000.0 msecs
Node Name Time Offset Status
falcen6b 15.115 passed
falcen6a -1.614 passed
Time Server "133.243.236.18" has time offsets that are within permissible limits for nodes "[falcen6b, falcen6a]".
Time Server: 210.173.160.86
Time Offset Limit: 1000.0 msecs
Node Name Time Offset Status
falcen6b 15.219 passed
falcen6a -1.527 passed
Time Server "210.173.160.86" has time offsets that are within permissible limits for nodes "[falcen6b, falcen6a]".
Time Server: .LOCL.
Time Offset Limit: 1000.0 msecs
Node Name Time Offset Status
falcen6b 0.0 passed
falcen6a 0.0 passed
Time Server ".LOCL." has time offsets that are within permissible limits for nodes "[falcen6b, falcen6a]".
Clock time offset check passed
Result: Clock synchronization check using Network Time Protocol(NTP) passed
Oracle Cluster Time Synchronization Services check passed
Post-check for cluster services setup was unsuccessful on all the nodes.
[oracle@falcen6a grid]$
Any suggestions? -
ASM disk busy 99% only on one cluster node
Hello,
We have a three node Oracle RAC cluster. Our dba(s) called us and said they are getting OEM critical alers for an asm disk on one node only. I checked and the SAN attached drive does not show the same high utilization on either of the other two nodes. I checked the hardware and it seems fine. If the issue was with the SAN attached disk, we would be seeing the same errors on all three nodes since they share the same disks. The system crashed last week(alert dump in the +asm directories), and at the disk has been busy ever since. I asked if the dba reviewed the ADDM reports and he said he had and that there were no suspicious looking entries that would lead us to the root cause based on those reports. CPU utilization is fine. I am not sure where to look at this point and any help pointing me in the right direction would be appreciated. They do use RMAN, could there be a backup running using those disks only on one node? Has anyone ever seen this before?
Thank you,
Benita Ulisano
Unix/SAN Team
Chicago Public Schools
[email protected]Hi Harish,
Thank you for responding. To answer your question, yes, the disks are all of the same spec and are shared among the three cluster node. The asm disk sdw1 is the one with the issue.
Problem Node: coefsdb02
three nodes in RAC cluster
coefsdb01, coefsdb02, coefsdb03
iostat results for all three nodes - same disk
coefsdb01
Device: rrqm/s wrqm/s r/s w/s rsec/s wsec/s avgrq-sz avgqu-sz await svctm %util
sdw1 0.00 1.71 0.12 0.58 1.27 18.78 28.63 0.01 13.38 1.75 0.12
coefsdb02
sdw1 0.11 0.02 4.00 0.62 305.84 21.72 70.93 2.96 12.58 211.95 97.88
coefdb03
sdw1 0.21 0.01 4.70 0.33 224.05 13.52 47.22 0.05 10.11 6.15 3.09
The dba(s) run RMAN backups, but only on coefsdb01.
Benita -
Configuring Scheduler only on single node in a clustered environment
Friends,
I have OIM 11gr2 environment in clustered mode with 6 nodes in it.
As per my client requirement, I need to configure scheduler only on one node and No on all other nodes. So, all schedule jobs OOTB and custom jobs should run only on one node.
What is the process for this?
Thanks,
MMCheck this: Managing the Scheduler - 11g Release 2 (11.1.2.1.0)
-Bikash -
When one node reboot other node in RAC
Hi Friends,
I faced one situation where one node of RAC cluster had been rebooted by other node. This happen due to network interconnect link fluctuation.
Sep 13 16:23:48 kkvs1a su: [ID 810491 auth.crit] 'su admin' failed for wipro1 on /dev/pts/3
Sep 14 00:22:17 kkvs1a ixgbe: [ID 611667 kern.info] NOTICE: ixgbe3: link down
Sep 14 00:22:21 kkvs1a ixgbe: [ID 611667 kern.info] NOTICE: ixgbe3: link up, , full duplex
Sep 14 00:22:31 kkvs1a ixgbe: [ID 611667 kern.info] NOTICE: ixgbe1: link down
Sep 14 00:22:31 kkvs1a ixgbe: [ID 611667 kern.info] NOTICE: ixgbe3: link down
/opt/oracle/product/10.2.0/crs/log/node1/alertkk1a.log
==============================================
2013-09-14 00:22:05.180
[cssd(12561)]CRS-1612:node kk1b (2) at 50% heartbeat fatal, eviction in 14.251 seconds
2013-09-14 00:22:12.180
[cssd(12561)]CRS-1611:node kk1b (2) at 75% heartbeat fatal, eviction in 7.251 seconds
2013-09-14 00:22:13.180
[cssd(12561)]CRS-1611:node kk1b (2) at 75% heartbeat fatal, eviction in 6.251 seconds
2013-09-14 00:22:17.179
[cssd(12561)]CRS-1610:node kk1b (2) at 90% heartbeat fatal, eviction in 2.251 seconds
2013-09-14 00:22:18.180
[cssd(12561)]CRS-1610:node kkvs1b (2) at 90% heartbeat fatal, eviction in 1.251 seconds
This clearly shows CSSD of node kkvs1a has given node eviction message to kkvs1b node.
I got following messages on the instance which got rebooted:
ASM alert log:
Sat Sep 14 00:22:25 IST 2013
Error: KGXGN aborts the instance (6)
Sat Sep 14 00:22:25 IST 2013
Errors in file /opt/oracle/admin/+ASM/bdump/+asm2_lmon_8527.trc:
ORA-29702: error occurred in Cluster Group Service operation
LMON: terminating instance due to error 29702
A network fluctuation shouldn't give reboot like this. Then why oracle design like this way? Is this a bug? My oracle version is: 10.2.0.5.0
Could you tell me the other possible situations when 1 RC instance reboots other RAC instacne.What you are describing is the expected behaviour: if your interconnect fails, you will have a node eviction. Releases < 11.2.0.2 evict a node by reboot, which can fix the problem: the NIC may come up correctly when the machine re-starts. Releases >= 11.2.0.2 can often evict without a re-boot. But either way, if your interconnect goes down, a node must be evicted to prevent uncoordinated disc writes.
If you are interested, you can find some discussion and demos of this in a series of webcasts I've recorded,
Free Oracle Database Tutorials for Administration and Developers
If you really don't like this behaviour and the problems are transient, you can try 'raising the CSS MISSCOUNT parameter.
John Watson
Oracle Certified Master DBA -
Direct Traffic to one node in Cloud Service
Is there a way to make a cloud service (web role) always use one of the nodes, there are two in this scenario, and only direct to the other one if the first one gets shut down. We have a piece of legacy software we use in our Cloud Service and we are waiting
on it being updated so it can work across the load balancer, but currently it does not. So we only have one node in the service. I would like to have two and see if we can always hit node 1 for example, and then hit node 2 if 1 is down. I am assuming, maybe
wrongly, that this would still be within SLA, and if maintenance was happening then both nodes would not get shutdown. Is it possible to maybe have a web role class which controls the load balancer traffic?
Thanks
EamonnHi Eamonn,
Thanks for your posting!
>>Is there a way to make a cloud service (web role) always use one of the nodes, there are two in this scenario,...
Base on my experience, if we have one or more instances of a VM (Web or Worker role), the traffic will be distributed amongst the instances. We isn't allowed to specify which instances. Please see David's psot(http://stackoverflow.com/a/12726613
>> I would like to have two and see if we can always hit node 1 for example, and then hit node 2 if 1 is down. I am assuming, maybe.....
About this requirement, I suggest you could try to those steps:
1.create a web role with adding a web service or create a web role using WCF service.
2.create a service with the instance information, like this:
[WebMethod]
public ReturnResult ReverseString(string value)
ReturnResult rr = new ReturnResult();
rr.ReturnString = new string(value);
rr.HostName = RoleEnvironment.CurrentRoleInstance.Id;;
return rr;
//Class
public class ReturnResult
private String returnString;
public String ReturnString
get { return returnString; }
set { returnString = value; }
private String hostName;
public String HostName
get { return hostName; }
set { hostName = value; }
3.Host this service on azure cloud service
4.Create a new project whatever it is webpage or console application.
5.using the Thread Pool or BackgroundWorker to send the concurrent request.
6.Check the results list and host name.
After made it, you could get the every requests hot which instances .
If you'd like to custom the Azure loadbalancer, I recommend you could refer to those documents:
http://msdn.microsoft.com/en-us/library/azure/jj151530.aspx
http://blogs.msdn.com/b/kdot/archive/2013/06/29/implementing-custom-load-balancing-with-sticky-connections-on-windows-azure.aspx
Regards,
Will
We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.
Click
HERE to participate the survey. -
Database starting on one node.
Hi All,
After reboot of server and ssh key gen. My RAC instance is running only on one node.
What can be problem?
Thanks.Only you know what can be problem, because only you have valuable inforamation. :)
- Have you checked CRS-related logs?
- Have you checked Oracle alert log and related background process trace files?
If anything interesting found, post and let us know.
Maybe you are looking for
-
Java Callout in OSB failing with null pointer exception
Hi, We have a requirement where we need to convert XML String to org.apache.xmlbeans.impl.values.XmlAnyTypeImpl type using java-callout, but value is not getting set when we are trying to do the same. Below is the code we are using in the java callou
-
Yammer recent activity in SharePoint online page
I want to display recent yammer activity on SharePoint online page. can any body explain is it possible? if yes then how? regards Kapil
-
Can someone help me in understanding Public ,Private ,VIP&SCAN for RAC11gr2
Can someone help me in understanding Public ,Private ,VIP&SCAN for RAC11gr2 as i am new to RAC 11gr2 installtion process
-
Hi, we are facing this dump , past one month, any one can please suggest us.. please find error details.. we checked communication between solman and CRP is seems tobe fine.. Runtime Errors CALL_FUNCTION_NOT_FOUND Date and Time 15.0
-
Grep script that looks for paragraph style A followed by paragraph style B
Hello, I am fairly new to scripting in Indesign and I have run across something I want to script that I don't know how to do. Two of the paragraph styles that are in our files are Normal and Song. I need to add an extra paragraph return between all i