BCC failure in CRS
Hi,
I have installed latest version of JDK and Weblogic for 64 bit windows. Performed fresh installation and was successful till the last step as mentioned in the http://docs.oracle.com/cd/E2263001/CRS.1002/pdf/ATGCRSInstall.pdf_
While configuring the full deployment for BCC i.e. after creating the site and agent and publishing the results, I encountered the below issue.
Message Detail:
CONTAINER:atg.deployment.file.DeploymentSourceException: IOException while reading file /atg/registry/data/scenarios/recorders/profileupdate.sdl; SOURCE:deploymentFileCommandIOExcReadFile: level 4: IOException while reading file /atg/registry/data/scenarios/recorders/profileupdate.sdl atg.deployment.file.DeploymentSourceException: IOException while reading file /atg/registry/data/scenarios/recorders/profileupdate.sdl at atg.deployment.file.FileWorkerThread.handleError(FileWorkerThread.java:1256) at atg.deployment.file.FileWorkerThread.runCommand(FileWorkerThread.java:740) at atg.deployment.file.FileWorkerThread.processMarkerForAddUpdatePhase(FileWorkerThread.java:420) at atg.deployment.DeploymentWorkerThread.processMarkerPhase(DeploymentWorkerThread.java:534) at atg.deployment.DeploymentWorkerThread.run(DeploymentWorkerThread.java:307)
Can anyone please help me on this. Thanks in advance!
-Mohammed
Can you do a
DELETE FROM epub_file_asset;
in publishing schema.
Also can you remove the tmp, data and work folders from your jboss server.
Restart the servers and try deploying.
Thanks
Similar Messages
-
Error on BCC full deployment | CRS set up
All,
I have completed installation and configuration of CRS and I am able to browse the store and BCC applications. To transfer assets from Publishing to Production environment, I am getting the following error when i initiate 'Create Initial Deployment'.
On Browser
Deployment '1700002' to target 'tar171' encountered a system level deployment error during data transfer.
On server log I am getting below error
21:54:10,067 ERROR [DeploymentManager]
CAUGHT AT:
CONTAINER:atg.repository.RepositoryException; SOURCE:com.mysql.jdbc.exceptions.jdbc4.MySQLSyntaxErrorException: You have an error in your SQL syntax;
check the manual that corresponds to your MySQL server version for the right syntax to use near 't1
FROM dcspp_giftcert t1, dcspp_claimable t2
WHERE' at line 1
at atg.adapter.gsa.Utils.deleteAllItems(Utils.java:1384)
at atg.adapter.gsa.Utils.deleteAllItems(Utils.java:1524)
at atg.adapter.gsa.Utils.deleteAllItems(Utils.java:1674)
at atg.deployment.repository.RepositoryWorkerThread.prepareFullDeployment(RepositoryWorkerThread.java:636)
at atg.deployment.repository.RepositoryWorkerThread.preDeploymentPhase(RepositoryWorkerThread.java:525)
at atg.deployment.DeploymentWorkerThread.run(DeploymentWorkerThread.java:310)
Caused by: com.mysql.jdbc.exceptions.jdbc4.MySQLSyntaxErrorException: You have an error in your SQL syntax; check the manual that corresponds to your
MySQL server version for the right syntax to use near 't1
FROM dcspp_giftcert t1, dcspp_claimable t2
WHERE' at line 1
at sun.reflect.NativeConstructorAccessorImpl.newInstance0(Native Method)
at sun.reflect.NativeConstructorAccessorImpl.newInstance(NativeConstructorAccessorImpl.java:39)
at sun.reflect.DelegatingConstructorAccessorImpl.newInstance(DelegatingConstructorAccessorImpl.java:27)
at java.lang.reflect.Constructor.newInstance(Constructor.java:513)
at com.mysql.jdbc.Util.handleNewInstance(Util.java:407)
Also I see below error as well
21:54:10,770 ERROR [DeploymentServer]
atg.deployment.common.DeploymentException: Deployment '1700002' to target 'Production' encountered a system level deployment error during data transf
er.
at atg.deployment.common.ResourceUtil.exception(ResourceUtil.java:306)
at atg.deployment.adapter.DistributedDeploymentAdapter.deployData(DistributedDeploymentAdapter.java:2238)
at atg.deployment.adapter.DistributedDeploymentAdapter.transferData(DistributedDeploymentAdapter.java:305)
at atg.deployment.server.Deployment.run(Deployment.java:1688)
at java.lang.Thread.run(Thread.java:662)
I am using; ATG10.0.2, Java 1.6.0_27, Jboss-5.1.0.GA and MySQL 5.5
Thanks in advance.
Madhu Patchava
Edited by: atg.ebox on Nov 12, 2011 10:13 PMDear Madhu,
you have asked a very technical question on a business forum, I would suggest asking this in a more technical area or escalating to ATG Support
sorry we can't help you
Lisa -
How can you identify when a RP has a bootflash failure on CRS-1?
Hi Everyone,
I am hoping someone can assist me in identifying when a RP has a bootflash issue and requires RMA. We have some CRS-1 running in our network with IOS XR 3.8.2 and some with 4.0.3. All the commands that I am aware of do not identify when a RP has a problem in bootflash. CCO recommends that if there is a problem with upgrade and you are moving to a code using filesystem fat 32 that you format the bootflash of the RP's. It was at this point we started having issues. We went into ROMMON and recieved the following error.
rommon B1 > dir bootflash:disk0/hfr-os-mbi-4.0.3 Checksum failed on hfr-fslib-m
Expected checksum: 6a53, calculated checksum: beba
open: file "hfr-fslib-m" not found
loadprog: error - on file open
cannot load the monitor library "bootflash:%hfr-fslib-m" from device
If someone has some insight on how we can validate the state of the RP outside of the typical command set:
show redundancy summary
show platform
etc
I'd appreciate it.
Cheers,
RashmiHi,
Provide some more information on your problem.
If you want to check the directory contect try
dir bootflash:
is it displaying any output?
In general above error doesn't indicate that there is any problem but it is not able to find particular directory or file.
Thanks
Parthiv -
Crs will not start on 10.2.0.3
Hi All !
RHEL 4 AS U2, 2.6.9-42.0.3.EL
After installation of a patch 10.2.0.3 it has ceased to be started crs.
[root@racnode1 init.d] ./init.crs start
Startup will be queued to init within 30 seconds
Thus crs will not start
cd /u01/app/oracle/product/10.2.0/crs/install
[root@racnode1 install] ./root102.sh
All fades on string "Starting will be queued to init withhin 30 seconds"
In crsd.log:
2007-02-01 12:11:53:10.143 [ COMMCRS][2808851376]clsc_connect (0x8b5b930) no listener at (ADDRESS=(PROTOCOL=ipc)(KEY=SYSTEM.evm.acceptor.auth))
Help me please !!!We experienced the same failure with CRS a none start after the patch (from 10.2.0.1 to 10.2.0.3)-- running RH 4.3 with Linux 2.6.9-42.ELsmp on 32 bit HP boxen. SELINUX is very disabled! Logging is no help. Could there be a connection with the following?:
11.8 VIP May Relocate to the Last Node While Upgrading Oracle
Clusterware
Rolling upgrade or upgrade of Oracle Clusterware to 10.2.0.3 patch set
may cause VIP to move to the last node.
Workaround:
Enter the following command to relocate the VIP to the preferred node:
crs_relocate VIP Resource
Then try to restart the CRS daemons
This issue is tracked with Oracle bug 5673067.
However, I cannot find a bug with that id...
Hope an answer is forthcoming, as this is holding up the proof-of-concept test for a mid-sized project...
Thanks -
Oracle 10g CRS autorecovery from network failures - Solaris with IPMP
Hi all,
Just wondering if anyone has experience with a setup similar to mine. Let me first apologise for the lengthy introduction that follows >.<
A quick run-down of my implementation: Sun SPARC Solaris 10, Oracle CRS, ASM and RAC database patched to version 10.2.0.4 respectively, no third-party cluster software used for a 2-node cluster. Additionally, the SAN storage is attached directly with fiber cable to both servers, and the CRS files (OCR, voting disks) are always visible to the servers, there is no switch/hub between the server and the storage. There is IPMP configured for both the public and interconnect network devices. When performing the usual failover tests for IPMP, both the OS logs and the CRS logs show a failure detected, and a failover to the surviving network interface (on both the public and the private network devices).
For the private interconnect, when both of the network devices are disabled (by manually disconnecting the network cables), this results in the 2nd node rebooting, and the CRS process starting, but unable to synchronize with the 1st node (which is running fine the whole time). Further, when I look at the CRS logs, it is able to correctly identify all the OCR files and voting disks. When the network connectivity is restored, both the OS and CRS logs reflect this connection has been repaired. However, the CRS logs at this point still state that node 1 (which is running fine) is down, and the 2nd node attempts to join the cluster as the master node. When I manually run the 'crsctl stop crs' and 'crsctl start crs' commands, this results in a message stating that the node is going to be rebooted to ensure cluster integrity, and the 2nd node reboots, starts the CRS daemons again at startup, and joins the cluster normally.
For the public network, when the 2nd node is manually disconnected, the VIP is seen to not failover, and any attempts to connect to this node via the VIP result in a timeout. When connectivity is restored, as expected the OS and CRS logs acknowledge the recovery, and the VIP for node 2 automatically fails over, but the listener goes down as well. Using the 'srvctl start listener' command brings it up again, and everything is fine. During this whole process, the database instance runs fine on both nodes.
From the case studies above, I can see that the network failures are detected by the Oracle Clusterware, and a simple command run once this failure is repaired restores full functionality to the RAC database. However, is there anyway to automate this recovery, for the 2 cases stated above, so that there is no need for manual intervention by the DBAs? I was able to test case 2 (public network) with the Oracle document 805969.1 (VIP does not relocate back to the original node after public network problem is resolved), is there a similar workaround for the interconnect?
Any and all pointers would be appreciated, and again, sorry for the lengthy post.
Edited by: NS Selvam on 16-Dec-2009 20:36
changed some minor typoshi
i ve given the shell script.i just need to run that i usually get the op like
[root@rac-1 Desktop]# sh iscsi-corntab.sh
Logging in to [iface: default, target: iqn.2010-02-23.de.sayantan-chakraborty:storage.disk1.amiens.sys1.xyz, portal: 192.168.181.10,3260]
Login to [iface: default, target: iqn.2010-02-23.de.sayantan-chakraborty:storage.disk1.amiens.sys1.xyz, portal: 192.168.181.10,3260]: successfulthe script contains :
iscsiadm -m node -T iqn.2010-02-23.de.sayantan-chakraborty:storage.disk1.amiens.sys1.xyz -p 192.168.181.10 -l
iscsiadm -m node -T iqn.2010-02-23.de.sayantan-chakraborty:storage.disk1.amiens.sys1.xyz -p 192.168.181.10 --op update -n node.startup -v automatic
(cd /dev/disk/by-path; ls -l *sayantan-chakraborty* | awk '{FS=" "; print $9 " " $10 " " $11}')
[root@rac-1 Desktop]# (cd /dev/disk/by-path; ls -l *sayantan-chakraborty* | awk '{FS=" "; print $9 " " $10 " " $11}')
ip-192.168.181.10:3260-iscsi-iqn.2010-02-23.de.sayantan-chakraborty:storage.disk1.amiens.sys1.xyz-lun-1 -> ../../sdc
[root@rac-1 Desktop]# can you post the oput of ls /dev/iscsi ??you may get like this:
[root@rac-1 Desktop]# ls /dev/iscsi
xyz
[root@rac-1 Desktop]# -
CRS Historical Reporting - printing failure !
Hello All,
I am getting consistent failures for my users when they try to print from UCCX 8.5 CRS Historical Reporting. From document ID 108553, and other references, I have set the shared memory to 1 from default of 0 in hrcconfig.ini, still no joy. The error is and remains could not start print job with a window title of crystal report viewer. Our environment has Microsoft print queues set up for users. Any suggestions? Thanks in advance.I believe you are running into this bug:
http://cdetsweb-prd.cisco.com/apps/dumpcr?identifier=CSCtn20575&parentprogram=QDDTS
CSCtn20575 Could not start print job to network on HRC for CCX 8.0(2)
Symptom:
Cutomers on CCX 8.0 may not be able to print to the network in Historical Reports Client even though all other print jobs work fine. Error message states "Could not start print job" from the Crystal report viewer.
Conditions:
Only affects customers on the 8.0 platform of UCCX. Does not seem to matter if they are on a client system of XP or Win7.
Workaround:
They can either map the HRC user system directly to the network printer, bypassing the print server, or save the report to the PC and then open in a PDF viewer and print from there. Bypassing the print server seems to be a good option. -
Auto alert mechanism for ATG scheduled job failure and BCC project failure
Hello all,
Could you please confirm if there are auto alert mechanisms for ATG scheduled job failure and BCC project failure?
Waiting for reply.
Thanks and regards,Hi,
You need to write custom code to get alerts if an ATG Scheduler fails.
For BCC project deployment monitoring, please refer to the below link in the documentation.
Oracle ATG Web Commerce - Configure Deployment Event Listeners
Thanks,
Gopinath Ramasamy -
Clusterware Install:root.sh- Failure at final check of Oracle CRS stack. 10
Hello All,
Image: !http://systemwars.com/rac/cluster_back.jpg!
I was attempting to perform the steps in:
Link: http://www.oracle-base.com/articles/11g/OracleDB11gR1RACInstallationOnLinuxUsingNFS.php
The only difference is that I decided to use fedora core 12 instead. I did this because I added a second NIC card (USB) and only FC12 would recognize it. I tried to get it to work on Cent 5 but it just wouldn't. The second nic on each machine eth1 are connected via crossover cable, and the interfaces can ping each other just fine, rac1-priv and rac2-priv.
So here is my setup:
# Public
192.168.2.11 rac1.localdomain rac1
192.168.2.12 rac2.localdomain rac2
#Private
192.168.0.11 rac1-priv.localdomain rac1-priv
192.168.0.12 rac2-priv.localdomain rac2-priv
#Virtual
192.168.2.111 rac1-vip.localdomain rac1-vip
192.168.2.112 rac2-vip.localdomain rac2-vip
#NAS
192.168.2.10 mini.localdomain mini
Mini refers to my Mac mini which I decided to use as the 3rd "server" in the group. I was able to mount/read & write to the file systems just fine. As you can see.
[root@rac1 ~]# df
Filesystem 1K-blocks Used Available Use% Mounted on
/dev/mapper/vg_rac1-lv_root
8063408 5156268 2497540 68% /
tmpfs 1417456 0 1417456 0% /dev/shm
/dev/sda1 198337 22080 166017 12% /boot
mini:/shared_config 488050688 76719808 411074880 16% /u01/shared_config
mini:/shared_crs 488050688 76719808 411074880 16% /u01/app/crs/product/11.1.0/crs
mini:/shared_home 488050688 76719808 411074880 16% /u01/app/oracle/product/11.1.0/db_1
mini:/shared_data 488050688 76719808 411074880 16% /u01/oradata
[root@rac1 ~]# ssh rac2
Last login: Mon Dec 21 19:33:38 2009 from rac1.localdomain
[root@rac2 ~]# df
Filesystem 1K-blocks Used Available Use% Mounted on
/dev/mapper/vg_rac2-lv_root
8063408 4958008 2695800 65% /
tmpfs 1417456 0 1417456 0% /dev/shm
/dev/sda1 198337 22063 166034 12% /boot
mini:/shared_config 488050688 76719808 411074880 16% /u01/shared_config
mini:/shared_crs 488050688 76719808 411074880 16% /u01/app/crs/product/11.1.0/crs
mini:/shared_home 488050688 76719808 411074880 16% /u01/app/oracle/product/11.1.0/db_1
mini:/shared_data 488050688 76719808 411074880 16% /u01/oradata[color]
CLUSTER VERIFY SEEMS OK APART FROM ONE WARNING
WARNING:
Could not find a suitable set of interfaces for VIPs.
which according to this link, "can be safety ignored", although I noticed in the link its an actual ERROR and not a WARNING => http://www.idevelopment.info/data/Oracle/DBA_tips/Oracle10gRAC/CLUSTER_11.shtml . I also noted that it saw the public IPs as the possible priv IPs, which I also thought could safety be ignored.
oracle@rac1 clusterware]$ ./runcluvfy.sh stage -pre crsinst -n rac1,rac2 -verbose
Performing pre-checks for cluster services setup
Checking node reachability...
Check: Node reachability from node "rac1"
Destination Node Reachable?
rac2 yes
rac1 yes
Result: Node reachability check passed from node "rac1".
Checking user equivalence...
Check: User equivalence for user "oracle"
Node Name Comment
rac2 passed
rac1 passed
Result: User equivalence check passed for user "oracle".
Checking administrative privileges...
Check: Existence of user "oracle"
Node Name User Exists Comment
rac2 yes passed
rac1 yes passed
Result: User existence check passed for "oracle".
Check: Existence of group "oinstall"
Node Name Status Group ID
rac2 exists 501
rac1 exists 501
Result: Group existence check passed for "oinstall".
Check: Membership of user "oracle" in group "oinstall" [as Primary]
Node Name User Exists Group Exists User in Group Primary Comment
rac2 yes yes yes yes passed
rac1 yes yes yes yes passed
Result: Membership check for user "oracle" in group "oinstall" [as Primary] passed.
Administrative privileges check passed.
Checking node connectivity...
Interface information for node "rac2"
Interface Name IP Address Subnet Subnet Gateway Default Gateway Hardware Address
eth0 192.168.2.12 192.168.2.0 0.0.0.0 192.168.2.1 00:01:6C:XXXX
eth2 192.168.0.12 192.168.0.0 0.0.0.0 192.168.2.1 00:25:4B:XXXX
Interface information for node "rac1"
Interface Name IP Address Subnet Subnet Gateway Default Gateway Hardware Address
eth0 192.168.2.11 192.168.2.0 0.0.0.0 192.168.2.1 00:01:6CXXXXX
eth1 192.168.0.11 192.168.0.0 0.0.0.0 192.168.2.1 00:25:4B:XXXX
Check: Node connectivity of subnet "192.168.2.0"
Source Destination Connected?
rac2:eth0 rac1:eth0 yes
Result: Node connectivity check passed for subnet "192.168.2.0" with node(s) rac2,rac1.
Check: Node connectivity of subnet "192.168.0.0"
Source Destination Connected?
rac2:eth2 rac1:eth1 yes
Result: Node connectivity check passed for subnet "192.168.0.0" with node(s) rac2,rac1.
Interfaces found on subnet "192.168.2.0" that are likely candidates for a private interconnect:
rac2 eth0:192.168.2.12
rac1 eth0:192.168.2.11
WARNING:
Could not find a suitable set of interfaces for VIPs.
Result: Node connectivity check passed.
Checking system requirements for 'crs'...
Check: Total memory
Node Name Available Required Comment
rac2 2.7GB (2834912KB) 1GB (1048576KB) passed
rac1 2.7GB (2834912KB) 1GB (1048576KB) passed
Result: Total memory check passed.
Check: Free disk space in "/tmp" dir
Node Name Available Required Comment
rac2 4.58GB (4805204KB) 400MB (409600KB) passed
rac1 10.51GB (11015624KB) 400MB (409600KB) passed
Result: Free disk space check passed.
Check: Swap space
Node Name Available Required Comment
rac2 2GB (2097144KB) 1.5GB (1572864KB) passed
rac1 3GB (3145720KB) 1.5GB (1572864KB) passed
Result: Swap space check passed.
Check: System architecture
Node Name Available Required Comment
rac2 i686 i686 passed
rac1 i686 i686 passed
Result: System architecture check passed.
Check: Kernel version
Node Name Available Required Comment
rac2 2.6.31.5-127.fc12.i686.PAE 2.6.9 passed
rac1 2.6.31.5-127.fc12.i686.PAE 2.6.9 passed
Result: Kernel version check passed.
Check: Package existence for "make-3.81"
Node Name Status Comment
rac2 make-3.81-18.fc12.i686 passed
rac1 make-3.81-18.fc12.i686 passed
Result: Package existence check passed for "make-3.81".
Check: Package existence for "binutils-2.17.50.0.6"
Node Name Status Comment
rac2 binutils-2.19.51.0.14-34.fc12.i686 passed
rac1 binutils-2.19.51.0.14-34.fc12.i686 passed
Result: Package existence check passed for "binutils-2.17.50.0.6".
Check: Package existence for "gcc-4.1.1"
Node Name Status Comment
rac2 gcc-4.4.2-7.fc12.i686 passed
rac1 gcc-4.4.2-7.fc12.i686 passed
Result: Package existence check passed for "gcc-4.1.1".
Check: Package existence for "libaio-0.3.106"
Node Name Status Comment
rac2 libaio-0.3.107-9.fc12.i686 passed
rac1 libaio-0.3.107-9.fc12.i686 passed
Result: Package existence check passed for "libaio-0.3.106".
Check: Package existence for "libaio-devel-0.3.106"
Node Name Status Comment
rac2 libaio-devel-0.3.107-9.fc12.i686 passed
rac1 libaio-devel-0.3.107-9.fc12.i686 passed
Result: Package existence check passed for "libaio-devel-0.3.106".
Check: Package existence for "libstdc++-4.1.1"
Node Name Status Comment
rac2 libstdc++-4.4.2-7.fc12.i686 passed
rac1 libstdc++-4.4.2-7.fc12.i686 passed
Result: Package existence check passed for "libstdc++-4.1.1".
Check: Package existence for "elfutils-libelf-devel-0.125"
Node Name Status Comment
rac2 elfutils-libelf-devel-0.143-1.fc12.i686 passed
rac1 elfutils-libelf-devel-0.143-1.fc12.i686 passed
Result: Package existence check passed for "elfutils-libelf-devel-0.125".
Check: Package existence for "sysstat-7.0.0"
Node Name Status Comment
rac2 sysstat-9.0.4-4.fc12.i686 passed
rac1 sysstat-9.0.4-4.fc12.i686 passed
Result: Package existence check passed for "sysstat-7.0.0".
Check: Package existence for "compat-libstdc++-33-3.2.3"
Node Name Status Comment
rac2 compat-libstdc++-33-3.2.3-68.i686 passed
rac1 compat-libstdc++-33-3.2.3-68.i686 passed
Result: Package existence check passed for "compat-libstdc++-33-3.2.3".
Check: Package existence for "libgcc-4.1.1"
Node Name Status Comment
rac2 libgcc-4.4.2-7.fc12.i686 passed
rac1 libgcc-4.4.2-7.fc12.i686 passed
Result: Package existence check passed for "libgcc-4.1.1".
Check: Package existence for "libstdc++-devel-4.1.1"
Node Name Status Comment
rac2 libstdc++-devel-4.4.2-7.fc12.i686 passed
rac1 libstdc++-devel-4.4.2-7.fc12.i686 passed
Result: Package existence check passed for "libstdc++-devel-4.1.1".
Check: Package existence for "unixODBC-2.2.11"
Node Name Status Comment
rac2 unixODBC-2.2.14-6.fc12.i686 passed
rac1 unixODBC-2.2.14-9.fc12.i686 passed
Result: Package existence check passed for "unixODBC-2.2.11".
Check: Package existence for "unixODBC-devel-2.2.11"
Node Name Status Comment
rac2 unixODBC-devel-2.2.14-6.fc12.i686 passed
rac1 unixODBC-devel-2.2.14-9.fc12.i686 passed
Result: Package existence check passed for "unixODBC-devel-2.2.11".
Check: Package existence for "glibc-2.5-12"
Node Name Status Comment
rac2 glibc-2.11-2.i686 passed
rac1 glibc-2.11-2.i686 passed
Result: Package existence check passed for "glibc-2.5-12".
Check: Group existence for "dba"
Node Name Status Comment
rac2 exists passed
rac1 exists passed
Result: Group existence check passed for "dba".
Check: Group existence for "oinstall"
Node Name Status Comment
rac2 exists passed
rac1 exists passed
Result: Group existence check passed for "oinstall".
Check: User existence for "nobody"
Node Name Status Comment
rac2 exists passed
rac1 exists passed
Result: User existence check passed for "nobody".
System requirement passed for 'crs'
Pre-check for cluster services setup was successful. So now here is the actual problem:
After the installation and during the run of the root.sh I get:
Failure at final check of Oracle CRS stack.
10
[root@rac1 crs]# ./root.sh
WARNING: directory '/u01/app/crs/product/11.1.0' is not owned by root
WARNING: directory '/u01/app/crs/product' is not owned by root
WARNING: directory '/u01/app/crs' is not owned by root
WARNING: directory '/u01/app' is not owned by root
Checking to see if Oracle CRS stack is already configured
/etc/oracle does not exist. Creating it now.
Setting the permissions on OCR backup directory
Setting up Network socket directories
Oracle Cluster Registry configuration upgraded successfully
The directory '/u01/app/crs/product/11.1.0' is not owned by root. Changing owner to root
The directory '/u01/app/crs/product' is not owned by root. Changing owner to root
The directory '/u01/app/crs' is not owned by root. Changing owner to root
The directory '/u01/app' is not owned by root. Changing owner to root
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: rac1 rac1-priv rac1
node 2: rac2 rac2-priv rac2
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Now formatting voting device: /u01/shared_config/voting_disk
Format of 1 voting devices complete.
Startup will be queued to init within 30 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
Failure at final check of Oracle CRS stack.
10According to this link => http://blog.contractoracle.com/2009/01/failure-at-final-check-of-oracle-crs.html
To recover from a status 10, one must check:
check firewall / routing / iptables issues
Now I have turned iptables off completely it doesnt even start up at boot time, so I know it can't be that.
ROUTE
[oracle@rac1 clusterware]$ route
Kernel IP routing table
Destination Gateway Genmask Flags Metric Ref Use Iface
192.168.2.0 * 255.255.255.0 U 1 0 0 eth0
192.168.0.0 * 255.255.255.0 U 1 0 0 eth1
default 192.168.2.1 0.0.0.0 UG 0 0 0 eth0
[oracle@rac2 ~]$ route
Kernel IP routing table
Destination Gateway Genmask Flags Metric Ref Use Iface
192.168.2.0 * 255.255.255.0 U 1 0 0 eth0
192.168.0.0 * 255.255.255.0 U 1 0 0 eth2
default 192.168.2.1 0.0.0.0 UG 0 0 0 eth0
[oracle@rac1 clusterware]$ traceroute rac2
traceroute to rac2 (192.168.2.12), 30 hops max, 60 byte packets
1 rac2.localdomain (192.168.2.12) 0.424 ms 0.427 ms 0.096 ms
[oracle@rac1 clusterware]$ traceroute rac2-priv
traceroute to rac2-priv (192.168.0.12), 30 hops max, 60 byte packets
1 rac2-priv.localdomain (192.168.0.12) 1.336 ms 1.238 ms 1.188 ms
[oracle@rac1 clusterware]$ traceroute rac2-vip
traceroute to rac2-vip (192.168.2.112), 30 hops max, 60 byte packets
1 rac1.localdomain (192.168.2.11) 2999.599 ms !H 2999.560 ms !H 2999.523 ms !H
[oracle@rac1 bin]$ ./crs_stat -t
CRS-0184: Cannot communicate with the CRS daemon.
Both rac1 and rac2 get the same output above with the -vip getting !H => !H, !N, or !P (host, network or protocol unreachable), I am assuming this is normal as CRS install did not complete successfully and the virtual IP is not bound yet.
Im pretty sure I have some kind of networking issue here, but I cant put my finger on it. I have tried absolutely everything that is suggested on the internet that I could find. Even deleting the /tmp/.oracle and /var/tmp/.oracle but nothing works. Ssh keys for root and oracle users exist and Ive connected using every possible combination to avoid that first time ssh prompt so users oracle on each node goes directly into rac1/rac2 rac1-priv/rac2-priv & actual IPs as well. Any ideas?
Edited by: Javier on Dec 30, 2009 12:34 PM
Edited by: Javier on Dec 30, 2009 6:58 PMHello
Note 370605.1 (Clusterware Intermittently Hangs And Commands Fail With CRS-184) is telling this.
"This is caused by a cron job that cleans up the /tmp directory which also removes the Oracle socket files in /tmp/.oracle
Do not remove /tmp/.oracle or /var/tmp/.oracle or its files while Oracle Clusterware is up."
Best Regards... -
Failure at final check of Oracle CRS stack.10 on the second node
Hi,
I am trying to install Oracle Clusterware 10.2.0.1.0 in VM machines (2 nodes config) in Linux (OEL5) using VMware Server (2.0). Everything went very well one the first node upto running the root.sh. Running root.sh ended with Failure at final check of Oracle CRS stack 10 error.
RAC1 root.sh output
[root@rac1 crs]# ./root.sh
WARNING: directory '/u01/crs/oracle/product/10.2.0' is not owned by root
WARNING: directory '/u01/crs/oracle/product' is not owned by root
WARNING: directory '/u01/crs/oracle' is not owned by root
WARNING: directory '/u01/crs' is not owned by root
WARNING: directory '/u01' is not owned by root
Checking to see if Oracle CRS stack is already configured
/etc/oracle does not exist. Creating it now.
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/u01/crs/oracle/product/10.2.0' is not owned by root
WARNING: directory '/u01/crs/oracle/product' is not owned by root
WARNING: directory '/u01/crs/oracle' is not owned by root
WARNING: directory '/u01/crs' is not owned by root
WARNING: directory '/u01' is not owned by root
assigning default hostname rac1 for node 1.
assigning default hostname rac2 for node 2.
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: rac1 rac1-priv rac1
node 2: rac2 rac2-priv rac2
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Now formatting voting device: /dev/raw/raw2
Format of 1 voting devices complete.
Startup will be queued to init within 90 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
CSS is active on these nodes.
rac1
CSS is inactive on these nodes.
rac2
Local node checking complete.
Run root.sh on remaining nodes to start CRS daemons.
[root@rac1 crs]#
RAC2 root.sh output
[root@rac2 crs]# ./root.sh
WARNING: directory '/u01/crs/oracle/product/10.2.0' is not owned by root
WARNING: directory '/u01/crs/oracle/product' is not owned by root
WARNING: directory '/u01/crs/oracle' is not owned by root
WARNING: directory '/u01/crs' is not owned by root
WARNING: directory '/u01' is not owned by root
Checking to see if Oracle CRS stack is already configured
/etc/oracle does not exist. Creating it now.
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
WARNING: directory '/u01/crs/oracle/product/10.2.0' is not owned by root
WARNING: directory '/u01/crs/oracle/product' is not owned by root
WARNING: directory '/u01/crs/oracle' is not owned by root
WARNING: directory '/u01/crs' is not owned by root
WARNING: directory '/u01' is not owned by root
assigning default hostname rac1 for node 1.
assigning default hostname rac2 for node 2.
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: rac1 rac1-priv rac1
node 2: rac2 rac2-priv rac2
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Now formatting voting device: /dev/raw/raw2
Format of 1 voting devices complete.
Startup will be queued to init within 90 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
Failure at final check of Oracle CRS stack.
10
[root@rac2 crs]#
Output of alterrac2.log
[root@rac2 rac2]# more alertrac2.log
2009-08-14 23:02:44.699
[client(5935)]CRS-1006:The OCR location /dev/raw/raw1 is inaccessible. Details in /u01/crs/oracle/product/10.2.
0/crs/log/rac2/client/ocrconfig_5935.log.
2009-08-14 23:02:44.704
[client(5935)]CRS-1006:The OCR location /dev/raw/raw1 is inaccessible. Details in /u01/crs/oracle/product/10.2.
0/crs/log/rac2/client/ocrconfig_5935.log.
2009-08-14 23:02:44.707
[client(5935)]CRS-1006:The OCR location /dev/raw/raw1 is inaccessible. Details in /u01/crs/oracle/product/10.2.
0/crs/log/rac2/client/ocrconfig_5935.log.
2009-08-14 23:02:44.864
[client(5935)]CRS-1001:The OCR was formatted using version 2.
2009-08-14 23:02:50.339
[client(6004)]CRS-1801:Cluster crs configured with nodes rac1 rac2 .
2009-08-14 23:05:07.603
[cssd(6600)]CRS-1605:CSSD voting file is online: /dev/raw/raw2. Details in /u01/crs/oracle/product/10.2.0/crs/l
og/rac2/cssd/ocssd.log.
[root@rac2 rac2]#
Since raw devices are not supported from OEL5, I did do the workaround in *63-oracle-raw.rules file under /etc/udev/rules.d* dir.
ACTION=="add", KERNEL=="sdb1", RUN+="/bin/raw /dev/raw/raw1 %N"
ACTION=="add", KERNEL=="sdc1", RUN+="/bin/raw /dev/raw/raw2 %N"
ACTION=="add", KERNEL=="sdd1", RUN+="/bin/raw /dev/raw/raw3 %N"
ACTION=="add", KERNEL=="sde1", RUN+="/bin/raw /dev/raw/raw4 %N"
ACTION=="add", KERNEL=="sdf1", RUN+="/bin/raw /dev/raw/raw5 %N"
KERNEL=="raw[1-2]*", OWNER="root", GROUP="oinstall", MODE="640"
KERNEL=="raw[3-5]*", OWNER="oracle", GROUP="oinstall", MODE="640"
One thing I have noticed after running root.sh on both the nodes is the permissons on raw devices changed from
Before root.sh
[root@rac2 crs]# ls -ls /dev/raw*
0 crw------- 1 root root 162, 0 Aug 14 22:42 /dev/rawctl
/dev/raw:
total 0
0 crw-r----- 1 root oinstall 162, 1 Aug 14 22:42 raw1
0 crw-r----- 1 root oinstall 162, 2 Aug 14 22:42 raw2
0 crw-r----- 1 oracle oinstall 162, 3 Aug 14 22:42 raw3
0 crw-r----- 1 oracle oinstall 162, 4 Aug 14 22:42 raw4
0 crw-r----- 1 oracle oinstall 162, 5 Aug 14 22:42 raw5
to
[root@rac2 crs]# ls -ls /dev/raw*
0 crw------- 1 root root 162, 0 Aug 14 22:31 /dev/rawctl
/dev/raw:
total 0
0 crw-r----- 1 root oinstall 162, 1 Aug 14 22:56 raw1
0 crw-r--r-- 1 oracle oinstall 162, 2 Aug 14 23:01 raw2
0 crw-r----- 1 oracle oinstall 162, 3 Aug 14 22:31 raw3
0 crw-r----- 1 oracle oinstall 162, 4 Aug 14 22:31 raw4
0 crw-r----- 1 oracle oinstall 162, 5 Aug 14 22:31 raw5
[root@rac1 crs]#
My shared disk listing
[root@www shared]# ls -ltr
total 8780
-rw------- 1 root root 640 Aug 14 22:43 votingdisk.vmdk
-rw------- 1 root root 598 Aug 14 22:43 ocr.vmdk
-rw------- 1 root root 604 Aug 14 22:43 asm3.vmdk
-rw------- 1 root root 604 Aug 14 22:43 asm2.vmdk
-rw------- 1 root root 604 Aug 14 22:43 asm1.vmdk
-rw------- 1 root root 65536 Aug 14 22:44 votingdisk-s006.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 votingdisk-s005.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 votingdisk-s004.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 votingdisk-s003.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 votingdisk-s002.vmdk
-rw------- 1 root root 393216 Aug 14 22:44 votingdisk-s001.vmdk
-rw------- 1 root root 65536 Aug 14 22:44 ocr-s006.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 ocr-s005.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 ocr-s004.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 ocr-s003.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 ocr-s002.vmdk
-rw------- 1 root root 393216 Aug 14 22:44 ocr-s001.vmdk
-rw------- 1 root root 65536 Aug 14 22:44 asm3-s006.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 asm3-s005.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 asm3-s004.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 asm3-s003.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 asm3-s002.vmdk
-rw------- 1 root root 393216 Aug 14 22:44 asm3-s001.vmdk
-rw------- 1 root root 65536 Aug 14 22:44 asm2-s006.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 asm2-s005.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 asm2-s004.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 asm2-s003.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 asm2-s002.vmdk
-rw------- 1 root root 393216 Aug 14 22:44 asm2-s001.vmdk
-rw------- 1 root root 65536 Aug 14 22:44 asm1-s006.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 asm1-s005.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 asm1-s004.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 asm1-s003.vmdk
-rw------- 1 root root 327680 Aug 14 22:44 asm1-s002.vmdk
-rw------- 1 root root 393216 Aug 14 22:44 asm1-s001.vmdk
[root@www shared]#
I don't know how to fix this problem. I did go through many docs and metalink notes.
I am new to RAC world. It took 3 days to come to this stage. Please help me.
Thanks
LeoHi Surachart,
Here is my messages output..
*/var/log/messages*
Aug 20 14:05:01 rac2 avahi-daemon[3627]: Registering new address record for fe80::20c:29ff:fe6b:f9a8 on eth1.
Aug 20 14:05:01 rac2 avahi-daemon[3627]: Registering new address record for 192.168.1.196 on eth1.
Aug 20 14:05:01 rac2 avahi-daemon[3627]: Registering new address record for fe80::20c:29ff:fe6b:f99e on eth0.
Aug 20 14:05:01 rac2 avahi-daemon[3627]: Registering new address record for 192.168.0.196 on eth0.
Aug 20 14:05:01 rac2 avahi-daemon[3627]: Registering HINFO record with values 'I686'/'LINUX'.
Aug 20 14:05:02 rac2 avahi-daemon[3627]: Server startup complete. Host name is rac2.local. Local service cookie is 927471131.
Aug 20 14:05:03 rac2 avahi-daemon[3627]: Service "SFTP File Transfer on rac2" (/services/sftp-ssh.service) successfully established.
Aug 20 14:05:08 rac2 smartd[3739]: smartd version 5.38 [i686-redhat-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Aug 20 14:05:08 rac2 smartd[3739]: Home page is http://smartmontools.sourceforge.net/
Aug 20 14:05:08 rac2 smartd[3739]: Opened configuration file /etc/smartd.conf
Aug 20 14:05:08 rac2 smartd[3739]: Configuration file /etc/smartd.conf was parsed, found DEVICESCAN, scanning devices
Aug 20 14:05:08 rac2 smartd[3739]: Device: /dev/hdc, opened
Aug 20 14:05:08 rac2 kernel: hdc: drive_cmd: status=0x51 { DriveReady SeekComplete Error }
Aug 20 14:05:08 rac2 kernel: hdc: drive_cmd: error=0x04 { AbortedCommand }
Aug 20 14:05:08 rac2 kernel: ide: failed opcode was: 0xec
Aug 20 14:05:08 rac2 smartd[3739]: Device: /dev/hdc, not ATA, no IDENTIFY DEVICE Structure
Aug 20 14:05:08 rac2 smartd[3739]: Device: /dev/sda, opened
Aug 20 14:05:08 rac2 smartd[3739]: Device: /dev/sda, IE (SMART) not enabled, skip device Try 'smartctl -s on /dev/sda' to turn on SMART features
Aug 20 14:05:08 rac2 smartd[3739]: Device: /dev/sdb, opened
Aug 20 14:05:08 rac2 smartd[3739]: Device: /dev/sdb, IE (SMART) not enabled, skip device Try 'smartctl -s on /dev/sdb' to turn on SMART features
Aug 20 14:05:08 rac2 smartd[3739]: Device: /dev/sdc, opened
Aug 20 14:05:08 rac2 smartd[3739]: Device: /dev/sdc, IE (SMART) not enabled, skip device Try 'smartctl -s on /dev/sdc' to turn on SMART features
Aug 20 14:05:08 rac2 smartd[3739]: Device: /dev/sdd, opened
Aug 20 14:05:08 rac2 smartd[3739]: Device: /dev/sdd, IE (SMART) not enabled, skip device Try 'smartctl -s on /dev/sdd' to turn on SMART features
Aug 20 14:05:09 rac2 smartd[3739]: Device: /dev/sde, opened
Aug 20 14:05:09 rac2 smartd[3739]: Device: /dev/sde, IE (SMART) not enabled, skip device Try 'smartctl -s on /dev/sde' to turn on SMART features
Aug 20 14:05:10 rac2 smartd[3739]: Device: /dev/sdf, opened
Aug 20 14:05:10 rac2 smartd[3739]: Device: /dev/sdf, IE (SMART) not enabled, skip device Try 'smartctl -s on /dev/sdf' to turn on SMART features
Aug 20 14:05:10 rac2 smartd[3739]: Monitoring 0 ATA and 0 SCSI devices
Aug 20 14:05:10 rac2 smartd[3741]: smartd has fork()ed into background mode. New PID=3741.
Aug 20 14:05:13 rac2 pcscd: winscard.c:304:SCardConnect() Reader E-Gate 0 0 Not Found
Aug 20 14:05:13 rac2 last message repeated 3 times
Aug 20 14:05:27 rac2 gconfd (root-3967): starting (version 2.14.0), pid 3967 user 'root'
Aug 20 14:05:27 rac2 gconfd (root-3967): Resolved address "xml:readonly:/etc/gconf/gconf.xml.mandatory" to a read-only configuration source at position 0
Aug 20 14:05:27 rac2 gconfd (root-3967): Resolved address "xml:readwrite:/root/.gconf" to a writable configuration source at position 1
Aug 20 14:05:27 rac2 gconfd (root-3967): Resolved address "xml:readonly:/etc/gconf/gconf.xml.defaults" to a read-only configuration source at position 2
Aug 20 14:05:29 rac2 gconfd (root-3967): Resolved address "xml:readwrite:/root/.gconf" to a writable configuration source at position 0
Aug 20 14:05:29 rac2 hald: mounted /dev/hdc on behalf of uid 0
Aug 20 14:05:29 rac2 hcid[3311]: Default passkey agent (:1.8, /org/bluez/applet) registered
Aug 20 14:05:31 rac2 nm-system-settings: Loaded plugin ifcfg-rh: (c) 2007 - 2008 Red Hat, Inc. To report bugs please use the NetworkManager mailing list.
Aug 20 14:05:31 rac2 nm-system-settings: ifcfg-rh: parsing /etc/sysconfig/network-scripts/ifcfg-eth1 ...
Aug 20 14:05:31 rac2 nm-system-settings: ifcfg-rh: read connection 'System eth1'
Aug 20 14:05:31 rac2 nm-system-settings: ifcfg-rh: parsing /etc/sysconfig/network-scripts/ifcfg-lo ...
Aug 20 14:05:31 rac2 nm-system-settings: ifcfg-rh: error: Ignoring loopback device config.
Aug 20 14:05:31 rac2 nm-system-settings: ifcfg-rh: parsing /etc/sysconfig/network-scripts/ifcfg-eth0 ...
Aug 20 14:05:31 rac2 nm-system-settings: ifcfg-rh: read connection 'System eth0'
Aug 20 14:05:31 rac2 pcscd: winscard.c:304:SCardConnect() Reader E-Gate 0 0 Not Found
Aug 20 14:05:32 rac2 last message repeated 4 times
Aug 20 14:12:51 rac2 kernel: FS-Cache: Loaded
Aug 20 14:22:06 rac2 xinetd[3488]: START: shell pid=5193 from=192.168.0.195
Aug 20 14:22:06 rac2 xinetd[3488]: EXIT: shell status=0 pid=5193 duration=0(sec)
Aug 20 14:22:07 rac2 xinetd[3488]: START: shell pid=5217 from=192.168.0.195
Aug 20 14:22:07 rac2 xinetd[3488]: EXIT: shell status=0 pid=5217 duration=0(sec)
Aug 20 14:22:07 rac2 xinetd[3488]: START: shell pid=5241 from=192.168.0.195
Aug 20 14:22:07 rac2 xinetd[3488]: EXIT: shell status=0 pid=5241 duration=0(sec)
Aug 20 14:22:16 rac2 xinetd[3488]: EXIT: shell status=0 pid=6236 duration=0(sec)
Aug 20 14:22:16 rac2 xinetd[3488]: START: shell pid=6265 from=192.168.0.195
Aug 20 14:22:16 rac2 xinetd[3488]: EXIT: shell status=0 pid=6265 duration=0(sec)
Aug 20 14:22:16 rac2 xinetd[3488]: START: shell pid=6291 from=192.168.0.195
Aug 20 14:22:17 rac2 xinetd[3488]: EXIT: shell status=0 pid=6291 duration=1(sec)
Aug 20 14:22:17 rac2 xinetd[3488]: START: shell pid=6317 from=192.168.0.195
Aug 20 14:22:17 rac2 xinetd[3488]: EXIT: shell status=0 pid=6317 duration=0(sec)
[root@rac2 log]# -
Failure at final check of Oracle CRS stack. 10 on the first node.
Hi everyone
I trying to install an Oracle RAC 10gr2 on an Oracle Enterprise Linux AS release 4 (October Update 7) , but I'm having this problem
root@fporn01 crs# ./root.sh
Checking to see if Oracle CRS stack is already configured
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
clscfg: EXISTING configuration version 3 detected.
clscfg: version 3 is 10G Release 2.
assigning default hostname fporn01 for node 1.
assigning default hostname fporn02 for node 2.
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: fporn01 fporn01-priv fporn01
node 2: fporn02 fporn02-priv fporn02
clscfg: Arguments check out successfully.
NO KEYS WERE WRITTEN. Supply -force parameter to override.
-force is destructive and will destroy any previous cluster
configuration.
Oracle Cluster Registry for cluster has already been initialized
Startup will be queued to init within 90 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
Failure at final check of Oracle CRS stack.
+10+
forget about the node names!!!!
but on the second node everything went fine, so I'm sure this is not a connectivity issue.
the iptables service is stopped and disabled
check the results after running the root.sh script
root@fporn02 ~# /u01/app/crs/root.sh
Checking to see if Oracle CRS stack is already configured
+/etc/oracle does not exist. Creating it now.+
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
clscfg: EXISTING configuration version 3 detected.
clscfg: version 3 is 10G Release 2.
assigning default hostname fporn01 for node 1.
assigning default hostname fporn02 for node 2.
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: fporn01 fporn01-priv fporn01
node 2: fporn02 fporn02-priv fporn02
clscfg: Arguments check out successfully.
NO KEYS WERE WRITTEN. Supply -force parameter to override.
-force is destructive and will destroy any previous cluster
configuration.
Oracle Cluster Registry for cluster has already been initialized
Startup will be queued to init within 90 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
CSS is active on these nodes.
fporn02
CSS is inactive on these nodes.
fporn01
Local node checking complete.
Run root.sh on remaining nodes to start CRS daemons.
this is the log of crs on the first node
root@fporn01 bin# cat /u01/app/crs/log/fporn01/alertfporn01.log
+2009-06-24 17:27:37.695+
client(9045)CRS-1006:The OCR location /u02/oradata/orcl/OCRFile_mirror is inaccessible. Details in /u01/app/crs/log/fporn01/client/ocrconfig_9045.log.
+2009-06-24 17:27:37.741+
client(9045)CRS-1001:The OCR was formatted using version 2.
+2009-06-24 17:28:24.544+
client(9092)CRS-1801:Cluster pdb-rac configured with nodes fporn01 fporn02 .
this is the log of crs on the second node
root@fporn02 ~# cat /u01/app/crs/log/fporn02/alertfporn02.log
+2009-06-24 18:09:09.307+
cssd(16991)CRS-1605:CSSD voting file is online: /u02/oradata/orcl/CSSFile. Details in /u01/app/crs/log/fporn02/cssd/ocssd.log.
+2009-06-24 18:09:09.307+
cssd(16991)CRS-1605:CSSD voting file is online: /u02/oradata/orcl/CSSFile_mirror1. Details in /u01/app/crs/log/fporn02/cssd/ocssd.log.
+2009-06-24 18:09:09.310+
cssd(16991)CRS-1605:CSSD voting file is online: /u02/oradata/orcl/CSSFile_mirror2. Details in /u01/app/crs/log/fporn02/cssd/ocssd.log.
+2009-06-24 18:09:12.441+
cssd(16991)CRS-1601:CSSD Reconfiguration complete. Active nodes are fporn02 .
I have rechecked the Remote Access / User Equivalence
after run the OCRCHECK command ia have this information
root@fporn01 bin# ./ocrcheck
Status of Oracle Cluster Registry is as follows :
Version : 2
Total space (kbytes) : 262144
Used space (kbytes) : 312
Available space (kbytes) : 261832
ID : 255880615
Device/File Name : /u02/oradata/orcl/OCRFile
Device/File integrity check succeeded
Device/File Name : /u02/oradata/orcl/OCRFile_mirror
Device/File integrity check succeeded
Cluster registry integrity check succeeded
on the second node i get the same output
root@fporn02 bin# ./ocrcheck
Status of Oracle Cluster Registry is as follows :
Version : 2
Total space (kbytes) : 262144
Used space (kbytes) : 312
Available space (kbytes) : 261832
ID : 255880615
Device/File Name : /u02/oradata/orcl/OCRFile
Device/File integrity check succeeded
Device/File Name : /u02/oradata/orcl/OCRFile_mirror
Device/File integrity check succeeded
Cluster registry integrity check succeeded
I have reviewed the following metalink notes but none of them seems to solve my problem
*344994.1*
*240001.1*
*725878.1*
*329450.1*
*734221.1*
I have done a research trough many forums, but always the fail is on the second node, but my fail is on the first node.
I hope anyone could help me.
this is the output of cluvfy
Performing pre-checks for cluster services setup
Checking node reachability...
Check: Node reachability from node "fporn01"
Destination Node Reachable?
fporn01 yes
fporn02 yes
Result: Node reachability check passed from node "fporn01".
Checking user equivalence...
Check: User equivalence for user "oracle"
Node Name Comment
fporn02 passed
fporn01 passed
Result: User equivalence check passed for user "oracle".
Checking administrative privileges...
Check: Existence of user "oracle"
Node Name User Exists Comment
fporn02 yes passed
fporn01 yes passed
Result: User existence check passed for "oracle".
Check: Existence of group "oinstall"
Node Name Status Group ID
fporn02 exists 501
fporn01 exists 501
Result: Group existence check passed for "oinstall".
Check: Membership of user "oracle" in group "oinstall" as Primary
Node Name User Exists Group Exists User in Group Primary Comment
fporn02 yes yes yes yes passed
fporn01 yes yes yes yes passed
Result: Membership check for user "oracle" in group "oinstall" as Primary passed.
Administrative privileges check passed.
Checking node connectivity...
Interface information for node "fporn02"
Interface Name IP Address Subnet
eth0 10.218.108.245 10.218.108.0
eth1 192.168.1.2 192.168.1.0
Interface information for node "fporn01"
Interface Name IP Address Subnet
eth0 10.218.108.244 10.218.108.0
eth1 192.168.1.1 192.168.1.0
eth2 172.16.9.210 172.16.9.0
Check: Node connectivity of subnet "10.218.108.0"
Source Destination Connected?
fporn02:eth0 fporn01:eth0 yes
Result: Node connectivity check passed for subnet "10.218.108.0" with node(s) fporn02,fporn01.
Check: Node connectivity of subnet "192.168.1.0"
Source Destination Connected?
fporn02:eth1 fporn01:eth1 yes
Result: Node connectivity check passed for subnet "192.168.1.0" with node(s) fporn02,fporn01.
Check: Node connectivity of subnet "172.16.9.0"
Result: Node connectivity check passed for subnet "172.16.9.0" with node(s) fporn01.
Suitable interfaces for the private interconnect on subnet "10.218.108.0":
fporn02 eth0:10.218.108.245
fporn01 eth0:10.218.108.244
Suitable interfaces for the private interconnect on subnet "192.168.1.0":
fporn02 eth1:192.168.1.2
fporn01 eth1:192.168.1.1
ERROR:
Could not find a suitable set of interfaces for VIPs.
Result: Node connectivity check failed.
Checking system requirements for 'crs'...
Check: Total memory
Node Name Available Required Comment
fporn02 7.93GB (8310276KB) 512MB (524288KB) passed
fporn01 7.93GB (8310276KB) 512MB (524288KB) passed
Result: Total memory check passed.
Check: Free disk space in "/tmp" dir
Node Name Available Required Comment
fporn02 9.57GB (10037300KB) 400MB (409600KB) passed
fporn01 9.55GB (10012168KB) 400MB (409600KB) passed
Result: Free disk space check passed.
Check: Swap space
Node Name Available Required Comment
fporn02 8.81GB (9240568KB) 1GB (1048576KB) passed
fporn01 8.81GB (9240568KB) 1GB (1048576KB) passed
Result: Swap space check passed.
Check: System architecture
Node Name Available Required Comment
fporn02 i686 i686 passed
fporn01 i686 i686 passed
Result: System architecture check passed.
Check: Kernel version
Node Name Available Required Comment
fporn02 2.6.9-78.0.0.0.1.ELhugemem 2.4.21-15EL passed
fporn01 2.6.9-78.0.0.0.1.ELhugemem 2.4.21-15EL passed
Result: Kernel version check passed.
Check: Package existence for "make-3.79"
Node Name Status Comment
fporn02 make-3.80-7.EL4 passed
fporn01 make-3.80-7.EL4 passed
Result: Package existence check passed for "make-3.79".
Check: Package existence for "binutils-2.14"
Node Name Status Comment
fporn02 binutils-2.15.92.0.2-25 passed
fporn01 binutils-2.15.92.0.2-25 passed
Result: Package existence check passed for "binutils-2.14".
Check: Package existence for "gcc-3.2"
Node Name Status Comment
fporn02 gcc-3.4.6-10.0.1 passed
fporn01 gcc-3.4.6-10.0.1 passed
Result: Package existence check passed for "gcc-3.2".
Check: Package existence for "glibc-2.3.2-95.27"
Node Name Status Comment
fporn02 glibc-2.3.4-2.41 passed
fporn01 glibc-2.3.4-2.41 passed
Result: Package existence check passed for "glibc-2.3.2-95.27".
Check: Package existence for "compat-db-4.0.14-5"
Node Name Status Comment
fporn02 compat-db-4.1.25-9 passed
fporn01 compat-db-4.1.25-9 passed
Result: Package existence check passed for "compat-db-4.0.14-5".
Check: Package existence for "compat-gcc-7.3-2.96.128"
Node Name Status Comment
fporn02 missing failed
fporn01 missing failed
Result: Package existence check failed for "compat-gcc-7.3-2.96.128".
++Check: Package existence for "compat-gcc-c++-7.3-2.96.128"++
Node Name Status Comment
fporn02 missing failed
fporn01 missing failed
++Result: Package existence check failed for "compat-gcc-c++-7.3-2.96.128".++
++Check: Package existence for "compat-libstdc++-7.3-2.96.128"++
Node Name Status Comment
fporn02 missing failed
fporn01 missing failed
++Result: Package existence check failed for "compat-libstdc++-7.3-2.96.128".++
++Check: Package existence for "compat-libstdc++-devel-7.3-2.96.128"++
Node Name Status Comment
fporn02 missing failed
fporn01 missing failed
++Result: Package existence check failed for "compat-libstdc++-devel-7.3-2.96.128".++
Check: Package existence for "openmotif-2.2.3"
Node Name Status Comment
fporn02 openmotif-2.2.3-10.2.el4 passed
fporn01 openmotif-2.2.3-10.2.el4 passed
Result: Package existence check passed for "openmotif-2.2.3".
Check: Package existence for "setarch-1.3-1"
Node Name Status Comment
fporn02 setarch-1.6-1 passed
fporn01 setarch-1.6-1 passed
Result: Package existence check passed for "setarch-1.3-1".
Check: Group existence for "dba"
Node Name Status Comment
fporn02 exists passed
fporn01 exists passed
Result: Group existence check passed for "dba".
Check: Group existence for "oinstall"
Node Name Status Comment
fporn02 exists passed
fporn01 exists passed
Result: Group existence check passed for "oinstall".
Check: User existence for "nobody"
Node Name Status Comment
fporn02 exists passed
fporn01 exists passed
Result: User existence check passed for "nobody".
System requirement failed for 'crs'
Pre-check for cluster services setup was unsuccessful on all the nodes.forget about my last post, it was my mistake, I rebooted the server and the clustered file system service did not start up at boot time.
sorry
this is what I really got in /var/log/messages
after manually running crs daemons
Jun 26 16:43:07 fporn01 su(pam_unix)[10020]: session opened for user oracle by (uid=0)
Jun 26 16:43:07 fporn01 su(pam_unix)[10020]: session closed for user oracle
Jun 26 16:43:07 fporn01 logger: Cluster Ready Services completed waiting on dependencies.
Jun 26 16:44:07 fporn01 su(pam_unix)[9977]: session opened for user oracle by (uid=0)
Jun 26 16:45:31 fporn01 su(pam_unix)[10293]: session opened for user oracle by (uid=0)
Jun 26 16:45:32 fporn01 su(pam_unix)[10293]: session closed for user oracle
Jun 26 16:45:32 fporn01 logger: Cluster Ready Services completed waiting on dependencies.
Jun 26 16:45:40 fporn01 su(pam_unix)[10351]: session opened for user oracle by (uid=0)
Jun 26 16:45:40 fporn01 su(pam_unix)[10351]: session closed for user oracle
Jun 26 16:45:40 fporn01 su(pam_unix)[10415]: session opened for user oracle by (uid=0)
Jun 26 16:45:40 fporn01 su(pam_unix)[10415]: session closed for user oracle
Jun 26 16:45:40 fporn01 logger: Cluster Ready Services completed waiting on dependencies.
Jun 26 16:46:32 fporn01 su(pam_unix)[10591]: session opened for user oracle by (uid=0)
Jun 26 16:46:40 fporn01 logger: Running CRSD with TZ =
after running ps -ef | grep -E 'init|d.bin|ocls|oprocd|diskmon|evmlogger|PID'
[root@fporn01 ~]# ps -ef | grep -E 'init|d.bin|ocls|oprocd|diskmon|evmlogger|PID'
UID PID PPID C STIME TTY TIME CMD
root 1 0 0 15:33 ? 00:00:00 init [5]
root 9869 7951 0 16:40 pts/1 00:00:00 [init.crsd] <defunct>
oracle 10053 9977 0 16:44 ? 00:00:00 /u01/app/crs/bin/evmd.bin
root 10249 7951 0 16:45 pts/1 00:00:00 /bin/sh /etc/init.d/init.cssd fatal
root 10341 7951 0 16:45 pts/1 00:00:00 /u01/app/crs/bin/crsd.bin reboot
root 10551 10249 0 16:46 pts/1 00:00:00 /bin/sh /etc/init.d/init.cssd daemon
oracle 10618 10592 0 16:46 ? 00:00:00 /u01/app/crs/bin/ocssd.bin
oracle 10926 10053 0 16:46 ? 00:00:00 /u01/app/crs/bin/evmlogger.bin -o /u01/app/crs/evm/log/evmlogger.info -l /u01/app/crs/evm/log/evmlogger.log
root 16658 9461 0 16:50 pts/2 00:00:00 grep -E init|d.bin|ocls|oprocd|diskmon|evmlogger|PID
CRS daemons finally work
*but i get this error when i run [oracle@fporn01 cluvfy]$ ./runcluvfy.sh stage -post crsinst -n fporn01,fporn02 -verbose*
Performing post-checks for cluster services setup
Checking node reachability...
Check: Node reachability from node "fporn01"
Destination Node Reachable?
fporn01 yes
fporn02 yes
Result: Node reachability check passed from node "fporn01".
Checking user equivalence...
Check: User equivalence for user "oracle"
Node Name Comment
fporn02 passed
fporn01 passed
Result: User equivalence check passed for user "oracle".
ERROR:
CRS is not installed on any of the nodes.
Verification cannot proceed.
Post-check for cluster services setup was unsuccessful on all the nodes. -
ATG+ CRS Error in ATGPublishing server, access not allowed to BCC
Hi,
After the installation of ATG 10.0.3 and Commerce Reference Store on Weblogic, using CIM, I started both ATGProduction and ATGPublishing without trouble. I can access the store (http://localhost:7003/crs/storeus) and dynamo administration console (http://localhost:7003/atg/dyn) but when I try to access BCC (http://localhost:7005/atg/bcc) the web browser shows an error (page can´t be displayed). I have copied the error that appears in the log ATGPublishing.log
I would be very grateful if someone could solve this issue. Thanks in advance
Regards,
Iñigo
Log error:
###<Oct 19, 2011 9:46:17 AM CEST> <Error> <HTTP> <atgappserver> <ATGPublishing> <[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)'> <<WLS Kernel>> <> <> <1319010377001> <BEA-101017> <[ServletContext@1961981706[app:ATGPublishing.ear module:/atg path:/atg spec-version:null], request: weblogic.servlet.internal.ServletRequestImpl@49982a03[
GET /atg/bcc HTTP/1.1
Accept: application/x-ms-application, image/jpeg, application/xaml+xml, image/gif, image/pjpeg, application/x-ms-xbap, application/vnd.ms-excel, application/vnd.ms-powerpoint, application/msword, */*
Accept-Language: es-ES
User-Agent: Mozilla/4.0 (compatible; MSIE 7.0; Windows NT 6.1; WOW64; Trident/4.0; SLCC2; .NET CLR 2.0.50727; .NET CLR 3.5.30729; .NET CLR 3.0.30729; Media Center PC 6.0; .NET4.0C; .NET4.0E; MS-RTC LM 8; InfoPath.3; managedpc)
Accept-Encoding: gzip, deflate, peerdist
Connection: Keep-Alive
Cookie: DYN_USER_ID=140000; DYN_USER_CONFIRM=c2d08d3eb51f5945b8e444d80628b112; ADMINCONSOLESESSION=z6MYTp2Q7Y4s4b3yHQGNPrwS15LmJBZvlLYxhSXvktyFvqfbgJyl!1748133635; JSESSIONID=01p6Tp1J2Y2QPllJJdT7WXvpQSTyq05Mp12FNqY7YMs27QSqtp2V!991620916
X-P2P-PeerDist: Version=1.0
]] Root cause of ServletException.
javax.servlet.ServletException: PageFilter: cannot get a start request servlet.
at atg.filter.dspjsp.PageFilter.doFilter(PageFilter.java:287)
at weblogic.servlet.internal.FilterChainImpl.doFilter(FilterChainImpl.java:56)
at atg.servlet.GenericFilterService.doFilterChain(GenericFilterService.java:599)
at atg.servlet.GenericFilterService.handleDoFilter(GenericFilterService.java:462)
at atg.servlet.GenericFilterService.doFilter(GenericFilterService.java:409)
at weblogic.servlet.internal.FilterChainImpl.doFilter(FilterChainImpl.java:56)
at weblogic.servlet.internal.WebAppServletContext$ServletInvocationAction.wrapRun(WebAppServletContext.java:3715)
at weblogic.servlet.internal.WebAppServletContext$ServletInvocationAction.run(WebAppServletContext.java:3681)
at weblogic.security.acl.internal.AuthenticatedSubject.doAs(AuthenticatedSubject.java:321)
at weblogic.security.service.SecurityManager.runAs(SecurityManager.java:120)
at weblogic.servlet.internal.WebAppServletContext.securedExecute(WebAppServletContext.java:2277)
at weblogic.servlet.internal.WebAppServletContext.execute(WebAppServletContext.java:2183)
at weblogic.servlet.internal.ServletRequestImpl.run(ServletRequestImpl.java:1454)
at weblogic.work.ExecuteThread.execute(ExecuteThread.java:209)
at weblogic.work.ExecuteThread.run(ExecuteThread.java:178)
>
####<Oct 19, 2011 9:46:17 AM CEST> <Info> <ServletContext-/atg> <atgappserver> <ATGPublishing> <[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)'> <<anonymous>> <> <> <1319010377046> <BEA-000000> <JspServlet: param verbose initialized to: true>
####<Oct 19, 2011 9:46:17 AM CEST> <Info> <ServletContext-/atg> <atgappserver> <ATGPublishing> <[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)'> <<anonymous>> <> <> <1319010377046> <BEA-000000> <JspServlet: param packagePrefix initialized to: jsp_servlet>
####<Oct 19, 2011 9:46:17 AM CEST> <Info> <ServletContext-/atg> <atgappserver> <ATGPublishing> <[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)'> <<anonymous>> <> <> <1319010377046> <BEA-000000> <JspServlet: param compilerclass initialized to: null>
####<Oct 19, 2011 9:46:17 AM CEST> <Info> <ServletContext-/atg> <atgappserver> <ATGPublishing> <[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)'> <<anonymous>> <> <> <1319010377046> <BEA-000000> <JspServlet: param compileCommand initialized to: javac>
####<Oct 19, 2011 9:46:17 AM CEST> <Info> <ServletContext-/atg> <atgappserver> <ATGPublishing> <[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)'> <<anonymous>> <> <> <1319010377047> <BEA-000000> <JspServlet: param compilerval initialized to: javac>
####<Oct 19, 2011 9:46:17 AM CEST> <Info> <ServletContext-/atg> <atgappserver> <ATGPublishing> <[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)'> <<anonymous>> <> <> <1319010377047> <BEA-000000> <JspServlet: param pageCheckSeconds initialized to: 1>
####<Oct 19, 2011 9:46:17 AM CEST> <Info> <ServletContext-/atg> <atgappserver> <ATGPublishing> <[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)'> <<anonymous>> <> <> <1319010377047> <BEA-000000> <JspServlet: param encoding initialized to: null>
####<Oct 19, 2011 9:46:17 AM CEST> <Info> <ServletContext-/atg> <atgappserver> <ATGPublishing> <[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)'> <<anonymous>> <> <> <1319010377047> <BEA-000000> <JspServlet: param superclass initialized to null>
####<Oct 19, 2011 9:46:17 AM CEST> <Info> <ServletContext-/atg> <atgappserver> <ATGPublishing> <[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)'> <<anonymous>> <> <> <1319010377047> <BEA-000000> <JspServlet: param workingDir initialized to: /mnt/opt/atguser/weblogic/user_projects/domains/base_domain/servers/ATGPublishing/tmp/_WL_user/ATGPublishing.ear/j3704z>
####<Oct 19, 2011 9:46:17 AM CEST> <Info> <ServletContext-/atg> <atgappserver> <ATGPublishing> <[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)'> <<anonymous>> <> <> <1319010377048> <BEA-000000> <JspServlet: initialization complete>
####<Oct 19, 2011 9:46:17 AM CEST> <Error> <Kernel> <atgappserver> <ATGPublishing> <[ACTIVE] ExecuteThread: '2' for queue: 'weblogic.kernel.Default (self-tuning)'> <<WLS Kernel>> <> <> <1319010377915> <BEA-000802> <ExecuteRequest failed
java.lang.NullPointerException.
java.lang.NullPointerException
at atg.taglib.dspjsp.PageTag.doCatch(PageTag.java:734)
at atg.taglib.dspjsp.elwrap.PageTagWrapper.doCatch(PageTagWrapper.java:36)
at jsp_servlet.__error._jspService(__error.java:405)
at weblogic.servlet.jsp.JspBase.service(JspBase.java:34)
at weblogic.servlet.internal.StubSecurityHelper$ServletServiceAction.run(StubSecurityHelper.java:227)
at weblogic.servlet.internal.StubSecurityHelper.invokeServlet(StubSecurityHelper.java:125)
at weblogic.servlet.internal.ServletStubImpl.execute(ServletStubImpl.java:300)
at weblogic.servlet.internal.ServletStubImpl.execute(ServletStubImpl.java:183)
at weblogic.servlet.internal.RequestDispatcherImpl.invokeServlet(RequestDispatcherImpl.java:523)
at weblogic.servlet.internal.RequestDispatcherImpl.forward(RequestDispatcherImpl.java:253)
at weblogic.servlet.internal.ServletResponseImpl.sendError(ServletResponseImpl.java:720)
at weblogic.servlet.internal.ServletResponseImpl.sendError(ServletResponseImpl.java:591)
at weblogic.servlet.internal.ErrorManager.handleException(ErrorManager.java:150)
at weblogic.servlet.internal.WebAppServletContext.handleThrowableFromInvocation(WebAppServletContext.java:2348)You were right. Some errors where displayed when launching ATGPublishing, some tables where missing. I made the imports again and everything went OK.
Thank you! -
CRS installation: Failure at final check of Oracle CRS stack.10
Hello,
I am trying to install Oracle RAC in 10GR2 to simulate a migration from 1024 to 11GR2. I am using VMWARE with two Linux CentOS 64b 6.2 and shared disks as raw devices. I got "Failure at final check of Oracle CRS stack.10" when running root.sh, on both nodes. The ocrcheck is fine, but I have two different IDs... which is not good and I do not understand why:
- I have shared raw devices
- the devices are the same, I checked this twice
Can anyoane help?
I thank you all in advance.Hello,
The exact error message is the one in my subject description: Failure at final check of Oracle CRS stack.10
This occurs when runing root.sh after the installation is successfull and there is no way to continue, the assistants are failing, which is logic, as the root.sh attempt to configure and run the CRS agents are not done successfully. The OCR ar ok with the same ID on both nodes, the minor in major values for the raw devices are and the firewall is disabled on both nodes. I checked the rights, they are also ok. The exact and complete return of the root.sh script is the following:
Checking to see if Oracle CRS stack is already configured
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: cygnus cygnus-priv cygnus
node 2: taurus taurus-priv taurus
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Now formatting voting device: /apps/oracle/oradat/vot10
Now formatting voting device: /apps/oracle/oradat/vot20
Now formatting voting device: /apps/oracle/oradat/vot30
Format of 3 voting devices complete.
Startup will be queued to init within 30 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
Failure at final check of Oracle CRS stack.
10
Any help will be really apreciated and I thank you in advancen guys... -
CRS installation roo.sh failed -- FAILURE AT FINAL CHECK
CRS 10.2.0.1 installation on Solaris 10
root. sh failed, following error appeared.
bash-3.00# sh -x root.sh
+ /opt/oracrs/install/rootinstall
+ /opt/oracrs/install/rootconfig
Checking to see if Oracle CRS stack is already configured
Setting the permissions on OCR backup directory
Setting up NS directories
Oracle Cluster Registry configuration upgraded successfully
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: proddb02 proddb02-priv proddb02
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Now formatting voting device: /dev/rdsk/vpath9a
Now formatting voting device: /dev/rdsk/vpath10a
Now formatting voting device: /dev/rdsk/vpath11a
Format of 3 voting devices complete.
Startup will be queued to init within 30 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
Failure at final check of Oracle CRS stack.
10
bash-3.00#
following metalink note 240001.1 as well, no success.
any idea where should I look?You can start looking at the following files to check if there are any errors reported :
a) The OS Messages file ( /var/adm/messages )
b) Any files with names like crsctl* under the /tmp directory
c) The client trace files under $ORA_CRS_HOME/log/<hostname>/client/*
d) The crsd and the cssd logs under $ORA_CRS_HOME/log/<hostname>/[crsd/cssd]/*
Also what does a ps -ef | grep d.bin indicate.
Which daemons get started and which do not.
Vishwa -
Failure at final check of Oracle CRS stac
+++crs ID conflicts ocrcheck scsi
++Referred metalink note:344994.1 but it was specific to RAW devices, Need a solution for scsi device.
oracle@vx0302 bin # ocrcheck
Status of Oracle Cluster Registry is as follows :
Version : 2
Total space (kbytes) : 3306636
Used space (kbytes) : 440
Available space (kbytes) : 3306196
ID : 1425438992 <<different from node1
Device/File Name : /dev/sdb1
Device/File integrity check failed
Device/File not configured
Cluster registry integrity check failed
oracle@vx0301 bin # ocrcheck
Status of Oracle Cluster Registry is as follows :
Version : 2
Total space (kbytes) : 40269168
Used space (kbytes) : 308
Available space (kbytes) : 40268860
ID : 68510624 << different from node1
Device/File Name : /dev/sdb1
Device/File integrity check succeeded
Device/File not configured
Cluster registry integrity check succeeded
According to the above output, the IDs are same. I followed this http://surachartopun.com/2009/01/failure-at-final-check-of-oracle-crs.html link for the workaround. I have taken the following steps:
1. Firewall is off.
2. I have referred to the metalink, but it provides a solution only for raw devices. I have used the link http://www.oracle.com/technology/pub/articles/hunter-rac11gr2-iscsi.html to configure Openfiler for iscsi devices. The configuration is EXACTLY the same. However, when i ran ls -l /dev/iscsi/*, the output was different on both nodes.
Node1:
[oracle@vx0301 ~]# ls -l /dev/iscsi/*
/dev/iscsi/crs1:
total 0
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part -> ../../sdc
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part1 -> ../../sdc1
/dev/iscsi/data1:
total 0
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part -> ../../sdb1
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part1 -> ../../sdb1
/dev/iscsi/fra1:
total 0
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part -> ../../sdd
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part1 -> ../../sdd1
Node2:
[oracle@vx0302 ~]# ls -l /dev/iscsi/*
/dev/iscsi/crs1:
total 0
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part -> ../../sdb
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part1 -> ../../sdb1
/dev/iscsi/data1:
total 0
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part -> ../../sdd1
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part1 -> ../../sdd1
/dev/iscsi/fra1:
total 0
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part -> ../../sdc
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part1 -> ../../sdc1
Is there a way i can fix this?
Edited by: user10594250 on Jan 22, 2010 4:41 AM+++crs ID conflicts ocrcheck scsi
++Referred metalink note:344994.1 but it was specific to RAW devices, Need a solution for scsi device.
oracle@vx0302 bin # ocrcheck
Status of Oracle Cluster Registry is as follows :
Version : 2
Total space (kbytes) : 3306636
Used space (kbytes) : 440
Available space (kbytes) : 3306196
ID : 1425438992 <<different from node1
Device/File Name : /dev/sdb1
Device/File integrity check failed
Device/File not configured
Cluster registry integrity check failed
oracle@vx0301 bin # ocrcheck
Status of Oracle Cluster Registry is as follows :
Version : 2
Total space (kbytes) : 40269168
Used space (kbytes) : 308
Available space (kbytes) : 40268860
ID : 68510624 << different from node1
Device/File Name : /dev/sdb1
Device/File integrity check succeeded
Device/File not configured
Cluster registry integrity check succeeded
According to the above output, the IDs are same. I followed this http://surachartopun.com/2009/01/failure-at-final-check-of-oracle-crs.html link for the workaround. I have taken the following steps:
1. Firewall is off.
2. I have referred to the metalink, but it provides a solution only for raw devices. I have used the link http://www.oracle.com/technology/pub/articles/hunter-rac11gr2-iscsi.html to configure Openfiler for iscsi devices. The configuration is EXACTLY the same. However, when i ran ls -l /dev/iscsi/*, the output was different on both nodes.
Node1:
[oracle@vx0301 ~]# ls -l /dev/iscsi/*
/dev/iscsi/crs1:
total 0
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part -> ../../sdc
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part1 -> ../../sdc1
/dev/iscsi/data1:
total 0
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part -> ../../sdb1
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part1 -> ../../sdb1
/dev/iscsi/fra1:
total 0
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part -> ../../sdd
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part1 -> ../../sdd1
Node2:
[oracle@vx0302 ~]# ls -l /dev/iscsi/*
/dev/iscsi/crs1:
total 0
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part -> ../../sdb
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part1 -> ../../sdb1
/dev/iscsi/data1:
total 0
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part -> ../../sdd1
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part1 -> ../../sdd1
/dev/iscsi/fra1:
total 0
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part -> ../../sdc
lrwxrwxrwx 1 root root 9 Nov 3 18:13 part1 -> ../../sdc1
Is there a way i can fix this?
Edited by: user10594250 on Jan 22, 2010 4:41 AM -
Failure at final check of oracle crs stack
Hi,
I am installing oracle clusterware 11, when I run on the first node root.sh it's ok, but when I run it on the second node recive this message:
Failure at final check of Oracle CRS stack 10
I have stopped firewall and ssh,scp works fine without password using node.domain and node without domain between the nodes.
please help me!thanks for the answer.
this is root.sh output of second node
[root@orac-asbe-c cluster]# ./root.sh
Checking to see if Oracle CRS stack is already configured
/etc/oracle does not exist. Creating it now.
Setting the permissions on OCR backup directory
Setting up Network socket directories
Oracle Cluster Registry configuration upgraded successfully
Successfully accumulated necessary OCR keys.
Using ports: CSS=49895 CRS=49896 EVMC=49898 and EVMR=49897.
node <nodenumber>: <nodename> <private interconnect name> <hostname>
node 1: orac-asbe-d orac-asbe-d-priv orac-asbe-d
node 2: orac-asbe-c orac-asbe-c-priv orac-asbe-c
Creating OCR keys for user 'root', privgrp 'root'..
Operation successful.
Now formatting voting device: /dev/hdd1
Now formatting voting device: /dev/hdd2
Now formatting voting device: /dev/hdd3
Format of 3 voting devices complete.
Startup will be queued to init within 30 seconds.
Adding daemons to inittab
Expecting the CRS daemons to be up within 600 seconds.
Failure at final check of Oracle CRS stack.
10
the ocssd.log in very long, I take fiew pieces
Oracle Database 11g CRS Release 11.1.0.6.0 - Production Copyright 1996, 2007 Oracle. All rights reserved.
[ clsdmt]Listening to (ADDRESS=(PROTOCOL=ipc)(KEY=orac-asbe-cDBG_CSSD))
[ CSSD]2011-06-17 11:04:17.918 >USER: Oracle Database 10g CSS Release 11.1.0.6.0 Production Copyright 1996, 2004 Oracle. All rights reserved.
[ CSSD]2011-06-17 11:04:17.918 >USER: CSS daemon log for node orac-asbe-c, number 2, in cluster orac-as_cluster
[ CSSD]2011-06-17 11:04:17.936 [649053344] >TRACE: clssscmain: local-only set to false
[ CSSD]2011-06-17 11:04:17.970 [649053344] >TRACE: clssnmReadNodeInfo: added node 1 (orac-asbe-d) to cluster
[ CSSD]2011-06-17 11:04:17.994 [649053344] >TRACE: clssnmReadNodeInfo: added node 2 (orac-asbe-c) to cluster
[ CSSD]2011-06-17 11:04:17.997 [649053344] >WARNING: clssnmReadWallet: Open Wallet returned 28759
[ CSSD]2011-06-17 11:04:17.997 [649053344] >WARNING: clssnmInitNMInfo: Node not configured for node kill
[ CSSD]2011-06-17 11:04:18.011 [1133824320] >TRACE: clssnm_skgxninit: Compatible vendor clusterware not in use
[ CSSD]2011-06-17 11:04:18.011 [1133824320] >TRACE: clssnm_skgxnmon: skgxn init failed
[ CSSD]2011-06-17 11:04:18.023 [649053344] >TRACE: clssnmNMInitialize: Network heartbeat thresholds are: impending reconfig 15000 ms, reconfig start (misscount) 30000 ms
[ CSSD]2011-06-17 11:04:18.027 [649053344] >TRACE: clssnmNMInitialize: Voting file I/O timeouts are: short 27000 ms, long 200000 ms
[ CSSD]2011-06-17 11:04:18.039 [649053344] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (0//dev/hdd1)
[ CSSD]2011-06-17 11:04:18.040 [1133824320] >TRACE: clssnmvDPT: spawned for disk 0 (/dev/hdd1)
[ CSSD]2011-06-17 11:04:18.083 [1133824320] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (0//dev/hdd1)
[ CSSD]2011-06-17 11:04:18.088 [1144314176] >TRACE: clssnmvKillBlockThread: spawned for disk 0 (/dev/hdd1) initial : sleep interval (1000)ms
[ CSSD]2011-06-17 11:04:18.185 [649053344] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (1//dev/hdd2)
[ CSSD]2011-06-17 11:04:18.189 [1154804032] >TRACE: clssnmvDPT: spawned for disk 1 (/dev/hdd2)
[ CSSD]2011-06-17 11:04:18.208 [1154804032] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (1//dev/hdd2)
[ CSSD]2011-06-17 11:04:18.212 [1102223680] >TRACE: clssnmvKillBlockThread: spawned for disk 1 (/dev/hdd2) initial sleep interval (1000)ms
[ CSSD]2011-06-17 11:04:18.219 [649053344] >TRACE: clssnmDiskStateChange: state from 1 to 2 disk (2//dev/hdd3)
[ CSSD]2011-06-17 11:04:18.223 [1165293888] >TRACE: clssnmvDPT: spawned for disk 2 (/dev/hdd3)
[ CSSD]2011-06-17 11:04:18.251 [1165293888] >TRACE: clssnmDiskStateChange: state from 2 to 4 disk (2//dev/hdd3)
[ CSSD]2011-06-17 11:04:18.255 [1175783744] >TRACE: clssnmvKillBlockThread: spawned for disk 2 (/dev/hdd3) initial sleep interval (1000)ms
CSSD]2011-06-17 11:04:18.367 [649053344] >TRACE: clssscSclsFatal: read value of disable
[ CSSD]2011-06-17 11:04:18.370 [649053344] >TRACE: clssscSclsFatal: read value of disable
[ CSSD]2011-06-17 11:04:18.373 [1196763456] >TRACE: clssnmFatalThread: spawned
[ CSSD]2011-06-17 11:04:18.375 [1207253312] >TRACE: clssnmClusterListener: Listening on (ADDRESS=(PROTOCOL=tcp)(HOST=orac-asbe-c-priv)(PORT=49895))
[ CSSD]2011-06-17 11:04:18.375 [1207253312] >TRACE: clssnmconnect: connecting to node(1), con(0xc3af000), flags 0x0003
[ CSSD]2011-06-17 11:04:18.381 [1217743168] >TRACE: clssgmDeathChkThread: Spawned
[ CSSD]------- Begin Dump -------
[ CSSD]
[ CSSD]
[ CSSD]2011-06-17 11:04:18.383 [1207253312] >TRACE: clssnmConnComplete: MSGSRC 1, type 6, node 1, flags 0x0003, con 0xc3af000, probe (nil), nodekillsz 0
[ CSSD]2011-06-17 11:04:18.383 [1207253312] >TRACE: clssnmConnComplete: msg src=1 dst=2 seq=0 type=6 birth=203767168 state=3 name=()
[ CSSD]2011-06-17 11:04:18.383 [1207253312] >ERROR: ASSERT clssnm.c 11562
[ CSSD]2011-06-17 11:04:18.383 [1207253312] >ERROR: clssnmConnComplete: OCR id mismatch (1700660325, 1307814030)
[ CSSD]2011-06-17 11:04:18.383 [1207253312] >ERROR: ###################################
[ CSSD]2011-06-17 11:04:18.383 [1207253312] >ERROR: clssscExit: CSSD aborting from thread clssnmClusterListener
[ CSSD]2011-06-17 11:04:18.383 [1207253312] >ERROR: ###################################
[ CSSD]
----- Call Stack Trace -----
[ CSSD]calling call entry argument values in hex
[ CSSD]location type point (? means dubious value)
[ CSSD]-------------------- -------- -------------------- ----------------------------
[ CSSD]sskgds_getexecname: using /proc/self/status and $PATH to get ocssd.bin
[ CSSD]Cannot open ocssd.bin for reading: errno=2
[ CSSD]Cannot open ocssd.bin for reading: errno=2
[ CSSD]Cannot open ocssd.bin for reading: errno=2
[ CSSD]000000000040C2D6 call kgdsdst() 000000000 ? 000000001 ?
[ CSSD] 047F4FB68 ? 047F4E650 ?
[ CSSD] 000000000 ? 000000003 ?
thanks again
Maybe you are looking for
-
I've been experiencing a problem using jikes to compile EJBs. Has anyone has this type of problem and is there a known fix? The beans seem to compile fine, but I get an error when trying to deploy them. Here's the error: weblogic.ejb.common.Deploymen
-
Plug-in registration under Adobe third-party plug-ins list, how?
Hello folks, I would like to list my plugin under Adobe third-party plug-ins list. I searched a little online but couldn't find any information on Adobe's website. Does anybody know how to do this registration? Regards. Mor
-
'All Photos' is sorted randomly. Cannot be changed.
Ok, this is very frustrating. I have 10.10.3 and Photos 1.0 (209.52.0). I've just dragged about 9000 photos in Photos (all taken with my various iPhones over the years, correct metadata, EXIF, etc). Everything imported fine, with the exception that t
-
Hi, Paul Buchkowski mentioned an External Connection Class in framework. Does anyone know whether this class has been documented somewhere ? Thanks, Paolo Paolo Sidoli DS Data Systems Parma, Italy Tel. ++39 521 2781 Fax. ++39 521 272818 [email protec
-
How to drag out additional field in document printing in B1
Hi all expert, My customer need additional field or information to be show at the document printing window in standard B1. Example like customer name or user define field for warehouse. The standard B1 document printing is limited with the form setti