Rs-ora:resource group failed to start on chosen node; it may end up failing
I have configured two node failover cluster environment using netra a/d 1000 storage. When I try to deploy oracle server application it throws the following error
rs-ora: resource group failed to start on chosen node; it may end up failing over to other node(s)
I created metaset and gave one raw did disk to that metaset.
I created logical hostname resource, ha-storage plus resource. Later I brought the resource group to online using following command
#clrg online emM rg-ora
Later I created oracle cluster resource using following command.
#clrs create -g rg-ora -t SUNW.oracle_server -p ORACLE_HOME=/global/oracle/product/10.2.0/db_1 -p ORACLE_SID=infra -p Alert_log_file=/global/oracle/product/10.2.0/db_1/admin/infra/bdump/alert_infra.log -p Connect_string=sysdba/dbadmin1@infra -p Resource_dependencies=rs-ora-has rs-ora
node1 - Validation failed. ORACLE_HOME /global/oracle/product/10.2.0/db_1 does not exist
node1 - ALERT_LOG_FILE /global/oracle/product/10.2.0/db_1/admin/infra/bdump/alert_infra.log doesn't exist
node1 - PARAMETER_FILE: /global/oracle/product/10.2.0/db_1/dbs/initinfra.ora nor server PARAMETER_FILE: /global/oracle/product/10.2.0/db_1/dbs/spfileinfra.ora exists
node1 - This resource depends on a HAStoragePlus resouce that is not online on this node. Ignoring validation errors.
rs-ora: resource group failed to start on chosen node; it may end up failing over to other node(s)
The status of oracle resource shows as follows.
Resource Name Node Name State Status Message
rs-ora node1 Start failed Faulted
I used solaris 10 update 6 patch level is Generic_137137-09, Oracle version 10.2.0, Sun clusters 3.2 update1. Following are the vfstab and /var/adm/messages of both nodes.
Node1#grep ora /etc/vfstab
/dev/md/oradg/dsk/d300 /dev/md/oradg/rdsk/d300 /global/oracle ufs 5 no logging
Node2#grep ora /etc/vfstab
/dev/md/oradg/dsk/d300 /dev/md/oradg/rdsk/d300 /global/oracle ufs 5 no logging
Node1#more /var/adm/messages
Oct 17 05:19:17 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hafoip_prenet_start> for resource <ha-
host-1>, resource group <rg-ora>, node <node1>, timeout <300> seconds
Oct 17 05:19:17 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/
lib/rgm/rt/hafoip/hafoip_prenet_start>:tag=<rg-ora.ha-host-1.10>: Calling security_clnt_connect(..., host=<node1>, sec_typ
e {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
Oct 17 05:19:17 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hafoip_prenet_start> completed successfully for
resource <ha-host-1>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <300 seconds>
Oct 17 05:19:17 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_prenet_start> for resour
ce <rs-ora-has>, resource group <rg-ora>, node <node1>, timeout <1800> seconds
Oct 17 05:19:17 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/
lib/rgm/rt/hastorageplus/hastorageplus_prenet_start>:tag=<rg-ora.rs-ora-has.10>: Calling security_clnt_connect(..., host=<tes
tlab5>, sec_type {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
Oct 17 05:19:18 node1 Cluster.RGM.rgmd: [ID 375444 daemon.notice] 8 fe_rpc_command: cmd_type(enum):<2>:cmd=<null>:tag=<rg-
ora.rs-ora-has.10>: Calling security_clnt_connect(..., host=<node1>, sec_type {0:WEAK, 1:STRONG, 2:DES} =<0>, ...)
Oct 17 05:19:18 node1 Cluster.RGM.rgmd: [ID 316625 daemon.notice] Timeout monitoring on method tag <rg-ora.rs-ora-has.10>
has been suspended.
Oct 17 05:19:20 node1 Cluster.Framework: [ID 801593 daemon.notice] stdout: becoming primary for oradg
Oct 17 05:19:21 node1 Cluster.RGM.rgmd: [ID 375444 daemon.notice] 8 fe_rpc_command: cmd_type(enum):<3>:cmd=<null>:tag=<rg-
ora.rs-ora-has.10>: Calling security_clnt_connect(..., host=<node1>, sec_type {0:WEAK, 1:STRONG, 2:DES} =<0>, ...)
Oct 17 05:19:21 node1 Cluster.RGM.rgmd: [ID 316625 daemon.notice] Timeout monitoring on method tag <rg-ora.rs-ora-has.10>
has been resumed.
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hastorageplus_prenet_start> completed successful
ly for resource <rs-ora-has>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <1800 seconds>
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hafoip_start> for resource <ha-host-1>
, resource group <rg-ora>, node <node1>, timeout <500> seconds
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/
lib/rgm/rt/hafoip/hafoip_start>:tag=<rg-ora.ha-host-1.0>: Calling security_clnt_connect(..., host=<node1>, sec_type {0:WEA
K, 1:STRONG, 2:DES} =<1>, ...)
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hafoip_start> completed successfully for resourc
e <ha-host-1>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <500 seconds>
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hafoip_monitor_start> for resource <ha
-host-1>, resource group <rg-ora>, node <node1>, timeout <300> seconds
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_start> for resource <rs-
ora-has>, resource group <rg-ora>, node <node1>, timeout <90> seconds
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 510020 daemon.notice] 46 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/
lib/rgm/rt/hafoip/hafoip_monitor_start>:tag=<rg-ora.ha-host-1.7>: Calling security_clnt_connect(..., host=<node1>, sec_typ
e {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/
lib/rgm/rt/hastorageplus/hastorageplus_start>:tag=<rg-ora.rs-ora-has.0>: Calling security_clnt_connect(..., host=<node1>,
sec_type {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hafoip_monitor_start> completed successfully for
resource <ha-host-1>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <300 seconds>
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hastorageplus_start> completed successfully for
resource <rs-ora-has>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <90 seconds>
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_monitor_start> for resou
rce <rs-ora-has>, resource group <rg-ora>, node <node1>, timeout <90> seconds
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/
lib/rgm/rt/hastorageplus/hastorageplus_monitor_start>:tag=<rg-ora.rs-ora-has.7>: Calling security_clnt_connect(..., host=<tes
tlab5>, sec_type {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
Oct 17 05:19:25 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hastorageplus_monitor_start> completed successfu
lly for resource <rs-ora-has>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <90 seconds>
Oct 17 05:19:38 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <bin/oracle_server_validate> for resour
ce <rs-ora>, resource group <rg-ora>, node <node1>, timeout <120> seconds
Oct 17 05:19:38 node1 Cluster.RGM.rgmd: [ID 375444 daemon.notice] 8 fe_rpc_command: cmd_type(enum):<1>:cmd=</opt/SUNWscor/
oracle_server/bin/oracle_server_validate>:tag=<rg-ora.rs-ora.2>: Calling security_clnt_connect(..., host=<node1>, sec_type
{0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
Oct 17 05:19:38 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <bin/oracle_server_validate> completed successful
ly for resource <rs-ora>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <120 seconds>
Oct 17 05:19:38 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <bin/oracle_server_init> for resource <
rs-ora>, resource group <rg-ora>, node <node1>, timeout <30> seconds
Oct 17 05:19:38 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</opt/SUNWscor
/oracle_server/bin/oracle_server_init>:tag=<rg-ora.rs-ora.4>: Calling security_clnt_connect(..., host=<node1>, sec_type {0
:WEAK, 1:STRONG, 2:DES} =<1>, ...)
Oct 17 05:19:38 node1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <bin/oracle_server_init> completed successfully f
or resource <rs-ora>, resource group <rg-ora>, node <node1>, time used: 0% of timeout <30 seconds>
Oct 17 05:19:38 node1 Cluster.CCR: [ID 973933 daemon.notice] resource rs-ora added.
Oct 17 05:19:39 node1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <bin/oracle_server_start> for resource
<rs-ora>, resource group <rg-ora>, node <node1>, timeout <600> seconds
Oct 17 05:19:39 node1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</opt/SUNWscor
/oracle_server/bin/oracle_server_start>:tag=<rg-ora.rs-ora.0>: Calling security_clnt_connect(..., host=<node1>, sec_type {
0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
Oct 17 05:19:48 node1 SC[SUNWscor.oracle_server.start]:rg-ora:rs-ora: [ID 876834 daemon.error] Could not start server
Oct 17 05:19:48 node1 Cluster.RGM.rgmd: [ID 938318 daemon.error] Method <bin/oracle_server_start> failed on resource <rs-o
ra> in resource group <rg-ora> [exit code <1>, time used: 1% of timeout <600 seconds>]
Node2# more /var/adm/messages
Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 529407 daemon.notice] resource group rg-ora state on node node2 change to RG_PENDIN
G_OFFLINE
Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource rs-ora-has state on node node2 change to R_MON_STOPP
ING
Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource ha-host-1 state on node node2 change to R_MON_STOPPI
NG
Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hafoip_monitor_stop> for resource <ha-host
-1>, resource group <rg-ora>, node <node2>, timeout <300> seconds
Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_monitor_stop> for resource <
rs-ora-has>, resource group <rg-ora>, node <node2>, timeout <90> seconds
Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 268902 daemon.notice] 45 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/lib/
rgm/rt/hafoip/hafoip_monitor_stop>:tag=<rg-ora.ha-host-1.8>: Calling security_clnt_connect(..., host=<node2>, sec_type {0:WEAK
, 1:STRONG, 2:DES} =<1>, ...)
Oct 14 20:20:04 node2 Cluster.RGM.rgmd: [ID 510020 daemon.notice] 46 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/lib/
rgm/rt/hastorageplus/hastorageplus_monitor_stop>:tag=<rg-ora.rs-ora-has.8>: Calling security_clnt_connect(..., host=<node2>, s
ec_type {0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hastorageplus_monitor_stop> completed successfully f
or resource <rs-ora-has>, resource group <rg-ora>, node <node2>, time used: 0% of timeout <90 seconds>
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource rs-ora-has state on node node2 change to R_ONLINE_UN
MON
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource rs-ora-has state on node node2 change to R_STOPPING
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 784560 daemon.notice] resource rs-ora-has status on node node2 change to R_FM_UNKNO
WN
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 922363 daemon.notice] resource rs-ora-has status msg on node node2 change to <Stopp
ing>
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_stop> for resource <rs-ora-h
as>, resource group <rg-ora>, node <node2>, timeout <1800> seconds
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 510020 daemon.notice] 46 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/lib/
rgm/rt/hastorageplus/hastorageplus_stop>:tag=<rg-ora.rs-ora-has.1>: Calling security_clnt_connect(..., host=<node2>, sec_type
{0:WEAK, 1:STRONG, 2:DES} =<1>, ...)
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hafoip_monitor_stop> completed successfully for reso
urce <ha-host-1>, resource group <rg-ora>, node <node2>, time used: 0% of timeout <300 seconds>
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource ha-host-1 state on node node2 change to R_ONLINE_UNM
ON
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hastorageplus_stop> completed successfully for resou
rce <rs-ora-has>, resource group <rg-ora>, node <node2>, time used: 0% of timeout <1800 seconds>
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource rs-ora-has state on node node2 change to R_STOPPED
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource ha-host-1 state on node node2 change to R_STOPPING
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <hafoip_stop> for resource <ha-host-1>, res
ource group <rg-ora>, node <node2>, timeout <300> seconds
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 784560 daemon.notice] resource ha-host-1 status on node node2 change to R_FM_UNKNOW
N
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 922363 daemon.notice] resource ha-host-1 status msg on node node2 change to <Stoppi
ng>
Oct 14 20:20:05 node2 Cluster.RGM.rgmd: [ID 510020 daemon.notice] 46 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/lib/
rgm/rt/hafoip/hafoip_stop>:tag=<rg-ora.ha-host-1.1>: Calling security_clnt_connect(..., host=<node2>, sec_type {0:WEAK, 1:STRO
NG, 2:DES} =<1>, ...)
Oct 14 20:20:06 node2 ip: [ID 678092 kern.notice] TCP_IOC_ABORT_CONN: local = 192.168.032.244:0, remote = 000.000.000.000:0, s
tart = -2, end = 6
Oct 14 20:20:06 node2 ip: [ID 302654 kern.notice] TCP_IOC_ABORT_CONN: aborted 0 connection
Oct 14 20:20:06 node2 Cluster.RGM.rgmd: [ID 784560 daemon.notice] resource ha-host-1 status on node node2 change to R_FM_OFFLIN
E
Oct 14 20:20:06 node2 Cluster.RGM.rgmd: [ID 922363 daemon.notice] resource ha-host-1 status msg on node node2 change to <Logica
lHostname offline.>
Oct 14 20:20:06 node2 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hafoip_stop> completed successfully for resource <ha
-host-1>, resource group <rg-ora>, node <node2>, time used: 0% of timeout <300 seconds>
Oct 14 20:20:06 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource ha-host-1 state on node node2 change to R_OFFLINE
Oct 14 20:20:06 node2 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource rs-ora-has state on node node2 change to R_POSTNET_S
TOPPING
Similar Messages
-
ONS failed to start on second node
Hi,
I have a problem with ons on 10g rac running on linux 5.3
on node 1 it is running without problem but on second node i got this error
2009-04-08 16:30:41.318: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the
2009-04-08 16:30:41.319: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
2009-04-08 16:30:41.319: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
Number of onsconfiguration retrieved, numcfg = 2
onscfg[0]
{node = rac1, port = 6200}
Adding remote host rac1:6200
onscfg[1]
{node = rac2, port = 6
2009-04-08 16:30:41.319: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: 200}
Adding remote host rac2:6200
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission d
2009-04-08 16:30:41.319: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: enied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server loca
2009-04-08 16:30:41.319: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: l port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
Number of onsconfiguration retrieved, numcfg = 2
onscfg[0]
{node = rac1, port = 6200}
Adding remote host rac1:6200
o
2009-04-08 16:30:41.319: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: nscfg[1]
{node = rac2, port = 6200}
Adding remote host rac2:6200
onsctl: ons failed to start
2009-04-08 16:30:41.319: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: clsrcexecut: env ORACLE_CONFIG_HOME=/u01/app/crs
2009-04-08 16:30:41.319: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: clsrcexecut: cmd = /u01/app/crs/bin/racgeut -e USRORA_DEBUG=0 540 /u01/app/crs/bin/onsctl start
2009-04-08 16:30:41.320: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: clsrcexecut: rc = 1, time = 2.580s
2009-04-08 16:30:42.148: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the
2009-04-08 16:30:42.150: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
2009-04-08 16:30:42.150: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: RCV: Permission denied
Communication error with the OPMN server local port.
Check the OPMN log files
Number of onsconfiguration retrieved, numcfg = 2
onscfg[0]
{node = rac1, port = 6200}
Adding remote host rac1:6200
onscfg[1]
{node = rac2, port = 6
2009-04-08 16:30:42.150: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: 200}
Adding remote host rac2:6200
ons is not running ...
2009-04-08 16:30:42.151: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: clsrcexecut: env ORACLE_CONFIG_HOME=/u01/app/crs
2009-04-08 16:30:42.151: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: clsrcexecut: cmd = /u01/app/crs/bin/racgeut -e USRORA_DEBUG=0 540 /u01/app/crs/bin/onsctl ping
2009-04-08 16:30:42.151: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: clsrcexecut: rc = 1, time = 0.840s
2009-04-08 16:30:42.153: [ RACG][3065611968] [16553][3065611968][ora.rac2.ons]: end for resource = ora.rac2.ons, action = start, status = 1, time = 3.620s
2009-04-08 16:30:44.376: [ RACG][3066242752] [17061][3066242752][ora.rac2.ons]: onsctl: shutting down ons daemon ...
Number of onsconfiguration retrieved, numcfg = 2
onscfg[0]
{node = rac1, port = 6200}
Adding remote host rac1:6200
onscfg[1]
{node = rac2, port = 6200}
Adding remote host rac2:6200
Any idea how to fix this?
Thankscheck the output for crs_getperm for the resource from both nodes. If you could, post them here.
Regards,
Ganesh -
Can not start messaging server resource group in cluster 3.2
Hi all,
Please help in the following issue.
I am not able to start resource group (msg-rg) and following is the error:
ms1@root# clrg online -M -e msg-rg
clrg: (C748634) Resource group msg-rg failed to start on chosen node and might fail over to other node(s)
clrg: (C135343) No primary node could be found for resource group msg-rg; it remains offline
scstat output (remove some for brief description)
-- Device Group Servers --
Device Group Primary Secondary
Device group servers: SJMS ms1 ms2
-- Device Group Status --
Device Group Status
Device group status: SJMS Online
-- Resource Groups and Resources --
Group Name Resources
Resources: msg-rg mail msg-hasp-rs msg-rs
-- Resources --
Resource Name Node Name State Status Message
Resource: mail ms1 Offline Offline - LogicalHostname offline.
Resource: mail ms2 Offline Offline - LogicalHostname offline.
Resource: msg-hasp-rs ms1 Offline Offline
Resource: msg-hasp-rs ms2 Offline Offline
Resource: msg-rs ms1 Offline Offline - Stop Succeeded
Resource: msg-rs ms2 Offline Offline - Stop Succeeded
Following is the from /var/adm/messages (remove some for brief description)
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 224900 daemon.notice] launching method <ims_svc_start> for resource <msg-rs>, resou
rce group <msg-rg>, node <ms1>, timeout <300> seconds
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 784560 daemon.notice] resource msg-rs status on node ms1 change to R_FM_UNKNOWN
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 922363 daemon.notice] resource msg-rs status msg on node ms1 change to <Starting>
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 751138 daemon.notice] 47 fe_rpc_command: cmd_type(enum):<1>:cmd=</usr/cluster/lib/r
gm/rt/hafoip/hafoip_monitor_start>:tag=<msg-rg.mail.7>: Calling security_clnt_connect(..., host=<ms1>, sec_type {0:WEAK, 1:ST
RONG, 2:DES} =<1>, ...)
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 268902 daemon.notice] 45 fe_rpc_command: cmd_type(enum):<1>:cmd=</opt/sun/comms/msg
scha/bin/imssvc_start>:tag=<msg-rg.msg-rs.0>: Calling security_clnt_connect(..., host=<ms1>, sec_type {0:WEAK, 1:STRONG, 2:
DES} =<1>, ...)
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 515159 daemon.notice] method <hafoip_monitor_start> completed successfully for reso
urce <mail>, resource group <msg-rg>, node <ms1>, time used: 0% of timeout <300 seconds>
Sep 26 12:25:19 ms1 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource mail state on node ms1 change to R_ONLINE
Sep 26 12:26:53 ms1 Cluster.PMF.pmfd: [ID 887656 daemon.notice] Process: tag="msg-rg,msg-rs,1.svc", cmd="/bin/sh -c /opt/sun/
comms/messaging64/bin/start-msg watcher", Failed to stay up.
Sep 26 12:26:55 ms1 Cluster.RGM.rgmd: [ID 784560 daemon.notice] resource msg-rs status on node ms1 change to R_FM_ONLINE
Sep 26 12:26:55 ms1 Cluster.RGM.rgmd: [ID 922363 daemon.notice] resource msg-rs status msg on node ms1 change to <Start succe
eded.>
Sep 26 12:26:55 ms1 Cluster.PMF.pmfd: [ID 819736 daemon.notice] PMF is restarting process that died: tag=msg-rg,msg-rs,1.svc,
cmd_path=/bin/sh -c /opt/sun/comms/messaging64/bin/start-msg watcher, max_retries=0, num_retries=0
Sep 26 12:27:25 ms1 SC[SUNW.ims:7.0,msg-rg,msg-rs,ims_svc_start]: [ID 141062 daemon.error] Failed to connect to host 192.168.
0.250 and port 27442: Connection refused.
Sep 26 12:29:55 ms1 last message repeated 6 times
Sep 26 12:30:26 ms1 Cluster.RGM.rgmd: [ID 764140 daemon.error] Method <ims_svc_start> on resource <msg-rs>, resource group <m
sg-rg>, node <ms1>: Timeout.
Sep 26 12:30:26 ms1 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource msg-rs state on node ms1 change to R_START_FAILED
Sep 26 12:30:26 ms1 Cluster.RGM.rgmd: [ID 529407 daemon.notice] resource group msg-rg state on node ms1 change to RG_PENDING_
OFF_START_FAILED
Sep 26 12:30:26 ms1 Cluster.RGM.rgmd: [ID 784560 daemon.notice] resource msg-rs status on node ms1 change to R_FM_FAULTED
Sep 26 12:30:26 ms1 Cluster.RGM.rgmd: [ID 443746 daemon.notice] resource msg-rs state on node ms1 change to R_STOPPING
SI got the mistake in adding /etc/hosts. I pasted the area here for any person who can notice if they encountered same problem or same mistake.
it should be following format:
192.168.0.250 mail.test.com mail msg-lcreate logical hostname as follow:
clrslh create -g msg-rg msg-lNotice qfe0:1
# ifconfig -a
lo0: flags=2001000849<UP,LOOPBACK,RUNNING,MULTICAST,IPv4,VIRTUAL> mtu 8232 index 1
inet 127.0.0.1 netmask ff000000
eri0: flags=1000843<UP,BROADCAST,RUNNING,MULTICAST,IPv4> mtu 1500 index 2
inet 192.168.0.240 netmask ffffff00 broadcast 192.168.0.255
groupname sc_ipmp0
ether 0:3:ba:29:8a:ac
eri0:1: flags=9040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4,NOFAILOVER> mtu 1500 >index 2
inet 192.168.0.242 netmask ffffff00 broadcast 192.168.0.255
qfe0: flags=9040842<BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4,NOFAILOVER> mtu 1500 index 3
inet 192.168.0.243 netmask ffffff00 broadcast 192.168.0.255
groupname sc_ipmp0
ether 0:3:ba:22:d4:36
qfe0:1: flags=1040843<UP,BROADCAST,RUNNING,MULTICAST,DEPRECATED,IPv4> mtu 1500 index 3
inet 192.168.0.250 netmask ffffff00 broadcast 192.168.0.255
qfe2: flags=1008843<UP,BROADCAST,RUNNING,MULTICAST,PRIVATE,IPv4> mtu 1500 index 5
inet 172.16.0.129 netmask ffffff80 broadcast 172.16.0.255
ether 0:3:ba:22:d4:38
qfe3: flags=1008843<UP,BROADCAST,RUNNING,MULTICAST,PRIVATE,IPv4> mtu 1500 index 4
inet 172.16.1.1 netmask ffffff80 broadcast 172.16.1.127
ether 0:3:ba:22:d4:39
clprivnet0: flags=1009843<UP,BROADCAST,RUNNING,MULTICAST,MULTI_BCAST,PRIVATE,IPv4> mtu 1500 >index 6
inet 172.16.4.1 netmask fffffe00 broadcast 172.16.5.255
ether 0:0:0:0:0:1Now I am able to plumb logical hostname ip. messaging resource group is able to swing over between nodes and resource group is able to go online (before creating messaging server resource (msg-rs).
after creating messaging server resource, use following command to start message resource group:
ms1@root #clrg online -eM msg-rgI used the following command to create message resource (msg-rs)
clrs create -g msg-rg -t SUNW.ims -x IMS_serverroot=/opt/sun/comms/messaging64 -y >Resource_dependencies=msg-l,msg-hasp-rs msg-rsBut still having problem in starting resource group after adding msg-rs
Please advise where I went wrong..
Thanks. -
Switching resource group in 2 node cluster fails
hi,
i configured a 2 node cluster to provide high availability for my oracle DB 9.2.0.7
i have created a resource and named it oracleha-rg,
and i crated later the following resources
oraclelh-rs for logical hostname
hastp-rs for the HA storage resource
oracle-server-rs for oracle resource
and listener-rs for listener
whenever i try to switch the resource group between nodes is gives me the following in dmesg:
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 224900 daemon.notice] launching method <hafoip_stop> for resource <oraclelh-rs>, resource group <oracleha-rg>, node <DB1>, timeout <300> seconds+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 784560 daemon.notice] resource oraclelh-rs status on node DB1 change to R_FM_UNKNOWN+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 922363 daemon.notice] resource oraclelh-rs status msg on node DB1 change to <Stopping>+
+Feb 6 16:17:49 DB1 ip: [ID 678092 kern.notice] TCP_IOC_ABORT_CONN: local = 010.050.033.009:0, remote = 000.000.000.000:0, start = -2, end = 6+
+Feb 6 16:17:49 DB1 ip: [ID 302654 kern.notice] TCP_IOC_ABORT_CONN: aborted 0 connection+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 784560 daemon.notice] resource oraclelh-rs status on node DB1 change to R_FM_OFFLINE+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 922363 daemon.notice] resource oraclelh-rs status msg on node DB1 change to <LogicalHostname offline.>+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 515159 daemon.notice] method <hafoip_stop> completed successfully for resource <oraclelh-rs>, resource group <oracleha-rg>, node <DB1>, time used: 0% of timeout <300 seconds>+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 443746 daemon.notice] resource oraclelh-rs state on node DB1 change to R_OFFLINE+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 224900 daemon.notice] launching method <hastorageplus_postnet_stop> for resource <hastp-rs>, resource group <oracleha-rg>, node <DB1>, timeout <1800> seconds+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 784560 daemon.notice] resource hastp-rs status on node DB1 change to R_FM_UNKNOWN+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 922363 daemon.notice] resource hastp-rs status msg on node DB1 change to <Stopping>+
+Feb 6 16:17:49 DB1 SC[,SUNW.HAStoragePlus:8,oracleha-rg,hastp-rs,hastorageplus_postnet_stop]: [ID 843127 daemon.warning] Extension properties FilesystemMountPoints and GlobalDevicePaths and Zpools are empty.+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 515159 daemon.notice] method <hastorageplus_postnet_stop> completed successfully for resource <hastp-rs>, resource group <oracleha-rg>, node <DB1>, time used: 0% of timeout <1800 seconds>+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 443746 daemon.notice] resource hastp-rs state on node DB1 change to R_OFFLINE+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 784560 daemon.notice] resource hastp-rs status on node DB1 change to R_FM_OFFLINE+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 922363 daemon.notice] resource hastp-rs status msg on node DB1 change to <>+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 529407 daemon.error] resource group oracleha-rg state on node DB1 change to RG_OFFLINE_START_FAILED+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 529407 daemon.notice] resource group oracleha-rg state on node DB1 change to RG_OFFLINE+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 447451 daemon.notice] Not attempting to start resource group <oracleha-rg> on node <DB1> because this resource group has already failed to start on this node 2 or more times in the past 3600 seconds+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 447451 daemon.notice] Not attempting to start resource group <oracleha-rg> on node <DB2> because this resource group has already failed to start on this node 2 or more times in the past 3600 seconds+
+Feb 6 16:17:49 DB1 Cluster.RGM.global.rgmd: [ID 674214 daemon.notice] rebalance: no primary node is currently found for resource group <oracleha-rg>.+
+Feb 6 16:19:08 DB1 Cluster.RGM.global.rgmd: [ID 603096 daemon.notice] resource hastp-rs disabled.+
+Feb 6 16:19:17 DB1 Cluster.RGM.global.rgmd: [ID 603096 daemon.notice] resource oraclelh-rs disabled.+
+Feb 6 16:19:22 DB1 Cluster.RGM.global.rgmd: [ID 603096 daemon.notice] resource oracle-rs disabled.+
+Feb 6 16:19:27 DB1 Cluster.RGM.global.rgmd: [ID 603096 daemon.notice] resource listener-rs disabled.+
+Feb 6 16:19:51 DB1 Cluster.RGM.global.rgmd: [ID 529407 daemon.notice] resource group oracleha-rg state on node DB1 change to RG_OFF_PENDING_METHODS+
+Feb 6 16:19:51 DB1 Cluster.RGM.global.rgmd: [ID 529407 daemon.notice] resource group oracleha-rg state on node DB2 change to RG_OFF_PENDING_METHODS+
+Feb 6 16:19:51 DB1 Cluster.RGM.global.rgmd: [ID 224900 daemon.notice] launching method <bin/oracle_listener_fini> for resource <listener-rs>, resource group <oracleha-rg>, node <DB1>, timeout <30> seconds+
+Feb 6 16:19:51 DB1 Cluster.RGM.global.rgmd: [ID 515159 daemon.notice] method <bin/oracle_listener_fini> completed successfully for resource <listener-rs>, resource group <oracleha-rg>, node <DB1>, time used: 0% of timeout <30 seconds>+
+Feb 6 16:19:51 DB1 Cluster.RGM.global.rgmd: [ID 529407 daemon.notice] resource group oracleha-rg state on node DB1 change to RG_OFFLINE+
+Feb 6 16:19:51 DB1 Cluster.RGM.global.rgmd: [ID 529407 daemon.notice] resource group oracleha-rg state on node DB2 change to RG_OFFLINE+
and the resource group fails to switch...
any help please?Hi,
this forum is for Oracle Clusterware, not Solaris Cluster. You probably should close this thread and open your question in the corresponding Solaris Cluster forum, to get help.
Regards
Sebastian -
How to unregister Resource Group LDom without Stop it.
Hello,I created a Resource Group and Resource Ldom in test ( SUN Cluster 4.2). Now I would like to remove these Resource Group and Resource LDom without stop the LDom itself.
If i'm following the Oracle DOC, the LDom become in inactive state.
clrs disable LDM-sovxxxxx
clrs delete LDM-sovxxxxx
root@ddom14:/etc/cluster/ccr/global# ldm ls
NAME STATE FLAGS CONS VCPU MEMORY UTIL NORM UPTIME
primary active -n-cv- UART 4 4G 1.8% 1.8% 105d
sovgvacluster01p active -n---- 5000 16 10G 0.1% 0.1% 19d 2h 13m
sovgvacluster02p active -n---- 5001 8 10G 0.1% 0.1% 5d 19h 16m
root@ddom14:/etc/cluster/ccr/global# clresource disable LDM-sovgvacluster02
root@ddom14:/etc/cluster/ccr/global# clrs show
=== Resources ===
Resource: LDM-sovgvacluster01p
Type: SUNW.ldom:4
Type_version: 4
Group: sovgvacluster01p
R_description:
Resource_project_name: default
Enabled{ddom14}: True
Enabled{ddom24}: True
Monitored{ddom14}: True
Monitored{ddom24}: True
Resource: LDM-sovgvacluster02p
Type: SUNW.ldom:4
Type_version: 4
Group: sovgvacluster02p
R_description:
Resource_project_name: default
Enabled{ddom14}: False
Enabled{ddom24}: False
Monitored{ddom14}: True
Monitored{ddom24}: True
root@ddom14:/etc/cluster/ccr/global# ldm ls
NAME STATE FLAGS CONS VCPU MEMORY UTIL NORM UPTIME
primary active -n-cv- UART 4 4G 3.2% 3.2% 105d
sovgvacluster01p active -n---- 5000 16 10G 0.1% 0.1% 19d 2h 15m
sovgvacluster02p inactive ------ 8 10G
Is there any way to do that.....
Than's for your help
WillyHi Willy,
when the LDom is configured in a SUNW.ldom resource then it is in the control from the rgm (resource group manager) of Solaris Cluster software.
And yes, if you disable the SUNW.ldom or delete it with ‘delete -F’ then the LDom goes down which is expected behavior.
I don’t know what you like to reach, but maybe ‘quiesce’ or ‘suspend’ the resource group could help?
quiesce:
This command stops a resource group from continuously switching from one node or zone to another node or zone if a START or STOP method fails.
Use the -k option to kill methods that are running on behalf of resources in the affected resource groups. If you do not specify the -k option, methods are allowed to continue running until they exit or exceed their configured timeout.
suspend:
To prevent the resource group from coming online automatically, use the suspend subcommand to suspend the automatic recovery actions of the resource group. To resume automatic recovery actions, use the resume subcommand.
More details in the man page of clrg.
Hth,
Juergen -
WebCache Failed to start : Failed to assign port 80: Permission denied
Hi All,
I have three server running IAS 10.1.2.0.2, running forms and reports application. One Infra and two Midtier.
Suddenly Midtier1 is crashed, but before it happen I have already backup with TAR : OracleHome and all its related configuration files.
After the crash, I reinstall the RH Linux same version and update and then restore ( TAR -xvf ) the backup that I have.
When I run opmnctl startall, all ias-component started, EXCEPT one : WEB CACHE.
When I look at Webcache event Log, here is the error :
[11/May/2004:17:29:05 +0700] [notification 9612] [ecid: -] OracleAS Web Cache 10g (10.1.2), Build 10.1.2.0.2 050802
[11/May/2004:17:29:05 +0700] [notification 9612] [ecid: -] OracleAS Web Cache 10g (10.1.2), Build 10.1.2.0.2 050802
[11/May/2004:17:29:05 +0700] [notification 9403] [ecid: -] Maximum number of file/socket descriptors set to 900.
[11/May/2004:17:29:05 +0700] [notification 9403] [ecid: -] Maximum number of file/socket descriptors set to 900.
[11/May/2004:17:29:05 +0700] [notification 13002] [ecid: -] Maximum allowed incoming connections are 700
[11/May/2004:17:29:05 +0700] [notification 13002] [ecid: -] Maximum allowed incoming connections are 700
[11/May/2004:17:29:05 +0700] [alert 13305] [ecid: -] Failed to assign port 80: Permission denied
[11/May/2004:17:29:05 +0700] [alert 9707] [ecid: -] Failed to start the server.
[11/May/2004:17:29:05 +0700] [alert 9609] [ecid: -] The server process could not initialize.
[11/May/2004:17:29:05 +0700] [notification 9610] [ecid: -] The server is exiting.
[11/May/2004:17:29:05 +0700] [alert 9000] [ecid: -] Process 3268 exit(1) at 890:main.c [Build 10.1.2.0.2 050802]
[11/May/2004:17:29:05 +0700] [warning 11917] [ecid: -] SSL wallet Origin Server Wallet file /etc/ORACLE/WALLETS/oraias/ewallet.p12 does not exist.
[11/May/2004:17:29:05 +0700] [warning 11917] [ecid: -] SSL wallet Origin Server Wallet file /etc/ORACLE/WALLETS/oraias/ewallet.der does not exist.
[11/May/2004:17:29:05 +0700] [warning 11919] [ecid: -] The SSL wallet autologin file /etc/ORACLE/WALLETS/oraias/cwallet.sso does not exist. Wallet does not appear to be autologin wallet.
[11/May/2004:17:29:05 +0700] [warning 11921] [ecid: -] The origin server wallet did not open. Operating without wallet for backend. Only Diffie-Hellman anonymous connections supported to origin servers.
[11/May/2004:17:29:05 +0700] [warning 11922] [ecid: -] Origin Server Wallet wallet fails to open at location /etc/ORACLE/WALLETS/oraias, NZE-28759, as user oraias
[11/May/2004:17:29:06 +0700] [notification 9607] [ecid: -] The admin server started successfully.
How can I solve this problem ?
Thank you for your help,
xtantoHi xtanto,
You may not have set up the privileges for running on a port less than 1024 (i.e. port 80) in your old back-uped tar-file.
Please check Chapter 8 "Running webcached with Root Privilege" in the Web Cache Admin document.
http://download-east.oracle.com/docs/cd/B14099_19/caching.1012/b14046/basics.htm#sthref1060
Regards,
Martin -
Arch32.service fails on start. bundled 32bit system
I am using the install bundled 32bit system in 64bit system wiki and I think I am having trouble in the systemd service part.
I am getting an error starting the arch32 service with systemctl. I have looked at the error messages and I am just not sure what is wrong here. Can someone point out what the error message is telling me, so that I can, in the future, read error messages better? I have did searches and google came up short. I am an eager Linux student. I would more than appreaciate any "hints" so that I can become better and be more value as I help others. Thank you.
systemctl start arch32
## Job for arch32.service failed. See "systemctl status arch32.service" and "journalctl -xe" for details.
The result of systemct status message...
# systemctl status -l arch32
● arch32.service - 32-bit chroot
Loaded: loaded (/etc/systemd/system/arch32.service; disabled; vendor preset: disabled)
Active: failed (Result: exit-code) since Fri 2015-05-22 10:32:54 EDT; 8min ago
Process: 980 ExecStart=/usr/local/bin/arch32 start (code=exited, status=1/FAILURE)
Main PID: 980 (code=exited, status=1/FAILURE)
May 22 10:32:53 archhost systemd[1]: Starting 32-bit chroot...
May 22 10:32:54 archhost systemd[1]: arch32.service: main process exited, code=exited, status=1/FAILURE
May 22 10:32:54 archhost systemd[1]: Failed to start 32-bit chroot.
May 22 10:32:54 archhost systemd[1]: Unit arch32.service entered failed state.
May 22 10:32:54 archhost systemd[1]: arch32.service failed.
The result of jouornalctl message...
# journalctl -xe
May 23 07:41:27 archhost org.a11y.Bus[3104]: Activating service name='org.a11y.atspi.Registry'
May 23 07:41:28 archhost org.a11y.Bus[3104]: Successfully activated service 'org.a11y.atspi.Registry'
May 23 07:41:28 archhost org.a11y.atspi.Registry[3233]: SpiRegistry daemon is running with well-known name - org.a11y.atspi.Registry
May 23 07:41:28 archhost polkitd[3111]: Registered Authentication Agent for unix-session:c1 (system bus name :1.13 [/usr/lib/polkit-gnome/polkit-gnome-authentication-agent-1], object path /org/gnome/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8)
May 23 07:41:32 archhost kernel: dell_wmi: Unknown WMI event type 0x11: 0xffd0
May 23 07:41:32 archhost kernel: dell_wmi: Unknown WMI event type 0x11: 0xffd1
May 23 07:41:32 archhost kernel: dell_wmi: Unknown WMI event type 0x11: 0x187
May 23 07:41:32 archhost kernel: dell_wmi: Unknown WMI event type 0x11: 0x188
May 23 07:41:32 archhost kernel: dell_wmi: Unknown WMI event type 0x11: 0x187
May 23 07:41:33 archhost kernel: dell_wmi: Unknown WMI event type 0x11: 0x188
May 23 07:49:50 archhost sudo[3422]: codeamend : TTY=pts/0 ; PWD=/usr/local/bin ; USER=root ; COMMAND=/usr/bin/systemctl start arch32
May 23 07:49:50 archhost sudo[3422]: pam_unix(sudo:session): session opened for user root by codeamend(uid=0)
May 23 07:49:50 archhost polkitd[3111]: Registered Authentication Agent for unix-process:3423:70961 (system bus name :1.14 [/usr/bin/pkttyagent --notify-fd 5 --fallback], object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8)
May 23 07:49:50 archhost systemd[1]: Starting 32-bit chroot...
-- Subject: Unit arch32.service has begun start-up
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has begun starting up.
May 23 07:49:50 archhost arch32[3428]: mount: mount point /opt/arch32/scratch does not exist
May 23 07:49:50 archhost systemd[1]: arch32.service: main process exited, code=exited, status=1/FAILURE
May 23 07:49:50 archhost systemd[1]: Failed to start 32-bit chroot.
-- Subject: Unit arch32.service has failed
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has failed.
-- The result is failed.
May 23 07:49:50 archhost systemd[1]: Unit arch32.service entered failed state.
May 23 07:49:50 archhost systemd[1]: arch32.service failed.
May 23 07:49:50 archhost sudo[3422]: pam_unix(sudo:session): session closed for user root
May 23 07:49:50 archhost polkitd[3111]: Unregistered Authentication Agent for unix-process:3423:70961 (system bus name :1.14, object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8) (disconnected from bus)
May 23 07:53:11 archhost systemd[1]: Starting Cleanup of Temporary Directories...
-- Subject: Unit systemd-tmpfiles-clean.service has begun start-up
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit systemd-tmpfiles-clean.service has begun starting up.
May 23 07:53:11 archhost systemd[1]: Started Cleanup of Temporary Directories.
-- Subject: Unit systemd-tmpfiles-clean.service has finished start-up
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit systemd-tmpfiles-clean.service has finished starting up.
-- The start-up result is done.
Last edited by AcousticBruce (2015-05-23 15:57:21)graysky wrote:Edit your /usr/local/bin/arch32 script removing the reference to '/scratch' unless you actually have it on your system. That was a typo from me on the wiki which has now been corrected.
I removed the /scratch from two areas and made a copy of the wiki version just in case I was wrong. I still got these errors.
$ systemctl status -l arch32.service
● arch32.service - 32-bit chroot
Loaded: loaded (/etc/systemd/system/arch32.service; disabled; vendor preset: disabled)
Active: failed (Result: exit-code) since Sun 2015-05-24 08:53:49 EDT; 20s ago
Process: 2139 ExecStart=/usr/local/bin/arch32 start (code=exited, status=1/FAILURE)
Main PID: 2139 (code=exited, status=1/FAILURE)
May 24 08:53:49 archhost systemd[1]: Starting 32-bit chroot...
May 24 08:53:49 archhost systemd[1]: arch32.service: main process exited, code=exited, status=1/FAILURE
May 24 08:53:49 archhost systemd[1]: Failed to start 32-bit chroot.
May 24 08:53:49 archhost systemd[1]: Unit arch32.service entered failed state.
May 24 08:53:49 archhost systemd[1]: arch32.service failed
$ journalctl -xe
-- Unit arch32.service has begun starting up.
May 24 08:49:36 archhost systemd[1]: arch32.service: main process exited, code=exited, status=1/FAILURE
May 24 08:49:36 archhost systemd[1]: Failed to start 32-bit chroot.
-- Subject: Unit arch32.service has failed
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has failed.
-- The result is failed.
May 24 08:49:36 archhost polkitd[663]: Unregistered Authentication Agent for unix-process:2103:877657 (system bus name :1.20, object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8) (disconnected from bus)
May 24 08:49:36 archhost systemd[1]: Unit arch32.service entered failed state.
May 24 08:49:36 archhost systemd[1]: arch32.service failed.
May 24 08:53:45 archhost polkitd[663]: Registered Authentication Agent for unix-process:2132:902936 (system bus name :1.24 [/usr/bin/pkttyagent --notify-fd 5 --fallback], object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8)
May 24 08:53:49 archhost polkitd[663]: Operator of unix-session:c1 successfully authenticated as unix-user:codeamend to gain TEMPORARY authorization for action org.freedesktop.systemd1.manage-units for system-bus-name::1.25 [systemctl start arch32] (owned by unix-user:codeamend)
May 24 08:53:49 archhost systemd[1]: Starting 32-bit chroot...
-- Subject: Unit arch32.service has begun start-up
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has begun starting up.
May 24 08:53:49 archhost systemd[1]: arch32.service: main process exited, code=exited, status=1/FAILURE
May 24 08:53:49 archhost systemd[1]: Failed to start 32-bit chroot.
-- Subject: Unit arch32.service has failed
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has failed.
-- The result is failed.
May 24 08:53:49 archhost systemd[1]: Unit arch32.service entered failed state.
May 24 08:53:49 archhost systemd[1]: arch32.service failed.
May 24 08:53:49 archhost polkitd[663]: Unregistered Authentication Agent for unix-process:2132:902936 (system bus name :1.24, object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8) (disconnected from bus)
May 24 08:56:04 archhost sudo[2156]: codeamend : TTY=pts/2 ; PWD=/opt/arch32/usr/local/bin ; USER=root ; COMMAND=/usr/bin/nano arch32
May 24 08:56:04 archhost sudo[2156]: pam_unix(sudo:session): session opened for user root by codeamend(uid=0)
May 24 08:56:21 archhost sudo[2156]: pam_unix(sudo:session): session closed for user root
May 24 08:56:24 archhost polkitd[663]: Registered Authentication Agent for unix-process:2160:918835 (system bus name :1.28 [/usr/bin/pkttyagent --notify-fd 5 --fallback], object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8)
May 24 08:56:28 archhost polkitd[663]: Operator of unix-session:c1 successfully authenticated as unix-user:codeamend to gain TEMPORARY authorization for action org.freedesktop.systemd1.manage-units for system-bus-name::1.29 [systemctl start arch32] (owned by unix-user:codeamend)
May 24 08:56:28 archhost systemd[1]: Starting 32-bit chroot...
-- Subject: Unit arch32.service has begun start-up
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has begun starting up.
May 24 08:56:28 archhost systemd[1]: arch32.service: main process exited, code=exited, status=1/FAILURE
May 24 08:56:28 archhost systemd[1]: Failed to start 32-bit chroot.
-- Subject: Unit arch32.service has failed
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has failed.
-- The result is failed.
May 24 08:56:28 archhost systemd[1]: Unit arch32.service entered failed state.
May 24 08:56:28 archhost systemd[1]: arch32.service failed.
May 24 08:56:28 archhost polkitd[663]: Unregistered Authentication Agent for unix-process:2160:918835 (system bus name :1.28, object path /org/freedesktop/PolicyKit1/AuthenticationAgent, locale en_US.UTF-8) (disconnected from bus) -
I was going about the process to set up the 32-bit chroot so I could use Adobe Acrobat on my system, but systemctl failed to start the service. Output of "systemctl status arch32.service" is as follows:
Loaded: loaded (/etc/systemd/system/arch32.service; disabled; vendor preset: disabled)
Active: failed (Result: exit-code) since Fri 2015-05-29 18:10:29 EDT; 11s ago
Process: 1295 ExecStart=/usr/local/bin/arch32 start (code=exited, status=1/FAILURE)
Main PID: 1295 (code=exited, status=1/FAILURE)
May 29 18:10:29 the-enforcer systemd[1]: Starting 32-bit chroot...
May 29 18:10:29 the-enforcer systemd[1]: arch32.service: main process exited, code=exited, status=1/FAILURE
May 29 18:10:29 the-enforcer systemd[1]: Failed to start 32-bit chroot.
May 29 18:10:29 the-enforcer systemd[1]: Unit arch32.service entered failed state.
May 29 18:10:29 the-enforcer systemd[1]: arch32.service failed.
output of "journalctl -xe" is as follows
-- Subject: Unit arch32.service has failed
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has failed.
-- The result is failed.
May 29 18:09:07 the-enforcer systemd[1]: Unit arch32.service entered failed state.
May 29 18:09:07 the-enforcer systemd[1]: arch32.service failed.
May 29 18:09:07 the-enforcer polkitd[309]: Unregistered Authentication Agent for unix-process:1242:62178 (system bus name :1.65, object path /org/freedesktop/PolicyKit1/Authen
May 29 18:09:07 the-enforcer sudo[1241]: pam_unix(sudo:session): session closed for user root
May 29 18:09:25 the-enforcer sudo[1258]: shaun : TTY=pts/0 ; PWD=/home/shaun ; USER=root ; COMMAND=/usr/local/bin/arch32 start
May 29 18:09:25 the-enforcer sudo[1258]: pam_unix(sudo:session): session opened for user root by shaun(uid=0)
May 29 18:09:25 the-enforcer sudo[1258]: pam_unix(sudo:session): session closed for user root
May 29 18:09:51 the-enforcer /usr/lib/gdm/gdm-x-session[517]: Activating service name='org.gnome.Terminal'
May 29 18:09:51 the-enforcer /usr/lib/gdm/gdm-x-session[517]: Successfully activated service 'org.gnome.Terminal'
May 29 18:10:02 the-enforcer sudo[1281]: shaun : TTY=pts/0 ; PWD=/home/shaun ; USER=root ; COMMAND=/usr/local/bin/arch32 stop
May 29 18:10:02 the-enforcer sudo[1281]: pam_unix(sudo:session): session opened for user root by shaun(uid=0)
May 29 18:10:03 the-enforcer sudo[1281]: pam_unix(sudo:session): session closed for user root
May 29 18:10:29 the-enforcer sudo[1289]: shaun : TTY=pts/0 ; PWD=/home/shaun ; USER=root ; COMMAND=/usr/bin/systemctl start arch32.service
May 29 18:10:29 the-enforcer sudo[1289]: pam_unix(sudo:session): session opened for user root by shaun(uid=0)
May 29 18:10:29 the-enforcer polkitd[309]: Registered Authentication Agent for unix-process:1290:70456 (system bus name :1.66 [/usr/bin/pkttyagent --notify-fd 5 --fallback], o
May 29 18:10:29 the-enforcer systemd[1]: Starting 32-bit chroot...
-- Subject: Unit arch32.service has begun start-up
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has begun starting up.
May 29 18:10:29 the-enforcer systemd[1]: arch32.service: main process exited, code=exited, status=1/FAILURE
May 29 18:10:29 the-enforcer systemd[1]: Failed to start 32-bit chroot.
-- Subject: Unit arch32.service has failed
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has failed.
-- The result is failed.
May 29 18:10:29 the-enforcer systemd[1]: Unit arch32.service entered failed state.
May 29 18:10:29 the-enforcer systemd[1]: arch32.service failed.
May 29 18:10:29 the-enforcer sudo[1289]: pam_unix(sudo:session): session closed for user root
May 29 18:10:29 the-enforcer polkitd[309]: Unregistered Authentication Agent for unix-process:1290:70456 (system bus name :1.66, object path /org/freedesktop/PolicyKit1/Authen
lines 1322-1364/1364 (END)
Here's what my arch32.service file looks like:
[Unit]
Description=32-bit chroot
[Service]
Type=oneshot
RemainAfterExit=yes
ExecStart=/usr/local/bin/arch32 start
ExecStop=/usr/local/bin/arch32 stop
[Install]
WantedBy=multi-user.target
Here's what my /usr/local/bin/arch32 file looks like
#!/bin/bash
## User variables.
MOUNTPOINT=/opt/arch32
## Set MANAGEPARTITION to any value if /opt/arch32 resides on a separate
## partition and not mounted by /etc/fstab or some other means.
## If /opt/arch32 is part of your rootfs, leave this empty.
MANAGEPARTITION=
## Leave USEDISTCC empty unless you wish to use distccd from within the chroot.
USEDISTCC=
DISTCC_SUBNET='10.9.8.0/24'
## PIDFILE shouldn't need to ba changed from this default.
PIDFILE=/run/arch32
start_distccd() {
[[ ! -L "$MOUNTPOINT"/usr/bin/distccd-chroot ]] &&
ln -s /usr/bin/distccd "$MOUNTPOINT"/usr/bin/distccd-chroot
DISTCC_ARGS="--user nobody --allow $DISTCC_SUBNET --port 3692 --log-level warning --log-file /tmp/distccd-i686.log"
[[ -z "$(pgrep distccd-chroot)" ]] &&
linux32 chroot "$MOUNTPOINT" /bin/bash -c "/usr/bin/distccd-chroot --daemon $DISTCC_ARGS"
stop_distccd() {
[[ -n "$(pgrep distccd-chroot)" ]] &&
linux32 chroot "$MOUNTPOINT" /bin/bash -c "pkill -SIGTERM distccd-chroot"
case $1 in
start)
[[ -f "$PIDFILE" ]] && exit 1
if [[ -n "$MANAGEPARTITION" ]]; then
mountpoint -q $MOUNTPOINT || mount LABEL="arch32" $MOUNTPOINT
fi
dirs=(/tmp /dev /dev/pts /home)
for d in "${dirs[@]}"; do
mount -o bind $d "$MOUNTPOINT"$d
done
mount -t proc none "$MOUNTPOINT/proc"
mount -t sysfs none "$MOUNTPOINT/sys"
touch "$PIDFILE"
[[ -n "$USEDISTCC" ]] && start_distccd
stop)
[[ ! -f "$PIDFILE" ]] && exit 1
[[ -n "$USEDISTCC" ]] && stop_distccd
if [[ -n "$MANAGEPARTITION" ]]; then
umount -R -A -l "$MOUNTPOINT"
else
dirs=(/home /dev/pts /dev /tmp)
[[ -n "$USEDISTCC" ]] && stop_distccd
umount "$MOUNTPOINT"/{sys,proc}
for d in "${dirs[@]}"; do
umount -l "$MOUNTPOINT$d"
done
fi
rm -f "$PIDFILE"
echo "usage: $0 (start|stop)"
exit 1
esac
Any ideas as to what's going on?I was going about the process to set up the 32-bit chroot so I could use Adobe Acrobat on my system, but systemctl failed to start the service. Output of "systemctl status arch32.service" is as follows:
Loaded: loaded (/etc/systemd/system/arch32.service; disabled; vendor preset: disabled)
Active: failed (Result: exit-code) since Fri 2015-05-29 18:10:29 EDT; 11s ago
Process: 1295 ExecStart=/usr/local/bin/arch32 start (code=exited, status=1/FAILURE)
Main PID: 1295 (code=exited, status=1/FAILURE)
May 29 18:10:29 the-enforcer systemd[1]: Starting 32-bit chroot...
May 29 18:10:29 the-enforcer systemd[1]: arch32.service: main process exited, code=exited, status=1/FAILURE
May 29 18:10:29 the-enforcer systemd[1]: Failed to start 32-bit chroot.
May 29 18:10:29 the-enforcer systemd[1]: Unit arch32.service entered failed state.
May 29 18:10:29 the-enforcer systemd[1]: arch32.service failed.
output of "journalctl -xe" is as follows
-- Subject: Unit arch32.service has failed
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has failed.
-- The result is failed.
May 29 18:09:07 the-enforcer systemd[1]: Unit arch32.service entered failed state.
May 29 18:09:07 the-enforcer systemd[1]: arch32.service failed.
May 29 18:09:07 the-enforcer polkitd[309]: Unregistered Authentication Agent for unix-process:1242:62178 (system bus name :1.65, object path /org/freedesktop/PolicyKit1/Authen
May 29 18:09:07 the-enforcer sudo[1241]: pam_unix(sudo:session): session closed for user root
May 29 18:09:25 the-enforcer sudo[1258]: shaun : TTY=pts/0 ; PWD=/home/shaun ; USER=root ; COMMAND=/usr/local/bin/arch32 start
May 29 18:09:25 the-enforcer sudo[1258]: pam_unix(sudo:session): session opened for user root by shaun(uid=0)
May 29 18:09:25 the-enforcer sudo[1258]: pam_unix(sudo:session): session closed for user root
May 29 18:09:51 the-enforcer /usr/lib/gdm/gdm-x-session[517]: Activating service name='org.gnome.Terminal'
May 29 18:09:51 the-enforcer /usr/lib/gdm/gdm-x-session[517]: Successfully activated service 'org.gnome.Terminal'
May 29 18:10:02 the-enforcer sudo[1281]: shaun : TTY=pts/0 ; PWD=/home/shaun ; USER=root ; COMMAND=/usr/local/bin/arch32 stop
May 29 18:10:02 the-enforcer sudo[1281]: pam_unix(sudo:session): session opened for user root by shaun(uid=0)
May 29 18:10:03 the-enforcer sudo[1281]: pam_unix(sudo:session): session closed for user root
May 29 18:10:29 the-enforcer sudo[1289]: shaun : TTY=pts/0 ; PWD=/home/shaun ; USER=root ; COMMAND=/usr/bin/systemctl start arch32.service
May 29 18:10:29 the-enforcer sudo[1289]: pam_unix(sudo:session): session opened for user root by shaun(uid=0)
May 29 18:10:29 the-enforcer polkitd[309]: Registered Authentication Agent for unix-process:1290:70456 (system bus name :1.66 [/usr/bin/pkttyagent --notify-fd 5 --fallback], o
May 29 18:10:29 the-enforcer systemd[1]: Starting 32-bit chroot...
-- Subject: Unit arch32.service has begun start-up
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has begun starting up.
May 29 18:10:29 the-enforcer systemd[1]: arch32.service: main process exited, code=exited, status=1/FAILURE
May 29 18:10:29 the-enforcer systemd[1]: Failed to start 32-bit chroot.
-- Subject: Unit arch32.service has failed
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/mailman/listinfo/systemd-devel
-- Unit arch32.service has failed.
-- The result is failed.
May 29 18:10:29 the-enforcer systemd[1]: Unit arch32.service entered failed state.
May 29 18:10:29 the-enforcer systemd[1]: arch32.service failed.
May 29 18:10:29 the-enforcer sudo[1289]: pam_unix(sudo:session): session closed for user root
May 29 18:10:29 the-enforcer polkitd[309]: Unregistered Authentication Agent for unix-process:1290:70456 (system bus name :1.66, object path /org/freedesktop/PolicyKit1/Authen
lines 1322-1364/1364 (END)
Here's what my arch32.service file looks like:
[Unit]
Description=32-bit chroot
[Service]
Type=oneshot
RemainAfterExit=yes
ExecStart=/usr/local/bin/arch32 start
ExecStop=/usr/local/bin/arch32 stop
[Install]
WantedBy=multi-user.target
Here's what my /usr/local/bin/arch32 file looks like
#!/bin/bash
## User variables.
MOUNTPOINT=/opt/arch32
## Set MANAGEPARTITION to any value if /opt/arch32 resides on a separate
## partition and not mounted by /etc/fstab or some other means.
## If /opt/arch32 is part of your rootfs, leave this empty.
MANAGEPARTITION=
## Leave USEDISTCC empty unless you wish to use distccd from within the chroot.
USEDISTCC=
DISTCC_SUBNET='10.9.8.0/24'
## PIDFILE shouldn't need to ba changed from this default.
PIDFILE=/run/arch32
start_distccd() {
[[ ! -L "$MOUNTPOINT"/usr/bin/distccd-chroot ]] &&
ln -s /usr/bin/distccd "$MOUNTPOINT"/usr/bin/distccd-chroot
DISTCC_ARGS="--user nobody --allow $DISTCC_SUBNET --port 3692 --log-level warning --log-file /tmp/distccd-i686.log"
[[ -z "$(pgrep distccd-chroot)" ]] &&
linux32 chroot "$MOUNTPOINT" /bin/bash -c "/usr/bin/distccd-chroot --daemon $DISTCC_ARGS"
stop_distccd() {
[[ -n "$(pgrep distccd-chroot)" ]] &&
linux32 chroot "$MOUNTPOINT" /bin/bash -c "pkill -SIGTERM distccd-chroot"
case $1 in
start)
[[ -f "$PIDFILE" ]] && exit 1
if [[ -n "$MANAGEPARTITION" ]]; then
mountpoint -q $MOUNTPOINT || mount LABEL="arch32" $MOUNTPOINT
fi
dirs=(/tmp /dev /dev/pts /home)
for d in "${dirs[@]}"; do
mount -o bind $d "$MOUNTPOINT"$d
done
mount -t proc none "$MOUNTPOINT/proc"
mount -t sysfs none "$MOUNTPOINT/sys"
touch "$PIDFILE"
[[ -n "$USEDISTCC" ]] && start_distccd
stop)
[[ ! -f "$PIDFILE" ]] && exit 1
[[ -n "$USEDISTCC" ]] && stop_distccd
if [[ -n "$MANAGEPARTITION" ]]; then
umount -R -A -l "$MOUNTPOINT"
else
dirs=(/home /dev/pts /dev /tmp)
[[ -n "$USEDISTCC" ]] && stop_distccd
umount "$MOUNTPOINT"/{sys,proc}
for d in "${dirs[@]}"; do
umount -l "$MOUNTPOINT$d"
done
fi
rm -f "$PIDFILE"
echo "usage: $0 (start|stop)"
exit 1
esac
Any ideas as to what's going on? -
Failover Cluster Core Resources question on a Windows 2008R2 three node cluster
We have a three node Windows 2008R2 cluster with SQL Server 2008 R2 as a clustered resource. There are three resource groups in this cluster 1) Available Storage 2) Cluster Group 3) SQL Server. The Available Storage and SQL Server resource groups
reside on one node while the Cluster Group resides on another. The only resources residing in the Cluster Resource Group is the Cluster name and IP. I'd like to failover the Cluster Resource Group to be on the same node as everything else.
I'm not sure what the implications are on doing this. Failing over the Cluster Group shouldn't have any impact on the SQL Server Resource Group correct or would there be an interruption to SQL because of the failover of the Cluster Group. It's
an critical application of which I'm trying to gather some information for a change request and I know I'm going to be asked if this impacts the production database and everybody using it.
Thanks
RGNo, that should not impact anything. The cluster group is completely separate from the SQL group.
. : | : . : | : . tim -
Facing prcr-1079 failed to start resource ora.orcl.db problem
Hi all,
I am about installing an oracle database with asm
I have installed ASM libraries and created asm instance with diskgroup successfully mounted. I started the ASM instance in the oragrid user.
Know I runned the DBCA command to create a database named ORCL from the oracle user,
I followed up the procedure and during creation phase (at 80 %) I got multiple errors,
: prcr-1079 failed to start resource ora.orcl.db
ORA-01031: insufficient privileges
is there anything wrong, or I need to grant system privileges for oracle over directories?
Thanks for helpHI,
Please check following link: PRCR-1079: Failed to start resource ora.test.db, ORA-01031,ORA-2674 | Just Innovation
# usermod -G asmadmin,asmdba,asmoper,dba grid
Now you will notice the Privileges are changed
# id grid
uid=501(grid) gid=1000(oinstall) groups=1000(oinstall),1100(dba),1300(asmadmin),1400(asmdba),1500(asmoper)
Now try to runInstaller again!
Thank you -
PRCR-1079 Failed to start resource ora.rac.db - during installation
Hi
After successful installation of Grid Infrasturcture I proceeded with database installation on clusterware and at the stage when the installer was creating clone database I got the following errors ( this was my 2nd attempt and I got the same errors both the time ) :
Errors:
PRCR-1079 : Failed to start resource ora.rac.db
ORA-01092 : ORACLE instance terminated. Disconnection forced
ORA-00704 : bootstrap process failure
ORA-00604 : error occurred at resursiive SQL level 2
ORA-01578 : ORACLE data block corrupted (file # 1, block # 5505)
ORA-01110 : data file 1:'+DATA/rac/datafile/system.256.799676855'
Process ID : 23498
Session ID : 63 Serial number 3
CRS-2674 Start of 'ora.rac.db' on 'rac2' failed
CRS-2632 There are no more servers to try to place resource 'ora.rac.db' on that would satisfy its placement policy
There are no logs on that node (rac2)
I am running Oracle Linux 5.4 64 bit
As mentioned above this was my 2nd attempt afresh and I got the same errors both the times, please let me know what is the problem as the rac2 is replica of rac2 in VMWare.
Thanks for your help
Rgds
THi
I tried again for the 3rd time and go the same error again, this time I rebuilt the node 2 - Can someone ple ase help me with this issue why it keeps failing on node 2 at the same stage for the 2rd time in a row.
Also please help me clone the database manually from node 1 to node 2 so I don't have to try to reinstall it again, there must be ways to do it
Thanks for your help in advance
Rgds
T -
Hi DBA's.
Im, running
Finalizing Installation 96% the following Warning:
[Thread-288] [ 2010-01-21 14:28:57.456 ARST ] [CRSNative.internalStartResource:352] Failed to start resource: Name: ora.racdb.db, node: null, filter: null, msg CRS-2674:
Start of 'ora.racdb.db' on 'linux2' failed
CRS-2678: 'ora.racdb.db' on 'linux2' has experienced an unrecoverable failure
CRS-0267: Human intervention required to resume its availability.
CRS-5807: Agent failed to process the message
ORA-01034: ORACLE not available
ORA-27101: shared memory realm does not exist
Linux Error: 2: No such file or directory
Process ID: 0
Session ID: 0 Serial number: 0
[Thread-288] [ 2010-01-21 14:28:57.457 ARST ] [PostDBCreationStep.executeImpl:828] Exception while Starting with HA Database Resource PRCR-1079 : Failed to start resourc
e ora.racdb.db
CRS-2674: Start of 'ora.racdb.db' on 'linux2' failed
CRS-2678: 'ora.racdb.db' on 'linux2' has experienced an unrecoverable failure
CRS-0267: Human intervention required to resume its availability.
CRS-5807: Agent failed to process the message
ORA-01034: ORACLE not available
ORA-27101: shared memory realm does not exist
Linux Error: 2: No such file or directory
Process ID: 0
Session ID: 0 Serial number: 0
oracle$ dbcaHi...
Now is Ok.
I did:
srvctl start instance -d racdb -i racdb2
[oracle@linux1 oracle]$ su - grid -c "crsctl status resource -w \"TYPE co 'ora'\" -t"
Password:
NAME TARGET STATE SERVER STATE_DETAILS
Local Resources
ora.CRS.dg
ONLINE ONLINE linux1
ONLINE ONLINE linux2
ora.FRA.dg
ONLINE ONLINE linux1
ONLINE ONLINE linux2
ora.LISTENER.lsnr
ONLINE ONLINE linux1
ONLINE ONLINE linux2
ora.RACDB_DATA.dg
ONLINE ONLINE linux1
ONLINE ONLINE linux2
ora.asm
ONLINE ONLINE linux1 Started
ONLINE ONLINE linux2 Started
ora.eons
ONLINE ONLINE linux1
ONLINE ONLINE linux2
ora.gsd
OFFLINE OFFLINE linux1
OFFLINE OFFLINE linux2
ora.net1.network
ONLINE ONLINE linux1
ONLINE ONLINE linux2
ora.ons
ONLINE ONLINE linux1
ONLINE ONLINE linux2
Cluster Resources
ora.LISTENER_SCAN1.lsnr
1 ONLINE ONLINE linux1
ora.linux1.vip
1 ONLINE ONLINE linux1
ora.linux2.vip
1 ONLINE ONLINE linux2
ora.oc4j
1 OFFLINE OFFLINE
ora.racdb.db
1 ONLINE ONLINE linux1 Open
2 ONLINE ONLINE linux2 Open
ora.scan1.vip
1 ONLINE ONLINE linux1
Thanks. -
ORA-12500 Tns listner failed to start the dedicated server process.
Hi all,
I am getting below error message when i am trying to connect to database through db console.
OS-Windows sever
Database-10GR1
ORA-12500 Tns listner failed to start the dedicated server process.
Kindly help me out..
Thanks in advance
Edited by: rajaryan on Sep 17, 2009 4:46 AMHi all,
I am getting below error message when i am trying to connect to database through db console.
OS-Windows sever
Database-10GR1
ORA-12500 Tns listner failed to start the dedicated server process.
Kindly help me out..
Thanks in advance
Edited by: rajaryan on Sep 17, 2009 4:46 AMYou are running short of resources. Check v$resource_limit. You may also use the orastack utility for minimizing the memory utilized by oracle.
http://hrivera99.blogspot.com/2008/01/orastack.html
Regards,
S.K. -
'ORA-12500: TNS: Listener failed to start a dedicated server process'
Hi,
While connecting from one database to another users are getting the below error when they are giving a select statement,its taking around 30 minutes and showing
'ORA-12500: TNS: Listener failed to start a dedicated server process'...
What could be the issue...
Thanks,
Kr.If database to which they are trying to connect is running, then check listener.log ($ORACLE_HOME/network/log).
Probably there is not enough system resources - check opsystem logs as well (on unix /var/log or /var/adm) on windows look at events.
Look into db alert.log as well. -
Hello Gurus
i have installed windows server 2012 RTM with Hyper-V. i already created virtual machine with virtual fiber channel adapter connected to physical one. sometimes when i restart the virtual machine it gets failed to start again and the following error appears
in the event viewer of the host:
error id 21502
'Virtual Machine xyz' failed to start.
'xyz' failed to start. (Virtual machine ID number)
'xyz' Synthetic FibreChannel Port: Failed to start reserving resources with Error 'Insufficient system resources exist to complete the requested service.' (0x800705AA). (Virtual machine ID
number)
'xyz': Operation for virtual port (C003FF18F98C000E) failed with an error: No physical port available to satisfy the request (Virtual machine ID
number).
error id 1069
Cluster resource 'Virtual Machine xyz' of type 'Virtual Machine' in clustered role 'xyz' failed. The error code was '0x5aa' ('Insufficient system resources exist to complete the requested service.').
Based on the failure policies for the resource and role, the cluster service may try to bring the resource online on this node or move the group to another node of the cluster and then restart it. Check the resource and group state using Failover Cluster
Manager or the Get-ClusterResource Windows PowerShell cmdlet.
appreciate your help
AshrafDear All,
Subject : Need to create a file cluster
in Guest Vms (Host machine : Windows 2012 R2), using MSA p2000 strorage with HBA Qlogic (HP 8Gb PCIe Host Bus Adapter)
Note: It's a direct connection from HBA to HP MSA p2000 (No SAN switch in between)
Currently I am using Qlogic HBA (HP
81Q 8Gb 1-port PCIe Fibre Channel Host Bus Adapter- Part No. AK344A).
Unable to create vPort using Microsoft
Hyper-V (Win 2012 R2) & secondly using QConverge
Console utility.
When trying to create Microsoft
Hyper-V (Win 2012 R2), I received the below error.
Secondly, when I am trying to create vSAN switch in using Qlogic QConverge Utility the above error is popup.
Currently I am using the following latest version of Qlogic Firmware & Drivers...
HBA - Running Firmware Version: 7.00.02
HBA - Driver Version: STOR Miniport 9.1.11.24
If any body the same issue, Please could you update me as earliest.
Regards,
Mirza
Maybe you are looking for
-
I have got an IPhone 5. It has been locking itself and not possible to run it again. I have restored it by Itunnes. But still not functioning. I have bought it from London whitecity applestore and have been using it in Turkey. Any recomandation ?
-
I've finally given up and trying to use iMovie '11 (still much prever the way iMovie 6 works, but I want to use some of the new options). In the event window (I think that is the name) the one that shows the clips, none are named and I can't find a w
-
Error in GR for Sub-Contract PO
Hi, When i carry out GR for Sub-Contract PO, system showing below message Programming error: data already aggregated Message no. AG223 Diagnosis The material data for the document is already available in aggregated form in the internal tables. This m
-
Editing a song or podcast?
Is there a way to edit a song or podcast? If not, is there any software on the market that can do that? I have a few podcasts that are cool but a few songs have some heavy profanity that I want to delete but not get rid of the whole thing. There are
-
Gyro sensor plot and save data on labview
Hi I am an electrical and electronics engineering student in Hacettepe University.I wanted to ask you something about my project: I have Labview 8.5 installed in my computer and lego NXT hardware.I am trying to have inputs fro