Failover in BODI 11.7.3 AIX
Can a failover be achieved by Server Groups rather than HACMP on AIX. The latter requres running jobs in Recovery mode, which brings a lot of overhead. What happens if a Job server 1 in a server group stops responding in the middle of the job, will the Job Server 2 of the same server group take over the processing?
Sorry experts ,
My problem is solved . Job server associated to the selected port is not working . So the job did not get started.
Thanks for your suport.
Regards,
Dirasa
Similar Messages
-
BODS 3.1 and Teradata 12 : AL_ENGINE process hanging issue
Hi All,
We are loading into Teradata 12 target tables using BODS 3.1 and sometime a BODS process is left hanging on Teradata side which blocks the target table. Lock has to be removed by Teradata DBA before the job can be executed sucessfully. On the Teradata side the log shows its a AL_ENGINE process via port 3326 using javaw. on BODS side as the job hangs we kill the job and looking at the processes on BODS server no AL_ENGINE process from the repository from where the ETL job was executed from is hanging. BODS is installed on AIX OS and connectivity to Teradata 12 is via an ODBC driver.
The occurence of this issue is highly erratic and not confined to one specific ETL job.
Majority of the job executions are being initiated from the BODS management console.
Thanks,
ranaCheck licensing on 3750x stack. suspect master has IP services, other member have only IP Base.
-
Issues with DB/CI and DI @ different levels
SAP gurus - Hi
Recently migrated my AIX servers (from 5.3 to 6.1) for the CI and DI instances and ran into issues -
Here is the order of events..
1- First migrated the DI instance, applications started alright, active in SAP
2- Then I migrated the inactive node of the CI HACMP cluster - no impact to service
3- Then I did the fail-over from to migrate the other HACMP node, at this point after the cluster failover (the CI is also at AIX 6.1 now), the DI instance is not showing active in SAP GUI anymore, although the SAP is running on both CI and DI at the OS level.
4- We tried to stop/start just the DI, that didn't work
5- Then we stopped/started both DI and CI in the correct order (CI first then DI), still not good
6- Finally rebooted the DI server, started SAP after reboot and then it showed up
Luckily this was in the test environment, I have to progress the change to Production and I cannot imagine having to reboot the Prod DI instances, that is a big no no.
Does this ring a bell? anyone, anybody seen this before? or know what might have caused the app servers to disappear from SAP? anything we can do proactively to avoid this situation? or avoid reboot post-actively?I'm not sure what you mean by "DI instance is not showing active in SAP GUI anymore", how did you see that ?
What error you captured in screen for fail logon ?
Is this issue reproduciable after migration completed to 6.1 ?
For improper instance stop/start, dev_* trace files / sapstart/stderr logs are the one the look at as well to find out the reason.
Regards,
Vincent -
SAP ECC 6.0 SR3 Cluster failover not working in AIX with DB2 UDB V9.1 FP6
Hi Gurus,
We have installed the SAP ECC 6.0 SR3 High Availability with DB2 UDB V9.1 FP6 in AIX cluster environment.
After installation we are doing the cluster fail test.
Node A
Application Server
Mount Points:
/sapmnt/<SID>
/usr/sap/<SID>
/usr/sap/trans
Node B
Database Server
Mount Points:
/db2//<SID>
The procedure followed to do the cluster failover:
We have down the cluster on Node A and all the resources of the Node A has been moved to Node B.
On Node B when we issued a command to start the SAP. It says u201Cno start profiles foundu201D
WE have down the cluster on Node B and moved the Resource from Node B to Node A . There the db2 User IDu2019s are not available. We have crated the user Idu2019s manually on Node A. however it did not work.
Please suggest the procedure to start the sap in cluster failover.
Best Regards
SijaHi Sija,
Can i have detailed scenario in your cluster configuration.
Means you are saying that going to start cluster package manually, if it is right please make sure that you had the same copy of start, instance profiles of NodeA to Node B. Means you need to maintain two startup, two instance profiles for both nodes. In a normal situation it will picik the profile of node A to start databse from A node. But in a failover situation it will not pick node A profile to start, it should pick Node B s profiles.
Just make a copy from node A and change the profile name accordingly to Node b. Then try to restart.
Regards
Nick Loy -
BODI XI crashes with signal 15 when attempting to select from DB2 on AIX.
I have inherited the task, started 18 months ago, of upgrading from DI 6.1 to XI release 2. My predecessor set up an AIX evalution box to test the impact of the changes to DI, especially the handling of Nulls. As soon as we try extracting DB2 data on this box (using a query transform) we get a signal 15 error, and a core dump. The query can be as simple as selecting a single column from a single table into a template table with no where clause. An SQL function in a script also crashes.
DB2 connect (version 7.1) is on the AIX evaluation box, and is working - I can go to the DB2 prompt, connect to a database and select data. Designer works with DB2, and I can create a new datastore, import metadata, and view data. Oracle to Oracle and oracle to XML work fine on the job server, DB2 to anything optimises the query then falls over.
We thought that it may be a problem with the version of DB2 on the mainframe, but I've tried against DB2 OS/390 7.1.2 and against DB2/6000 7.2.4, both with the same result.
As far as I can see all the environmental variables on the AIX box are OK. Most are the same as on the functioning development box, except for changes that are part of the XI install. One difference is in the libpath - the working box includes '/usr/opt/db2_08_01/lib', which does not exist!!! The evalution box has '/usr/opt/db2_07_01/lib' instead, which does exist. - can a valid reference instead of an invalid one cause a problem?
BODI 6.1 continues to work fine with DB2, as did 5.5 and 5.1.
Anyone seen anything similar? Anyone got any clues as to what is happening?Which version of Data Integrator are you exactly migrating to ? It this is DI 11.7.x (XI Release 2 Accelerated) -> a big difference with DI 6.1 is that DI 11.7 is a 64-bit application now, hence also needs 64-bit versions of the database drivers. Could that be the issue ?
I would definitely look into that direction to try to locate what's going on. You already indicated there is a difference in the libpath for the working and not-working environment. Maybe do some cleanup in the libpath to make sure DI will pick u the correct version.
You could also open a case with support, they will be able to give you a faster response time and work with you to troubleshoot the issue. -
Need Suggestion on DB2 HA Setup on AIX 6.1
Dear All,
We are configuring HA for PRD System having below configuration:
OS: AIX 6.1
Database: DB2 9.1 Fix pack 7
System: ERP 6.0 EHP4
What we have done so far:
We have installed DB & CI on node 1 (primary node) and ASCS & Dialog Instance on node 2 (secondary node). ASCS instance on node 2 and DB on node 1 is on movable resource group. CI on node 1 and Dialog Instance on node 2 is local. So, e.g. if there is failover on node 1 then DB will move on node 2 and Dialog Instance will take over service provided by CI.
Now my doubts are:
1) I have installed DB2 in resource group and nothing local. But If I am checking redbook for HA on DB2 then it is mentioned there that we should install db2 binaries local on both nodes. Is it mandatory because my understanding is that if whole db2 is moving (log, data and binaries) then it should start there ?
2) I am using the concept of service IP's to install this HA. When I installed CI and DB on node 1 then i used hostname associated with service ip of node 1. Same when i installed ASCS and Dialog instance then I used hostname associated with service IP of node 2. And Even If I am rebooting my node 1 then all resources are going to node 2 and I am also getting hostname and service IP of node 1 on node 2. Then I think I need not to change hostname of node 1 when resources are there on node 2. Am I right ?
3) Is it necessary to catalog database on node 2 ?
4) Am I need to change host entry in db2nodes.cfg when db will move on node 2 ? My understains is that I need not to change hostname in db2nodes.cfg as I am getting ip address and hostname of node 1 on node 2. Since IP are service ip's so the host name associates with them.
Please suggest.
Thanks
SunnyNo need to install binaries on node 2 if it is on movable resource group like in my case.
Database catalog is not required on other node.
Since only IP is moving but hostname is not moving. So need to change change host name in db2nodes.cfg file.
Thanks
Sunny -
Strange error when starting tnslsnr on AIX machine
Hello!
I have the following system:
there are two nodes (say, node1 and node2) in HACMP cluster (OS is AIX 5.2). ORACLE runs on node2 and our software - on node1 (in normal state). Also there is an external storage, where ORACLE's and software's file systems are situated.
So, in normal state ORACLE runs on node2 and software - on node1. ORACLE's filesystems are mounted on node2 and software's filesystem - on node1.
There can be two more states of working system:
1. when node1 fails and all it's resources are acquired by node2 (filesystems and IP). In this state ORACLE and software are executed on node2.
2. when node2 fails and it's IP and filesystems are acquired by node1. ORACLE and software are executed on node1.
So, in failover state no.2 everything works ok. In failover state no.1 and in normal state i get an error when i try to run tnslsnr:
oracleuser$> lsnrctl start
LSNRCTL for IBM/AIX RISC System/6000: Version 9.2.0.6.0 - Production on 26-SEP-2005 14:43:58
Copyright (c) 1991, 2002, Oracle Corporation. All rights reserved.
Starting /oracle/MFP/920_64/bin/tnslsnr: please wait...
TNSLSNR for IBM/AIX RISC System/6000: Version 9.2.0.6.0 - Production
System parameter file is /oracle/MFP/920_64/network/admin/listener.ora
Log messages written to /oracle/MFP/920_64/network/log/listener.log
Listening on: (DESCRIPTION=(ADDRESS=(PROTOCOL=ipc)(KEY=EXTPROC)))
Error listening on: (DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=prdr3dbt)(PORT=1527)))
TNS-12545: Connect failed because target host or object does not exist
TNS-12560: TNS:protocol adapter error
TNS-00515: Connect failed because target host or object does not exist
IBM/AIX RISC System/6000 Error: 13: Permission denied
Listener failed to start. See the error message(s) above...
oracleuser$
When i edit listener.ora (replace "prdr3dbt" by corresponding IP-address) and try to start tnslsnr again, the startup procedure hangs and does not finish for a long time (it hangs until I press CONTROL-C key combination) and i get a huge amount of such error:
TNS-12531: TNS:cannot allocate memory
26-SEP-2005 15:17:50 * 12531
TNS-12531: TNS:cannot allocate memory
26-SEP-2005 15:17:50 * 12531
TNS-12531: TNS:cannot allocate memory
26-SEP-2005 15:17:50 * 12531
TNS-12531: TNS:cannot allocate memory
26-SEP-2005 15:17:50 * 12531
TNS-12531: TNS:cannot allocate memory
26-SEP-2005 15:17:50 * 12531
TNS-12531: TNS:cannot allocate memory
26-SEP-2005 15:17:50 * 12531
TNS-12531: TNS:cannot allocate memory
26-SEP-2005 15:17:50 * 12531
TNS-12531: TNS:cannot allocate memory
26-SEP-2005 15:17:50 * 12531
TNS-12531: TNS:cannot allocate memory
I am sure that there is nothing wrong with oracle's files, becouse they are on EXTERNAL storage and the system doesn't work only in two of three states (in one of states it works fine).'TNS-12531: TNS:cannot allocate memory' may be misleading, it seems to be a permission problem (see also IBM/AIX RISC System/6000 Error: 13: Permission denied). A possible reason is:
Oracle (more specific the listener) is unable to read /etc/hosts, because of permission problems. So host resolution is not possible. -
HA Installation of ERP EHP4 on AIX and DB2
Dear All,
I have to do a HA installation of ERP EHP4 system on AIX and DB2. I have followed the standard inst.guide from SAP but I want more information. Can anyone help me in understanding how to go about installing HA in detail. I would be very grateful if anyone can provide me a guide/document of what and how they have done. Can you pls explain me the steps.
I also have few questions
1. Shall i do a Central installation or distributed installation ( meaning CI, ASCS and DB in one server failover to other; CI,ASCS on one server and DB running on different server which in case of failure will move to other server)
2. I know that OS team will do the cluster configuration but is there any other software to be bought for an automatic switchover in case of failures?
3. Mainly, as a BASIS consultant what is our role in HA configuration? What file systems should be defined and nfs mounted?
Request you to take time and answer. Thanks in Advance.
Lastly, i hope this is the right forum. If not , please let me know.Hi,
>
> 1. Shall i do a Central installation or distributed installation ( meaning CI, ASCS and DB in one server failover to other; CI,ASCS on one server and DB running on different server which in case of failure will move to other server)
>
I think you did not get basic of HA installation. Its neither central nor distributed installation. You need to install CI and Dialog Instance locally on 2 nodes. DB and ASCS instance on the disk that will move across 2 nodes.
> 2. I know that OS team will do the cluster configuration but is there any other software to be bought for an automatic switchover in case of failures?
>
OS team will do file system config so that file system having ASCS and DB should move across 2 nodes. And rest of the file system should be local.
After installation making cluster movement automatic both OS and SAP Basis guy have to work together.
> 3. Mainly, as a BASIS consultant what is our role in HA configuration? What file systems should be defined and nfs mounted?
>
I would say primary role in HA installation is of Basis guy. Because he is the person who will decide what approach will you take to install HA either active-active or active-passive. Its you on whom requirement OS guy will work.
Thanks
Sunny -
Data Guard Broker and Cold Failover clusters
Hi,
I wanted to use Data Guard Broker to control primary/standby systems on AIX clusters but found that without Oracle clusterware it is not supported on failover of a database to another node (unless you drop and recreate the configuration which in my opinion is not support at all!).
The 11g documentation states that DG Broker offers support for single instance databases configured for HA using Oracle Clusterware and "cold failover clusters".
Does anyone know whether this support for cold failover clusters in 11g means "proper support" i.e the configuration detects that the hostname has changed and automatically renames it so it continues to work?, or is the support in 11g the same as that in 10g?.
Thanks,
AndyHi Jan,
We already use virtual hostnames.
When I set up the configuration the hostname for the databases default to the server name. The only way I know of changing this is to disable the configuration, then use the "edit database...set property" command but when I enable the configuration again it reverts back to its previous value.
regards,
Andy -
Hello forum users
I installed Oracle 9i (9.2.0.1) on AIX 5.2 ML7 in a HACMP environment.
Before installing Oracle I created users & groups via "smitty hacmp - Users & groups" so I have same users on each node.
On each node i installed Oracle on internal disk drives and i have an external drive for failover with my database files.
When I try to upgrade to Oracle 9.2.0.6 with Oracle Universal Installer I have a screen "Selected Node" with :
- my first server PRDWSS2 : Available
- my second server PRDWSS3 : Unreachable
I tried to stop HACMP on second server but i have the same message.
I'd like to upgrade on each node separately.
Is anyone know how to do that ?
Thanks for any help
GuillaumeHi Laurent,
Thanks for your answer.
I have just Failover Cluster.
Do you first install Oracle on each server and then install HACMP or install HACMP and then Oracle ?
I cannot ignore messages because I have :
PRDWSS2 Available
PRDWSS3 Unreachable
Button "install" is unavailable ...
Do you know if i can force "Universal Installer" to only run on available node ?
It's very difficult to find ressources on HACMP/AIX 5.2 with Oracle 9i.
Guillaume -
Do we know of any problems with BODS and Oracle Real Application Clusters architecture? For example, a RAC TNS entry would specify one service name but multiple hosts for load balancing and failover. In BODS the connections are specified in many places (web console, datastores, job server). As far as I remember in some of those places we do have to enter the host name. Would this disqualify BODS from benefiting from RAC or would it (even worse) cause problems? Any official note by SAP on BODS and Oracle RAC?
Thanks.Hi Mahir,
As far as I know it's only the web console that requires the hostname, however I am using DS XI 3.1 and when selecting database type Oracle in the web console there is a checkbox for 'Oracle RAC'. If you check this box then you do not require to enter the hostname. -
Failover - How to achieve a transparent failover using SQLPlus
AIX 5.3 Oracle Clusterware 10.2.0.4.0 Oracle Enterprise Edition 10.2.0.4.0
This is the behavior I see from an Oracle Client session which is to be expected if I read RAC: Frequently Asked Questions [ID 220970.1] see below
(1) SQLPlus session connected to NodeA
(2) NodeA - Clusterware services stopped
(3) NodeA-vip has failed over to Node B
(4) SQLPlus session receives an error
(5) SQLPlus establish new connection to NodeA-vip
My question is how is a transparent SQLPLus session failover achieved as illustrated in [ID 339107.1] see below
*** Dedicated Connections to a Migrated VIP Can Lose their Connection after the VIP is Switched Back [ID 339107.1] ***
SQL> select instance_name from v$instance;
INSTANCE_NAME
rac11g1
$ crsctl stop crs
SQL> /
INSTANCE_NAME
rac11g2
RAC: Frequently Asked Questions [ID 220970.1]
*** Why do we have a Virtual IP (VIP) in Oracle RAC 10g or 11g? Why does it just return a dead connection when its primary node fails? ***
The goal is application availability.
When a node fails, the VIP associated with it is automatically failed over to some other node. When this occurs, the following things happen.
(1) VIP detects public network failure which generates a FAN event.
(2) the new node re-arps the world indicating a new MAC address for the IP.
(3) connected clients subscribing to FAN immediately receive ORA-3113 error or equivalent. Those not subscribing to FAN will eventually time out.
(4) New connection requests rapidly traverse the tnsnames.ora address list skipping over the dead nodes, instead of having to wait on TCP-IP timeouts
Without using VIPs or FAN, clients connected to a node that died will often wait for a TCP timeout period (which can be up to 10 min) before getting an error.
As a result, you don't really have a good HA solution without using VIPs and FAN. The easiest way to use FAN is to use an integrated client with Fast Connection Failover (FCF) such as JDBC, OCI, or ODP.NET.
*** What do the VIP resources do once they detect a node has failed/gone down? Are the VIPs automatically acquired, and published, or is manual intervention required? Are VIPs mandatory? ***
With Oracle RAC 10g or higher, each node requires a VIP. With Oracle RAC 11g Release 2, 3 additional SCAN vips are required for the cluster. When a node fails, the VIP associated with the failed node is automatically failed over to one of the other nodes in the cluster. When this occurs, two things happen:
1. The new node re-arps the world indicating a new MAC address for this IP address. For directly connected clients, this usually causes them to see errors on their connections to the old address;
2. Subsequent packets sent to the VIP go to the new node, which will send error RST packets back to the clients. This results in the clients getting errors immediately.
In the case of existing SQL conenctions, errors will typically be in the form of ORA-3113 errors, while a new connection using an address list will select the next entry in the list. Without using VIPs, clients connected to a node that died will often wait for a TCP/IP timeout period before getting an error. This can be as long as 10 minutes or more. As a result, you don't really have a good HA solution without using VIPs.
With Oracle RAC 11g Release 2, you can delegate the management of the VIPs to the cluster. If you do this, the Grid Naming Service (part of the Oracle Clusterware) will automatically allocated and manage all VIPs in the cluster. This requires a DHCP service on the public network.
Thank you
SteveAnswer = Follow MetaLink 377100.1
-
Service registration 11gr2 RAC AIX
Hi Gurus,
I have created an database service called "MTRXUATR" with srvctl as both nodes as prefered as show below ,this service got registered on both nodes (mvx20b) but only instance 1 is providing the service and not by the other instance .where logically should be provided by both instances.When i shutdown the instance 1(node 1) and tried connecting through sqlplus using this services, started getting error "ORA-12514" where i should have been redirected to instance 2(connect time failover) as i'm using scan name,correct?
Why is it that the service is not providing by both instances?is it some configuration issue that i missed?Services should be registered with the local listeners or SCAN listeners?
Node 1:
Code: Select all
mvx20b:/u01/app/omvmxu1/product/11.2.0/dbhome_1/racg>srvctl config service -d mvmxu2
Service name: mtrxuatr
Service is enabled
Server pool: mvmxu2_mtrxuatr
Cardinality: 2
Disconnect: false
Service role: PRIMARY
Management policy: AUTOMATIC
DTP transaction: false
AQ HA notifications: true
Failover type: SELECT
Failover method: BASIC
TAF failover retries: 180
TAF failover delay: 1
Connection Load Balancing Goal: LONG
Runtime Load Balancing Goal: NONE
TAF policy specification: NONE
Preferred instances: mvmxu21,mvmxu22
Available instances:
mvx20b:/home/gmvmxu1>lsntrctl services
ksh: lsntrctl: not found
mvx20b:/home/gmvmxu1>lsnrctl services
LSNRCTL for IBM/AIX RISC System/6000: Version 11.2.0.1.0 - Production on 27-OCT-2010 09:58:02
Copyright (c) 1991, 2009, Oracle. All rights reserved.
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=IPC)(KEY=LISTENER)))
Services Summary...
Service "+ASM1" has 1 instance(s).
Instance "+ASM1", status UNKNOWN, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:52549 refused:0
LOCAL SERVER
Service "matrxuat" has 1 instance(s).
Instance "mvmxu11", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:24 refused:0 state:ready
LOCAL SERVER
Service "mtrxuatr" has 1 instance(s).
Instance "mvmxu21", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:4 refused:0 state:ready
LOCAL SERVER
Service "mvmxu1" has 1 instance(s).
Instance "mvmxu11", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:24 refused:0 state:ready
LOCAL SERVER
Service "mvmxu1XDB" has 1 instance(s).
Instance "mvmxu11", status READY, has 1 handler(s) for this service...
Handler(s):
"D000" established:0 refused:0 current:0 max:1022 state:ready
DISPATCHER <machine: mvx20b.us.hp.com, pid: 1687592>
(ADDRESS=(PROTOCOL=tcp)(HOST=mvx20b.us.hp.com)(PORT=50696))
Service "mvmxu2" has 1 instance(s).
Instance "mvmxu21", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:4 refused:0 state:ready
LOCAL SERVER
Service "mvmxu2XDB" has 1 instance(s).
Instance "mvmxu21", status READY, has 1 handler(s) for this service...
Handler(s):
"D000" established:0 refused:0 current:0 max:1022 state:ready
DISPATCHER <machine: mvx20b.us.hp.com, pid: 1667116>
(ADDRESS=(PROTOCOL=tcp)(HOST=mvx20b.us.hp.com)(PORT=52509))
The command completed successfully
mvx20b:/home/gmvmxu1>srvctl config service -d mvmxu2
Service name: mtrxuatr
Service is enabled
Server pool: mvmxu2_mtrxuatr
Cardinality: 2
Disconnect: false
Service role: PRIMARY
Management policy: AUTOMATIC
DTP transaction: false
AQ HA notifications: true
Failover type: SELECT
Failover method: BASIC
TAF failover retries: 180
TAF failover delay: 1
Connection Load Balancing Goal: LONG
Runtime Load Balancing Goal: NONE
TAF policy specification: NONE
Preferred instances: mvmxu21,mvmxu22
Available instances:
mvx20b:/home/gmvmxu1>srvctl status database -d mvmxu2
Instance mvmxu21 is running on node mvx20b
Instance mvmxu22 is running on node mvx21b
mvx20b:/home/gmvmxu1>srvctl status service -d mvmxu2
Service mtrxuatr is running on instance(s) mvmxu21
mvx20b:/home/gmvmxu1>srvctl stop instance -d mvmxu2 -i mvmxu21
mvx20b:/home/gmvmxu1>srvctl status database -d mvmxu2
Instance mvmxu21 is not running on node mvx20b
Instance mvmxu22 is running on node mvx21b
mvx20b:/home/gmvmxu1>srvctl status service -d mvmxu2
Service mtrxuatr is not running.
mvx20b:/home/gmvmxu1>sqlplus system@mtrxuatr
SQL*Plus: Release 11.2.0.1.0 Production on Wed Oct 27 10:27:27 2010
Copyright (c) 1982, 2009, Oracle. All rights reserved.
Enter password:
ERROR:
ORA-12514: TNS:listener does not currently know of service requested in connect
descriptor
Enter user-name:
mvx20b:/home/gmvmxu1>tnsping mtrxuatr
TNS Ping Utility for IBM/AIX RISC System/6000: Version 11.2.0.1.0 - Production on 27-OCT-2010 10:28:51
Copyright (c) 1997, 2009, Oracle. All rights reserved.
Used parameter files:
/u01/app/11.2.0/grid/network/admin/sqlnet.ora
Used TNSNAMES adapter to resolve the alias
Attempting to contact (DESCRIPTION = (ADDRESS = (PROTOCOL = TCP)(HOST = mvxuatscan.us.hp.com)(PORT = 1521)) (CONNECT_DATA = (SERVER = DEDICATED) (SERVICE_NAME = MTRXUATR)))
OK (10 msec)
mvx20b:/home/gmvmxu1>nslookup mvxuatscan.us.hp.com
Server: einstein.us.hp.com
Address: 10.35.15.20
Name: mvxuatscan.us.hp.com
Addresses: 10.35.33.170, 10.35.33.172, 10.35.33.171
mvx20b:/home/gmvmxu1>host mvxuatscan.us.hp.com
mvxuatscan.us.hp.com is 10.35.33.171
mvx20b:/home/gmvmxu1>host mvxuatscan.us.hp.com
mvxuatscan.us.hp.com is 10.35.33.171
mvx20b:/home/gmvmxu1>ping mvxuatscan.us.hp.com
PING mvxuatscan.us.hp.com (10.35.33.172): 56 data bytes
64 bytes from 10.35.33.172: icmp_seq=0 ttl=255 time=0 ms
64 bytes from 10.35.33.172: icmp_seq=1 ttl=255 time=0 ms
64 bytes from 10.35.33.172: icmp_seq=2 ttl=255 time=0 ms
64 bytes from 10.35.33.172: icmp_seq=3 ttl=255 time=0 ms
--- mvxuatscan.us.hp.com ping statistics ---
4 packets transmitted, 4 packets received, 0% packet loss
round-trip min/avg/max = 0/0/0 ms
mvx20b:/home/gmvmxu1>ping mvxuatscan.us.hp.com
PING mvxuatscan.us.hp.com (10.35.33.171): 56 data bytes
64 bytes from 10.35.33.171: icmp_seq=0 ttl=255 time=0 ms
64 bytes from 10.35.33.171: icmp_seq=1 ttl=255 time=0 ms
64 bytes from 10.35.33.171: icmp_seq=2 ttl=255 time=0 ms
--- mvxuatscan.us.hp.com ping statistics ---
3 packets transmitted, 3 packets received, 0% packet loss
round-trip min/avg/max = 0/0/0 ms
mvx20b:/home/gmvmxu1>ping mvxuatscan.us.hp.com
PING mvxuatscan.us.hp.com (10.35.33.170): 56 data bytes
64 bytes from 10.35.33.170: icmp_seq=0 ttl=255 time=0 ms
64 bytes from 10.35.33.170: icmp_seq=1 ttl=255 time=0 ms
64 bytes from 10.35.33.170: icmp_seq=2 ttl=255 time=0 ms
--- mvxuatscan.us.hp.com ping statistics ---
3 packets transmitted, 3 packets received, 0% packet loss
round-trip min/avg/max = 0/0/0 ms
mvx20b:/home/gmvmxu1>lsnrctl services
LSNRCTL for IBM/AIX RISC System/6000: Version 11.2.0.1.0 - Production on 27-OCT-2010 10:34:16
Copyright (c) 1991, 2009, Oracle. All rights reserved.
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=IPC)(KEY=LISTENER)))
Services Summary...
Service "+ASM1" has 1 instance(s).
Instance "+ASM1", status UNKNOWN, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:52549 refused:0
LOCAL SERVER
Service "matrxuat" has 1 instance(s).
Instance "mvmxu11", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:24 refused:0 state:ready
LOCAL SERVER
Service "mvmxu1" has 1 instance(s).
Instance "mvmxu11", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:24 refused:0 state:ready
LOCAL SERVER
Service "mvmxu1XDB" has 1 instance(s).
Instance "mvmxu11", status READY, has 1 handler(s) for this service...
Handler(s):
"D000" established:0 refused:0 current:0 max:1022 state:ready
DISPATCHER <machine: mvx20b.us.hp.com, pid: 1687592>
(ADDRESS=(PROTOCOL=tcp)(HOST=mvx20b.us.hp.com)(PORT=50696))
The command completed successfully
mvx20b:/home/gmvmxu1>srvctl start instance -d mvmxu2 -i mvmxu21
mvx20b:/home/gmvmxu1>srvctl status service -d mvmxu2
Service mtrxuatr is running on instance(s) mvmxu21
mvx20b:/home/gmvmxu1>
Node 2:
Code: Select all
mvx21b:/home/gmvmxu1>lsnrctl services
LSNRCTL for IBM/AIX RISC System/6000: Version 11.2.0.1.0 - Production on 27-OCT-2010 10:38:52
Copyright (c) 1991, 2009, Oracle. All rights reserved.
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=IPC)(KEY=LISTENER)))
Services Summary...
Service "+ASM" has 1 instance(s).
Instance "+ASM2", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:3973 refused:0 state:ready
LOCAL SERVER
Service "+ASM2" has 1 instance(s).
Instance "+ASM2", status UNKNOWN, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:9 refused:0
LOCAL SERVER
Service "mvmxu1" has 1 instance(s).
Instance "mvmxu12", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:951 refused:0 state:ready
LOCAL SERVER
Service "mvmxu1XDB" has 1 instance(s).
Instance "mvmxu12", status READY, has 1 handler(s) for this service...
Handler(s):
"D000" established:0 refused:0 current:0 max:1022 state:ready
DISPATCHER <machine: mvx21b.us.hp.com, pid: 1552450>
(ADDRESS=(PROTOCOL=tcp)(HOST=mvx21b.us.hp.com)(PORT=34128))
Service "mvmxu2" has 1 instance(s).
Instance "mvmxu22", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:0 refused:0 state:ready
LOCAL SERVER
Service "mvmxu2XDB" has 1 instance(s).
Instance "mvmxu22", status READY, has 1 handler(s) for this service...
Handler(s):
"D000" established:0 refused:0 current:0 max:1022 state:ready
DISPATCHER <machine: mvx21b.us.hp.com, pid: 2064564>
(ADDRESS=(PROTOCOL=tcp)(HOST=mvx21b.us.hp.com)(PORT=64601))
The command completed successfully
mvx21b:/home/gmvmxu1>ps -ef |grep lsnr
gmvmxu1 1695876 1 0 Sep 13 - 20:40 /u01/app/11.2.0/grid/bin/tnslsnr LISTENER -inherit
gmvmxu1 1732752 1 0 Sep 13 - 5:52 /u01/app/11.2.0/grid/bin/tnslsnr LISTENER_SCAN1 -inherit
gmvmxu1 1855706 2195662 0 10:39:13 pts/0 0:00 grep lsnr
mvx21b:/home/gmvmxu1>lsnrctl services LISTENER_SCAN1
LSNRCTL for IBM/AIX RISC System/6000: Version 11.2.0.1.0 - Production on 27-OCT-2010 10:39:25
Copyright (c) 1991, 2009, Oracle. All rights reserved.
Connecting to (DESCRIPTION=(ADDRESS=(PROTOCOL=IPC)(KEY=LISTENER_SCAN1)))
Services Summary...
Service "+ASM" has 2 instance(s).
Instance "+ASM1", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:0 refused:0 state:blocked
REMOTE SERVER
(DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=mvx21b-vip.us.hp.com)(PORT=1521)))
Instance "+ASM2", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:0 refused:0 state:ready
REMOTE SERVER
(DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=mvx21b-vip.us.hp.com)(PORT=1521)))
Service "matrxuat" has 1 instance(s).
Instance "mvmxu11", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:2 refused:0 state:ready
REMOTE SERVER
(DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=mvx20b-vip.us.hp.com)(PORT=1521)))
Service "mvmxu1" has 2 instance(s).
Instance "mvmxu11", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:2 refused:0 state:ready
REMOTE SERVER
(DESCRIPTION=(ADDRESS=(PROTOCOL=TCP)(HOST=mvx20b-vip.us.hp.com)(PORT=1521)))
Instance "mvmxu12", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:0 refused:0 state:ready
REMOTE SERVER
(ADDRESS=(PROTOCOL=TCP)(HOST=mvx21b.us.hp.com)(PORT=1521))
Service "mvmxu1XDB" has 2 instance(s).
Instance "mvmxu11", status READY, has 1 handler(s) for this service...
Handler(s):
"D000" established:0 refused:0 current:0 max:1022 state:ready
DISPATCHER <machine: mvx20b.us.hp.com, pid: 1687592>
(ADDRESS=(PROTOCOL=tcp)(HOST=mvx20b.us.hp.com)(PORT=50696))
Instance "mvmxu12", status READY, has 1 handler(s) for this service...
Handler(s):
"D000" established:0 refused:0 current:0 max:1022 state:ready
DISPATCHER <machine: mvx21b.us.hp.com, pid: 1552450>
(ADDRESS=(PROTOCOL=tcp)(HOST=mvx21b.us.hp.com)(PORT=34128))
Service "mvmxu2" has 1 instance(s).
Instance "mvmxu22", status READY, has 1 handler(s) for this service...
Handler(s):
"DEDICATED" established:0 refused:0 state:ready
REMOTE SERVER
(ADDRESS=(PROTOCOL=TCP)(HOST=mvx21b.us.hp.com)(PORT=1521))
Service "mvmxu2XDB" has 1 instance(s).
Instance "mvmxu22", status READY, has 1 handler(s) for this service...
Handler(s):
"D000" established:0 refused:0 current:0 max:1022 state:ready
DISPATCHER <machine: mvx21b.us.hp.com, pid: 2064564>
(ADDRESS=(PROTOCOL=tcp)(HOST=mvx21b.us.hp.com)(PORT=64601))
The command completed successfully
mvx21b:/home/gmvmxu1>
Instance 1 parameter:
Code: Select all
SQL> show parameter local_
NAME TYPE VALUE
local_listener string (DESCRIPTION=(ADDRESS_LIST=(AD
DRESS=(PROTOCOL=TCP)(HOST=mvx2
0b-vip.us.hp.com)(PORT=1521)
log_archive_local_first boolean TRUE
SQL> show parameter remote_
NAME TYPE VALUE
remote_dependencies_mode string TIMESTAMP
remote_listener string mvxuatscan:1521
Instance 2 parameter:
Code: Select all
SQL> show parameter local_
NAME TYPE VALUE
local_listener string
log_archive_local_first boolean TRUE
SQL> show parameter remote_
NAME TYPE VALUE
remote_dependencies_mode string TIMESTAMP
remote_listener string mvxuatscan:1521
remote_login_passwordfile string EXCLUSIVE
remote_os_authent boolean FALSE
remote_os_roles boolean FALSE
result_cache_remote_expiration integer 0
Edited by: user9097501 on Oct 27, 2010 12:21 PMI tried starting service on instance 2 but getting the following error...
mvx21b:/home/omvmxu1>srvctl status service -d mvmxu1
Service matrxuat is running on instance(s) mvmxu11
mvx21b:/home/omvmxu1>srvctl start service -d mvmxu1 -s matrxuat -i mvmxu12
PRCR-1064 : Failed to start resource ora.mvmxu1.matrxuat.svc on node mvx21b
CRS-2674: Start of 'ora.mvmxu1.matrxuat.svc' on 'mvx21b' failed
CRS-2678: 'ora.mvmxu1.matrxuat.svc' on 'mvx21b' has experienced an unrecoverable failure
CRS-0267: Human intervention required to resume its availability.
When i say 'srvctl start service -d mvmxu1' should not it bring up services on both nodes won which it is configured as prefered?
Edited by: user9097501 on Oct 28, 2010 6:44 AM -
SFSB Instance Sharing across multiple servers/SFSB failover practices
Hello:
My question is in two parts. I have spent a great deal of time searching the forums without a satisfactory answer, so I thought I'd post my question directly.
1) I have two clients that can potentially talk to two different servers, but they both need to interact with the same instance of a SFSB. For example, the first client calls the SFSB and causes it to save some state in its instance variables. The second client connects to a different server (because of a "network dispatcher" load-balancing architecture), but needs to use the same instance of the SFSB that the first client initialized. The two clients will not always talk to different servers, but the possibility exists that they might.
2) What is the proper design pattern for "fail-over" for a SFSB. For example, a client establishes a session, tickles a SFSB and causes its instance variables to contain state, and then WHAMO the application server crashes or becomes unavailable for some reason. We have in place a mechanism to reroute further client request to a second application server, but currently it is a problem because even though the failover is transparent to the client, the backup server creates a new instance of the SFSB and therefore it has none of the previous state information.
I'm thinking that the answer to one of these questions will be the answer to the other.
A little bit of background: this is not a theoretical application. We are building 250+ cars per day, ramping up quickly to 500-650 cars per day. I'm hoping I can solve this problem from an architectural standpoint, without having to modify each individual bean, because we have on the order of 200-300 SFSBs that would have to be changed--not a pretty thing in a production critical application. We have two AIX servers, but one of them is currenlty just a hot standby because we cannot run both of them at the same time because of problem 1) from above. We'd like to be able to run both servers at the same time for load-balancing purposes. Furthermore, if one box fails then all SFSB data will be lost because of the problem is mentioned problem 2) above.
(As a point of clarification, and only because I don't know if this affects any possible suggestions, but the clients do not use remote references to the EJBs. They simply pass "data containers" via HTTP to the server, and this data is passed to the various EJBs and returned back to the client via HTTP Response and in some cases TCP/IP. In either case, we do not use "remote object" references in the most typical sense. The servlet maintains the references to the EJBs.)
I've read things about "session clustering" but have not pinned down the subject. I have the "Core J2EE Patterns" book on order.
Any suggestions or pointers to reading materials would be greatly, greatly appreciated. I also welcome the "what you should have done" variety suggestions.
Thanks!
Regards,
Doug WilkersonDoug,
Here are my thoughts, I hope they help.
I don't think there is a possibility to have two different client (different sessions) talk to one and the same SFSB. A SFSB is specific to a users session and cannot be shared.
The way I would tackle this problem is probably by using SLSB which might access either an entity bean or the DB directly. This way, you might cause DB overhead, but you can share the data between clustered servers and the users will work with the correct data.
About the second question, I don't really now a pattern to provide fail-over for SFSB. To my opinion that is the weakest side of SFSB (beside all the load-balancing that needs to take place).
Hope this helps. -
HACMP Clustering Script for SAP ECC 6.0 (SR1) - Automatic Failover
Hello,
I have installed the SAP ECC 6.0 (SR1) under AIX 5.3 / DB2 V8 FP12 with HACMP Clustering environment. Manual Failover is working fine. Central System has been installed in share drive with Virtual IP and Virtual name inNode A. Dialog Instance is loaded locally in Node B. I want to get HACMP Clustering script(automatic failover script) for Automation. Please help me if you have. It is single package clustering. If Node A fails, Node B will take care ( Central System and Dialog instance will run in Node B)
Thanks
Gautam Poddarthis post is duplicated at Upgrade to ERP 2005/ECC 6.0 from R/3 4.72/Basis 640 on Z/OS 1.4 DB2 8.1
Maybe you are looking for
-
How to use multiple patterns for masking/format the input text
Hi All, I am using Jdeveloper 11.1.1.5 and i have a requirement where i need to format my input Text value in these below patterns:- Format Example AA9A 9AA EC1A 1BB A9A 9AA W1A 1HQ A9 9AA M1 1AA B33 8TH A99 9AA AA9 9AA CR2 6XH DN55 1PT AA99 9AA For
-
I am new to Apple, somehow I figured out how to make font of Web pages bigger, but it is too big and I can't use sites properly. for the life of me I can not how to get my font size reduced. Help anyone? thank you, sbb
-
Very frustrated with the battery exchange program
I'm very frustrated with the battery exchange program. I signed up online, double checked all my information, and submitted. No where on the page did it say that a signature would be required upon delivery. I work during the day, so I'm not home to r
-
Problems with a VBA Userform using Multipage (2) and DTPicker.
Hi Problems with a VBA Userform using Multipage (2) and DTPicker (4) On Page1 I've got 2 DTPicker, one for the date and the second for the time. Same thing on Page 2. Problem: Only one set will work, if I close the Userform with" MultiPage"on page2
-
I just up-dated to the latest version and each time i log on it runs for a few seconds then showns an error message telling me that Skype has stopped working. Can someone please fix it...