3 Node HA cluster issues.

Hi,
We are having a tough time in setting up a 3 node HA cluster (with sun clusterware 3.2 on Solaris 10 systems). 2 systems are joining the cluster fine but the third system is not; I changed the netconfig and rebooted all the three systems. After the reboot, again only 2 systems are joining the cluster (scstat is hanging on these two systems for a while but is working fine after couple hours or so). The third system is never get connected to cluster and scstat is hanging on this system with message:
libsecurity: create of rpc handle to program rgmd_receptionist (100141) failed, will keep trying
libsecurity: create of rpc handle to program rgmd_receptionist (100141) failed,will keep trying
Any debugging hints to resolve this would be greatly appreciated. Thanks.

Check the following entry:
http://blogs.sun.com/TF/entry/secure_by_default_and_sun
do the basic checks like OS version, patch level etc.
Best Regards,
Madhan Kumar

Similar Messages

  • 4 Node Q-Master Cluster issue with Nodes stuck at "Waiting"

    I have a 4 node QMaster cluster configured no problem, One Cluster Controller and 3 Service nodes. When i export a project from FCP to compressor i can assign the cluster no problem. Then the only node that starts to process is the cluster controller, The service nodes stay at Waiting.... I f i make any other service node the Cluster controller then THAT node starts to process and the 3 Service nodes are stuck at waiting.
    I can mount the cluster storage on all nodes and i am able to read/write to that volume.
    The source material (Firewire drive) is also mounted on all nodes.
    The setup is a new MB-PRO running FCP 5.0 and Compressor 2.0, QMaster 2.0 , the cluster nodes are all 1ghz G4's
    Any ideas?
    Many thanks
    MACBOOK PRO 2.1 Mac OS X (10.4.6) Multiple G4's
    MACBOOK PRO 2.1   Mac OS X (10.4.6)   Multiple G4's

    I don't recommend submitting to a cluster directly from the timeline. Instead, export a QT referance movie, and submit that to the cluster via compressor using a setting that has 'allow job segmenting' checked. If that is not checked, then you are telling Qmaster to only let one machine work on that file - this is recommended for high quality, multi-pass VBR Mpeg2 jobs because of how multi-pass and VBR work...
    If you submit from outside FCP, but you are doing multi-pass VBR, you can send your cluster 4 jobs that will all happen at the same time tho.
    Basically, to use all the nodes, it needs to have the whole file and be allowed to break it into segments, and send thos segments to the nodes. when you export off the time line, fcp sends the data to compressor frame at a time, not as a whole file, thus, no segmenting.

  • 2 node Webcenter Cluster setup

    Hi,
    I am trying to configure 2 node webcenter cluster, so far i was able to accomplish this:
    i) create 2 node weblogic cluster with webcenter [ WLS_Spaces1,WLS_Spaces2] , [WLS_Services1,WLS_Services2], [WLS_Portlet1,WLS_Portlet2]
    ii)configured the weblogic to Authenticate using AD, and configured Jive Admin to login using AD
    iii) configured JOC[java object cache] for webcenter spaces
    iv) and I am having issues configuring the WS-Security for spaces to acess discussion services.
    v) and also i am not able to see any policy store, and while trying to add a user to admin role for webcenter , i see it empty.
    in single node i am able to assign admin roles to users from AD with no issues.
    can some one help me in accomplishing the tasks for configuring the backend services of webcenter spaces when configured in cluster mode.
    Thank you
    A/
    * here is the doc link i am using for the setup
    http://sqltech.cl/doc/oas11gR1/core.1111/e12037/extend_wc.htm
    Edited by: user10696627 on Oct 7, 2010 10:28 PM

    We're running Server 2012 Data Center on the cluster nodes.
    I was thinking the same about the 3rd party software to do what I'd like it to do.   The data  is mostly security camera video from our security system.  Since its not really critical data, i'm just looking for a way to maximize
    the available hard drive space, and make it addressable as one volume or network share...
    -Eric
    You can build Storage Spaces (simple, not clustered as it would waste 50% of your capacity, MSFT can do mirror and parity with R2 for clustered only) from iSCSI LUs. Dog slow and unsupported but you'll have linear spanned space. See:
    Rough Guide To Setting Up A Scale-Out File Server
    http://www.aidanfinn.com/?p=13176
    Creating Virtual SoFS with shared VHDX
    http://www.aidanfinn.com/?p=15145
    you don;t need SoFS (obviously) but in this article Aidan creates Storage Spaces from iSCSI LUNs.
    Good luck!
    StarWind VSAN [Virtual SAN] clusters Hyper-V without SAS, Fibre Channel, SMB 3.0 or iSCSI, uses Ethernet to mirror internally mounted SATA disks between hosts.

  • Install mulutple RAC databases on 2-node RAC cluster

    I am installing 5 RAC databases on a 2-node RAC cluster. I have setup SCAN using 3 IP addresses.
    Do I have to use SCAN listener for all databases?
    When installing the 3 database, I get ORA-12537: TNS connection closed error.
    ENV: 11gR2 2-node RH5.x
    Thanks!

    I have setup SCAN using 3 IP addresses.
    Do I have to use SCAN listener for all databases?These 3 scan ip will work for youe all database running under this cluster setup.
    you may also use VIP to make connection like 10g.
    I get ORA-12537: TNS connection closed error.Appear some connectivity/configuration isue,please try MOS doc contain detail on this.
    How to Troubleshoot Connectivity Issue with 11gR2 SCAN Name [ID 975457.1]

  • How to Delete the node from cluster when the machine crashed?

    In an three nodes Rac of 11g r2,How to delete the node from cluster when the machine crashed?
    There is now way to repair the machine and have to add a new one.
    What is step to follow up?

    hi
    IF YOU WANT TO DELETE RAC1 NODE
    check $./olsnodes
    1) delete the instance using dbca from any active nodes
    crs_stat -t
    srvctl stop asm -n rac1
    2) delete listener
    3) delete oracle_home from oracle user
    $ORACLE_HOME/bin/runInstaller -updatenodelist ORACLE_HOME=<db_home> "CLUSTER_NODES={RAC1}
    4)delete asm home
    $ORACLE_HOME/bin/runInstaller -updatenodelist ORACLE_HOME=<asm_home> "CLUSTER_NODES={RAC1}
    5) update cluster node
    $ORACLE_HOME/bin/runInstaller -updatenodelist ORACLE_HOME=<db_home> "CLUSTER_NODES={active nodes like rac2,rac3}
    6) update ASm home
    $ORACLE_HOME/bin/runInstaller -updatenodelist ORACLE_HOME=<asm_home> "CLUSTER_NODES={active nodes like rac2,rac3}
    cd $ORA_CRS_HOME
    cd crs/opmn/conf
    check for
    $cat ons.config
    remoteport=6200
    cd crs_home/bin
    $./racgons remove_config rac1:6200
    $ go to crs home
    and $ORA_CRS_HOME/crs/install/rootdelete.sh
    $ORA_CRS_HOME/crs/install/rootdeletenode.sh
    check for ./olsnodes

  • Multiple databases/instances on 4-node RAC Cluster including Physical Stand

    OS: Windows 2003 Server R2 X64
    DB: 10.2.0.4
    Virtualization: NONE
    Node Configuration: x64 architecture - 4-Socket Quad-Core (16 CPUs)
    Node Memory: 128GB RAM
    We are planning the following on the above-mentioned 4-node RAC cluster:
    Node 1: DB1 with instanceDB11 (Active-Active: Load-balancing & Failover)
    Node 2: DB1 with instanceDB12 (Active-Active: Load-balancing & Failover)
    Node 3: DB1 with instanceDB13 (Active-Passive: Failover only) + DB2 with instanceDB21 (Active-Active: Load-balancing & Failover) + DB3 with instanceDB31 (Active-Active: Load-balancing & Failover) + DB4 with instance41 (Active-Active: Load-balancing & Failover)
    Node 4: DB1 with instanceDB14 (Active-Passive: Failover only) + DB2 with instanceDB22 (Active-Active: Load-balancing & Failover) + DB3 with instanceDB32 (Active-Active: Load-balancing & Failover) + DB4 with instance42 (Active-Active: Load-balancing & Failover)
    Note: DB1 will be the physical primary PROD OLTP database and will be open in READ-WRITE mode 24x7x365.
    Note: DB2 will be a Physical Standby of DB1 and will be open in Read-Only mode for reporting purposes during the day-time, except for 3 hours at night when it will apply the logs.
    Note: DB3 will be a Physical Standby of a remote database DB4 (not part of this cluster) and will be mounted in Managed Recovery mode for automatic failover/switchover purposes.
    Note: DB4 will be the physical primary Data Warehouse DB.
    Note: Going to 11g is NOT an option.
    Note: Data Guard broker will be used across the board.
    Please answer/advise of the following:
    1. Is the above configuration supported and why so? If not, what are the alternatives?
    2. Is the above configuration recommended and why so? If not, what are the recommended alternatives?

    Hi,
    As far as i understand, there's nothing wrong in configuration except you need to consider below points while implementing final design.
    1. No of CPU on each servers
    2. Memory on each servers
    3. If you've RAC physical standby then apply(MRP0) will run on only one instance.
    4. Since you are configuring physical standby for on 3rd and 4th nodes of DB1 4 node cluster where DB13 and DB14 instances are used only for failver, if you've a disaster at data center or power failure in entire data center, you are losing both primary and secondary with an assumption that your primary and physical standby reside in same data center so it may not be highly available architecture. If you are going to use extended RAC for this configuration then it makes sense where Node 1 and Node 2 will reside in Datacenter A and Node 3 ,4 will reside in Datacenter B.
    Thanks,
    Keyur

  • How to remove a node from 4 node sun cluster 3.1

    Dear All,
    We are having a four nodes in a cluster.
    Could any one please guide me, how to remove a single node from a 4 node cluster.
    what are the procedure and step's I have to follow.
    Thanks in advance.
    Veera.

    Google is pretty good at finding the right pages in our docs quickly. I tried >how to remove a node Solaris Cluster< and it came up with
    http://docs.sun.com/app/docs/doc/819-2971/gcfso?a=view
    Tim
    ---

  • Automatic restart of services on a 1 node rac cluster with Clusterware

    How do we enable a service to automaticly start-up when the db starts up?
    Thanks,
    Dave

    srvctl enable service -d DBThanks for your reply M. Nauman. I researched that command and found we do have it enabled and that it only works if the database instance was previously taken down. Since the database does not go down on an Archiver Hung error as we are using FRA with an alt location, this never kicks in and brings up the service. What we are looking for something that will trigger off of when the archive logs error and switch from FRA(Flash Recovery Area) to our Alternate disk location. Or more presicely, when it goes back to a Valid status(on the FRA - after we've run an archive log backup to clear it).
    I found out from our 2 senior dba's that our other 2 node rac environment does not suffer from this problem, only the newly created 1 node rac cluster environment. The problem is we don't know what that is(a parameter on the db or cluster or what) and how do we set it?
    Anyone know?
    Thanks,
    Gib
    Message was edited by:
    Gib2008
    Message was edited by:
    Gib2008

  • Failed to add node to cluster

    Hey, I am currently migrating my cluster.
    I removed the server pool master according to the metalink note by doing a failover (stopped the agent on the server pool master)
    Deleted the old master (node2) from the server pool.
    Executed the cleanup script on node2 and switched it off
    Modified the cluster.conf on the remaining node and remove the entries for the old master node2.
    Replaced the old server with new hardware -
    same name - same ip.
    Now I try to add this server to the server pool, but I get a timeout message
    OVM-1006 Register Oracle VM Server (node2) Failed: errcode=00001, errmsg=CDS accquire lock /etc/ovs-agent/db/srv.lock timeout. locker process is 8339
    Where can I look ?
    Christian

    Lemeunier wrote:
    > environment: sles 10 sp3, oes2, cluster services
    >
    > problem: reconfiguring oes to add a node to the cluster is causing the
    > error *failed to add node to cluster*
    >
    > history: I installed a 4 node cluster in a HP C7000 blade. We had to
    > replace the network switch in the blade center by a virtual connect
    > flex-10. This resulted in a loss of network connectivity, so I removed 3
    > of 4 nodes from cluster and eDirectory.
    > This worked fine, replication and time synchronisation was succesfully
    > and all server objects belonging to these 3 servers were deleted.
    >
    > Now the new switch has been configured and network connection
    > reestablished. Reconfiguring eDirectory and other oes2 services
    > succeeds, alle server objects are recreated, eDirectory is in sync, but
    > reconfiguring cluster services does not succeed.
    >
    > What do I have to do, to reconfigure cluster service and add nodes to
    > the cluster?
    >
    > Thank you for all hints.
    >
    > Ursula
    >
    >
    Did you remove the cluster rpms and then reinstall the rpms. I would
    recommend following TID 3131978 and see if that helps.

  • All connections are connecting to 2nd node only in a 2 Node RAC Cluster

    Hello,
    I have a 10.2.0.3 database on a two node RAC Cluster with only one service configured. This service set to be preferred on both nodes.
    However, all the connections are falling on Node2 only. Any idea where to look.
    $> srvctl config service -d PSDB
    psdbsrv1 PREF: psdb1 psdb2 AVAIL:
    Thanks,
    MM

    Application is using the following connection string.
    jdbc:oracle:thin:@(DESCRIPTION =(ADDRESS = (PROTOCOL = TCP)(HOST = PQ2-PS-db-01-vip)(PORT = 1521))(ADDRESS = (PROTOCOL = TCP)(HOST = PQ2-PS-db-02-vip)(PORT = 1521)) (LOAD_BALANCE = yes) (CONNECT_DATA =(SERVER = DEDICATED)(SERVICE_NAME = PSDBSRV1)(FAILOVER_MODE =(TYPE = SELECT)(METHOD = BASIC)(RETRIES = 180)(DELAY = 5))))
    --MM                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       

  • 2 Node Failover Cluster - ISCSI Disks as 1 volume?

    Hi,
    Not sure if I'm in the correct forum. If I am I apologize.  I need some advice.  
    I have created a 2-node failover cluster with 2 HP Blades.  I also currently have 2 NAS Servers (HP X1600 24tb servers running 2008 Storage server) -- The ultimate goal would be to combine all of the storage space from the NAS's into 1 volume addressable
    by the failover cluster. (As well as disk space from any additional NAS's added in the future.)
    Right now, I can add the ISCSI disk space from the NAS Targets as different volumes under cluster shared volumes.  Because of the 16TB limit in the ISCSI target, I essentially have 2 ISCSI disks on each NAS. One for 16TB, and the other for 4TB (The
    NAS Drives are configured for RAID 5 so there's a 4TB Loss.)  So, I have 4 ISCSI disks in the cluster, each as their own volume.
    Any thoughts on making the 4 drives addressable as one volume? 
    Regards,
    -Eric

    We're running Server 2012 Data Center on the cluster nodes.
    I was thinking the same about the 3rd party software to do what I'd like it to do.   The data  is mostly security camera video from our security system.  Since its not really critical data, i'm just looking for a way to maximize
    the available hard drive space, and make it addressable as one volume or network share...
    -Eric
    You can build Storage Spaces (simple, not clustered as it would waste 50% of your capacity, MSFT can do mirror and parity with R2 for clustered only) from iSCSI LUs. Dog slow and unsupported but you'll have linear spanned space. See:
    Rough Guide To Setting Up A Scale-Out File Server
    http://www.aidanfinn.com/?p=13176
    Creating Virtual SoFS with shared VHDX
    http://www.aidanfinn.com/?p=15145
    you don;t need SoFS (obviously) but in this article Aidan creates Storage Spaces from iSCSI LUNs.
    Good luck!
    StarWind VSAN [Virtual SAN] clusters Hyper-V without SAS, Fibre Channel, SMB 3.0 or iSCSI, uses Ethernet to mirror internally mounted SATA disks between hosts.

  • Replace 2 Nodes in Cluster

    Have a 2 node SQL Cluster & looking for best way to replace these with two new servers. I was thinking of removing 1 SQL node and then remove node from windows failover cluster mmc. Then unplug crossover cable and plug into new server and make new
    server same name as one that was just removed.. Then add to cluster and start w. SQL nodes.. thoughts? any articles,etc to follow?

    Hi,
    You can refer the following same scenario solution:
    Add or Remove Nodes in a SQL Server Failover Cluster (Setup)
    http://technet.microsoft.com/en-us/library/ms191545.aspx
    Replace broken node on SQL 2008 failover cluster
    http://social.msdn.microsoft.com/Forums/sqlserver/en-US/a25cba7a-4762-45b5-
    be4c-18fc13ec7eab/replace-broken-node-on-sql-2008-failover-cluster?
    forum=sqldisasterrecovery
    Hope this helps.
    We
    are trying to better understand customer views on social support experience, so your participation in this
    interview project would be greatly appreciated if you have time.
    Thanks for helping make community forums a great place.

  • Adding nodes to cluster in 10g r2 10.1.0.3

    I apologize if this is a repeat but my browser crashed before I could watch my post. I am asking a hypothetical question regarding adding nodes to the cluster. I am trying to get a feel for how much risk is involved in the operation and if there is any chance we could corrupt the current configuration?
    I was reading the article from Murali Vallath and notice that he made it a point to say that you should make a full cold backup before you perfrom step 6...
    Step 6: Add New Instance(s)
    DBCA has all the required options to add additional instances to the cluster.
    Requirements:
    Make a full cold backup of the database before commencing the upgrade process.
    Is there risk of corrupting the database during this step?
    We are running 10.2.0.3 on linux Itanium on RHEL4 and we are running a 2 node cluster. We are using OCFS2 for the OCR and Voting devices and we are using ASM and also ASMLIB for our shared storage option. We also are running EMC Powerpath on our hosts.
    Any tips or heads up would be greatly appreciated.
    Thanks.

    Duplicate post :- adding nodes to cluster in 10g r2 10.1.0.3

  • 2 node failover cluster power down

    I have a 2node failover cluster. When I power down a node that has the SQL server instance and resources, all the resources and service failover to the other node.   When I see that all the resources and service report "online" I then power
    that node.  I am being told that this is improper because failover may not have completed.  Is that correct?
    Also, in our 2 node failover cluster is there a proper sequence to restarting the powered down nodes?

    Hi,
    The cluster group containing SQL Server can be configured for automatic failback to the primary node when it becomes available again. By default, this is set to off.
    To Configure:
    Right-click the group containing SQL Server in the cluster administrator, select 'properties' then 'failback' tab.
    To prevent an auto-failback, select 'Prevent Failback', to allow select 'Allow Failback' then one of the following options:
    Immediately: Not recommended as it can disrupt clients
    Failback between n and n1 hours: allows a controlled failback to a preferred node (if it's online) during a certain period.
    The related article:
    Windows Failover Clustering Overview
    http://blogs.technet.com/b/rob/archive/2008/05/07/failover-clustering.aspx
    Hope this helps.
    We
    are trying to better understand customer views on social support experience, so your participation in this
    interview project would be greatly appreciated if you have time.
    Thanks for helping make community forums a great place.

  • How to check that data was propagated to all nodes in cluster?

    Hi.
    We are using Weblogic 10.3.5 and Coherence 3.6. Both applications work in cluster mode and we are using replicated mode as a Coherence topology. Also the NameCache use to store and retreive data from Coherence cluster. Now I have a task to calculate a time that take data propagation to all nodes. So, from my sight of view coherence should raise some kind of event when each node in cluster will fulfield with the same data. Or may be there is a standard coherence(weblogic?) listener that provide such an information.
    I will be appreciate for help how to solve my task.

    Jonathan.Knight wrote:
    Hi,
    If you are using a replicated cache then the time taken to replicate the data is the time taken to do a put. Coherence will not return from a put method call on a NamedCache until the data has reached all the nodes. That is why replicated caches are a bad idea for clusters with a lot of nodes where there are frequent updates as they are slow.
    JKHi JK,
    actually, AFAIK, it is not 100% correct.
    From what I remember from an earlier discussion or email, replication in a replicated cache is synchronous to one other member (the lease owner), and asynchronous thereafter. The synchronous part of the protocol involves the mutating member and the entry lease owner (which may be the same). As I understand the lease owner orders the operations and resolves races between multiple mutators, and drives the asynchronous part of the replication to all other members.
    In short, total network cost is linear with nodes, but latency wise you do not need to wait until all updates actually took place on all other nodes (that would be a really sad scenario when some nodes are communicating slowly).
    Best regards,
    Robert

Maybe you are looking for

  • Adobe Classroom in a book file won't open anymore

    I have a Framemaker 7 Classroom in a book file that won't open anymore. I purchased in in 2006 and it's worked for years. There is no error message. I keep getting redirected to Adobe Digital Editions website. So I've installed the Adobe Digital Edit

  • Content Conversion with Substructures

    I'm very new at all of this, and I have a bit of a complex mapping.  In an attempt to make it a little tidier, I made several substructures when I set up the data type. For example, most of the lines start with a 'record id' made up of 4 fields.  So

  • Multiple constraining classifiers in generic templates - compiler broken

    can not compile multiple constraining classifiers to a generic template if the classifier is defined in the template:      public <U, T extends U> void a() {} will compile. java language specification: 4.4 Type Variables says:      public <T extends

  • Font color preview not working?

    I created a gradient and tried to save it for web, but the preview only showed me a dark orange color. That was the foreground color, but I don't know why that became the entire picture. Also, I saved it anyways, and when I opened the image using pic

  • Dreamweaver CC 2014 dont work.

    The application does not start. Nothing's happening. I'm working on MacOS 10.9.3 - Intel Core 2 Duo 3,06GHz - 8 Go. Have an idea ? Thank you in advance