Can we monitor Sun Cluster events

Hi,
Whenever there are some events, issues in cluster, we see logs on the node consoles regarding the same. Is there any way to capture these
events through some other application? Better, is there any way by which we can send these events, in some standard format (protocol way) to some application that can send emails etc to administrators so that one need not keep eye continuously on the console for issus in cluster?

Well, I did not get a chance to experiment with CRNP, however, came up with a strategy that can be used.
First of all, we can do this with Solaris 9 and above as we do not have a resource type SUNW.Event available with Solaris 8. This resource type needs to be registered with only one resource group on the cluster, the resource group can be related to your data service.
So, one should first get the file SUNW.Event from the cluster source, copy it at /usr/cluster/lib/rgm/rtreg, also, copy corresponding RT_BASEDIR contents mentioned in this file at appropriate location. This directory contains start, stop, check and monitor scripts.
One can set the required properties SUNW.Event.
All this needs to be done on all the nodes.
The properties of this resource type cover the port and logical address (which is usually the cluster address) where the events would be sent. One can then write a client to listen to this port and process the events.
Hope this clarifies the point, please confirm if you see any issues with this.

Similar Messages

  • Can I install Sun Cluster on LDOM guest domain. Is Oracle RAC a supported c

    Hello,
    Can I install Sun Cluster on LDOM guest domains. Is Oracle RAC on LDOM guest domains of 2 physical servers a supported configuration from oracle?
    Many thanks in advance
    Ushas Symon

    Hello,
    The motive behind using LDOm Guest domains as RAC node is to have a better control of the resource allocation, since i will be having more than one guest domains which should perform different functions. The customer wants to have ORACLE RAC alone (without sun cluster).
    I will have two T5120's and one 2540 shared storage.
    My plan of configuration is to have
    Control&IO Domain with 8VCPU, 6GB mem
    one LDOM guest domain on each physical machine with 8 VCPU's, 8GB of memory, shared n/w and disks participating as RAC node's. (Don't know yet if i will use solaris cluster or not)
    one guest domain on each physical machine with 12 VCPU's, 14GB of memory, shared n/w and disks participating as BEA weblogic cluster nodes (not on solaris cluster)
    One guest domain on each physical machine with 4 VCPU's, 4GB of memory,shared n/w and disks participating as apache web cluster (on solaris cluster)
    Now, My question is, is it a supported configuration to have guest domains as Oracle RAC participants for 11gR2 (either with or without solaris cluster).
    If I need to configure RAC nodes on solaris cluster, is it possible to have two independent clusters on LDOM , one 2 node cluster for RAC and another 2 node cluster for apache web?
    Kindly advise
    Many thanks in advance
    Ushas Symon

  • Content Server on Sun Cluster

    Hello.
    Does anybody know, is Contenr Server can work on Sun Cluster?
    I did not find any information in notes or PAM. Technically, Sun Cluster support both Apache and MaxDB.

    HI,
    It is supported on sun cluster. You can use Apache 2.052 and MaxDB 7.6
    Anirudh,

  • Dependency between Sun Cluster 2.2 and Sun hardware ?

    I am evaluating Sun Cluster 2.2 for the usage as clustering software within our project.
    Can anybody say me, if there is a dependency between Sun Cluster 2.2 and Sun hardware ?
    Can I use Sun Cluster 2.2 with other hardware platforms running Sun Solaris 7 and Veritas Volume Manager ?

    I have had this discussion with fellow cluster admins, and we think a port of cluster 2.2 could be done to Intel running Solaris. And here is the BUT, BUT SUN does not support anything except a SPARC based cluster. So currently there is definately a depedancy bewteen the 2.2 software and the hardware.
    Hope this helps
    Heath

  • Auditing cluster events - can we see who paused a node?

    Hello
    We have a 2 node cluster with an active passive configuration - running server 2008 r2 / SQL 2008
    We had an issue today where one of the nodes was paused; and the active node had an issue with the cluster services, which ended up taking the resources offline (as you would expect)
    I fixed the issue via resuming the paused node; failing over the resources and restarting the cluster services - and bringing the resources back online
    Im trying to find out the 'why' behind the inactive node being left paused, as such i want to know if there are any events logged when a node is paused (i.e. a security event)
    I did a test scenario on the server, I paused the node but nothing is logged in any of the usual event places. Does the inactive node have to be left paused for a certian period of time before the cluster events / cluster operations log start logging things?
    Any help would be appreciated

    Hello,
    After checking the cluster related logs and systme/application logs, it appears that there're no logs recorded when pausing a cluster node.
    And generally, pausing a node is usually done when applying software updates to the node, where the recommended sequence is to move all services and applications off of the node, pause the node, then apply software updates to the node.
    You may refer to the following link for more information.
    Pause or Resume a Node in a Failover Cluster
    http://technet.microsoft.com/en-us/library/cc731291.aspx
    Best Regards,
    Sophia Sun
    Please remember to click “Mark as Answer” on the post that helps you, and to click “Unmark as Answer” if a marked post does not actually answer your question. This can be beneficial to other community members reading the thread.
    Thanks - I was afraid of that
    I know that pausing a node is only usually done when updating / patching etc, but windows updates fore this cluster are not done whatsoever - which put me down this path to begin with
    Appreciate everyones input on this

  • What kind of KVM switch can I use to monitor sun rack with 490s, 245s, and

    Hi,
    What kind of KVM switch can I use to monitor sun rack with 490s, 245s, and T2000s ?
    Ushas Symon

    It sounds like the customer and server room operator are using a Microsoft/X86 mindset. There are several selling points on using RSCs & SCs. First off are the big 3 that are easy for any good server room operator to understand: cost, space & power. If you don't have to buy a 1U Tripp-lite pull out rack KVM console ($1700) and a video card for each of your SUN servers ($400, yes SUN vid cards are $$$), you start off by saving several thousand dollars.
    Next you need to let them know that you require remote access to the systems from your desk and a standard KVM doesn't allow this (with Unix, if you have to get up from your desk then its a hardware problem). They can choose to purchase a network enabled KVM, but that adds $700 to your $1700 KVM. By using the SC you will be able to administer the system remotely.
    Third, remember that SUNs do not show all the critical boot information on a frame-buffer console output. If you are in diagnostic mode during boot, the monitor doesn't show results until it completes. If some process is stuck (missing NFS or fibre-channel hard mounts, memory tests, etc) then you need to be able to see it to fix the problem. Plus, console frame-buffers only give you 25 lines of output that you can't scroll back.
    The only small hitch is security. The new SCs (on T5220s) have SSH built in, but the older V240s and V490s use telnet which some security people balk at. The solution that we used is to put the older system SCs on a private LAN. Access to that private LAN from our intranet is via a SSH enabled dual-homed (non-routing) workstation with local user accounts. We SSH to that workstation & then telnet to the SC on the relevant server. That solution allows me to access our servers that are 40 miles away, saving response time to solve customer problems. A crude diagram is below:
    your workstation------intranet------[dual homed, non-routing workstation]------[inexpensive switch]------[server SC]
    Good luck.

  • Sun Cluster.. Why?

    What are the advantages of installing RAC 10.2.0.3 on a Sun Cluster.? Are there any benefits?

    Oracle 10g onward, there is no such burning requirement for Sun Cluster (or any third party cluster) as far as you are using all Oracle technologies for your Oracle RAC database. You should Oracle RAC with ASM for shared storage and that would not require any third party cluster. Bear inmind that
    You may need to install Sun Cluster in the following scenarios:
    1) If there is applicaiton running with in the cluster along with Oracle RAC database that you want to configure for HA and Sun Cluster provide the cluster resourced (easy to use) to manage and monitor the application. THIS can be achieved with Oracle Clusterware but you will have to write your own cluster resource for that.
    2) If you want to install cluster file system such as QFS then you will need to install the Sun Cluster. If this cluster is only running the Oracle RAC database then you can rely on Oracle technologies such as ASM, raw devices without installing Sun Cluster.
    3) Any certification conflicts.
    Any correction is welcome..
    -Harish Kumar Kalra

  • Beta Refresh Release Now Available!  Sun Cluster 3.2 Beta Program

    The Sun Cluster 3.2 Release team is pleased to announce a Beta Refresh release. This release is based on our latest and greatest build of Sun Cluster 3.2, build 70, which is close to the final Revenue Release build of the product.
    To apply for the Sun Cluster 3.2 Beta program, please visit:
    https://feedbackprograms.sun.com/callout/default.html?callid=%7B11B4E37C-D608-433B-AF69-07F6CD714AA1%7D
    or contact Eric Redmond <[email protected]>.
    New Features in Sun Cluster 3.2
    Ease of use
    * New Sun Cluster Object Oriented Command Set
    * Oracle RAC 10g improved integration and administration
    * Agent configuration wizards
    * Resources monitoring suspend
    * Flexible private interconnect IP address scheme
    Availability
    * Extended flexibility for fencing protocol
    * Disk path failure handling
    * Quorum Server
    * Cluster support for SMF services
    Flexibility
    * Solaris Container expanded support
    * HA ZFS
    * HDS TrueCopy campus cluster
    * Veritas Flashsnap Fast Mirror Resynchronization 4.1 and 5.0 option support
    * Multi-terabyte disk and EFI label support
    * Veritas Volume Replicator 5.0 support
    * Veritas Volume Manager 4.1 support on x86 platform
    * Veritas Storage Foundation 5.0 File System and Volume Manager
    OAMP
    * Live upgrade
    * Dual partition software swap (aka quantum leap)
    * Optional GUI installation
    * SNMP event MIB
    * Command logging
    * Workload system resource monitoring
    Note: Veritas 5.0 features are not supported with SC 3.2 Beta.
    Sun Cluster 3.2 beta supports the following Data Services
    * Apache (shipped with the Solaris OS)
    * DNS
    * NFS V3
    * Java Enterprise System 2005Q4: Application Server, Web Server, Message Queue, HADB

    Without speculating on the release date of Sun Cluster 3.x or even its feature list, I would like to understand what risk Sun would take when Sun Cluster would support ZFS as a failover filesystem? Once ZFS is part of Solaris 10, I am sure customers will want to use it in clustered environments.
    BTW: this means that even Veritas will have to do something about ZFS!!!
    If VCS is a much better option, it would be interesting to understand what features are missing from Sun Cluster to make it really competitive.
    Thanks
    Hartmut

  • Solaris cluster event log

    Someone please tell me there is a way to get readable data from the even log in /var/cluster/log. I'm not finding anything in my searches, so I really hope I'm just overlooking something simple. Thanks in advance for any help.

    Hi,
    there's a binary called showev4 but I do not remember from where I've downloaded it. :)
    Sun Cluster 3.x series clusters records event logs as binary log files. Log files can not be read with standard cluster commands. showev4 is a binary to read these log files.
    Usage :
    ./showev4 /var/cluster/logs/eventlog
    Sample output :
    Tue May 29 08:37:38 2007
    class: EC_Cluster subclass: ESC_cluster_gds_probe
    vendor: SUNW pub: gds pid: 2347
    cluster_id: 0x45E81ADA cluster: test3210g node: testxdb1
    ts_sec: 1180417058 ts_usec: 538717
    severity: 0 (Info) initiator: 3 (Agent)
    r_name: mysql-server
    rg_name: mysql-rg status_msg: Probe has been executed with exit code 0 [opt/SUNWscmys/bin/probe_mysql -R mysql-server -G mysql-rg -B /global/mysql -D /global/mysql-data -U mysql   -H mysql-IP -F fmuser%fmuser -L /global/mysql-data/logs -C ]
    Tue May 29 08:37:38 2007
    class: EC_Cluster subclass: ESC_cluster_gds_probe
    vendor: SUNW pub: gds pid: 2347
    cluster_id: 0x45E81ADA cluster: test3210g node: testxdb1
    ts_sec: 1180417058 ts_usec: 539543
    severity: 0 (Info) initiator: 3 (Agent)
    r_name: mysql-server
    rg_name: mysql-rg status_msg: The probe result is 0

  • Sun Cluster 3.2  without share storage. (Sun StorageTek Availability Suite)

    Hi all.
    I have two node sun cluster.
    I am configured and installed AVS on this nodes. (AVS Remote mirror replication)
    AVS working fine. But I don't understand how integrate it in cluster.
    What did I do:
    Created remote mirror with AVS.
    v210-node1# sndradm -P
    /dev/rdsk/c1t1d0s1      ->      v210-node0:/dev/rdsk/c1t1d0s1
    autosync: on, max q writes: 4096, max q fbas: 16384, async threads: 2, mode: sync, group: AVS_TEST_GRP, state: replicating
    v210-node1# 
    v210-node0# sndradm -P
    /dev/rdsk/c1t1d0s1      <-      v210-node1:/dev/rdsk/c1t1d0s1
    autosync: on, max q writes: 4096, max q fbas: 16384, async threads: 2, mode: sync, group: AVS_TEST_GRP, state: replicating
    v210-node0#   Created resource group in Sun Cluster:
    v210-node0# clrg status avs_test_rg
    === Cluster Resource Groups ===
    Group Name       Node Name       Suspended      Status
    avs_test_rg      v210-node0      No             Offline
                     v210-node1      No             Online
    v210-node0#  Created SUNW.HAStoragePlus resource with AVS device:
    v210-node0# cat /etc/vfstab  | grep avs
    /dev/global/dsk/d11s1 /dev/global/rdsk/d11s1 /zones/avs_test ufs 2 no logging
    v210-node0#
    v210-node0# clrs show avs_test_hastorageplus_rs
    === Resources ===
    Resource:                                       avs_test_hastorageplus_rs
      Type:                                            SUNW.HAStoragePlus:6
      Type_version:                                    6
      Group:                                           avs_test_rg
      R_description:
      Resource_project_name:                           default
      Enabled{v210-node0}:                             True
      Enabled{v210-node1}:                             True
      Monitored{v210-node0}:                           True
      Monitored{v210-node1}:                           True
    v210-node0# In default all work fine.
    But if i need switch RG on second node - I have problem.
    v210-node0# clrs status avs_test_hastorageplus_rs
    === Cluster Resources ===
    Resource Name               Node Name    State     Status Message
    avs_test_hastorageplus_rs   v210-node0   Offline   Offline
                                v210-node1   Online    Online
    v210-node0# 
    v210-node0# clrg switch -n v210-node0 avs_test_rg
    clrg:  (C748634) Resource group avs_test_rg failed to start on chosen node and might fail over to other node(s)
    v210-node0#  If I change state in logging - all work.
    v210-node0# sndradm -C local -l
    Put Remote Mirror into logging mode? (Y/N) [N]: Y
    v210-node0# clrg switch -n v210-node0 avs_test_rg
    v210-node0# clrs status avs_test_hastorageplus_rs
    === Cluster Resources ===
    Resource Name               Node Name    State     Status Message
    avs_test_hastorageplus_rs   v210-node0   Online    Online
                                v210-node1   Offline   Offline
    v210-node0#  How can I do this without creating SC Agent for it?
    Anatoly S. Zimin

    Normally you use AVS to replicate data from one Solaris Cluster to another. Can you just clarify whether you are replicating to another cluster or trying to do it between a single cluster's nodes? If it is the latter, then this is not something that Sun officially support (IIRC) - rather it is something that has been developed in the open source community. As such it will not be documented in the Sun main SC documentation set. Furthermore, support and or questions for it should be directed to the author of the module.
    Regards,
    Tim
    ---

  • TimesTen database in Sun Cluster environment

    Hi,
    Currently we have our application together with the TimesTen database installed at the customer on two different nodes (running on Sun Solaris 10). The second node acts as a backup to provide failover functionality, although right now only manual failover is supported.
    We are now looking into a hot-standby / high availability solution using Sun Cluster software. As understood from the documentation, applications can be 'plugged-in' to the Sun Cluster using Agents to monitor the application. Sun Cluster Agents should be already available for certain applications such as:
    # MySQL
    # Oracle 9i, 10g (HA and RAC)
    # Oracle 9iAS Application Server
    # PostgreSQL
    (See http://www.sun.com/software/solaris/cluster/faq.jsp#q_19)
    Our question is whether Sun Cluster Agents are already (freely) available for TimesTen? If so, where to find them. If not, should we write a specific Agent separately for TimesTen or handle database problems from the application.
    Does someone have any experience using TimesTen in a Sun Cluster environment?
    Thanks in advance!

    Yes, we use 2-way replication, but we don't use cache connect. The replication is created like this on both servers:
    create replication MYDB.REPSCHEME
    element SERVER01_DS datastore
    master MYDB on "SERVER01_REP"
    transmit nondurable
    subscriber MYDB on "SERVER02_REP"
    element SERVER02_DS datastore
    master MYDB on "SERVER02_REP"
    transmit nondurable
    subscriber MYDB on "SERVER01_REP"
    store MYDB on "SERVER01_REP"
    port 16004
    failthreshold 500
    store MYDB on "SERVER02_REP"
    port 16004
    failthreshold 500
    The application runs on SERVER01 and is standby on SERVER02. If an invalid state is detected in the application, the application on SERVER01 is stopped and the application on SERVER02 is started.
    In addition to this, we want to fail over if the database on the SERVER01 is in invalid state. What should we have monitored by the Clustering Agent to detect an invalid state in TT?

  • Close/shutdown the Sun Cluster Package/resource Group

    Hi,
    I have a SUN cluster system.
    I want to know what script do when the SUN cluster shutdown the package "app-gcota-rg" as I may need to modify it ?? Where can I find out this information in the system??
    In which directory and log file ???
    Any suggestion ???
    Resource Groups --
    Group Name Node Name State
    Group: ora_gcota_rg ytgcota-1 Online
    Group: ora_gcota_rg ytgcota-2 Offline
    Group: app-gcota-rg ytgcota-1 Online
    Group: app-gcota-rg ytgcota-2 Offline

    Hi,
    you would first find out which resources belong to app-gcota-rg.
    Do a "clrs list -g app-gcota-rg". Then find out which of the resource is the one dealing with your application. Then try to find out its resource type:
    "clrs show -v <resource name>| fgrep Type". If it is a standard type like HA Oracle, it is an extremely bad idea to hack the scripts, as you'll lose support. If type is SUNWgds, the scripts to start, stop and monitor the application are user supplied. You can find their pathnames using:
    "clrs show -v <resource-name>| fgrep _command". This should display full pathnames.
    Regards
    Hartmut

  • Sun Cluster question

    Hello everyone
    I've inherited an Oracle Solaris system holding ASE Sybase databases. The system consists of two nodes inside a Sun Cluster. Each of the nodes is hosting 2 Sybase database instances, where one of the nodes is active and other is standing by. The scenario at hand is that when any of the databases on one node fails for whatever reason, the whole system gets shifted to the second node to keep the environment going. That works fine.
    My intended scenario:
    Each node is holding 2 database instances, both nodes ARE working at the same time so that each one is serving one instance of the database. In the event of failure on one node, the other one should assume the role of BOTH database instances till the first one gets fixed.
    The question is: is that possible? and if it is, does that require breaking the whole cluster and rebuilding it? or can this be done online without bringing down the system?
    Thanks a lot in advance

    What you propose will not work either. E.g. there is no logic implemented to fence the underlying zpool from one node to the other in such a configuration.
    Also the current SUNW.HAStoragePlus(5) manpage document:
            Note -   SUNW.HAStoragePlus does not support  file  sys-
                     tems created on ZFS volumes.
                     You cannot use SUNW.HAStoragePlus  to  manage  a
                     ZFS storage pool that contains a file system for
                     which the ZFS  mountpoint  property  is  set  to
                     legacy or none.[...]
    Greets
    Thorsten

  • Sun Cluster and SMF

    Folks,
    Did you come across any products, who SMFied their product and how it impacts their cluster installations ?. Please provide details if you have any
    Thanks mUch
    _D                                                                                                                                                                                                                                                                                                                                                               

    Since I am currently not aware of third party products shipping an SMF service for their application (which does not mean much, ie. they could well exist) I just make two generic statements:
    1) If for that application a standard Sun Cluster agent exists, then the instructions for installing this data service would include to disable the SMF on the cluster nodes (like on pre Solaris 10 systems the legacy runlevel script would need to get deactivated).
    2) If for that application no standard Sun Cluster agent exists, then one would have the ability to leverage the new proxy SMF resource types coming with Sun Cluster 3.2. They would use the SMF manifest to start and stop the application. Note that by using this resource type you would not have any application fault monitoring.
    Details to 2) can be found at
    http://blogs.sun.com/SC/entry/making_smf_services_highly_available and
    http://docs.sun.com/app/docs/doc/819-2974/6n57pdk2b?a=view
    Greets
    Thorsten
    Message was edited by:
    Thorsten.Frueauf

  • How Can I create a cluster of two instances running on the same server?

    Hello,
    I can't find how to create a cluster or even a failover rule for my two instances running on the same server ODSEE 11gR1.
    Could you please help?
    Thanks,

    Hello,
    Clustering (intended as Sun Cluster, Veritas Cluster, etc... ) IS NOT SUPPORTED since 7.0:
    http://docs.oracle.com/cd/E20295_01/pdf/821-1216.pdf [page 20]
    and, based on my own experience, is the WORST way of achieving high availability of an LDAP Directory Service and an almost GRANTED way to get replication issues in your topology (lock, split, re-init, etc...).
    Please use instead the 'application-level' clustering that you get out of the box with the Multi-Master Replication.
    http://docs.oracle.com/cd/E20295_01/html/821-1217/index.html
    http://docs.oracle.com/cd/E20295_01/html/821-1220/fhkry.html#scrolltoc
    HTH,
    Marco

Maybe you are looking for