UDLM rac-udlm-rs failing to come online

Hi all,
I'm setting up a zonecluster that will be running RAC. This has gone well, up to the point where, using clsetup, I attempt to create the rac-framework-rg (and associated rac-udlm-rs, rac-framework-rs).
In attempting to create the resourcegroup, the rac-udlm-rs fails to come online, causing the rest of the framework to fail.
The error in the logfile for UDLM (/var/cluster/ucmm/dlm_<zone>/logs/dlm.log is:
CONNECT ERROR: 'Invalid argument', family=0, port=6009, in=0.0.0.0
This repeats 5 times per second on both zonecluster nodes. regularly the error changes port (i.e. it cycles through the ports).
I've tried rebuilding the zonecluster, to no avail. I am at a loss as to what the error might be, or where to start looking, since there is no evidence anyone has ever hit this error before.
Any suggestions on where to look and what I can try to fix this is will be appreciated.
Possibly related, or even the cause, is that the zonecluster has been configured with:
add net
     address=xxx.xxx.xxx.xxx
end
at the zonecluster level, but these addresses are not showing up under 'ifconfig -a'. Maybe I'm misunderstanding how they are used, and when they should show up though.
Some notes:
the zonecluster in all other respects seems fine - I have QFS resources configured and these appear to successfully work in all circumstances.
the ORCLudlm package is installed (the one that comes with Oracle 11.2.0.3)

Right, in case this helps someone else in future:
I found the problem.   In a fully working zonecluster configuration the /etc/nsswitch.conf file includes these lines:
hosts:  cluster files
netmasks: cluster files
but for sad, sad, reasons on the system I was setting up this file had:
hosts:  files
netmasks: files
and you know what, this allows you to go:
ping clusternode1-priv
and this does a lookup and resolves correctly. Now, if this doesn't resolve correctly, basically the UDLM daemon cannot communicate with the other cluster nodes, because it must do a nameserver lookup on clusternode1-priv internally.
So, in essence, make sure your nsswitch.conf file looks up the cluster first for hostname lookups.
This issue shows itself if you do a traceroute to the IP of the other cluster node (from one of the zonecluster nodes) using the private interconnect IP address. In a working system it'll show the name (clusternode1-priv). In a non-working system it shows the IP only.

Similar Messages

  • Cable Modem fails to come online ?

    Hi,
    Cable modem fails to come online
    Never gets beyond state init(i) wend I try to boot
    other than defaul file ??
    cisco CMTS uBR7114E IOS-Version: 12.1(13)EC3
    Cable modem Motorola SB5100
    ----- CMTS Konfig ----
    cable config-file platinum.cm
    service-class 1 max-upstream 256
    service-class 1 guaranteed-upstream 256
    service-class 1 max-downstream 2250
    service-class 1 max-burst 1600
    cpe max 2
    timestamp
    !- This config not work ? -
    ip dhcp pool cm-0011.1a04.9944
    host 10.10.6.63 255.255.255.0
    client-identifier 01.111a.0499.44
    bootfile silver.cm

    Are you getting any errors when you run the "debug cable mac log ". Under normal conditions ,when you run the "debug cable mac log" command and you see nothing, it means that the modem has completed the process of coming online. If you run the "show controller cable 0 mac state" command, you should see the MAC state set to "maintenance_state", which is code in the DOCSIS world for "online". Check if you are getting any errors and if yes what is the error message that you find...

  • SCC Cluster name failed to come online on perticular node

    Hi all,
     I'm working on a 2-node  SCC cluster that I've had up and running for quite a while.  For some reason now, I cannot move one of my CMS' to a particular node.  I am getting two events in the event viewer: 1207 & 1069. 
    1207 from event viewer is:
    Cluster network name resource 'Network Name (EXG)' cannot be brought online. The computer object associated with the resource could not be updated in domain 'ADI.local' for the following reason:
    Unable to obtain the Primary Cluster Name Identity token.
    The text for the associated error code is: An attempt has been made to operate on an impersonation token by a thread that is not currently impersonating a client.
    The cluster identity 'CLUS$' may lack permissions required to update the object. Please work with your domain administrator to ensure that the cluster identity can update computer objects in the domain.
    "EXG" is Exchange application instance (CMS)Network name which is failing on perticular node. Find cluster log below for more info
     So, my question is, does anyone know what needs to be done in Active Directory to rectify this problem? 
    -------- Cluster log ----------
    00001824.000025f4::2012/10/16-23:39:11.550 WARN  [RES] Network Name <Network Name (EXG)>: Trying to remove credentials for LocalSystem returned status C0000225, STATUS_NOT_FOUND is a non-critical failure for a remove operation
    00001824.000025f4::2012/10/16-23:39:11.613 INFO  [RES] Network Name <Network Name (EXG)>: Initiating the Network Name operation : 'Verifying computer object associated with network name resource EXG'
    00001824.000025f4::2012/10/16-23:39:11.613 INFO  [RES] Network Name <Network Name (EXG)>: Trying to find computer account EXG object GUID(6a0d9900d2122d4480ea5acc1653e0be) on any available domain controller.
    00001824.000025f4::2012/10/16-23:39:11.738 INFO  [RES] Network Name <Network Name (EXG)>: Found computer account EXG on domain controller
    domain.com 00001824.000025f4::2012/10/16-23:39:11.738 INFO  [RES] Network Name <Network Name (EXG)>: Trying to obtain the VSToken for Core Cluster Name resource
    00001824.000025f4::2012/10/16-23:39:11.784 ERR   [RES] Network Name <Network Name (EXG)>: Can't acquire crypto context for container 814f67ea-6f38-41ed-a331-a421cb4de9cc-Netname Resource Data with provider "1\Microsoft Enhanced Cryptographic
    Provider v1.0". status 2148073494.
    00001824.000025f4::2012/10/16-23:39:11.784 ERR   [RES] Network Name <Network Name (EXG)>: Unable to decrypt Core netname resource's ResourceData, status 2148073494.
    00001824.000025f4::2012/10/16-23:39:11.784 INFO  [RES] Network Name <Network Name (EXG)>: GetCoreNetnameObject_VSToken returning status 2148073494
    00001824.000025f4::2012/10/16-23:39:11.784 ERR   [RES] Network Name <Network Name (EXG)>: This Netname resource can not be brought online, Failed getting token for CNO
    00001824.000025f4::2012/10/16-23:39:11.800 WARN  [RES] Network Name <Network Name (EXG)>: Trying to remove credentials for LocalSystem returned status C0000225, STATUS_NOT_FOUND is a non-critical failure for a remove operation
    00001824.000025f4::2012/10/16-23:39:11.800 ERR   [RHS] Online for resource Network Name (EXG) failed.

    Hello,
    How this is done ? I couldn't find any documentation, and i struggle with the issue
    "ERR   [RES] Network Name <Cluster Name>: Can't acquire context handle for container 24263d98-4b17-4cff-866c-80f90ea0623f-Netname Resource Data with provider "1\Microsoft Enhanced Cryptographic
    Provider v1.0". status 0X80090016."
    ERR   [RES] Network Name <Cluster Name>: Unable to Decrypt the password. status 2148073494."
    Thanks for the help

  • Failover Cluster 2012 Network Name fails to come online.

    Hi,
    I created a new one node Windows 2012 failover cluster.  The cluster was created successfully, but configuring Client Access Point  finished without creating VCO in Active Directory. Then I created computer object in the same OU
    with cluster according article
    http://technet.microsoft.com/en-us/library/cc732035(v=ws.10).aspx 
    Permissions on OU and  VCO for cluster account where escalated to FULL, DNS A and PTR records where created, quota related to creating computer objects was increased, but has fixed my problem: events 1194 and 1069 are generated on attempt to online
    network name. Following one guide, I tried to find the registry key for network name resource that corresponded with the GUID of the object of VCO, but it was absent here.
    I've investigated everything I could find and found no solution.
    Any help would be appreciated.

    Hi,
    Please install the Recommended hotfixes and updates for Windows Server 2012-based failover clusters update first, then try again.
    Recommended hotfixes and updates for Windows Server 2012-based failover clusters
    http://support.microsoft.com/kb/2784261
    Hope this helps.
    We
    are trying to better understand customer views on social support experience, so your participation in this
    interview project would be greatly appreciated if you have time.
    Thanks for helping make community forums a great place.

  • Multi-lingual Error: The translation failed because the online translation service was unavailable

    Hi,
    We need to implement multi-lingual functionality in SharePoint 2013 on premise server. We have implemented the functionality by referring the below URL.
    http://blogs.technet.com/b/sharepoint_quick_reads/archive/2013/08/12/sharepoint-2013-variations-creating-site-and-variation-labels.aspx
    But after this once we translate any page/document, we are getting below error.
    Error: The
    translation failed because the online translation service was unavailable. Please resubmit this file for translation. If the files fails again with this error message, contact system administrator.
    Machine
    We have configured the internet over SharePoint server, also referred below url, but
    no success.
    http://blogs.msdn.com/b/weslbo/archive/2012/11/07/sharepoint-2013-machine-translations-the-translation-failed-because-the-online-translation-service-was-unavailable.aspx
    http://technet.microsoft.com/en-us/library/jj729796(v=office.15).aspx
    https://social.msdn.microsoft.com/Forums/office/en-US/dcd0e1d3-2b26-41fa-ad07-77fe234fbc23/machine-translation-service?forum=sharepointdevelopment
    If you have faced a similar issue and has any information on how to resolve this issue or how to achieve multi-lingual functionality or configure machine translation service in SharePoint 2013,
    it will be great help.
    Regards,
    Shailendra Gupta

    Hi Shailendra
    Please check this below
    http://blogs.msdn.com/b/weslbo/archive/2012/11/07/sharepoint-2013-machine-translations-the-translation-failed-because-the-online-translation-service-was-unavailable.aspx
    https://social.technet.microsoft.com/Forums/sharepoint/en-US/a7bf9604-d652-4566-aac5-6bcb10dd6ce5/failed-because-there-was-no-translation-available-online-translation-service?forum=sharepointgeneral
    Please remember to click 'Mark as Answer' on the answer if it helps you

  • Printer connected to Airport Express USB port won't come online after firmware 7.6 upgrade. I've tried backing up to 7.5.2, but it still shows offline. It is a Canon iP4600 printer and I'm running OS 10.6.8. I've powered everything down and still no go.

    Printer connected to Airport Express USB port won't come online after firmware 7.6 upgrade. I've tried backing up to 7.5.2, but it still shows offline. It is a Canon iP4600 printer and I'm running OS 10.6.8. I've powered everything down and still no go. I've connected the printer directly to my MacBook's USB port and it works fine. Any suggestions?

    I have an iP4500, which I assume is quite similar to your model.
    I had to reset the printing system to get things printing again. Only takes a few minutes and no problems at all since I did it a week ago or so.
    Plug the printer into the USB port on the Express and restart the printer.
    Open System Preferences (gear icon on the dock) and then open Print and Fax
    Right-Click in the printer area on the left and select Reset Printing System
    Then click the + (plus) button at the bottom of the printer list to install the printer again.

  • Domain Controller require ADC to sync before DC come online

    Hi,
    I have an urgent query that my domain controller I think might not working properly. The reason for this if I restart my DC and ADC and when try to open Active Directory console it gives error until the ADC comes online completely. I don't know why DC is
    doing such behavior that depend on ADC. The all FSMO roles are on DC and ADC is only DNS and GC.
    Kindly suggest what is the cause of this issue.
    Please help

    Hello,
    What is the error message says ? what is the error you see in event viewer ?
    Thanks
    Dishan M. Francis
    MVP – Directory Services
    Dishan M. Francis www.rebeladmin.com

  • Volfs in virtual machine doesn't come online

    Hi,
    I am running Solaris10 inside Virtual Box 2.1.0.
    Even though I am mounting the ISO image for vol2 of Solaris10 installation, when the system reboots, the system doesn't recognize the cdrom and I have to skip installing the rest of Solaris cd's.
    The volfs doesn't exist and svcadm enable says that it doesn't see "volfs" pattern ...
    what the heck is going on?
    Does anyone know on this forum?
    Please help ...
    Thanks

    mario_garcia wrote:
    Hello
    i have been trying to mount a cd in a virtual machine but volfs is disabled and it just doesn't come online.
    i do svcadm enable volfs but it still remains offline
    I try with this : svcadm enable -s volfs
    svcadm: Instance "svc:/system/filesystem/volfs:default" has unsatisfied dependencies So what are the dependencies? What does svcs -xv show?
    Darren

  • PDK 6.0 SP3 deployment failed on com.sap.pct.pdk.pct.epa

    Hello,
    I am trying to deploy Portal Development Kit 6.0 SP3 on WAS 6.40 (J2EE
    6.30) with EP 6.0 SP3 and it is failing on the last package with the
    error below. It is HP-Unix 11i system and the same PDK file
    (20040506.sca) was used with no issues on Windows based WAS and EP same
    releases.
    Please, advise.
    Vladimir
    04/08/06 17:51:47 - Start updating archive file...
    04/08/06 17:51:47 - Archive file updated successfully for 102ms.
    04/08/06 17:51:47 - Start deploying ...
    04/08/06 17:51:48 - Archive file uploaded to server for 327ms.
    04/08/06 17:52:02 - ERROR: Not deployed. Deploy Service returned
    ERROR:
    java.rmi.RemoteException: Cannot deploy
    application sap.com/com.sap.pct.pdk.pct.epa..
    Reason: <--Localization failed:
    ResourceBundle='com.sap.engine.services.deploy.DeployResourceBundle',
    ID='Exception while deploying:
    com.sap.portal.transport.deploy.IncompleteDeploymentException: The
    archive ./temp/deploy/work/deploying/com.sap.pct.pdk.pct.epa.epa could
    not be deployed completely. Import failed for 2 of 13 objects.
    [Unexpected exception during import.
    [com.sapportals.portal.transport.EptFile]<init>:
    transport file not
    found: /usr/sap/HP1/JC00/j2ee/temp/pcd/transport/IMPORT-
    0806_175148_720_97add7f324f87b25/EPT/com.sap.pct.pdk.personalizationexam
    ple.ept, Unexpected exception during import.
    [com.sapportals.portal.transport.EptFile]<init>:
    transport file not
    found: /usr/sap/HP1/JC00/j2ee/temp/pcd/transport/IMPORT-
    0806_175148_720_97add7f324f87b25/EPT/com.sap.pct.pdk.WebDynpro.ept, ]',
    Arguments: []--> : Can't find resource for bundle
    java.util.PropertyResourceBundle, key Exception while deploying:
    com.sap.portal.transport.deploy.IncompleteDeploymentException: The
    archive ./temp/deploy/work/deploying/com.sap.pct.pdk.pct.epa.epa could
    not be deployed completely. Import failed for 2 of 13 objects.
    [Unexpected exception during import.
    [com.sapportals.portal.transport.EptFile]<init>:
    transport file not
    found: /usr/sap/HP1/JC00/j2ee/temp/pcd/transport/IMPORT-
    0806_175148_720_97add7f324f87b25/EPT/com.sap.pct.pdk.personalizationexam
    ple.ept, Unexpected exception during import.
    [com.sapportals.portal.transport.EptFile]<init>:

    Apparently, there is a bug in the .sca file that prevents it from deployment on UNIX servers.
    SAP is working on the patch, but meantime we can use the following workaround:
    unzip the .sca file
    unzip the com.sap.pct.pdk.pct.epa.epa from the .sca
    Copy all .ept files contained into the WebAS folder <(>
    <<)>WASROOT>/JC00/j2ee/cluster/server0/apps/sap.com/irj/servlet_jsp/irj/root/web-inf/deployment/pcdcontent
    restart the server.
    Regards,
    Vladimir

  • Can't execute command - drive didn't come online; check configuration/hardw

    I have a Scalar i500 on a Linux host attached via fiber channel. I have everything working accept automounting the drive. I can't seem to get it working.
    I can use the web interface of the i500 and then put a tape in the drive. I can then mount the drive from Secure Backup and run a back up just fine. Yet, I can't run a backup without a tape mounted. If I choose "Load Volume (Drive).... I get the error "can't execute command - drive didn't come online; check configuration/hardware"
    I can inventory the library and do just about anything except automount the drives. I run a verify and the configuration comes back clean with no errors. I have tried using the SCSI params in the obl0 and obt0 ,and obt2 with no success. I have tried using the raw /dev/sg paths with no success. Same error.
    This is a upgraded installation from 10.1 to 10.3. I have tried doing a clean install with same issue. I have restored the installation and then run a upgrade with the same issue.
    I am stuck. Please help :)
    Thanks

    It appears you may have the DTE's swapped.
    Swap the DTE #'s assigned to the drive device objects and try again. This could happen during an upgrade if you reassigned scsi settings and the obt0 and obt1 devices swapped scsi settings from when they ran things last.

  • How do I set an alert to let me know when someone comes online in Messages?

    I was pretty sure that in iChat there was a feature to alert me when someone comes online. I can't seem to find this in Messages in Mountain Lion. Does anyone know where that went?
    Sniffles

    for the Points
    8:14 pm      Friday; October 11, 2013
      iMac 2.5Ghz 5i 2011 (Mountain Lion 10.8.4)
     G4/1GhzDual MDD (Leopard 10.5.8)
     MacBookPro 2Gb (Snow Leopard 10.6.8)
     Mac OS X (10.6.8),
     Couple of iPhones and an iPad
    "Limit the Logs to the Bits above Binary Images."  No, Seriously

  • I'm connected to Wi-Fi, but my NEW MacBook Pro won't come online.  This happened right after I transferred my data from my old Mac.

    I'm connected to Wi-Fi, but my NEW MacBook Pro won't come online.  This happened right after I transferred my data from my old Mac.

    The warranty entitles you to complimentary phone support for the first 90 days of ownership.
    If you bought the product in the U.S. directly from Apple (not from a reseller), you have 14 days from the date of delivery in which to exchange or return it for a refund. In other countries, the return policy may be different. If you bought from a reseller, its return policy applies.

  • My Firefox browser at home does not come online. I tried reinstalling, but it will not come online.

    # Question
    My Firefox browser at home does not come online. I tried reinstalling, but it will not come online.
    IE works fine. Microsoft outlook mail works fine. Norton anti virus works fine. Therefore I can access the internet.

    Reset your Firewall permissions for Firefox.
    http://support.mozilla.com/en-US/kb/Firewalls

  • How do I hide from my contacts when I come online

    Is it possible to hide from my contacts when I go online?  I don't change to invisible etc, I mean the actual point when I log on.  Currently even if I am set to invisible, my contacts who have "notify me when my contacst come online" activated still see when I come online.  Kind of defeats the whole being invisible point.

    joshkreger wrote:
    Skype-User wrote:
    I think when you sign in as invisible, your contacts will not be notified that you logged in.
    Tried that but didn't work. My contacts still know when I come online even if I sign in as invisible.
    how did you know?  I think the only feature included in Skype today is the feature to notify your contacts when you login as online, not when you login as invisible (which will appear as offline).
    ...and just tested it and still working in my case (latest Skype for windows version)
    IF YOU FOUND OUR POST USEFUL THEN PLEASE GIVE "KUDOS". IF IT HELPED TO FIX YOUR ISSUE PLEASE MARK IT AS A "SOLUTION" TO HELP OTHERS. THANKS!
    ALTERNATIVE SKYPE DOWNLOAD LINKS | HOW TO RECORD SKYPE VIDEO CALLS | HOW TO HANDLE SUSPICIOS CALLS AND MESSAGES

  • Notify me when a specific user comes online

    As said in subject, seems to be reasonably better feature, than notify me when users come online.
    Can be implemented via mouse menu over contact. Same as here:
    http://community.skype.com/t5/Windows-desktop-client-Ideas/Pop-up-notification-when-a-specific-user-...
    Vlad.

    or even better, "notify me once when this user comes online" with missed notification on skype tray icon if notification was missed. Would be a great feature.
    (And yes, please reduce skype files size and and all that social posts integration, much better if social integration would be a super cool option just to call people over a stable 3g connection)

Maybe you are looking for

  • Webutil runs in my Dev env but errors out in my Prod env

    I have the Webutil demo working fine in my Dev environment, but I keep getting errors in my production environment. Here is the output from the java console. java.io.IOException: javax.net.ssl.SSLException: SSL handshake failed: X509CertChainInvalidE

  • IPhoto 09 8.1.1 update: no longer able to sync photos with AppleTV

    I have Itunes 9.0.2 (25) and just updated iPhoto 8.1.1 (419) and when I try to synch with my AppleTv 3.0.1 I get an error -50 trying to sync the photos. If I unselect photo sync it will work sync just fine. Also I am now unable to select individual e

  • ADF Faces RichTable doesn't validate but still display changed data

    Hello, I think I miss something in the ADF Faces lifecycle management. Can you help me? In a adf faces/jsf page, 1- I have a RichTable which DataModel is based on a List<Department> stored in a managed bean (session). 2- I have a button "Cancel" with

  • Extract Data from essbase to ODI

    Hello , I'm trying to extract data from Essbase cube to Oracle . Reverse engineering successfully completed , When running the interface I'm getting the following error : "com.hyperion.odi.essbase.ODIEssbaseException: com.hyperion.odi.essbase.ODIEssb

  • Start services connector connect to sap gateway failed

    hi experts when i want to start services connector saprouter saprouter string :/H/172.16.2.11/S/3299 a error window say "connect to sap gateway failed error  partner 'host:3299' not reached" where is the possible problem here ? regards ying xie