Error disabled. Reason:DCX-No ACK in 100 PDUs

Hi,
  I have a customer who lost all connectivity from the ESX host for both networking and FCoE because (as the title suggests) the interfaces were error disabled.  This happened across all 8, dual ported, dual homed CNAs at the same time.  Does anyone have any idea what this error comes from?  The are using ESX 4.0 and are running Nexus 5020 with 4.2(1)N2(1a).
Thanks,
Thom

DCBX Type Length Values(TLV) are packaged within a LLDP frame which  is exchanged between the switch and the CNA. One such Control Sub-TLV is  used for ACK which is sequence based. For example, the switch sends  this control Sub-TLV with SeqNo of 1 and AckNo of 2. The host is  supposed to inverse this and send a LLDP frame with control sub-TLV with  SeqNo of 2 and AckNo of 1.
We expect this exchange every 30 seconds from the host and if the  switch does not see it for 100 times 30 which is 3000 seconds (or 50  minutes), the switch error disables with following error
2011 May 13 12:03:23 CSX_5020_A1 %ETHPORT-2-IF_DOWN_ERROR_DISABLED: Interface Ethernet115/1/17 is down (Error disabled. Reason:DCX-No ACK in 100 PDUs)
2011 May 13 12:03:27 CSX_5020_A1 %ETHPORT-2-IF_DOWN_ERROR_DISABLED: Interface Ethernet116/1/16 is down (Error disabled. Reason:DCX-No ACK in 100 PDUs)
Some commands on the switch which helps in narrowing down root cause.
F340.24.10-5548-1# show lldp interface ethernet 1/22
Interface Information:
  Enable (tx/rx/dcbx): Y/Y/Y    Port Mac address: 00:05:73:ab:29:bd
Peer's LLDP TLVs:
Type Length Value
001  007    040000c9 9d2372
002  007    030000c9 9d2372
003  002    0078
006  045    456d756c 6578204f 6e65436f 6e6e6563 74203130 4762204d 756c7469
            2066756e 6374696f 6e204164 61707465 72
007  004    00800080
127  055    001b2102 020a0000 00000002 00000001 04110000 c0000001 00003232
            00000000 00000206 060000c0 00080808 0a0000c0 00890600 1b2108
000  000   
F340.24.10-5548-1# show lldp dcbx interface ethernet 1/22
Local DCBXP Control information:
Operation version: 00  Max version: 00  Seq no: 1  Ack no: 2  <<---Our sequence # and Ack #
Type/
Subtype    Version    En/Will/Adv Config
003/000     000        Y/N/Y      0808
004/000     000        Y/N/Y      8906001b21 08
002/000     000        Y/N/Y      0001000032 32000000 00000002
Peer's DCBXP Control information:
Operation version: 00  Max version: 00  Seq no: 2  Ack no: 1  <<---Peer sequence # and Ack # should be reversed.
Type/      Max/Oper
Subtype    Version    En/Will/Err Config
002/000     000/000    Y/Y/N      0001000032 32000000 00000002
003/000     000/000    Y/Y/N      0808
004/000     000/000    Y/Y/N      8906001b21 08
F340.24.10-5548-1#
Root cause for this problem in most cases is misbehaving CNA/server or incorrect firmware/driver on the CNA.

Similar Messages

  • Nexus 5500 - Fabricpath Core Port - Error disabled. Reason:DCX-No ACK in 100 PDUs

    Has anyone seen Fabricpath Core Interfaces between two Nexus 5596UP switches error-disabled because of missing DCBX Acks after 50mins?
    I do not see interface errors and the peer is another 5500.
    Both switches are running 5.1(3)N2(1) with this port config:
    int e1/3
    switchport mode fabricpath
    ! Cisco 5m Twinax cables
    Log messages
    2012 May 25 17:40:59 nexus1 %L3VM-5-FP_TPG_INTF_DOWN: Interface Ethernet1/3 down in fabricpath topology 0 - Interface down
    2012 May 25 17:40:59 nexus1 %ETHPORT-5-IF_DOWN_NONE: Interface Ethernet1/3 is down (None)
    2012 May 25 17:40:59 nexus1 %ISIS_FABRICPATH-5-ADJCHANGE:  isis_fabricpath-default [3365]  P2P adj L1 nexus5 over Ethernet1/3 - DOWN (Delete All) on MT-0
    2012 May 25 17:40:59 nexus1 %CDP-5-NEIGHBOR_REMOVED: CDP Neighbor nexus5(FOX1550GDH1) on port Ethernet1/3 has been removed
    2012 May 25 17:40:59 nexus1 %LLDP-5-SERVER_REMOVED: Server with Chassis ID 547f.ee63.fa88 Port ID Eth1/1 on local port Eth1/3 has been removed
    2012 May 25 17:40:59 nexus1 %ETHPORT-2-IF_DOWN_ERROR_DISABLED: Interface Ethernet1/3 is down (Error disabled. Reason:DCX-No ACK in 100 PDUs)
    Robert

    Can you send the output of
    show lldp interface ethernet 1/3
    show lldp dcbx interface ethernet 1/3
    a workaround may be to disable lldp on both sides on these physical interfaces

  • N5K err-disabled a port due to DCX-No ACK in 100 PDUs

    I'm super new to Nexus, so I'm not really sure how to troubleshoot this. I did a quick search and found that this is related to DCBX TLVs in LLDP, which we apparently shouldn't be getting on a regular ethernet port. I'm pretty sure this is just a regular ethernet port. (Like I said, I'm pretty new to nexus. lol)  I wonder if the following output indicates that we are receiving and sending DCBX TLVs in LLDP. If so, it sounds like the interface will go into err-disable state if the server stops sending ACK frames.
    # show lldp dcbx int e1/17
    Local DCBXP Control information:
    Operation version: 00  Max version: 00  Seq no: 1  Ack no: 1 
    Type/
    Subtype    Version    En/Will/Adv Config
    004/000     000        Y/N/Y      8906001b21 08
    002/000     000        Y/N/Y      0000000064 00000000 00000001
    Peer's DCBXP Control information:
    Operation version: 00  Max version: 00  Seq no: 1  Ack no: 1 
    Type/      Max/Oper
    Subtype    Version    En/Will/Err Config
    004/000     000/000    Y/Y/N      8906001b21 08
    003/000     000/000    Y/Y/Y      ff08
    002/000     000/000    Y/Y/N      ffffffff00 00000000 00000008
    This is the first time we've run into this. Any idea what might really be going on?
    Thanks!

    Hi,
    Per the display it  the Server Adapter is doing LLDP DCBX negotiation.
    If not needed, then you might want to check the Server adapter settings.
    DCBX is an extension of LLDP link layer discovery protocol; not restricted for FCOE usage.
    Further informatin on LLDP for Nexus:
    http://www.cisco.com/en/US/docs/switches/datacenter/nexus5500/sw/layer2/602_N1_1/b_5500_Layer2_Config_602N11_chapter_01010.html#task_1152779
    Thanks!
    Regards,
    Carlos

  • (Error disabled. Reason:Disabled by Server Mgr triggered)

    I´ve some ports in my Nexus 5k going to err-disable with the following message:
    (Error disabled. Reason:Disabled by Server Mgr triggered)
    These ports are connected in HPBlade 7000 through FEX Nexus B22 does someone know about this errors ? 

    Hi,
    Not sure if you already found the root cause of the issue,but this message generally comes from blade FEX's when there is an internal communication error or no connection between the FEX HIF's and the server chassis/software. This might come when a port is made "admin up" while is not configured or mapped blade from server perspective.
    Thanks,
    Ivan.

  • Vfc error disabled because of Ethernet port down?

    We are building a new datacenter and is starting to set up the SAN.
    This small DC consist of 2 Nexus 5596UP, a Compellent SAN and 2 Dell M1000e cassis with blade servers.
    naive FC part in Nexus has been easy to set up but we have problems with the FCOE setup.
    Since the blades are not installed yet none of the Ethernet ports are up. As soon as we bind an ethernet port to a vfc the vfc goes to error disable with this message:
    %PORT-2-IF_DOWN_ERROR_DISABLED: %$VSAN 1900%$ Interface vfc101 is down (Error disabled)
    Config snippets:
    vlan 1900
      fcoe vsan 1900
      name VSAN-A-FCOE
    vlan 1901
      name VSAN-B-FCOE
    vsan database
      vsan 1900 name "VSAN-A"
      vsan 1901 name "VSAN-B"
    interface vfc101
      bind interface Ethernet1/1
      no shutdown
    vsan database
      vsan 1900 interface vfc101
      vsan 1900 interface fc2/13
      vsan 1900 interface fc2/14
      vsan 1900 interface fc2/15
      vsan 1900 interface fc2/16
    interface Ethernet1/1
      switchport mode trunk
      spanning-tree port type edge trunk
    vlan 1900
      fcoe vsan 1900
      name VSAN-A-FCOE
    vlan 1901
      name VSAN-B-FCOE
    vsan database
      vsan 1900 name "VSAN-A"
      vsan 1901 name "VSAN-B"
    vsan database
      vsan 1900 interface vfc101
      vsan 1900 interface fc2/13
      vsan 1900 interface fc2/14
      vsan 1900 interface fc2/15
      vsan 1900 interface fc2/16
    interface vfc101
      bind interface Ethernet1/1
      no shutdown
    interface Ethernet1/1
      switchport mode trunk
      spanning-tree port type edge trunk
    Do the vfc go error disable because the ethernet interface being down? I was under the impression it should just go "down"

    What you are seeing is normal.. Here are outputs from my lab switch where I shut Eth1/20 which is tied to vFC 20
    24.10.5020A.1(config)# int ethernet 1/20
    24.10.5020A.1(config-if)# shut
    2011 Oct 21 10:16:01 24 %ETHPORT-5-IF_DOWN_CFG_CHANGE: Interface Ethernet1/20 is down(Config change)
    2011 Oct 21 10:16:01 24 %FLOGI-5-MSG_PORT_LOGGED_OUT: %$VSAN 100%$ [VSAN 100, Interface vfc20: mode[TF]] Nx Port 21:00:00:c0:dd:12:0e:35 logged OUT.
    2011 Oct 21 10:16:01 24 %PORT-5-IF_PORT_QUIESCE_FAILED: Interface vfc20 port quiesce failed due to failure reason: Epp Not Supported by Peer (0x19d)
    2011 Oct 21 10:16:01 24 %PORT-5-IF_DOWN_NONE: %$VSAN 100%$ Interface vfc20 is down (None)  
    2011 Oct 21 10:16:01 24 %PORT-2-IF_DOWN_ERROR_DISABLED: %$VSAN 100%$ Interface vfc20 is down (Error disabled)  
    2011 Oct 21 10:16:02 24 %ETHPORT-5-IF_DOWN_ADMIN_DOWN: Interface Ethernet1/20 is down (Administratively down)
    24.10.5020A.1(config-if)#

  • UDLD Detection & Error Disable On Cat 6513

    Hi
    We have a problem with an etherchannel trunk between 2 Cat 6513's. The etherchannel is 8Gb split across 2 Copper 10/100/1000Mb 16 port cards in each chassis. The trunk uses ports 13 - 16 on slot 5 in one chassis to 13 - 16 on slot 5 in the other chassis and 13 - 16 on slot 6 in one chassis to 13 - 16 in slot 6 on the other chassis.
    We appear to have had a UDLD Detection which error disabled all 4 of the links from slot 6 to slot 6 at the same time. IE ports 13 - 16 on slot 6 of both chassis went into error disabled state.
    Could this be a ASIC hardware problem on one of the cards? If so, how do we establish which end of the trunk the problem exists? All other connections on these slot 6 cards are working fine.

    UDLD is a protocol that discovers if communication over a link is one-way only, and therefore partially broken. A damaged fiber cable or other cabling/port issue could cause this one-way only communication. Spanning tree loops can occur with this problem. UDLD allows the port to detect a unidirectional link, and can be configured to put a port in errDisable state when it detects this condition.

  • Cisco Prime Infrastructure 2.1 error-disable alert

    We have a cisco PI 2.1 managing switches and a lot of switchports have BPDUGuard enabled. When occur error-disable , request send email notification to administrator .
    By default, when a port of a switch goes down, the Prime generates alarm for that. (this is a problem, because every laptop disconnection will generate alarm for administrator)
    Can i change the alert just for error-disable and how to ?
    Thanks

    Causes of Errdisable
    This feature was first implemented to handle special collision situations in which the switch detected excessive or late collisions on a port. Excessive collisions occur when a frame is dropped because the switch encounters 16 collisions in a row. Late collisions occur after every device on the wire should have recognized that the wire was in use. Possible causes of these types of errors include:
    A cable that is out of specification (either too long, the wrong type, or defective)
    A bad network interface card (NIC) card (with physical problems or driver problems)
    A port duplex misconfiguration
    A port duplex misconfiguration is a common cause of the errors because of failures to negotiate the speed and duplex properly between two directly connected devices (for example, a NIC that connects to a switch). Only half-duplex connections should ever have collisions in a LAN. Because of the carrier sense multiple access (CSMA) nature of Ethernet, collisions are normal for half duplex, as long as the collisions do not exceed a small percentage of traffic.
    There are various reasons for the interface to go into errdisable. The reason can be:
    Duplex mismatch
    Port channel misconfiguration
    BPDU guard violation
    UniDirectional Link Detection (UDLD) condition
    Late-collision detection
    Link-flap detection
    Security violation
    Port Aggregation Protocol (PAgP) flap
    Layer 2 Tunneling Protocol (L2TP) guard
    DHCP snooping rate-limit
    Incorrect GBIC / Small Form-Factor Pluggable (SFP) module or cable
    Address Resolution Protocol (ARP) inspection
    Inline power
    Note: Error-disable detection is enabled for all of these reasons by default. In order to disable error-disable detection, use the no errdisable detect cause command. The show errdisable detect command displays the error-disable detection status.

  • Error disable due to loopback

    Hi,
    We have a campus network. A student's hostel room port that is connected to a 2960 switch get error-disabled time and again and it shows the reason being "loopback".
    From switch patch panel to user's LAN port in room , we have tested the connectivity through cable tester and it is found to be proper.
    He has changed his LAN cable also, what can be the exact reasons causing this problem

    Thanks for the output to the command "sh version".  
    Look at the uptime of the switch.  Because of this the output to the "loopback_error_2960" is totally useless.  Why?  If there was any line errors which can determine if there was a cabling issue, a NIC card issue or something more sinister, then there's only 4-days worth of data.  Not much to run with.  
    The IOS is very, very old.  
    Currently, the only thing I can think of is remove the configuration of setting the speed to the port to 10 Mbps.  (I don't see the benefit of punishing students by slowing down their network speed.)  Once the port is running auto speed/auto duplex, wait for approximately 30 minutes and run a TDR on the port.   Even better if you can move the cable to the GigabitEthernet port and run the TDR there so you'll get a better picture of all the pairs.
    Another thing, post the output to the command "sh post".

  • Error disabled ports

    I have a 3550 switch that gets a error-disabled copper ports. There is no errors ont the port. What else would cause it to be disabled?

    Although there is no error on the port , there should be an error message in the log that should tell you the reason why the port got error disabled like port security violation , loopback detection , etherchannel misconfig etc.
    You can enable errordisable recovery for all different causes by setting a timer. What happens is once this timer expires , the port is brough out of error dsiabled state.
    Here are some of the useful commands.
    D-C3550-2A(config)#errdisable ?
    detect Error disable detection
    recovery Error disable recovery
    D-C3550-2A(config)#errdisable recovery ?
    cause Enable error disable recovery for application
    interval Error disable recovery timer value
    Hope this helps.
    Salman Z.

  • Nexus 4k error disable

    Anyone ever seen this before? Trying to understand why the port error disabled.
    2011 Jan 18 11:53:09 MTWDAVDC1BLDB9001 %ETHPORT-2-IF_DOWN_ERROR_DISABLED: Interface Ethernet1/1 is down (Error disabled. Reason:requested by sap: MTS_SAP_DCX, req down_type: 2, req down_reason: 222 )
    MTWDAVDC1BLDB9002# sh int eth1/1
    Ethernet1/1 is down (DcxMultipleMSAPs)
      Hardware: 1000/10000 Ethernet, address: 0027.0d23.4d8d (bia 0027.0d23.4d8d)
      MTU 1500 bytes, BW 1000000 Kbit, DLY 10 usec,
         reliability 255/255, txload 1/255, rxload 1/255
      Encapsulation ARPA
      Port mode is trunk
      full-duplex, 10 Gb/s, media type is 10g
      Input flow-control is off, output flow-control is off
      Rate mode is dedicated
      Switchport monitor is off
      Last link flapped 1d03h
      Last clearing of "show interface" counters never
      1 minute input rate 0 bits/sec, 0 packets/sec
      1 minute output rate 0 bits/sec, 0 packets/sec
      Rx
        153349455 input packets 143221479 unicast packets 3730739 multicast packets
        6397237 broadcast packets 0 jumbo packets 0 storm suppression packets
        66448449173 bytes
      Tx
        328643307 output packets 90031378 multicast packets
        113132871 broadcast packets 12877361 jumbo packets
        62121338327 bytes
        0 input error 0 short frame 0 watchdog
        0 no buffer 0 runt 0 CRC 0 ecc
        0 overrun  0 underrun 0 ignored 0 bad etype drop
        0 bad proto drop 0 if down drop 0 input with dribble
        1416 input discard
        0 output error 0 collision 0 deferred
        0 late collision 0 lost carrier 0 no carrier
        0 babble
        0 Rx pause 0 Tx pause
      8 interface resets

    I looked into the error and would suggest opening a TAC case so our engineering team can look into this error.  You can open a case using the supportforum as well to turn the thread into a case.
    Sorry to not be able to help more, but looks like our internal team needs to look into this further.
    Chad

  • Vfc Error disabled

    Hi,
    I have a problem connecting a CNA (Qlogic 8152) to a Nexus 5010, the network part is working goot but not the FCoE.
    What I have noticed is that the vfc is down (Error disabled), the DCBX works any how:
    Nex1# sh system internal dcbx info interface ethernet 1/16
    Interface info for if_index: 0x1a00f000(Eth1/16)
    tx_enabled: TRUE
    rx_enabled: TRUE
    dcbx_enabled: TRUE
    DCX Protocol: CEE
    This is the port configuration:
    interface port-channel16
      switchport mode trunk
      vpc 16
      switchport trunk native vlan 501
      switchport trunk allowed vlan 501,810
      spanning-tree port type edge
      flowcontrol receive on
      flowcontrol send on
    interface Ethernet1/16
      switchport mode trunk
      switchport trunk native vlan 501
      switchport trunk allowed vlan 501,810
      spanning-tree port type edge
      flowcontrol receive on
      flowcontrol send on
      channel-group 16 mode active
    Any ideas what can be the problem? I have seen that if change "channel-group 16 mode active" to "channel-group 16 mode on" the interface goes up but the network connectivity is lost...
    Br
    Per

    http://www.cisco.com/en/US/docs/switches/datacenter/nexus5000/sw/operations/n5k_fcoe_ops.html#wp1080158
    --snip--
    LACP and FCoE To The Host
    Today, when deploying FCoE over a host-facing vPC, the vFC interface is  bound to the port channel interfaces associated with the vPC.  This  requires that the port channel interface be up and forwarding before  FCoE traffic can be switched.  Cisco recommends when running vPC in an  Ethernet environment is to use LACP in order to negotiate the parameters  on both sides of the port channel to ensure that configurations between  both sides is consistent.
    However, if there are inconsistencies in any of the Ethernet  configuration parameters LACP uses to bring up the port channel  interface, both sides of the virtual port channel will remain down.   This means that FCoE traffic from the host is now dependent on the  correct configuration on the LAN/Ethernet side.  When this dependency  occurs, Cisco recommends that you use the static port channel  configuration (channel-group # mode on) when deploying vPC and FCoE to  the same host.
    --snip--
    I'm guessing there's something about the way your CNA / host is handling LACP that caused some kind of mismatch. A packet trace may give a clue. Did you try 'mode passive' as well?

  • Error-disabled cause by loopback?

    Hello All,
    Does anyone please explain me about error-disabled cause by loopback? What is loopback? When does it happen? And why?
    Thank you very much,
    Nitass

    This happens usually with IP Phones using power over ethernet. Hope this link helps
    http://www.cisco.com/en/US/products/hw/phones/ps379/products_field_notice09186a008031575e.shtml
    Iam not sure of any other reasons though,

  • Add-on Conflict error while installing add-on LOCIN release 100

    Hi experts,
    I want to install the add-on for India Localization for IS-Utilities i.e., LOCIN release 100. I have downloaded the add-on and also the CRT(SAPK-10001INLOCINISU) required for the add-on.
    we got an error while installing the add-on it is
    "OCS package SAPK-10001INLOCINISU does not match the current software component vector"
    if we remove the CRT from the queue and start the installation then it will give add-0n conflict error.
    Conflicts Between Add-On LOCINISU 100 and Support Packages
    Component    Release      Support Package        Information on
                                                      Conflict Resolution
    FI-CA        600          SAPK-60004INFICA       Include CRT
                               SAPK-60005INFICA       Include CRT
                               SAPK-60006INFICA       Include CRT
                               SAPK-60007INFICA       Include CRT
                               SAPK-60008INFICA       Include CRT
                               SAPK-60009INFICA       Include CRT
                               SAPK-60010INFICA       Include CRT
    Plz tell the solution for this.

    Hi,
    I had somthng like this recently with SCMEWM....
    1. Clear out your EPS/in directory
    2. Download the latest version of the add-on and all the patches ===> the latest CRT's will be in the patches.
    3. Unpack them and try import again.
    Also if you have not seen or read:
    https://websmp230.sap-ag.de/sap(bD1lbiZjPTAwMQ==)/bc/bsp/spn/sapnotes/index2.htm?numm=950513
    Mark

  • Could not open error log file ''. Operating system error = 5(failed to retrieve text for this error. Reason: 15105).

    Hello
    When I try to start the SQl server service i get the following error:
    Event id 17058
    Could not open error log file ''. Operating system error = 5(failed to retrieve text for this error. Reason: 15105).
    As a test I have made sure the errorlog file ,and the entire drive it is, has everyone full control permissions, but to no avail. Does anyone have any ideas to resolve this issue?
    Thank you

    Hi,
    Try running:
    SELECT SERVERPROPERTY('ErrorLogFileName')
    Then verify that the account being used to run the SQL Server service account has access to the path output above.  If possible, you could try logging onto the server with the same account used to run SQL Server then navigate to the errorlog folder.
    Thanks,
    Andrew Bainbridge
    SQL Server DBA
    Please click "Propose As Answer" if a post solves your problem, or "Vote As Helpful" if a post has been useful to you

  • Campus Manager report - Ports in error Disabled state

    Hi,
    I have LMS 3.2 and I wonder how Campus Manager collects information from the switch to generate a report of discrepancies, namely a report of "Ports in Error Disabled state"??
    I find that I have ports in errDisabled state but Campus Manger doesn´t show this information in "Ports in Error Dissabled state" report. What could be the problem?
    Thanks.

    Hi,
    Campus Manager do snmpwalk on the ciscoErrDisableMIB to get the status of the error disabled ports.
    Thanks,
    Gaganjeet Singh

Maybe you are looking for

  • Back Order per Vendor

    Hi All, I want create Value of Back Order per Vendor report, Any body help me, What are the tables I have to use. My selection options are : Vendor and Delivery Date. Thanks,

  • I dropped my iPhone 5s and white vertical bars problem

    Hey all, Recently today I dropped my iPhone 5s and white vertical bars started to appear in the right hand side of my phone... Do I just need to get my screen replaced or is there more to the problem?

  • How to hide this table if my record in display table =null?

    i create a list to display record inside the table. If i wanna hide my table in case the list no any record. What statement should i add on? i may using <c:choose> ? <tr> <td colspan="3"> <fieldset> <legend>CSC Listing</legend> <table> <tr> <td colsp

  • Regarding background job.

    Hi All, I have to debug background job which is currently running. I know how to get the job in debug mode, that is through SM51 transaction...But once i m done with seeing some values in debug mode. I need to put this job back again in the backgroun

  • Payment Methods for iPhone 4: Retail Stores

    Sorry if this question has already been posted, but I am going to go pick up an iPhone 4 on launch day, but I don't have a credit card to buy it with. I was wondering if I could pay with cash? And if not, could I use a pre-paid Visa card or an Apple