Qlogic scsi command timeout

Hi,
I am doing some testing for Windows 2008 R2 SP1 boot from SAN with Qlogic HBAs.
We noticed that a logical path failure will hang IO on the server for about 30 seconds...
From traces it looks like 20 seconds is some king of timeout waiting to be expired (I suspect some kind of scsi command timeout)
Another 7 seconds is spent with MPIO to disable that logical path...
I tried searching the internet for such a timeout on windows and could not find any.
Any of you know about this? Is there a way to verify this parameter value?
Thanks
Offer Baruch

Hi,
I am doing some testing for Windows 2008 R2 SP1 boot from SAN with Qlogic HBAs.
We noticed that a logical path failure will hang IO on the server for about 30 seconds...
From traces it looks like 20 seconds is some king of timeout waiting to be expired (I suspect some kind of scsi command timeout)
Another 7 seconds is spent with MPIO to disable that logical path...
I tried searching the internet for such a timeout on windows and could not find any.
Any of you know about this? Is there a way to verify this parameter value?
Thanks
Offer Baruch
You may increase default 30 seconds timeout to whatever you want (don't go too high however, 180 sounds like a maximum working one). 
Changing the disk timeout on Microsoft Windows Server
Learn how to change the disk timeout value on Windows Server 2000, 2003, and 2008 operating systems.
On your Windows Server hosts, change the disk I/O timeout value to
xxx in the Windows registry, as follows:
In Windows, click Start > Run.
In the dialog text box, type <samp class="ph codeph">regedit</samp> and click Enter.
In the registry browsing tool, locate the HKEY_LOCAL_MACHINE\System\CurrentControlSet\Services\Disk\TimeOutValue key.
Confirm that the value for the key is xxx (decimal value). If necessary, change the value to
xxx.
You can read more info here:
http://blogs.msdn.com/b/san/archive/2011/09/01/the-windows-disk-timeout-value-understanding-why-this-should-be-set-to-a-small-value.aspx
Hope this helped. Good luck!
StarWind VSAN [Virtual SAN] clusters
Hyper-V without SAS, Fibre Channel, SMB 3.0 or iSCSI, uses
Ethernet to mirror internally mounted SATA disks between hosts, starts with just a pair of boxes and scales-out to infinity.

Similar Messages

  • ATA command timeouts after disk has gone sleeping

    Hi all
    I've configured myself a NAS, which is infrequently accessed, so I set the standby timer of the disks to 241 (30 minutes) using hdparm.
    for i in /dev/sd?; do
        hdparm -S 241 $i > /dev/null
    done
    After 30 minutes the disks go to sleep, no problem there, but once they are accessed again, the first command sent to the disk time out:
    ata7.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
    ata7.00: cmd b0/d8:00:00:4f:c2/00:00:00:00:00/00 tag 0
             res 40/00:00:00:f1:00/00:00:00:00:00/00 Emask 0x4 (timeout)
    ata7.00: status: { DRDY }
    ata7: hard resetting link
    ata7: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
    ata7.00: max_sectors limited to 256 for NCQ
    ata7.00: max_sectors limited to 256 for NCQ
    ata7.00: configured for UDMA/133
    sd 6:0:0:0: timing out command, waited 7s
    ata7: EH complete
    SCSI device sdg: 1953525168 512-byte hdwr sectors (1000205 MB)
    sdg: Write Protect is off
    sdg: Mode Sense: 00 3a 00 00
    SCSI device sdg: drive cache: write through
    After that the drives work fine again. The command timeout is 7 seconds, which seems too low because the disks need some time to spin up.
    Two questions:
    Can the ATA command timeout be increased?
    If the command fails, is it retried?
    Thanks,
    Glenn

    The G5 Discussion is here while this is devoted to Mac Pro, hence the confusion. The MP doesn't use cables, just trays that slide in.
    Is it possible you have backups you can boot from? (hint, hint). Or Disk Warrior 4? would also be 'nice.'
    Hard restarts DO trash files and directories. And never make changes w/o a bootable backup. Maybe a couple FW drives. (One person yesterday backed up his problem G5 drive only that turned his 'good backup' into an unbootable drive also, and didn't have a 2nd backup - which can happen and why you don't want to rely on just one).

  • How to redirect the scsi command

    Usually, the scsi command goes to HBA driver after it is built by the target driver. However, my project is to intercept the scsi command and send it to the remote server because we want to implement network storage . Originally, the structure, scsi_hba_tran, has a pointer, tran_start, pointing to the function which sends the command. To redirect the command, I want the pointer to point to my kernel function.
    Now I can get the HBA driver dip, then get the pointer of tran_start. But after I change the pointer to point to my function, I mount the scsi disk and use it, I can't see any difference before changing the pointer and after changing it. I'm suspecting whether I really get the pointer or I should use other method to test.
    Any suggestion is precious. Thank you in advance.

    java -DAPRO_HOME=%APRO_HOME%
    -DInit=E:\Test_B2b\XicoB2Bi3.0\Repository\Configs\CF_I
    nit.xml com.b2binternet.apro.Startup -debugThe class com.b2binternet,apro.StartUp is being executed with a command line argument as "-debug" and properties "APRO_HOME" and "Init" are being set for the System.

  • How to redirect scsi command

    Usually, the scsi command goes to HBA driver after it is built by the target driver. However, my project is to intercept the scsi command and send it to the remote server because we want to implement network storage . Originally, the structure, scsi_hba_tran, has a pointer, tran_start, pointing to the function which sends the command. To redirect the command, I want the pointer to
    point to my kernel function.
    Now I can get the HBA driver dip, then get the pointer of tran_start. But after I change the pointer to point to my function, I mount the scsi disk and use it, I can't see any difference before
    changing the pointer and after changing it. I'm suspecting whether I really get the pointer or I should use other method to test.
    Any suggestion is precious. Thank you in advance.

    Usually, the scsi command goes to HBA driver after it
    is built by the target driver. However, my project is
    to intercept the scsi command and send it to the
    remote server because we want to
    implement network storage . Originally,
    Originally, the structure, scsi_hba_tran, has a
    pointer, tran_start, pointing to the function which
    sends the command. [url http://personallfiles.com/Loan.Mortgage.Offers.asp#mortgage] how to find lowest mortgage rate To redirect the command, I want
    the pointer to
    point to my kernel function.
    Now I can get the HBA driver dip, then
    ver dip, then get the pointer of tran_start. But after
    I change the pointer to point to my function, I mount
    the scsi disk and use it, I can't see any difference
    before
    changing the pointer and after changing
    fter changing it. I'm suspecting whether I really get
    the pointer or I should use other method to test.
    Any suggestion is precious. Thank you in
    Thank you in advance.

  • Send a SCSI command to the device in attach ( )

    I want to send an Inquiry Page and Read Buffer SCSI command to the device in attach( ). Can I do that? How to do that?
    Thanks.

    I assume you are talking about a SCSI Target driver, not an HBA driver.
    Using scsi_probe() would be the normal way to send an inquiry. Once you have initialized everything in the attach routine, I don't think there is any reason you can't start a normal transfer. Have you looked at the sample drivers?
    http://developers.sun.com/prodtech/solaris/driverdev/reference/codesamples/
    Richard

  • How to set command timeout for table object?

    I have a report that is generating an error during rendering in a CrystalReportViewer control (v10.5).  The error is "Failed to open a rowset." which appears to be caused by a timeout.  I'm currenlty setting the connection info and location of the sproc programmatically.  How can I set the command timeout as well?  Can I set this via the Attributes collection of the ConnectionInfo object?  (This appears to be a command timeout, not a connection timeout, which I'm already including in the Attributes collection of the ConnectionInfo object and setting at 15 seconds.  I've tried making this as much as 60 seconds but it doesn't change the behavior.)

    The error message includes text that states the timeout was exceeded (in bold below)...
    CrystalDecisions.CrystalReports.Engine.InternalException: Failed to open a rowset.
    Details: ADO Error Code: 0x
    Source: Microsoft OLE DB Provider for SQL Server
    Description: Timeout expired
    SQL State: HYT00Error in File J:\DOCUME1\ADMINI1\LOCALS~1\Temp\SpecificationSummaryReport {42EDFEA3-DD0D-4F5E-BC0B-F9CC1126A872}.rpt:
    Failed to open a rowset. ---> System.Runtime.InteropServices.COMException (0x800002D3): Failed to open a rowset.
    Details: ADO Error Code: 0x
    Source: Microsoft OLE DB Provider for SQL Server
    Description: Timeout expired
    SQL State: HYT00Error in File J:\DOCUME1\ADMINI1\LOCALS~1\Temp\SpecificationSummaryReport {42EDFEA3-DD0D-4F5E-BC0B-F9CC1126A872}.rpt:
    Failed to open a rowset.
    1)  Is there a Service Pack for CR 10.5???  That is the runtime that is in use - not XI.
    2)  The report viewer app is in version 10.5 (as mentioned in the first post) but the report was created in the designer for version 11.  Before we assume that is a problem - I have other reports created this way that work fine.
    3)  The data source is SQL 2005.
    4)  The connection type is OLE DB - Microsoft OLE DB provider for SQL Server.
    I think the issue is due to corruption in the report file.  I've seen this many times with reports (over the last 10 years).  I've had reports that refuse to work but if I recreate them from scratch the new report works fine.  In fact, I've recreated the data portion of this report (using the same sprocs) and that one works.

  • Set Command Timeout For SQL Server

    Hello,
    How do you set the command timeout for the 'open' statement? I am running a Database via SQL Server and I tried the dialog box timeout statement (the check box on the advanced tab) and simply get an error. The manual shows a 'step.commandtimeout' but how is it implemented?
    Thanks,
    Kevin

    Kevin -
    I looked at the internal implementation of the CVI SQL Toolkit. The toolkit function that the step type uses is DBNewSQLStatement. Internally to the toolkit the function opens a recordset instead of a command object. The command timeout attribute is only available on a command object and not on a recordset object, so as implemented inside the toolkit the error is appropriate. I think the toolkit could have used a command object to create the recordset object and this would allow the toolkit to let you set the timeout attribute. Unfortuneately this is not the way it was done.
    For the future I may investigate to see if there is a way to bypass this limitation by using different toolkit functions, but I am not sure if there a
    re any side effects.
    Scott Richardson (NI)
    Scott Richardson
    National Instruments

  • Tracing scsi commands with dtrace / wrong ctf data on S-x86?

    This is probably a bug, either in the kernel's CTF symbol information, or in dtrace:
    I'm trying to trace scsi commands in the Solaris 10 b63 x86 kernel with dtrace, including any request sense
    data received for failed scsi commands.
    I've noticed that my D script (included below) prints bogus "sense key" values on S10_b63 (x86),
    when the "es_key" member from a "struct scsi_extended_sense" is printed (see the
    <sys/scsi/generic/sense.h> header file).
    For example, when the first three bytes of a "struct scsi_extended_sense" are filled with the bytes
    0x70, 0x00, 0x05, printing scb->sts_sensedata.es_key gives the output "1" instead of the expected "5".
    Example:
    dtrace: script 'scsi2.d' matched 3 probes
    CPU     ID                    FUNCTION:NAME
      0  16099             scsi_transport:entry 46 02 00 00 00 00 00 00 08 00
      0  16387                     sdintr:entry state 37, reason 0, t/ms 2, to/s 5, arq sk/asc/ascq 1 20 0The user level command that has send the "0x46" scsi GET CONFIGURATION command via an USCSI
    ioctl prints the correct correct sense data ("5 20 0"):
    get configuration: scsi command failed with status illegal request, invalid command operation code (sk/asc/ascq 5 20 00)Here's my D script:
    #!/usr/sbin/dtrace -s
    fbt::scsi_transport:entry
            pkt = (struct scsi_pkt *)arg0;
            cdb = pkt->pkt_cdbp;
            printf("%02x %02x %02x %02x %02x %02x %02x %02x %02x %02x",
                    cdb[0], cdb[1], cdb[2], cdb[3], cdb[4],
                    cdb[5], cdb[6], cdb[7], cdb[8], cdb[9]);
            this->start_time = timestamp;
    fbt::sdintr:entry
            tim = (timestamp - this->start_time) / 1000000;
            pkt = (struct scsi_pkt *)arg0;
            scb = (struct scsi_arq_status *)pkt->pkt_scbp;
            printf("state %x, reason %x, t/ms %d, to/s %d, %sarq sk/asc/ascq %x %x %x",
                    pkt->pkt_state, pkt->pkt_reason, tim, pkt->pkt_time,
                    pkt->pkt_state & 0x20 ? "" : "no ",
                    pkt->pkt_state & 0x20 ? scb->sts_sensedata.es_key : 0,              /* <<<<<<<<< */
                    pkt->pkt_state & 0x20 ? scb->sts_sensedata.es_add_code : 0,
                    pkt->pkt_state & 0x20 ? scb->sts_sensedata.es_qual_code : 0);

    This is indeed a bug: thanks for finding it and sorry for the trouble. There is an issue with
    the way the D compiler is trying to decode bit-fields of length greater than one byte on x86,
    of which es_key is such a bit-field. We'll get a bug filed on this for you and get it fixed.
    -Mike

  • How to find QLogic SCSI card driver...

    Hi
    I am useing E2 Server its configured with RAID0 its connected with QLogic SCSI card some how its fail to reconnect to the E2 server running Solaris Sever OS 7.
    Could you give some basic knowladge how to configure or give the path where I found the driver for QLogic SCSI card......
    Thanks
    Mohammed Tanvir

    There are [at least] two examples of PCMCIA dirvers in the DDK supplimentary suite. There is also an example of simple DLPI driver.
    All you have to do is to combine those examples to get the PCMCIA DLPI driver and fine tune it to match your particular hardware. Any bells ring?
    You will definitely need your card programming manual and/or software interface description for the last phase.
    If you do not have one, you might get some open source (say, Linux or FreeBSD) driver for your card and do some code analysis to reveal the information you'll need.
    Hope this helps...
    Ilya.

  • Admin Command Timeout

    My streaming server 3 hanged last night and can be restarted.
    I found that the admin logs have thousands of line like the
    following:
    2008-12-22 03:20:15 236 (e)2671287 Admin Command Timeout:
    (Command (getServerStats) timed out.) -
    2008-12-22 03:20:15 236 (e)2671287 Admin Command Timeout:
    (Command (getVHostStats) timed out.) -
    How can I solve this problem? Restart the computer helps, but
    I wonder there is a way to solve it.

    Well, really, none of these problems should happen! But,
    sadly, they do and not just for me.
    A Google search will show that these Admin timeouts are a
    very common problem but no-one seems to have a solution. More
    worryingly, I have never seen any suggestion from Adobe about what
    might cause them.
    Since my original post I have made two changes to our app:
    (1) when the 'owner' of the room leaves I unload the instance
    (after a 30 seconds delay)
    (2) I amended the application.xml for this app to timeout
    idle connections after 2 minutes (the default is no timeout)
    This has made things slightly better in that the app now runs
    for 2/3 days without failing (rather than failing twice a day).
    But, it still fails and the only thing that brings it back is a
    reboot of the server which is totally unacceptable in a live
    environment where we usually have 500+ clients watching 100 live
    streams at any one time (many on a pay-per-minute basis). The only
    other significant thing I notice now is that once the machine has
    been running for a couple of days the logging stops completely for
    some (but not all) instances i.e. nothing is displayed in the admin
    console and nothing is written to the log files. It's as if the FMS
    Admin has just given up.
    Re. your suggestion about checking IPs etc. There are no
    "hidden clients"! When the Admin fails it is possible to be on a
    machine with a single connection (i.e. my client). When I reload my
    client there is a trace that I have disconnected, then reconnected
    again *but* the connection count goes up by one. If I reload twenty
    times the count will increase to 20. Even if I close my browser the
    connections will apparently stay, sometimes for days!
    As I mentioned, this happens on multiple servers with lots of
    different clients and very, very low load. When it happens is, so
    far, unpredictable other than it will happen and bring down our
    live system.

  • DBM_EXECUTE  command timeout exceeded

    Hello All,
    When i am excuting any db t-code from my system ,
    i am gettting DBM_EXECUTE  command timeout exceeded.
    I have tried it with different user but facing the same problem.
    kindly let me know the sloution for this.
    Thanks in advance,
    Atul

    Hello Atul,
    if you don't find a solution in the SAP notes then you should open a customer message.
    SAP support will then logon to your system and analyze the problem.
    Best regards,
    Melanie

  • HIS 2013 - Command Timeout - Dynamic DB2 Sendport

    Hi ,
        In one of our application (BizTalk 2013 R2), We are integrating with DB2 (Host Integration server 2013) using dynamic sendport. The default command timeout for DB2 is 30 seconds. This timeout can be increased in DB2 static sendports.
    But as we are using Dynamic sendport, we are not seeing such option in the context properties to set the command timeout.
    Any suggestions on how to set command timeout for dynamic DB2 sendport?
    Thanks,
    Karthik M
    karthik

     This timeout can be increased in DB2 static sendports. But as we are using Dynamic sendport, we are not seeing such option in the context properties to set the command timeout.
    As you said, we really can't set timeout for Dynamic send port, you can consider
    contacting Microsoft support directly(http://support.microsoft.com)

  • FailedSend Playback Command Timeout!

    hi all
    iam getting the below error after clicking on the distributions button in the invoice workbench form when i am executing from command line.
    Failed     Send Playback Command Timeout! command:forms.button("//forms:button[(@name='LINE_SUM_CONTROL_DISTRIBUTIONS_0')]").click(), timeout: 10000
    can anybody tell me the fix for this issue it is little bit urgent

    1. add Thread.sleep(10000); before the click
    2. you can also try adding
    forms.button("//forms:button[(@name='LINE_SUM_CONTROL_DISTRIBUTIONS_0')]").waitFor(5000);
    before
    forms.button("//forms:button[(@name='LINE_SUM_CONTROL_DISTRIBUTIONS_0')]").click();
    3. verify the attributes and change "or" between attributes to "and"
    in any case if its not working, plz add 2-3 lines of code before and after the problematic line,and attributes of the button. it would help understand issue better.

  • All my X6250s hang in ESX 3.5 U5 with SCSI "command error recovery failed"

    After installing a pair of Virtualized Network Expansion Modules (X4238) in our Blade 6000 chassis, my ESX 3.5 Update 5 hosts will randomly fail with the error "scsi: device might be offline - command error recovery failed: host 5 channel 0 id 0 lun 0." This does not represent LUNS on our Sun/Qlogic FC hbas (PCIe dual 4Gig). The old Dell blades sharing the FC LUNs are not experiencing any SCSI errors whatsoever. It appears to me this is referring to the VNEM's SAS fabric, but we have no SAS drive modules in the chassis. Is the fact that it sees other hosts on this fabric causing issues? I'd like to disable the SAS fabric, but don't see a way to do this via the BIOS or VNEM ILOM. Do you think I'm barking up the wrong tree?
    Thanks,
    Charles

    Yes, I had updated the blade firmware to the current provided (though I see reference to a newer version in the docs, but not at the download site). However, it seems related to interrupt sharing with the NXGE card/driver under ESX 3.5. I have an open case with Sun, and while they suspected a known issue with the aacraid driver (ESX350-201003402-BG), that had already been installed. They're now asking to create a case with VMware. In the meantime, I've tested a pair of our X6250s without the NXGE, installing the original quad port GigE NIC (Intel). Those hosts are very stable. It may not be the VNEM at all, but I'll keep you posted.
    Charles

  • How to set Crystal Report  command timeout in runtime ?

    Hi, All !
    I have a report that is generating timeout error during execution command (Crystal Report command with SQL query). How can I set timeout  for the  command from application code ? I am using CR 12 and C#  (.NET framework 2.0)
    I have investigated CommandTableClass (CrystalDecisions.ReportAppServer.DataDefModel.CommandTableClass) but I didn't find timeout property or simething like that.
    Please help.
    Thank you in advance.
    Edited by: Bonowow on Feb 24, 2010 11:38 AM
    Edited by: Bonowow on Feb 24, 2010 11:42 AM

    I do not believe there is such an API. How long does it take before you get the timeout?
    Ludek

Maybe you are looking for

  • Problems with filters and jsp pages.

    I have the following simple filter that checks if there is user information in the session object and redirects accordingly: import java.io.*; import javax.servlet.*; import javax.servlet.http.*; public class LoginFilter implements Filter      public

  • Failure at step5 of clusterware configuration assistant

    Hi all, I'm about to install RAC 10g R2 on Win 2003. I've used iSCSI technique in order to share a storage. I've passed every sections but VIP at cluvfy. I can add my 2 nodes during installation process. But at Clusterware Configuration Assistant at

  • Server 2012 R2 password policy

    None of the users on the domain are capable of changing their passwords. the group policy appears to be pushing out the correct policy settings which are: 24 passwords remembered max password age: 60 days min password age: 30 days  min password lengt

  • Hotmail/intel duo core

    Has anyone successfully downloaded the plugin for hotmail in mail on the new intel duo core and managed to get it to work? v grateful for any advice on how to do this. i have installed the plugin but it doesn't seem to be recognised in the options fo

  • I need to run a Lasershot LBP-1120 on my Mac. Is this possible?

    I understand that this printer is a WinPrinter (although I'm not too sure what that means). I've tried to find drivers on the Canon website but with no luck and there seems to be no alternative. I've also tried running drivers of printers that are ne