Scanning always produces invalid PDF files

I recently purchased an HP OfficeJet 4636, and I notice that when scanning documents to PDF, the generated PDF files are always invalid. When opening these files in Acrobat Reader or Acrobat Pro, one gets the message "The file is damaged but is being repaired." Many other PDF tools can't open the files at all.
I have installed the newest firmware update and restarted the printer, but the problem was not solved.
I tried many different settings (e.g., resolution, paper size, scan from ADF, scan from Glass), but the problem is always the same. For the record, I am using the "Web scan" interface: http://XX.XX.XX.XX/#hId-pgWebScan.
If there were an opportunity to attach a file to this bug report, I would attach an actual PDF file generated by this printer. I am happy to supply the file later, and I will refer to this specific file in my analysis of the bug below.
I am familiar with Adobe's PDF definition:
PDF Reference, sixth edition. Adobe Portable Document Format
Version 1.7, November 2006. Abobe Systems Incorporated.
I examined the PDF file in detail and it is quite clear what the bug is. Object 1 of the PDF file starts at byte offset 10, and it is a stream object containing the main image data in compressed JPEG format. The Length field for the stream object (at byte offset 168) is set to 1051875. However, the actual compressed stream data is only 64914 bytes long.
This is incorrect, because the PDF Reference (Section 3.2.7, p.61, "Stream Extent") specifies that "If the stream has a filter, Length is the number of bytes of encoded data." (emphasis in original). In other words, the stream length should be set to 64914, which is the length of the JPEG encoded data, and not 1051875 (which is presumably the length of the raw unencoded image data).
Moreover, four of the entries in the PDF xref table (at offset 65504) are incorrect:
the byte offset for object 2 is given as 1052085 (actual location of object 2 is 65124);
the byte offset for object 3 is given as 1052140 (actual location of object 3 is 65179);
the byte offset for object 4 is given as 1052206 (actual location of object 4 is 65245);
the byte offset for object 5 is given as 1052383 (actual location of object 5 is 65422).
Also, the PDF startxref pointer (at offset 65674) points to 1052465, whereas the actual location of the xref table is byte offset 65504.
Note that in all six cases, the error is exactly equal to the difference between the declared stream length (1051875) and the actual stream length (64914):
1051875 - 64914 = 986961
1052085 - 65124 = 986961
1052140 - 65179 = 986961
1052206 - 65245 = 986961
1052383 - 65422 = 986961
1052465 - 65504 = 986961
It looks like this is a simple programming error in the printer's PDF generation software: all the offsets are computed as if the length of the embedded image stream were 1051875, whereas it is actually 64914.
I scanned at different page sizes, resolutions, and so on. Each time, the actual byte offsets were slightly different (depending on the length of the encoded image stream), but the above relationships still hold in each case.
Please fix this! It is not really acceptable for an HP scanner to produce broken PDF files. Thanks, -- Peter

Hi @Selinger 
One of the important things to point out is that the Webscan feature was designed for diagnostic purposes. The intended method of scanning is with HP software and the software build into the OS.
For Mac users this includes Apple Preview, Image Capture, or scanning from the Print and Fax Window.
For Windows users, non HP software includes, Windows Live Photo GAllery, Paint, and Windows Fax and Scan.
Webscan is a great alternative, but it is very basic. Your best option it to install the HP software and us the HP Scan program.
If there is a particular reason you are using Webscan and prefer to scan this way, the only thing I can really suggest is to try a different browser.
Please let me know the outcome of a different Browser, what Browser you are currently using, and what happens when you scan with HP software. If you require further assistance, please also include your operating system. What operating system, and version do you have? Mac or Windows?
I hope this helps.
Please click the Thumbs up icon below to thank me for responding.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Please click “Accept as Solution” if you feel my post solved your issue, it will help others find the solution.
Sunshyn2005 - I work on behalf of HP

Similar Messages

  • How do I save a scanned document as a PDF file?

    How can I save a scanned document as a PDF file.  I'm using a Photosmart C6280.
    This question was solved.
    View Solution.

    If you still have the document re-scan it using the Solution Center saving it as a pdf file.
    Please mark the post that solves your issue as "Accept as Solution".
    If my answer was helpful click the “Thumbs Up" on the left to say “Thanks”!
    I am not a HP employee.

  • Add multiple scans to the same pdf file scan at time of scanning

    Hello, HP Officejet Pro 6830 How do I get it to add multiple scans to the same pdf file? I need to scan multiple documents and have them all end up in one pdf. Some may be double-sided and it is fine if I have to scan them individually. At present it will only do one scan or a double-sided scan then it wants to save the scan and not ask if there are any more pages. The only option is to save or not. Thank you.

    After your first scan you need to click the + sign at the 7 o'clock position. 

  • Attaching a scanned document to a pdf file

    how do I attach a scanned document into a pdf file that needs to be faxed                                       

    The ExportPDF service doesn't do anything like that.  Do you have Adobe Acrobat?

  • How do scan items to a pdf file on computer from scanner

    how to send scanned items to a pdf file on computer 

    Yes, very easy: Place a thumbnail picture and link the file to it (link command on top of the window).

  • I scan a page to PDF file, but I  couldn't to highlight it.

    I scan a page to PDF file, but I  couldn't to highlight it. I use Adobe Reader, is any way I can use this funtion?

    You need run OCR first using Acrobat, then you should be able to highlight text.

  • How do I save a scanned image as a pdf file w/ my MG5420?

    I just installed my MG5420 printer and when I scan an image I do not see an option to save the file as a .pdf file.  Is there a scan utility to assist in scanning documents?  There was a utility for my MP800 printer that made this very easy.

    Hi delach,
    It sounds like you may not have the My Image Garden software which allows you to scan documents to PDF.  You can download the software by following these steps:
    Go to Canon's Support & Drivers website at:
    http://www.usa.canon.com/cusa/support/consumer
    Enter your model name into the Model Name field, then click GO.
    Select your operating system in the Select Operating System and Select OS Version dropdown lists, respectively.
    Expand the Software section by clicking the red triangle.
    Click on the file named My Image Garden.
    After reading the details and disclaimer, click 'I Agree Begin Download' and save the file to your computer.
    Once the download is complete, double-click the file from its download location to begin the installation.
    Did this answer your question? Please click the Accept as Solution button so that others may find the answer as well.

  • Is it possible that 1 RTF template with 1 xml file produces many PDF files

    Hello,
    We are using XML publisher with eBS.
    As part of it, we've got treatments that generate invoices.
    One treatment produces 1 big XML file with many invoices in it.
    Currently, this big XML file generates one big PDF file with all the invoices.
    I'd like to know if it's possible to keep the big XML file (with many invoices), and instead, generate 1 PDF file per invoice ?
    Basically we would like to split the big PDF file into small PDF, one PDF file per invoice .
    If it's possible, then, how can we do it ?
    I hope my explanation is clear.
    Thanks in advance for your help,
    Olivier

    Have you tried the BI publisher bursting feature?
    Take a look at this:
    http://www.strsoftware.com/wp-content/uploads/2011/09/Oracle-EBS-and-BI-Publisher-Report-Creation-Bursting-and-Delivery.pdf
    http://garethroberts.blogspot.com/2008/03/bi-publisher-ebs-bursting-101.html
    Thanks,
    Bipuser

  • Does mail always insert the pdf file into the email rather than attachment

    How can I set mail to always attach pdf files as attachments rather than insert into the mail message?

    But if you receive an email with a PDF attachment - which appears as text embedded in the email rather than an attachment are you really saying that the only way to solve this is to resend the email to yourself so that it will then appear as a PDF attachment?

  • Getting Canon 4770 to scan multipages into a PDF file while using the document feeder

    Hi All
    I'm new to this forum.  So if I break any rules with my questions below, kindly forgive me.
    But my questions are about my Canon MF4770 printer.
    1.  Why can't I used the document feeder to scan multiple pages into one PDF file?
    2.  Why do I have to constantly select the remote scanner option when the printer is connected to my computer?
    Any help I can get will be very appreciated.
    Starlene

    Hi Starlene and welcome to the Canon forum,
    The ability to scan multi-page documents can depend on your operating system and version, and if the scanning software being used supports the feature.  However, the MF Toolbox application that accompanies your laser product offers this option when scanning via a USB connection to your computer.
    For example, if you are using Windows, below are the steps that would be performed:
    1.  Place documents.
    Note: Up to 35 documents can be loaded in the feeder.  Be sure to fan the stack, place it face up, and align the document guides to the width of document.
    2. Press [SCAN].
    3. Press [^] or [v] to highlight <Remote Scanner>, and then press [OK].
    The machine is now waiting to be scanned.
    4. Double-click the [Canon MF Toolbox 4.9] icon on the desktop.
    The MF Toolbox starts.
    5. Click [PDF].
    Select [PDF (Multiple Pages)] in [Save as Type].
    7. Specify the required settings as needed and click [PDF Settings].
    The [PDF Settings] dialog box appears.
    8. Specify the required settings as needed and click [OK]:
    [Create Searchable PDF] Converts the characters in the document to text data and makes the PDF document searchable with keywords.
    [Text Language] Select the language of the text to be scanned. The characters may be recognized more accurately if you select [English] from the drop-down list and set [Image Quality] to [300 dpi] or higher in [Scanner Settings].
    [PDF Compression] Select [High] for color images such as photos or illustrations to reduce file sizes.
    9. Click [Start].
    Note:
    A folder with the scanning date will be created in the [MY PICTURES] folder in the [MY DOCUMENTS] folder, and your document will be saved in this folder. If there is no [MY PICTURES] folder, the folder with the scanning date will be created in the [MY DOCUMENTS] folder and your document saved in this folder.
    For text documents or black-and-white documents, it is recommended you select either [BLACK and WHITE] or [GRAYSCALE] in [SCAN MODE].
    If making a Multiple PDF with color documents ([IMAGE QUALITY] set to [300 dpi]), it is recommended that the PDF have fewer than 20 pages.
    The MF toolbox can be loaded from your software CD or downloaded here from the "Drivers & Software" tab on the Canon USA website.
    I can understand some confusion on having to select "Remote Scanner" when you have a direct computer connection to your product.  The reason for this is that first, the "Remote Scanner" mode refers to seeing your computer as a stand alone, or remote, device; however, the "Computer" scanning mode option allows you to select which computer will be performing a scan when doing so over a network (Max: up to 10 computers).
    Another reason for having to select the scan mode is that the selection is not programmed but selected based on the type of scan you will be performing at the time.
    If this does not answer your questions or we can offer further assistance, please feel free to Contact Us.
    Did this answer your question? Please click the Accept as Solution button so that others may find the answer as well.

  • Is this possible? Name xml file produced by pdf file

    Hi,
    At work we have pdf files on the intranet completed by managers, which produces an xml file that is sent to a specific mailbox. Via a button, the form opens an Outlook email with the xml file attached. The user just has to click send. This works fine. However, we end up with a mailbox of xml files called 'reader.xml'.
    Is it possible for Adobe to rename the produced xml file using data from one or two text boxes in the completed pdf file?
    Would it be possible to add a timestamp to the file name as well?
    I've not used java before, but use VBA regularly, and would enjoy the challenge.
    At the moment I'm just exploring the possibility.
    Many thanks.

    The data is text and dates, not that much. I'd want the file to be named after the first and last name fields.
    I'm not sure the IT dept of our organisation would want to install script locally, it would be a very low priority for them apart from anything else.

  • Scan and name a PDF file before Emailing

    I have a client who scans a lot of files into PDF via Acrobat and then emails them. When she does this, the file name in the email is "untitled.pdf" (File > Attach to Email). She does not need to save copies of these PDFs to her machine, but she would like them to have a more descriptive name before attaching them to email. Does anyone know how to name a file once it's scanned and before it's emailed? This could save a lot of time for her. Any help is much appreciated!!

    What do you mean by scan -- a page of paper with a scanner or something else? Do you have Acrobat?

  • Downloading pdf files with firefox 6.0.2 doesn't always work but pdf files do download with IE

    for example the home page for the web site kerrville.org has several items to download and most are pdf files using firefox version 6.0.2 the download seems to start but never completes . If I use IE version 8 at the same web site I'm able to download any and all of the pdf files. I have tried this several times and get the same results each time.

    I would just get rid of the create PDF 1.1 and go with CutePDF instead. It's a much better program, it's free, and it doesn't create all the issues that the Adobe software does.

  • Interactive Report Produces Corrupt PDF File

    Hello.
    I am using Apex 3.1.2 and have created an Interactive Report (IR). I then chose the "Download as PDF" option and chose the "Open" option. This caused an error about a corrupt PDF file.
    I then saved the file to my Windows drive and attempted to open it using Wordpad. I saw the following error as the very first line in the file:
    ORA-06502: PL/SQL: numeric or value error: character string buffer too small
    I would much appreciate any suggestions as to what to do.
    Thank you.
    Elie

    No Christina,
    First I went under report attributes on Interactive reports under download
    Download formats:
    CSV and PDF
    I checked both. CSV format is fine. PDF format is corrupt.
    Next I then went in using the below:
    http://www.oracle.com/technology/obe/apex/apex31nf/apex31rpt.htm
    It tells me I don't have a print server defined. I am trying to get the INLINE method to work.
    I see in the ADMIN there is a print server configuration section though it is not totally clear as I would think that you would get the default print configuration as you would with any other PDF docuement.

  • Adobe Reader don't always open instruction pdf files

    I have a 27 inch iMac running the latest osx software 10.8.2. and Adobe Reader 10.1.4. Often when I download instruction manuals, such as the one for iDraw, which I purchased from the App Store;  it says the file is corrupt and can't open it. I have experienced this problem with several user manual pdf files. Is there another pdf reader that might be able to open these files. I don't mind paying for the software if it will work. Any help and suggestions would be greatly appreciated.
    Thank you,
    Leonorafromarcadia

    I did get around this problem by trashing Reader 8 and confirming that I still had Reader 7.05 installed
    This tip in the MacWorld forum worked for me:
    http://forums.macworld.com/thread/102114
    Now I can open pdf files from the internet or email, in Reader 7.05.

Maybe you are looking for

  • OIM 9.1.0.2 Resource Profile Query

    Hello I need to get the usr_key or any info on the users in OIM that have a certain condition. I have a number of OID resources for users that have a status or 'provisioned' but when looking at the resource tasks the system validation is set to 'canc

  • ICal not syncing from existing iPhone to new macbook pro hard drive

    Hi everyone, Can anyone help? I have just had the Hard drive on my Macbook Pro (mid 2009) replaced and I have then restored all previous hard drive content and settings from my time machine. Everything transferred across with no problem except that w

  • The Shared list does not appear in my iTunes

    The Shared list dos not appear in my iTunes, turned Home Sharing off then on many times, still not working, how could I solve this problem?

  • Auto-Login into these forums here possible ? Enabling by cookie ?

    I would appreciate if there is an auto-login into these forums from Oracle. Currently every time I re-visit the forum pages I have to type in my login+passwd again. Usually in hundreds of other forums there is an auto-login option selectable. Could e

  • How to set JTable column's color?

    How can I set JTable Columns' color? I only found this class DefaultTableCellRenderer which can set cell's color.