Identifying corrupt PDF's

Please bear with me if I've not put this thread in the right place, but I have a problem that I haven't been able to solve so I thought I'd try this forum.
I have an automated process whereby I receive pdf's via ftp from a supplier, merge a batch of them together using PDFTK and then send them on to a printing company. Recently the printing company have started complaining about corruptions so I investigated and found that some of the pdf's I was receiving had some form of corruption. Adobe Reader and Acrobat would still open the files and they would be fine - although sometimes a distorted image would be spotted - and PDFTK would still merge them, printing is also fine for me; however if I test the file with an application called Solid PDF converter then it won't open the file (no specific error unfortunately) and the corruption is confirmed.
I'm assuming that the Adobe products and clever enough to workaround the corruption but what I need is something that will check each pdf for corruption before I process it. Although there are a number of tools that claim to fix corrupted files all I want is something that will just report it to me so that I can send it back to where it came from.
I'd appreciate any ideas you may have.

It still could be  PDFTK's fault. The only way to tell is to experiment  with the PDF's before you use it. But why not simply use Acrobat? Combine a PDF with Acrobat and send that in to see if they have problems.
By stress test I mean opening them.If they say they are corrupt, then you know. I don't know of anything you can get to tell you this before you open them.
IanClements wrote:
I don't actally produce the pdf's, they come from an outside company who are being extremely unhelpful (ever heard of Right Now?). I just use PDFTK to merge them so it's not PDFTK's fault if the result is faulty - garbage in, garbage out.
What do you mean by the "stress test"?

Similar Messages

  • Any way to repair corrupted pdf file?

    I created a pdf document on a my PC and it became corrupted with all these strange characters.
    Now when I open the document I get an error message that wasn't coming up before im hoping this msg will bring some light into the situation.
    If anyone can help me id be very thankful

    General Repair of Adobe Acrobat Reader corrupted .pdf file...
    1. Go to adobe official site, click the "Support" tab on the top of the screen, click on "Adobe Reader," and then select the "I can't open a PDF document" statement underneath "Troubleshooting." The Support tab is located near the top of the screen as a white word against a black backdrop. After clicking Support, you will see the phrase "Product Support Centers" on a new page in bold white font, under which is "Adobe Reader." The new page after clicking "Adobe Reader" will have the word "Troubleshooting" underneath the phrase "Adobe Reader and Help Support." Underneath "Troubleshooting" is "I can't open a PDF document."
    2. Follow the instructions on how to complete a general repair and installation of Adobe Acrobat Reader. To attempt the most basic PDF repair method, Open "Adobe Acrobat Reader" (select the program in your Start menu, which is on the bottom-left part of the screen), select "Help" from the gray tab menu at the top of Adobe Acrobat Reader, and click on "Repair Adobe Reader Installation." A meter should pop up indicating that your Adobe version is being checked for errors and is being linked back to the home Adobe official website just in case you need to download a new update of the program. If you need to download an updated version of the program, another pop-up will appear on your screen asking you if you want to update Adobe. Click "Yes."
    3. Reopen the PDF file. It should now be readable. If not, then try the next set of directions.
    4. If nothing helped, and you can't redowload .pdf file from source where it was, then use third party solution PDF On-line repair service https://onlinefilerepair.com/en/pdf-repair-online.html

  • Interactive Report Produces Corrupt PDF File

    Hello.
    I am using Apex 3.1.2 and have created an Interactive Report (IR). I then chose the "Download as PDF" option and chose the "Open" option. This caused an error about a corrupt PDF file.
    I then saved the file to my Windows drive and attempted to open it using Wordpad. I saw the following error as the very first line in the file:
    ORA-06502: PL/SQL: numeric or value error: character string buffer too small
    I would much appreciate any suggestions as to what to do.
    Thank you.
    Elie

    No Christina,
    First I went under report attributes on Interactive reports under download
    Download formats:
    CSV and PDF
    I checked both. CSV format is fine. PDF format is corrupt.
    Next I then went in using the below:
    http://www.oracle.com/technology/obe/apex/apex31nf/apex31rpt.htm
    It tells me I don't have a print server defined. I am trying to get the INLINE method to work.
    I see in the ADMIN there is a print server configuration section though it is not totally clear as I would think that you would get the default print configuration as you would with any other PDF docuement.

  • Officejet L7680, corrupt PDF file when scanning

    When scanning on my L7680, I have just started to have issues with a "General Error" message (on and off) and also a corrupt PDF file error.  What can I do to fix this? 
    HELP!!!!

    Hi 1193,
    Welcome to the HP Forums!
    I see that you cannot scan with your HP Officejet L7680, and I am happy to help you with this scanning issue!
    For further assistance, I will need to know the following:
    If you are using a Windows or Mac Operating System, and the version number. To find the exact version, visit this link. Whatsmyos.
    If the printer is connected, Wireless, Ethernet, or USB.
    If the power cable is plugged into a surge protector, or directly to the wall outlet. Issues when Connected to an Uninterruptible Power Supply/Power Strip/Surge Protector. This applies to Inkjet printers as well.
    If the printer is able to make copies by itself.
    If you are using Windows, please try our HP Print and Scan Doctor, and let me know what happens!
    Hope to hear from you, and have a great day!
    RnRMusicMan
    I work on behalf of HP
    Please click “Accept as Solution ” if you feel my post solved your issue, it will help others find the solution.
    Click the “Kudos Thumbs Up" to say “Thanks” for helping!

  • Can't download PDF through IronPort (result:corrupted pdf)

    Hi,
    Users can't download  PDF through IronPort web proxy(7.3.1)
    The result is always the same: corrupted pdf.
    Who can help me ?

    Hi,
    Please check in access policy matched, in 'Objects' column, if you have any limit set or type of files blocked.
    You can also perform a test - create a Custom URL Category and add the
    website where you are downloading PDF in it, then in access policy -> URL Filtering, configure this Custom URL Category to 'Allow'. 
    Regards,
    Kush

  • Pdf image recovery from corrupt pdf files

         the pdf file in which i kept my pictures gt corrupt. i used image extractor tools but for nothing. please help me. i am clueless what to do?

    How silly was I as I had been using PDF files since the day I learned to operate the computer system and didn’t know anything about them. Then one day I realized that why don’t I learn something about PDF to become a mater in it.
    As we know in these days, PDF files are most commonly used by worldwide. Belong to any part of the society, we as an individual or an organization use PDF files. Therefore it has become very essential element in computer service.
                                      “Generally a PDF or Portable Document Format file is a self-contained cross-platform document which appears same as in the form of soft copy or hard copy. PDF files are used by all of us as they contain the complete formatting of the original document, including fonts and images, PDF files are highly compressed, allowing complex information to be downloaded efficiently.”
    PDF is very popular due to its easiest form of transferring the files over and through the internet as it maintains the original formatting and secures the documents so nicely that other files’ formats don’t.
    Any PDF file contains text or images and sometimes both i.e. text and images. It can be used for office presentation, school assignment or personal collection. But sometimes we don’t need the text part which is inside our PDF file. Occasionally, we need only the pictures from our PDF files. That time we usually do this: copy the images or pictures from the PDF files and then paste them in other new PDF file. That process of copy and paste takes a long time and makes us tired. So that time we need an application which can easily extract all the images and pictures from our PDF files in very short point of time.
    But just think about this: How can you extract images and pictures from a PDF file which is corrupted. Because there is not any software application which can extract the images and pictures from a corrupt PDF file. Did I say no?
    Actually there is a tool which can easily extract the images and pictures from not only a normal PDF file but also from a corrupt PDF file. With the help of this tool anyone can easily extract the images and pictures from a single or multiple PDF files of all versions such as 1.3/1.4/1.5/1.6/1.7, from Adobe Acrobat 3.x to Adobe Acrobat X either it is normal or corrupted as it is very simple to use.  After extracting the images and pictures, it allows you to save them in different formats such as JPEG, BMP, PNG and GIF. It is one of the fastest extracting tools which does extraction process in no more time.
    i used this tool as it was refered earlier in this thread, and i am totally satisfy from this tool : PDF Image Extractor from SysInfoTools. What a utility excellent work done by experts.
    http://www.sysinfotools.com/recovery/pdf-image-extractor.html

  • Where I can download corrupt pdf viewer for me?

    I found some broken pdf files doing a raw recovery from Ontrack and receive similar issues when trying to open these files (I tried to match some up based on file size. How to view corrupted pdf? Where I can download corrupt pdfviewer for me?

    Wait some hours and you will get a reply with a spam like on forums.planetpdf.com

  • How to repair corrupted PDF-File  ?

    Hi there !
    I am having problems with a couple of PDF-files which got damaged due to a hard drive break down / file recovery action.
    Neither can i open them ("...not supported") nor have several attempts in repairing them (by using Pdf-repair software) been successful.
    Interesting point about it is, that in terms of file characteristics everything seems to be OK (file size, file type etc. are recognized by the computer).
    Does anyone have an idea what could be done to get my PDF's back to a readable status ?
    Thanks     timrelo
    Ps: In case you got an idea about how to fix a corrupted pdf-file, i am willing to send you one of my files.
          To do so, please send an email to    [email protected]    and i will email you a file.

    Hi Bernd,
    unfortunately i don't have the root-file (indesign etc.) to re-create/export a new pdf.
    all i got is a couple of restored (from the crashed hard drive) pdf files which are indicated as being "corrupt/damaged".
    greets   tim

  • SPro | SQR APY1060 is randomly generating Corrupted PDF's

    Hi,
    We're in the process of implementing SPro FSCM 8.9 on 8.47.11 for one of our clients. While the running the OOB SQR APY1060, we've found that the SQR randomly generates corrupted PDF. When we run the Process again, it works fine...Its is also difficult to replicate in other environments.
    Has any body faced an issues like this earlier with any PS delivered SQR.
    We've been able to zero in on the culprit - Control-M ... We've noticed a pattern, using the same run control ID, if we run it (PSJob) using the online page, then the PSJob runs absolutely fine. However, when we schedule it through Control-M the corruption happens. Also, if we create separate Process Requests in Control-M to schedule individual processes rather than the OOB PSJob, the SQR runs fine...Not sure if some one has come across such an issue...
    Thank You
    Prashant
    Edited by: PSFT_PP on Feb 9, 2009 8:47 AM

    I am not sure about this..but do you have different run control for the different OS or the same run control. Unix always recognises CAPS runcontrol values. do a check on that please.

  • Corrupt PDF from Photoshop CC

    This week I found that I cannot save a file to a PDF from Photoshop CC. Both Mac and Windows.
    The file shows that it is saving and gets to 10%, then acts like it has saved.
    Trying to open the PDF brings up an "Out of Memory" error.
    Is anyone else experiencing this issue?

    Hi,
    I think you can try a utility called Advanced PDF Repair to repair your PDF files. It works rather well for my corrupt PDF files. Its web address is http://www.datanumen.com/apdfr/
    Hope this will help.
    Alan

  • Detecting corrupt pdf's programatically

    Hello,
    Background: Have written/using a c++ plugin for Acrobat in Windows.
    I've encountered a couple of corrupt pdf files that cause a problem with my plug-in. I am unable to open these files manually in Acrobat or Foxit Reader (just to demonstrate it's not an Acrobat bug).
    I was wondering if there was a way to detect these files prior to opening a pdf using PDDocOpen() in my plug-in? I haven't come across anything in the SDK/API.
    I did figure out how to detect these corrupt files manually. I use the Recognize Text in multiple files->Select a folder option and it will indicate any problem files with a red X symbol.
    If there's a call in the API that can detect these files, let me know.
    Thanks.

    No, there is no way to detect them other than having PDDocOpen() throw an error or return NULL.
    From: Adobe Forums <[email protected]<mailto:[email protected]>>
    Reply-To: "[email protected]<mailto:[email protected]>" <[email protected]<mailto:[email protected]>>
    Date: Mon, 28 Nov 2011 10:43:54 -0800
    To: Leonard Rosenthol <[email protected]<mailto:[email protected]>>
    Subject: Detecting corrupt pdf's programatically
    Detecting corrupt pdf's programatically
    created by zephed56<http://forums.adobe.com/people/zephed56> in Acrobat SDK - View the full discussion<http://forums.adobe.com/message/4050665#4050665

  • Corrupt PDF File (Error 109) detection

    Hi there,
    I might probbably be wrong here ...
    We as a company use List&Labels to create our invoices in PDF Format. We create about 70000 invoices each month.
    Now and then we have single PDF documents that are corrupt and I'd like to program or use (in Visual Studio 2010) some PDF_Validation before sending the PDFs to our print provider as this dialog pops up in production environment and somebody has to react on it - which is not good :-) as none of the 70000 documents got printed until somebody clicks the OK button.
    We get the following Error Dialogs (see below):
    I'd like to catch the error programaticaly or analze the PDF by code to see wether it is corrupt or not.
    The PDF itself has misplaced text and invalid character spacings so it is useless to be sent to customers.
    Any ideas or recommendations?
    TIA a lot
    Ole Grossklaus

    Sven-Ole,
    I don't think you can attach files to forum posts - regardless of how you
    try to submit the attachment. You can either host the file on your own
    server, or upload it to some other file sharing service and post the link
    here.  Something that is easy to use is Dropbox. Just upload the file and
    then select to share it.
    As has been mentioned before, you cannot use Acrobat on a server, but if
    the problem is always the same, there may be an easy way to detect that
    specific problem. You would have to use a PDF library/toolkit/framework to
    get access to the structure of the PDF file. One option for this would be
    the Adobe PDF library (and because this is an Adobe forum, I'll refrain
    from mentioning any non-Adobe software).
    The key would be to - with the help of the PDF specification - identify the
    nature of the problem, and then verify that all your
    non-printing/non-processing files have the same problem. Once the source
    for the problem is identified, you can then use the PDF library (that is,
    if the file loads at all) to look for the problem in the files before you
    hand them off.
    Karl Heinz Kremer
    PDF Acrobatics Without a Net
    [email protected]
    http://www.khkonsulting.com
    On Wed, Dec 12, 2012 at 7:21 AM, SvenOleGrossklaus <[email protected]

  • Mail is corrupting PDF attachments

    about 25% of the PDF files I receive as attachments show as an icon in the top part of the msg, but also show as unreadable hex "text" in the bottom part of the msg.
    If I click on the PDF ICON in Mail.app, it opens fine in PREVIEW.APP
    IF I forward any mail with this sort of PDF Attachment (showing as "text" at the bottom) to a windows user, they can NOT open the file... Adobe Acrobat Reader says the file is corrupt.
    A corollary is - a LOT of the PDF files I receive from an e-fax service as PDF files show as CORRUPT on the Mac, but open FINE on Windows (Adobe Acrobat Reader).. saying the file is not "correctly encoded"
    This is a SERIOUS PROBLEM... If I can't get mail to reliably open and forward PDF files to Windows users, I have no choice but to give up on Mail.app and use Windows...
    I have to deal with way too many PDF files as attachments to worry that every one I send may be corrupt to some windows user.
    IF apple would give us a way to submit one of these files, I would gladly do so... but apparently from MAIL.APP they do not!
    Any ideas?

    Keith,
    I saw your other post.
    With problem Forwarding, try this again, but before sending, click on Format in the Menubar, and choose Make Plain Text. Then let me know what happens.
    I do not you have a transmission problem. The problem in the other topic, with Walter, is still under review, but has been traced to only be with that one IMAP server, and regardless of which network you are connected to.
    The text at the bottom may be a clue that that Rich Text is being converted to HTML at some point, and the HTML has compromised the content of the PDF.
    Feel free to send me an example, by email. My address can be found by clicking on my name.
    Ernie

  • Preview corrupts PDF documents when saving

    There appears to be a serious bug in Preview and/or OS X's PDF creation libraries, that causes PDFs to be "invisibly" corrupted.
    My situation is: I have a number of PDFs that have been created using Windows-based OCR software. They are standard PDF/A documents, and when I open them in Preview they display fine. More importantly, I can "copy" the text from the document to the clipboard and it works as you would expect.
    However, if the PDF document it edited in any way - page order changed, a page from another document moved into the document etc - and then saved, the resulting PDF is corrupt. Although the text appears on the screen normally, any text copied to the clipboard is garbled: for example, the displayed text:
    AUTUMN SPECIAL!
    appears in one of my documents. However, highlighting it and copying it results in
    *)﴿*%&(﴾'"!# $ 
    in the clipboard.
    I now have hundreds of documents that are effectively useless, as I cannot accurately copy the document text. I know others have had the same issue (see the posts in this Superuser.com thread for examples). The issue would appear to go as far back as Lion and possibly before then.
    Is this a "known issue"? And has anyone come up with work-arounds - other than re-OCRing the files (usually by exporting as TIFFs, reOCRing etc)?

    This issue is still happening on Mavericks and it also happens with Adobe Acrobat Reader. If you highlight some lines on a PDF text and then save it, the OCR becomes unreadable. If you try to redo the OCR on Acrobat Pro, this is impossible because the pages 'contain renderable text'. The only solution I can fathom is not to use annotation on any PDFs. The strange thing is that I cannot find any solutions to this, or any bugs submitted on Acrobat's forums, where they should also be, because this is not just a problem with Apple.

  • Annotations made in Preview to PDFs created from MS Word corrupts PDF

    In Preview, when I save annotations to PDFs created from an MS Word document, the PDF seems to be corrupted. Searching in the PDF no longer shows any results, and text copied from the PDF is displayed as a series of meaningless characters when pasted into a note annotation, or pasted into another file (e.g. a TextEdit file). So for example, "the monitoring of progress" becomes ")@%8,1),&12%,=%4&,2&//" when copied and pasted.
    Is there any way to prevent PDFs from being corrupted in this way, or to restore the PDF so that searching and copying work correctly?

    I suspect this is a problem with M$ failing to handle PDF files correctly. I would suggest you ask your question over on the M$ forums instead of here.
    Allan

Maybe you are looking for