Auto delete duplicate pages in a merged pdf?

Hi,
I was wondering if anyone knew of a way to automatically have Adobe Professional search a pdf and delete/remove all of the duplicate pages?  I have merged a bunch of separate pdf files where some of the files have duplicate pages...thus instead of searching the entire pdf for the duplicate pages manually (+500 pages), I was hoping Adobe Professional could do this somehow automatically.
Thanks,
kmullen

Attached a VBS example how to get the text of a PDF page.
So you can loop trough the pages and delete if text1 = text2
The documentation you will find in the Acrobat SDK.
Section: IAC (InterApplication Comm..).
If you need examples for single commands search in Google for that.
The search function here is not very good.
HTH, Reinhard
GetText.vbs
'// Save this as GetText.vbs and start with Double Click
'// Acrobat with a Document must be OPEN
set WshShell = CreateObject ("Wscript.Shell")
WshShell.AppActivate("Adobe Acrobat")
WScript.Sleep 500
'// get the active Document
Set AcroApp = CreateObject("AcroExch.App")
Set AVDoc = AcroApp.GetActiveDoc
Set PDDoc = AVDoc.GetPDDoc
Set PdfPage = PDDoc.AcquirePage(0) '<<--SET in FILE! =Pagenumber
Set PageHL = CreateObject("AcroExch.HiliteList")
PageHLRes = PageHL.Add(0,9000) '<<--SET in FILE! (Start,END[9000=All])
Set PageSel = PdfPage.CreatePageHilite(PageHL)
for i = 0 to PageSel.Getnumtext - 1
  pdfData =PDFData & PageSel.GetText(i)
Next
msgBox PDFDATA

Similar Messages

  • How do I delete some pages from an existing pdf file?

    I have an existing pdf file that is too large to send to some people. How can I delete some pages from this existing pdf file, and break it up into two files?

    Acrobat
    You can download a 30 day trial at that link.
    You may also be able to do it with CreatePDF, but I'm not sure.

  • How can I auto delete certain pages

    I need to delete some pages in a large PDF file. The headings are unique. For example, If each page of the file contained the name of a country as the heading and a description of that country in the body; I would need to identify certain countries and delete those pages. Thanks, in advance.

    That is not a simple task. You would need to loop over the pages, checking their contents word-by-word using the getPageNthWord method until you find what you're after. Then you would delete the page using the deletePages method. Just keep in mind that removing items in an array that you're iterating over can cause problems, so it's better to iterate from the end of it to the start.

  • Standardise page sizes in merged PDF?

    Hi everyone,
    I'm trying to merge together a PowerPoint '07 and an Excel '07 file, both set to A4 lanscape page layouts, in Acrobat 9 Standard.  However, in the final merged PDF, the old Excel pages are huge compared to the PowerPoint slides - about 6 times larger.
    Any ideas how I can force Adobe output a merged PDF all in the same page size and orientation?
    Thanks in advance,
    Simon

    Acrobat's combine feature to merge files into one PDF does not provide a page size "editor".
    Likely a cleaner work flow if the input files had page size adjusted by their native application.
    Once done, use combine to merge into the single PDF.
    You may find this is (over all) easier than using the Crop Tool resize features.
    To view any PDF's page size when open in Adobe Reader/Acrobat go into Preferences.
    Select the Page Display catagory then tick the "Always show document page size" choice.
    Be well

  • Use varying page sizes when merging PDFs in Preview?

    In 10.5, one could use Preview to merge multiple images or pages into a single PDF. Just open one document, show thumbnails in the sidebar, and drag in any additional items. The additional items could be other PDFs or images (like JPGs, Photoshop files, etc.). The dimensions and page size of the new items were maintained.
    For example, I would use this a lot to package website mockups and other designs into a single document to share with clients. The title page could be a different size from the mockups.
    In 10.6's Preview, when I drag items into the sidebar, they are added to the PDF, but placed on a page that is the size of the pages already in the document.
    For example, I have a couple pages of 8.5x11 in my PDF. Then I drag in an image that is much smaller, but Preview frames it on a white 8.5x11 page, so there are huge white borders around the image.
    Is there a way to get 10.6 Preview to behave more like 10.5 in this respect? Thanks!

    In Preview's sidebar, just drag one document on top of (i.e. not above and not below) another, but it sounds as if you're doing that.
    Make sure you are doing this in the sidebar:
    and make sure they are both PDFs.

  • Can't delete blank page in combined files PDF

    I combined a number of files to create a new PDF binder in Acrobat X, then added a blank page in the middle using Tools > More Insert Options > Insert blank page. I later decided I didn't need the blank page and tried to delete it but received this error message:
    One or more pages are in use and could not be deleted.
    I gave up, trashed the PDF, and started all over again. But the new binder included the blank page in the same spot, and it won't let me delete it! I tried creating the PDF using my old Acrobat Pro 8. It created the file correctly, but when I closed the file and reopened it, it opened in Acrobat X - with the blank page again! How do I get rid of the blank page?

    Open the file in Acrobat, delete the blankpage, and save the file.

  • Deleting one page?

    How do I delete one page in an Adobe PDF?  No editing restrictons.

    thanks for this.  Actually, following another thread after I posted this, I saw a recommendation to sue some free software called PDFsplitter (www.pdfsplit.com) - and I have successfully used that to remove the one page I wanted to.

  • Can I rearrange pages when I merge files with Adobe PDF Pack?

    Hi
    Have purchased PDF pack as an upgrade, but not happy with what it does as doesn't seem to be any better than what I had before
    When you merge documents there doesn't appear to be any tool to rearrange the order or to delete any pages you don't want, which leaves it very limited
    Perhaps I'm missing something, but without this feature it's no better than what I had before
    Can anyone either help or perhaps someone from Adobe can contact me to discuss a refund
    Thanks

    Hi mdibmead,
    I'm sorry that PDF Pack isn't meeting your needs. It's true that to alter PDF pages in any way, you need Acrobat. PDF Pack lets you convert files to and from PDF and combine files to PDF, but not edit PDFs.
    I would be happy to take care of a cancelation and refund for you. Please let me know how you'd like me to proceed.
    Best,
    Sara

  • Delete A Large Number of Randomly Distributed Duplicate Pages

    Hello Sir,
    I have a pdf file with as many as 10,000 pages, which actually should has 6,000 pages. The extra duplicate 4,000 pages are randomly distributed in the file without any discipline. Do we have such as tool can scan, compare and pick out the duplicate pages by a batch processing  so that I don't need to do this manually. If we don't have such a function, can we develop it for next version/generation? Thank you very much. My email is [email protected]

    You can manual mark the duplicate pages with comments. After this you can delete the marked pages with Acrobat Javascript.

  • How to delete duplicate templates in pages 09?

    I want to delete duplicate templates from "My Templates" but can't.  Tried to drag them to the trash bin but they won't budge.  Can't find a delete command on the menu bar, either.

    You can't delete templates or any file from within Pages, it's done from Finder. Pages stores those you created & saved as templates in (your account) > Library > Application Support > iWork > Pages > Templates > My Templates. The door to the user's Library is hidden in Lion but it is easy to open. In Finder, hold down the Option key while clicking on the Go menu & your users Library will appear about halfway down the list.

  • How do you duplicate a PDF fill in form page, WITHOUT linking the text (for MAC)???  That is, while still being able to type different things into the blanks on the duplicate pages?

    The Situation (again, I have a Mac):  I am filling out a 5 job application- it is PDF format, with questions, and fill in the blanks for your information- such as name, date, and employment history.  Page 3, is the page used for employment history.  But, there is only room for two jobs on page 3.  The instructions tell you, you can "have as many page three's" as you want.  In my case, I will be putting 8 jobs- at two jobs per page, that is a total of 4 page three's.   I already know how to duplicate a PDF page, simply by opening the thumbnail viewer on the side, holding down the "alt" key, and clicking and dragging the desired page to create a duplicate.  So what is the problem??? When you duplicate the page, the boxes into which you type, are linked.  So if on the original, you type "Job A", into the experience box, it forces the same answer on the duplicate page, making it impossible to describe different positions!!!!  Is there a way to work around that? Thank you!!!!!

    You're probably using Acrobat if you can duplicate pages like that. This is the Adobe Reader forum...
    To answer your question, though: The only way this can be done automatically, without having to manually rename each field, is to use a Template to spawn new pages. This is something the form's author should have done, but you can do it yourself as well, using some code.

  • Merging PDF / Page Numbers / Acrobat SDK V9  & LiveCycle

    Hello everyone,
    I use Adobe LiveCycle to create forms, Visual Studio 2005 and the Acrobat SDK for the application I'm programming to fill in these forms. The application fills in the forms and merges them with no issues.
    The problem I have is that these pages have page numbers in the upper right. I use the Page N of M object on these forms in LiveCycle. PDF page numbers are filled in correctly when filling in the forms, but when I merge PDFs, the pages keep their original page numbers. I've looked at the Windows - Interapplication Communications even using templates to no avail.
    How can I merge these pdf's and have my program renumber these pages correctly and how can insert pages anywhere I want in the merge document?
    Below is the code I use to merge the PDF's. It was posted in a forum.
    Sub MergePDF(ByVal ThePath As String, ByVal outFileName As String)
    On Error GoTo serror
    Dim dPDDocMerge As New Acrobat.AcroPDDoc
    Dim dPDDoc As New Acrobat.AcroPDDoc
    Dim strFiles() As String
    Dim numPage As Integer
    Dim TotalPage As Integer
    Dim objThisFile As IO.FileInfo 'get FileInfo object for file string
    strFiles = System.IO.Directory.GetFiles(ThePath) ' Read in the file names
    Dim b As Boolean ' mostly for testing purposes... could use it for error 'checking to make sure that a file is really added before deleleting it...
    For i As Integer = 0 To strFiles.Length - 1 ' run through all the files in 'the directory
    objThisFile = New IO.FileInfo(strFiles(i)) ' Get the extension
    If objThisFile.Extension = ".pdf" Then ' Only add in PDFs
    If dPDDocMerge.GetFileName = "" Then ' check if it's the first file
    dPDDocMerge = New Acrobat.AcroPDDoc
    b = dPDDocMerge.Open(strFiles(i)) ' open first file
    TotalPage = dPDDocMerge.GetNumPages
    Else
    dPDDoc = New Acrobat.AcroPDDoc
    b = dPDDoc.Open(strFiles(i)) ' open other files
    numPage = dPDDocMerge.GetNumPages ' get the page count
    TotalPage += numPage
    b = dPDDocMerge.InsertPages(numPage - 1, dPDDoc, 0, dPDDoc.GetNumPages, _ False) ' Insert
    End If
    End If
    Next
    'b = dPDDocMerge.Save(1, ThePath & "\" & outFileName) ' save file
    b = dPDDocMerge.Save(1, ThePath & "\" & "\MyTest.PDF") ' save file
    b = dPDDocMerge.Close()
    Exit Sub
    serror:
    MsgBox(ErrorToString)
    End Sub
    Thanks for any code or advice.

    You can't merge LiveCycle forms this way :(. LC forms are NOT standard PDF files and can't be processed in the same way.

  • Which product do I purchase???? - Creating a merged PDF report from excel with cover page

    Hi Guys,
    I work in the BI field and have created a pdf document using PDFCreator and another third part scripting tool called jpdfbookmarks.
    We have delivered something to the client but the quality of the output ( not the best) and the time it takes to cycle through each page ( when being viewed) within the merged
    pdf document takes a little time...
    So my boss as asked me to explore something within the Adobe suite of products BUT.... i have no idea which one i should choose.
    My requirements are as follows:
    To deliever this report to the client the program needs to be able to do the following:
    1)  convert excel files to pdf
    2) take these individually created converted excel to pdf converted files and and merge them into 1 document
    3) within this merged document have the abilty to be able to bookmark each page
    4) have a cover page with hyperlinks
    5) and finally but most important of all to be able perform steps 1 to 4 above programatically through scripting language vb script / vba
    nice to haves:
    6) abilty to email each merged doc to list of email address
    7) abilty to edit document to add in hyperlinks etc
    Need to get an understanding of costs ie
    1) how much per licence
    2) what product with have all the features above and how much
    Can anyone please be kind enough to point me in the right direction as to what product would be best suited?? Being able
    to scipt a solution together is key.
    Cheers
    Shockwave

    You can do everything except the cover page with Adobe Acrobat (Standard or Pro).  There are demos on our website that you can install and try out.  You will need the SDK documentation and sample code to learn how to automate/program.

  • Cannot delete pages after saving a PDF in X Pro

    I am on Windows 7 and Adobe Acrobat XPro.  Every time I try to delete pages in any PDF after saving I get the error message "one or more pages are in use and could not be deleted".  the work around is to close the file and then reopen.  Then you can delete the pages.  It is annoying to have to shut and reopen the file everytime after making a change.  Is there a fix to this?

    Have you tried Help > Check for updates (in Acrobat)?
    That very annoying problem is fixed in XI. I seem to recall it could be fixed by updating X, but I could be mistaken. Please report back.

  • When I try to delete page(s) from a PDF document I created, I geth the following message: "One or more pages are in use and could not be deleted." Any suggestions?

    When I try to delete page(s) from a PDF document I created, I geth the following message: "One or more pages are in use and could not be deleted." Any suggestions?

    I sent an email to TerraGo Support and they sent the following response: "This has bug has been discovered and our development team is quickly creating a patch to resolve the problem. The patch should be available within a week or two. For your reference, the bug number assigned to this case is: 3620. Check back in about two weeks and hopefully the patch will be available. Again, I apologize for the inconvenience."

Maybe you are looking for

  • Cannot send email from iPhne 5s

    I have just upgrade from iPhone 4 to iPhone 5s.  Excited but now I cannot even send any email.  No problem receiving. My setting in my email account is POP, incoming and outgoing mail server is mail.optusnet.com.au which is the same seeting as my pre

  • IE load is OK, Problem with Firefox...tries to load from local drive

    My own website, in Firefox, opens fine when it is the Home (index.php) page OR a link from my homepage to a third party website. BUT, links from the home page to my other pages do not bring up images, possibly do not use the css (not sure), and try t

  • How do I read & update email group subscription status for a specific contact via api?

    Looking at Eloqua's api documentation, I can find out how to update contacts, and even the global subscription flag, however we want to build a page on our custom website whcih allows users to opt in and out of specific email groups (as can be done m

  • Management packs installation on OEM 12C.

    Hi, In one of my projects i need to apply below Management Packs Plus on OEM12C.could any let me know what is the proccess need to be followed? Oracle Management Pack for WebCenter Oracle Business Intelligence Management Pack Management Pack Plus for

  • Failed load Kernel Modules | fglrx exec Format Error

    Hello after I started my computer this morning it won't load fglrx anymore. Yesterday everything worked just fine. But I can still start the xserver without any problem. systemctl status systemd-modules-load.serivce outputs: systemd-modules-load.serv