Help extracting URL's from PDFs I create

Using Acrobat pro 9.+ I have several PDF catalogs I have created, and what I need t
o do is extract the URL's for each 100 page catalog.
These are added URL's using the Acrobat linking tool
Ideally they would be one URL per row, and have at least a minimum amout of information.
1. URL
2. (if possible, the page the URL is from)
Thank you

maybea two step process? one applescript that extracts to a text file, and then an automator that can read that file (can autoamtor read froma  text file?) and do what you want?
Jason

Similar Messages

  • Extract embedded xml from PDF/A-3b (also creation)

    Hello there,
    in the context of a research project, we are currently trying to extract embedded xml from a PDF/A-3b document via code.
    The project deals with establishing a new invoicing standard (Zugferd: ferd-net.de, only german). Invoices are expressed via xml, which is embedded in PDF/A.
    What we are trying to archive is extraction of the xml via java code. For testing purposes, we are currently using an third party skd to extract the invoice-xml, by calling a .EXE file and then picking up the results in java.
    I currently have only one valid example file that can be processed via this sdk. To get more data, i used the test version of acrobat pro to alter the embedded xml file. To be more specific, i deleted the embedded file, added a new xml file, and used preflight to make the PDF conform to /A-3b. Although the file seems to have the same properties as the original, it can no more be processed via the extraction sdk. Since messing around with acrobat does not seem to get me anywhere, i am now looking into extracting data from the pdf my self.
    Is there any present implementation/library/solution for extracting data in a java context? The few third party tools i found are all based of a .net/windows native environment. I have heard rumors about Adobe giving out tools to extract embedded data from PDF/A?
    How is it the other way around? Is it possible to embedd xml into a PDF via Java? Given there allready is PDF file which we can attach to.
    I really appreciate reading and thanks for any help or input!
    Greetings,
    Florian

    Hi Florian,
    I would look for general purpose PDF libraries that can open a PDF and access data objects in it.
    All in all it is not too difficult to get to the embedded XML, once you have a library that can access and read data structures/data objects inside a PDF file. Some understanding of the inner workings of PDF data structures will help you get the job done (e.g. read the section about embedded files in the PDF standard / ISO 32000-1, as well as the chapter about PDF syntax).
    Olaf
    Am 19 Aug 2013 um 13:19 schrieb xfrapp <[email protected]>:
    Extract embedded xml from PDF/A-3b (also creation)
    created by xfrapp in PDF Language and Specifications - View the full discussion
    Hello there,
    in the context of a research project, we are currently trying to extract embedded xml from a PDF/A-3b document via code.
    The project deals with establishing a new invoicing standard (Zugferd: ferd-net.de, only german). Invoices are expressed via xml, which is embedded in PDF/A.
    What we are trying to archive is extraction of the xml via java code. For testing purposes, we are currently using an third party skd to extract the invoice-xml, by calling a .EXE file and then picking up the results in java.
    I currently have only one valid example file that can be processed via this sdk. To get more data, i used the test version of acrobat pro to alter the embedded xml file. To be more specific, i deleted the embedded file, added a new xml file, and used preflight to make the PDF conform to /A-3b. Although the file seems to have the same properties as the original, it can no more be processed via the extraction sdk. Since messing around with acrobat does not seem to get me anywhere, i am now looking into extracting data from the pdf my self.
    Is there any present implementation/library/solution for extracting data in a java context? The few third party tools i found are all based of a .net/windows native environment. I have heard rumors about Adobe giving out tools to extract embedded data from PDF/A?
    How is it the other way around? Is it possible to embedd xml into a PDF via Java? Given there allready is PDF file which we can attach to.
    I really appreciate reading and thanks for any help or input!
    Greetings,
    Florian
    Please note that the Adobe Forums do not accept email attachments. If you want to embed a screen image in your message please visit the thread in the forum to embed the image at http://forums.adobe.com/message/5606424#5606424
    Replies to this message go to everyone subscribed to this thread, not directly to the person who posted the message. To post a reply, either reply to this email or visit the message page: http://forums.adobe.com/message/5606424#5606424
    To unsubscribe from this thread, please visit the message page at http://forums.adobe.com/message/5606424#5606424. In the Actions box on the right, click the Stop Email Notifications link.
    Start a new discussion in PDF Language and Specifications by email or at Adobe Community
    For more information about maintaining your forum email notifications please go to http://forums.adobe.com/message/2936746#2936746.
    Olaf Druemmer | Managing Director | callas software GmbH | Schoenhauser Allee 6/7 | 10119 Berlin
    Tel +49.30.4439031-0 | Fax +49.30.4416402 | [email protected] | www.callassoftware.com
    Amtsgericht Charlottenburg, HRB 59615 | Geschäftsführung: Olaf Drümmer, Ulrich Frotscher

  • How to extract a still from iMovie to create a jpg

    I'm trying to extract a still from iMovie to create a jpg. I've been able to make a freeze frame, but now I want to export it into iPhoto as a jpg? How?

    I suggest that you use a free app called MPEG Streamclip.
    Detailed instructions are here.
    https://discussions.apple.com/docs/DOC-3231

  • Extracting Tiff Image from PDF document

    Hi Leo,
    I need to extract Tiff images from PDF file in .NET applications. Is there any way to extract images using Javascript, Plug-ins or other APIs.
    If possible can you kindly send some code snipplet

    LiveCycle is a range of products, designed (almost all) to run on
    servers. Except LiveCycle Designer, bundled with Acrobat. These
    provide a Java API.
    http://www.adobe.com/products/livecycle/
    Aandi Inston

  • Php extract url's from recordset

    Hello guys,
    I need to extract url images from a recordset (the recodset is an html page) for build a feed.
    To do this I use regular expression to find url and I get the results with print_r function.
    Good, now I need to replace the array key with xml tags, to do this I need to replace the array key with regular expression ?
    I think that is not the best way...

    Hello again, someone know if exist a regular expression generator tool online ?

  • Extract email addresses from PDF file?

    Hi,
    Does somebody know if there is any -builtin- way to extract email addressed from PDF file in acrobat?
    I tried 'save as' text/excel but this is a laborious task, especially when the pdf is large!
    Thanks

    I've developed a script that does just that. Have a look here:
    http://try67.blogspot.com/2012/02/acrobat-list-all-email-addresses.html

  • Extracting Linked images from PDF

    Good afternoon,
    I have a PDF which I created in illustrator but unfortunately I had a problem with with my computer which meant that the recovered files were corrupted. I found a PDF on disc which opens in Acrobat just fine but when I open it in illustrator it asks for the linked images which of course I do not have.
    The linked images must be in the PDF as they show up ok.
    Is there a way of extracting the images from the PDF so that I can do a repair in illustrator?
    Hope someone can help.
    Thank  you,
    Kirk

    If you have photoshop, open that pdf from it.  File/open, then chose images...
    Another option, if you have Acrobat Pro, go to Advanced/Document procesing/Export images

  • Auto-extracting URL LINKS from webpage?

    is there a way to extract all the linked url's from a webpage?
    i am trying to put together some lists and would like to gather these up and then insert them into a numbers spreadsheet. in some cases there are 30 to 50 of these listed on the left hand side of a webpage and right now I imagine clicking on a link, copying the url from the browser, inserting into  numbers, clicking back, clicking on the next link etc.
    is there a way to save some steps by exporting these to a text document and then copying and pasting from there?
    thanks for any help.

    You create the new bookmark with the javascript replacing the original URL from any old bookmark you have and no longer need, or just create a new bookmark for any thing at all specifically for your links script. Then once you are at a page where you want the links displayed, you just select your new links bookmark. Your page will stay open, but a new window will open as well, with all the links listed. You might want to change the size of the window, the width=400 and height=200 result in a pretty small window. Anyway, I put the Links javascript bookmarklet in my Bookmarks menu, so while I am here on this page I just go to the Bookmarks menu, slide down and select Links, and a new window opens with all the links on this page listed.
    Francine
    PS--It would be more impressive if I had written that script, but I didn't. I found it years ago and saved it with a batch of bookmarklets that do other useful things, such as suppressing animated gifs. I just checked, and a particularly useful collection, Jesse's Bookmarklets, still exists:
    https://www.squarefree.com/bookmarklets/

  • Help with exporting data from pdf form

    I have about 100 pdf forms that I created in adobe forms central and distributed as a pdf form (rather than on the web). I am trying to export the data into a spreadsheet but when I export it, the fields are all jumbled in the csv file, as in they are not in the same order. I need to export the data all together so I'm going to the forms menu and selecting "manage form data" and then selecting "merge data files into spreadsheet". I tried exporting a single file but that gave me something really weird.
    Please help, I have a deadline next week to analyze this data and can't make sense of it once it is exported to a spreadsheet.

    Would you please share your form with me and send me one of your pdf forms and some of the csv files?
    You can share your form by doing the following:
    1. Click on the “Share” icon on the bottom left corner.
    2. Click on “Add Collaborator” on the popup menu.
    3. Enter [email protected] under “People to share with”.
    4. Set subject to "Export data from pdf form"
    5. Click the “Share” button on the bottom right of the dialog.
    Thanks
    Ken

  • How to extract the image from pdf file

         Hai friends........
             Is it possible to extract the images in a page from pdf file.
             If so. please share with me.......
        Thanks in advance,
        abu

    In later versions of Acrobat you can select an Image with the Select tool, then right-hand click for Save options.
    ------------->
    It helps if you quote your exact version of Adobe Acrobat/Reader - choose [Help, About...] to find this.
    Also useful: Version numbers of other software (e.g. Word) if relevant. Age of computer and amount of memory (RAM) available (r/h cllcking on 'My Computer' and choosing Properties gives you this, plus processor speed).

  • Can't create a multiple file PDF from PDF forms created in Designer

    Hi
    I want to create a single PDF which combines 6 pdf forms created in Adobe Designer (all as separate pages).
    When I try to create a single PDF from these multiple files using the "Create a PDF from Multiple files" command, my A-3D won't let me do this returning a dialog box saying
    "The file "filename.pdf" is protected. It cannot be used for this command".
    I can't find any properties that control this, either in the original file or in A-3D. Can I overcome this and how?
    Many thanks in anticipation.
    Phil

    Designer-created files aren't really PDF files any more and cannot be
    edited or combined in Acrobat (including Acrobat 3D). You CAN make a
    package of them, however, in Combine Files.
    Aandi Inston

  • How can I extract single pages from pdf document

    how can I extract a single page from pdf document

    Purchase and install Acrobat XI. 
    Open a multi-page PDF.
    Use the click path of:
    Tools - Pages - Under "Manipulate Pages": Extract
    Be well...

  • How to extract word coordinates from PDF using vc++6.0

    In sdk,i just know how to get coordinate from pdf using javascript,and it will be completed use vb.but i dont know how to get the coordinate througt vc++6.0.anyone can help me?
    thank you advance!

    PDEWordFinder is the usual method for getting words and co-ordinates.
    PDFEdit is not usually used, it is not suitable for getting text.
    It is very hard work to make the two worlds work together (e.g. to
    edit text you find).
    Aandi Inston

  • Extracting Zoomed in views from PDFs to create new ones.

    I recently purchased Acrobat for use with my small business.  We wanted an easy way to go from blueprints in a large  PDF to blown up shots of portions of certain pages (each blown up shot being its own page) in a new PDF.  I really feel like there is an easy way to do this in Acrobat X Pro but the only way that I have figured out how to do this, is it copy a section from the PDF using the selection tool into MS Paint and saving this as a jpeg and combining all the jpegs into a long PDF.  Even this doesn't always work, as sometimes Acrobat will not give me the option to copy, only copy with formatting (I have changed the general option to tell the select tool to use images before text).
    This is extremely clunky and overly time consuming.  Is there a faster way to do this, or is should I look into writing a macro to do it?
    Thank you for your help!

    Sometimes, Acrobat only gives me the option to "Copy with Formatting" which means it does not recognize the selection as an image.  The document has no security, and only being able to use the snapshot tool renders terrible quality when blown up.  The only way I have discovered around this is to zoom way in on the document to make the snapshot get better quality before sizing that down to a normal page.  This is terribly inefficient and a huge time sink when I have to do this 400+ times each month.  Any ideas, alternative ways, or advice would be greatly appreciated!
    Thank you for your effort!

  • How to Extract Text coordinates from PDF

    Hi,
    can anyone tell me how to get coordinates in pdf document using VB or .NET, suppose if some text is written in pdf document then how can i get coordinates of that text. Its very Urgent.
    Thanks in Advance.

    I am trying to use the getPageNthWordQuads information to determine if a word on the page is within a region that I am interested in.
    I have a limited knowledge of javascript and have been looking up text manipulation functions and array manipulation functions in an attempt to figure out how to separate the values that are returned from the Quads routine. The Adobe documentation indicates that the Quads function returns an array, but when I try to access one of the values in the array, it gives me the entire contents of the array as though it is a string. If I use the .length function to try to determine the length of it, it tells me it is length of 1! I obviously am mis-handling this reference, but I have yet to find any specific examples that work with the quads array the way I am trying to work with it....
    Here is my code...I am running it against an open file in batch processing mode(maybe this has something to do with it)...
    var sourceDoc = this
    var tx1=492.5;
    var ty1=761.5;
    var tx4=563;
    var ty4=726.2;
    try {
    for (var j = 0; j < (this.numPages); j=j+2){
    var cnt=0;
    var rcvrnum="";
    cnt = sourceDoc.getPageNumWords(j);
    if (j == 0) {
    try {for (var i = 0; i < cnt; i++) {
    var quads = sourceDoc.getPageNthWordQuads(j,i);
    var x1 = quads[0];
    console.println("Page(" + j + "),Word(" + i + ") = " + sourceDoc.getPageNthWord({nPage: j, nWord: i}));
    console.println("Quads length is " + quads.length);
    console.println("X1 = " + x1);
    if ( x1 >= tx1 & x1 <= tx4 & y1 >= ty4 & y1 <= ty1 ) {
    console.println("Q1 is good");
    console.println("Page(" + j + "),Word(" + i + ") = " + sourceDoc.getPageNthWord({nPage: j, nWord: i}));
    } catch (e) { console.println("Aborted: " + e) };
    } catch (f) { console.println("Aborted: " + f) };
    I have tried several variations of the code above to try to extract my values so that I can compare them, but to no avail. The above code outputs to the console the following...
    Page(0),Word(0) = OTTO
    Quads length is 1
    X1 = 19.350006103515625,782.15087890625,126.51744079589844,782.15087890625,19.350006103515625, 721.5038452148438,126.51744079589844,721.5038452148438
    Page(0),Word(1) =
    Quads length is 1
    X1 = 125.17047119140625,782.15087890625,153.91525268554688,782.15087890625,125.17047119140625, 721.5038452148438,153.91525268554688,721.5038452148438
    and so on...
    x1 becomes the entire output from the array and yet I can not perform a simple split function on x1. If I try to split X1 into an array by splitting on the comma, I get the following error.
    Aborted: TypeError: x1.split is not a function
    Am I supposed to import some libraries or something?
    Thanks for any help....
    Kevin Ailes

Maybe you are looking for

  • Hey my icloud account is disable so how can i reset it please any is help me

    i have iphone5 iOs8.2 wichi is 16 gb so im bedle suffering to restart my phone due to icloud account is disable so please help me

  • 8 issues JavaFX needs to overcome to be truly successful

    JavaFX is excellent technology but I feel it will never be broadly successful until the following issues are addressed. Note, I am referring to JavaFX running inside a browser as an applet in all these remarks. 1. Startup Times They are just way too

  • Black screen upon waking Mac

    Mac Mini 2011 with 2x4GB RAM, 2.3Ghz i5, 500GB 5400RPM HDD So I need some help regarding my Mac and it's driving me mad! Its a bit tempermental and it isnt always the same occurance each time, but basically symptons are (somtimes it occurs, sometimes

  • Date function Retirement in due next 8 month

    Dear I am developing report and developed all query, Infact i want to display all employee records who has only 8 month remain to retirement age (AFTER 60 YEAR EMPLOYEE WILL BE RETIRE). Please guide me how can i do this Please find query : SELECT DIS

  • 100mbps multi-port fiber card for Sol10?

    Does anyone know of a multi-port fiber card (PCI) with Solaris 10 support that supports 100Base-FX ? We have a requirement to provide 4 such ports for a customer, and are having a tough time finding such a beast. We can use either quad-port or 2 dual