Read metadata from PDF via VBA

I wrote a VBA script that reads the custom properties of a collection of Word docs and writes the results to a file which I can then import to Excel.
Now I have to do the same thing for the PDF files created from the Word docs - that is, read the metadata from a collection of PDF files.
The fields that have to be read are always the same.
How can I read the metadata of a CLOSED PDF file from VBA?
For Word, the solution was to use DSOleFile.PropertyReader (part of the DS: OLE Document Properties 1.4 object library.
TIA

I refer you to some of the vendors that I just posted - perhaps there is a low cost solution you can find. Pound Hill has some scripting interfaces that might help.
Otherwise you will need to take the free SDK and hook it up somehow.
Gunar

Similar Messages

  • How to read tags from pdf to print in different printers (PCL or PS)

    Hello from Spain.. I would apreciate some help who to read tags from pdf to discriminate PDF Production tag for send print PDF to printer job using driver PCL or PS. I get trubles when print PDFs files made by PDFcreator (Ghostscript, intermediate PDF from PS) in PCL job printer. I need to eval tags from PDF for send command line os javascript inside Adobe Acrobat to print in the correct job printer: PCL or PS...
    Thanks.

    As noted you cannot access the "Tag" of a PDF but you can access the meta data of a PDF and one of the items recorded in the meta data is the "producer". This meta data tag is supposed contain the name of the program that produced the PDF. One can access this through the "info" property or the "metadata" property of the doc object.
    var cProducer = this.info.Producer; // get the producer application for this PDF;
    console.show();
    console.clear();
    console.println("This PDF was created by " + cProducer);
    Be aware that the user can easily modify the Producer information with Acrobat, JavaScript, or other application after the PDF has been created.

  • Extract metadata from PDF

    Hi all
    Maybe the wrong category to post this question but I am wondering if it is possible to extract the metadata from PDF document as well as we do from images?
    Since both metadata are based on XMP from Adobe it should theoretically be possible. Anybody knows how?
    Thank you.
    Nitai

    XMP extraction from PDF format files is not implemented by interMedia.
    You can learn more about XMP at the Adobe web site. It is possible to create a simple XMP extractor by implementing a byte scanner that looks for the XMP indicator string. However, I believe that the PDF format allows for some object blocks to be marked old or superseded by newer blocks. Thus it may be possible to have more than one XMP block in a PDF file. You would need to know more about PDF format to determine which is the current block.

  • Bug: Keywords behave incorrectly after "Read Metadata from File"

    (Happens in LR3.4 RC and probably also in previous versions... but I am not sure if it happened in LR2)
    The problem seems tro be somthing like de-sync of database status and what we see on the screen.
    1) Let's say we have a photo with keyword "abc"
    2) Click in the Keyword List on the arrow around the "abc" keyword to show only photos with this keyword (single one), leave the filter on this settings
    2) Save Metadata to File
    3) Edit XMP in an external application, add new keyword "def"
    4) Read Metadate from File
    5) Now what happens is: The photos disappers from the screen as if it lost the "abc" keyword (weird), but if you look at the Keyword List, the "abc" keyword has still 1 photo attached to it. Also, if you release the filter and find the photo, it clearly has the "abc" and "def" keywords in the Keywording panel. And you can search for this keyword in the text filter (works correctly), but if you click on the arrow around the "abc" keyword, the photo is not shown.
    6) If you restart the Lightroom, it will work correctly again.
    I have an own application that is able to modify the XMP and the modifications may not be perfect, but in my opinion, this shouldn't happen when reading the metadata. If any developer is reading this, I can supply a file with original and changed metadata for debugging.
    It would be great it this could be resolved.
    Thanks!

    Please do report your find with the details you state (well done!) here: https://www.adobe.com/cfusion/mmform/index.cfm?name=wishform
    Thanks.........

  • How to read HyperLinks from pdf file??

    hi developer's,
    I am in PDF processing... I am having doubt in that Processing.
    How to read Hyperlinks from PDF file?
    I can able to set the hyperlink.. But i cant able to get the hyperlinks..
    The following example program will set the hyperlink to the PDF file using lowagie API..
    import com.lowagie.text.Anchor;
    import com.lowagie.text.Chunk;
    import com.lowagie.text.Document;
    import com.lowagie.text.DocumentException;
    import com.lowagie.text.Paragraph;
    import com.lowagie.text.html.HtmlWriter;
    import com.lowagie.text.pdf.PdfReader;
    import com.lowagie.text.pdf.PdfWriter;
    public class Argu1 {
         public static void main(String[] args) {
              Document document = new Document();
              try {
                   PdfWriter pdf = PdfWriter.getInstance(document,
                             new FileOutputStream("PageLink.pdf"));
    PdfReader pdf_read=new                
                   document.open();
                   document.add(new Paragraph("Hi Everbody....!"));
                   Anchor pdfRef = new Anchor("Click Me");
                   pdfRef.setReference("www.java2s.com");
                   Anchor rtfRef = new Anchor("Touch Me");
                   rtfRef.setReference("www.sun.com");
                   System.out.println(rtfRef.reference());
                   document.add(pdfRef);
                   document.add(Chunk.NEWLINE);
                   document.add(rtfRef);
              } catch (DocumentException de) {
                   System.err.println(de.getMessage());
              } catch (IOException ioe) {
                   System.err.println(ioe.getMessage());
              document.close();
    Help me how to read the Hyperlinks from the PDF file using java ...
    Thanks in advance,
    With Regards,
    J.Imran

    Instead of cross-posting unformatted code you could have taken a look at the API, because there you might have come across a method named getLinks...Even though it's not documented, I really suspect that it will return the Hyperlinks on a given page.

  • Error: Could not read Metadata from Portal Database !

    Hi All!
    My problem is when i am trying to open the portal page (Oracle 10G)
    http://domain/portal/page?_pageid=6,1,6_13&_dad=portal&_schema=PORTAL
    then i have an Error: Could not read Metadata from Portal Database.
    And i do not know why because everything is up and running. I can connect and browse my database schema with portal user. I can connect to pls/orasso and where ever i want.
    [07/Feb/2006:00:57:23 +0100] "GET /pls/portal/htp.p?cbuf=Test HTTP/1.1" 200 5
    plz help i am trying to fix this in the last 2 weeks :)
    thx kEndE

    Hi Andreas,
    I've changed the permissions to 775 and changed the ownership of /sapcd (volume name) to orasid:dba (including all files within the staging area /sapcd/ora-patch).
    So the permissions and ownership seems to be fine.
    I've also ran your suggested commands but the issue still exists.
    I'm now liasing with our UNIX guy to check if there is any underlying permission issues on the volume (sapcd) that may be causing this problem.
    In the mean time, let me know if you have any further thoughts.
    Thanks,
    Chris

  • Reading content from PDF to XI

    Hi All,
    Can I know about, is XI capable of reading the content from PDF. As I came to know that we can achieve this with the adapter module can you please ignite me how to do for this or is there any other option other than this way or show me if already any threads or articles are existed?

    Hi,
    Please find some links on Convertion agent which not only convert the PDF documents. it is used to convert PDF, Word Document, HL7 and more...
    pdf files *
    SAP Network Blog: XI: Read data from PDF file in Sender Adapter
    /people/sap.user72/blog/2005/07/31/xi-read-data-from-pdf-file-in-sender-adapter
    SAP Network Blog: XI: Generate PDF file out of file adapter
    /people/sap.user72/blog/2005/07/27/xi-generate-pdf-file-out-of-file-adapter
    http://help.sap.com/saphelp_nw04/helpdata/en/43/6d95e0ac846fcbe10000000a1553f6/CMGetStart.pdf
    http://help.sap.com/saphelp_nw04/helpdata/en/43/4c38c4cf105f85e10000000a1553f6/content.htm
    Regards,
    Phani

  • Read barcode from pdf

    Hi all,
    Please can any one tell me how to read barcode from pdf in java. This will be very help full for me.
    Thanks,
    Avinash

    Avinash_loves_JAVA wrote:
    Please can any one tell me how to read barcode from pdf in java. This will be very help full for me.JMF doesn't support PDFs or barcode reading.
    If you need further assistance with this issue, you'll need to repost in a generic Java forum, or perhaps do a google search for a barcode-reading library.

  • Bug: "Read metadata from file"

    When I add GPS coordinates to a file (with the program Geotagger) and select "read metadata from file" to import the new metadata, a color adjustment badge appears on the thumbnail (even though I didn't adjust any colors). This only happens with files that have a cropping applied to them in Lightroom. If the file doesn't have a custom cropping the color adjustment badge doesn't appear. It happens with both JPG and CR2 files. Seams like a bug to me.
    I'm on a 24" iMac 2.8 with 4 GB and OSX 10.5.5 / Lightroom 2.1

    It also happens with DNG. Anyway, I don't think it has anything to do with the GPS, but the issue has been around right back through version 1.x Try the following:
    1. Select an image that has no crops or develop adjustments
    2. Switch to Develop module
    3. Apply a crop
    4. Save metadata to file (Cmd+S)
    5. Read metadata back from file (Metadata menu - "Read Metadata from file"
    Please report what happens?
    At my end the Crop badge remains but a Develop Adjustment badge also appears - IMO it shouldn't, but as stated above it has for a VERY long time. I'll bug it again and see what happens.

  • Batch read metadata from file

    Hi Everybody,
    I wonder if Lightroom 3 offers a function that lets me read metadata from file for multiple images in a batch process. The reason is that I have changed the GPS information for quite a couple of my pics using the Geoencoding Support pluging from Jeffrey Friedl and let it write back into the DNG-file. Since I also flushed the shadow data I can't see any GPS information in my metadata table in Lightrrom even though the data are present in the DNG. Now the real question is: Is there a better way to get my GPS data into Lightroom than emptying the Library and reimport all of my pics?
    Oh, and before I forget it: Yes, I have searched this forum - and a couple of others as well - and I found stuff about batch processing date/time information. However, the method presented there apparently does not apply to my problem.
    Thanks for any information.
    Have a nice day,
    f.

    In Library Module, select Grid Mode. Select all images you wish to read metadata from. Go to Metadata>Read Metadata from Files
    Similarly, you can edit time for time zone/ AM-PM issues, etc by going to Metadata>Edit Capture Time

  • Reading Metadata From File Resets the Setting

    I am using Lightroom 4 on MAC OS X Yosemite. I have done some editing on a picture in DNG format and then clicked "Save Metadata to File".  I checked the XMP file with textedit and found that the change is indeed saved. But when I click "Read Metadata From File", it resets the pictures before any editing. Please, I need help to fix this problem.

    jinsheng wrote:
    I have done some editing on a picture in DNG format and then clicked "Save Metadata to File".  I checked the XMP file with textedit and found that the change is indeed saved.
    I'm confused - I'm pretty-sure Lightroom 4 will not save changes to an xmp file (even if DNG is read-only, unlike Bridge) - it embeds the metadata in the DNG file. Did you mean you edited the DNG file with a text editor? Anyway, if you do have a .xmp sidecars to your DNG, I recommend deleting them, or renaming them (until you figure out what's going on anyway), then retry - any difference?
    PS - last I checked, Lr will generate an error when saving metadata if DNG is read-only, but since you're having a strange problem, I'd definitely make sure they are not read-only.
    Note: Lr WILL read metadata from an xmp sidecar, *iff* it seems newer than what is embedded. So, IF you have xmp sidecars that have a recent modification date, or *maybe* (I don't really know) if Lr was NOT able to successfully embed metadata into your DNG, it could be reading some incompatible metadata from the xmp sidecar - thus the reason to get it out of the way to test..

  • Looking for pure Java API to read metadata from an MP4 file.

    Hi, I am looking for a pure Java api that can read MP4 metadata. I have been trying to look online for one but have not found anything
    suitable. If there are no good api's, can someone point me to resources on how to parse metadata from an MP4, tutorials would be great
    as well. I have looked online for that as well but have not found much. Any help would be greatly appreciated.

    Cross posted
    http://www.java-forums.org/new-java/59652-looking-pure-java-api-read-metadata-mp4-file.html
    http://www.coderanch.com/t/580833/Streams/java/Looking-pure-Java-API-read
    http://stackoverflow.com/questions/10568588/looking-for-pure-java-api-to-read-metadata-from-an-mp4-file
    db

  • Read metadata from file cancel modifications !

    Hi,
    I use Lightroom 1.1 in french. I made some photo (in raw but in jpeg the result is the same) and import them in Lightroom with conversion in DNG. In Lightroom, in develop module, I crop, change contrast and som others.
    After that, I geotag my DNG files with HoudahGeo. HoudahGeo write geotag in the file. So back in Lightroom, I make a 'Read metadata from file' ('Lire les métadonnées depuis les fichiers' in french) for having geotag in Lightroom.
    Geotag are now in Lightoom, nice, but in develop history, I have a new line 'Rénitialiser les paramètres' !!! So I can't see all my modifications... Is it normal ?
    Thank you
    PS: Sorry for my english...

    I think you should test this with one or two pictures - import/convert, save metadata to file, geotag, read metadata. Also try import/convert, make small adjustment, save metadata to file, geotag, read metadata. In iView I often write metadata to DNG's that have been adjusted, then read the metadata in Lightroom - sometimes I add more data in iView and do more adjustments.
    I have seen the problem you describe, but it has been with raw files which I have adjusted in Lightroom but haven't saved out the metadata. I have then created an xmp sidecar in iView, and read the metadata in LR. So the sidecar had no ACR/LR instructions. The result was that my adjustments were wiped out. What I suspect is that something similar is happening with your DNGs - that conversion creates them but doesn't write the ACR/LR fields.
    So try a few alternative sequences.
    John

  • Feature Request: save metadata to file, and read metadata from file.

    Please have function to save metadata to file, and read metadata from file.
    This would allow us to write cross-platform plugins that can assure data is up-to-date in the xmp, or modify xmp and have it reinstated in the catalog.
    This would go nicely with the ability to save xmp for virtual copies, dng, and rgb formats.

    Perhaps someone with direct experience with publishing to Smugmug will chime in.  Short of that, some thoughts on your issues:
    Should I not bother to "Save Metadata to File", this seems to be what is messing up the SmugMug publish service?
    I very much doubt that it is messing up the publishing.  What gets saved to the file from the catalog is independent of what gets exported.  Saving metadata to the file is generally a good practice (in case you screw up your catalog backups).  Many, perhaps most, users set the preference Catalog Settings > Automatically Write Changes Into XMP.
    I would have expected it to behave like an export which would write synonyms, exclude keywords that should be excluded, etc, which seems to work ok in point 6 above.
    Yes, that's a reasonable expectation.
    You may have done this already, but double-check: Right-click the Smugmug publishing service in the left-hand column of the Library, select Edit Settings, and in the Metadata section verify that you have Include: All Metadata set.
    Did you download the Smugmug plugin from Smugmug? You might try this Smugmug plugin instead -- the author designed the publishing architecture of LR and has an excellent reputation. 
    Are the publish services more effort than they are worth? So I get SmugMug to work, what happens when I try Flickr?
    I think many users find the publishing services are generally worthwhile.  I publish regularly to Flickr (using the built-in Flickr plugin, which isn't completely reliable) and the Zenfolio plugin from Friedl. 

  • Read Comments from PDF...

    Hello Everyone,
    I need to know that how can we read comments from PDF file programmatically using C++.
    I was thinking was to read from file itself. I thought the PDF file must have some text like "StartComments" and "EndComments". (if we open it in notepad) but didn't found anything like that.
    Any suggestions ?
    Or anyone can refer any free Library which does this thing.
    Thanks
    ...Pankaj

    For API, Application Program Interface, there is the Acrobat SDK forum:
    http://forums.adobe.com/community/acrobat/acrobat_sdk
    For coding in Acorbat JavaScirpt there is ths the Acrobat Scripting forum:
    http://forums.adobe.com/community/acrobat/acrobat_scripting

Maybe you are looking for