Converting a pdf image embedded in an XML file to a PDF file

Hi experts,
I have an XML file which contains a PDF image of a billing document and i want to extract this PDF image and convert it into a PDF file.
Any ideas how to do this?
Thanks
Ally

Answered!!

Similar Messages

  • Embedding data from xml file into metadata of a pdf

    Hi All
    I'm wanting to do the following, but struggling to figure the right way to go about it.
    I want to embedded data from my MIS into a pdf's metadata (as scrnshot). I can create a standalone xml file with all the data I require, but I'm unsure how to automate that being embedded into a pdf's advanced metadata. I know this can be done, as it worked at a previous employer, but I didn't get chance to find out how they did it.
    I'm wanting to do this so I can carry out a more advanced search of the metadata in Bridge.
    Any advice would be appreciated!

    Hi Northern,
        I have modified the modifyingXMP sample for you. After this change, put your xmp file as sample.xml and also put pdf file in the same folder where ModifyXMP executable is. After merging my changes, ModifyXMP file will read the sample.xml and will embed it into pdf file.
       Please follow the following steps
    1. Download XMPToolkit SDK and follow the steps to compile Sample
    2. Open ModifyingXMP file, replace all the content of that file with the below content
    3. Compile the ModifyingXMP file.
    4. The ModifyXMP.exe will be generated in folder (samples\target\windows\Debug), if you have changed the output folder it will be generated there.
    5. In parallel to ModifyingXMP.exe put the sample.xml (the xml file you have) and also the pdf file (say pdf file name is mypdf.pdf)
    6. Go to console and change directory to the directory where ModifyingXMP is and pass the following command
    ModifyingXMP mypdf.pdf
    7. Open the pdf file and check that value/properties
    For your reference, I am putting the content of the sample.xml file too, put this content in sample.xmp and any pdf and you will find subject field is getting added.
    ************** content of the sample.xml file. Create a file name sample.xml and put that content. Put sample.xml in parallel to ModifyingXMP.exe*******
    <rdf:RDF xmlns:rdf='http://www.w3.org/1999/02/22-rdf-syntax-ns#'>
        <rdf:Description rdf:about='' xmlns:dc='http://purl.org/dc/elements/1.1/'>
            <dc:subject>
                <rdf:Bag>
                    <rdf:li>XMP</rdf:li>
                    <rdf:li>SDK</rdf:li>
                    <rdf:li>Sample</rdf:li>
                </rdf:Bag>
            </dc:subject>
            <dc:format>image/tiff</dc:format>
        </rdf:Description>
    </rdf:RDF>
    ******************* MODIFIED CONTENT OF MODIFYING.CPP FILE. ***************************************************************************************** ************
    // ========================================================================================= ========
    // Copyright 2008 Adobe Systems Incorporated
    // All Rights Reserved.
    // NOTICE:  Adobe permits you to use, modify, and distribute this file in accordance with the terms
    // of the Adobe license agreement accompanying it.
    // ========================================================================================= ========
    * Tutorial solution for Walkthrough 2 in the XMP Programmers Guide, Modifying XMP
    * Demonstrates how to open a file for update, and modifying the contained XMP before writing it back to the file.
    #include <cstdio>
    #include <vector>
    #include <string>
    #include <cstring>
    // Must be defined to instantiate template classes
    #define TXMP_STRING_TYPE std::string
    // Must be defined to give access to XMPFiles
    #define XMP_INCLUDE_XMPFILES 1
    // Ensure XMP templates are instantiated
    #include "public/include/XMP.incl_cpp"
    // Provide access to the API
    #include "public/include/XMP.hpp"
    #include <iostream>
    #include <fstream>
    using namespace std;
    * Creates an XMP object from an RDF string.  The string is used to
    * to simulate creating and XMP object from multiple input buffers.
    * The last call to ParseFromBuffer has no kXMP_ParseMoreBuffers options,
    * thereby indicating this is the last input buffer.
    #include <sstream>     
    SXMPMeta createXMPFromRDF()
        string rdf;
        //open the RDF file and put it's content into rdf buffer
        ifstream inFile;
        inFile.open("sample.xml");//open the input file
        if (!inFile.is_open()) {
            cout <<"Couldn't open xml file" <<endl;
            exit(1);
        stringstream strStream;
        strStream << inFile.rdbuf();//read the file
        rdf = strStream.str();//str holds the content of the file
        SXMPMeta meta;
        // Loop over the rdf string and create the XMP object
        // 10 characters at a time
        int i;
        for (i = 0; i < (long)rdf.size() - 10; i += 10 )
            meta.ParseFromBuffer ( &rdf[i], 10, kXMP_ParseMoreBuffers );
        meta.ParseFromBuffer ( &rdf[i], (XMP_StringLen) rdf.size() - i );
        return meta;
    int main ( int argc, const char * argv[] )
        if ( argc != 2 ) // 2 := command and 1 parameter
            cout << "usage: ModifyingXMP (filename)" << endl;
            return 0;
        string filename = string( argv[1] );
        if(!SXMPMeta::Initialize())
            cout << "Could not initialize toolkit!";
            return -1;
        XMP_OptionBits options = 0;
        #if UNIX_ENV
            options |= kXMPFiles_ServerMode;
        #endif
        // Must initialize SXMPFiles before we use it
        if(SXMPFiles::Initialize(options))
            try
                // Options to open the file with - open for editing and use a smart handler
                XMP_OptionBits opts = kXMPFiles_OpenForUpdate | kXMPFiles_OpenUseSmartHandler;
                bool ok;
                SXMPFiles myFile;
                std::string status = "";
                // First we try and open the file
                ok = myFile.OpenFile(filename, kXMP_UnknownFile, opts);
                if( ! ok )
                    status += "No smart handler available for " + filename + "\n";
                    status += "Trying packet scanning.\n";
                    // Now try using packet scanning
                    opts = kXMPFiles_OpenForUpdate | kXMPFiles_OpenUsePacketScanning;
                    ok = myFile.OpenFile(filename, kXMP_UnknownFile, opts);
                // If the file is open then read get the XMP data
                if(ok)
                    cout << status << endl;
                    cout << filename << " is opened successfully" << endl;
                    // Create the XMP object and get the XMP data
                    SXMPMeta meta;
                    myFile.GetXMP(&meta);
                    // Create a new XMP object from an RDF string
                    SXMPMeta rdfMeta = createXMPFromRDF();
                    // Append the newly created properties onto the original XMP object
                    // This will:
                    // a) Add ANY new TOP LEVEL properties in the source (rdfMeta) to the destination (meta)
                    // b) Replace any top level properties in the source with the matching properties from the destination
                    SXMPUtils::ApplyTemplate(&meta, rdfMeta, kXMPTemplate_AddNewProperties | kXMPTemplate_ReplaceExistingProperties | kXMPTemplate_IncludeInternalProperties);
                    // Check we can put the XMP packet back into the file
                    if(myFile.CanPutXMP(meta))
                        // If so then update the file with the modified XMP
                        myFile.PutXMP(meta);
                    // Close the SXMPFile.  This *must* be called.  The XMP is not
                    // actually written and the disk file is not closed until this call is made.
                    myFile.CloseFile();
                else
                    cout << "Unable to open " << filename << endl;
            catch(XMP_Error & e)
                cout << "ERROR: " << e.GetErrMsg() << endl;
            // Terminate the toolkit
            SXMPFiles::Terminate();
            SXMPMeta::Terminate();
        else
            cout << "Could not initialize SXMPFiles.";
            return -1;
        return 0;
    Please let me know if you find any issue/assistance.
    -Sunil

  • Export as PDF with embedded profiles for Grayscale images

    Does InDesign CS6 export pages with Grayscale images as PDF with embedded profiles?
    Possible profiles, for instance:
    – Gamma 2.2
    – Dot gain 20%
    – Black Ink ISO Coated v2 (ECI)
    Export mode:
    – Acrobat 5 or higher
    – No color conversion
    – Embed all profiles
    Test by Acrobat Pro
    Advanced > Print production > Preflight > PDF analysis > List objects using ICC/Lab/Calibrated Color
    [The question is not about the export of Color images as Grayscales]
    Best regards --Gernot Hoffmann

    I wouldn't ever use different RGB profiles and different CMYK profiles and
    different Gray profiles in one doc – it's just necessary for tests.
    If you had a hypothetical case where your InDesign document's assigned CMYK profile and intended output was ISOcoated_v2_300_eci and you recieved grayscales for placement with different gray profiles assigned, I think you would have to make the conversion in Photoshop if you want the grayscales to be converted to your ISOcoated_v2_300_eci output intent space.
    So in this case I have a grayscale image with Dot Gain 10% assigned and you can see the 50% patch is reading as 50% in Info panel:
    If set my Working Gray space to the ISO Coated profile as above and do a Convert to Profile with the Destination set to Working Gray:
    The preview doesn't change but I get converted gray values—50% is now 44%:
    If I place the grayscale in an InDesign doc with ISOcoated_v2_300_eci assigned as the CMYK profile, the preview won't change (you have to turn on Overprint /Sep Preview), and the converted numbers will show in Separation Preview. The preview and numbers will also be unchanged in Acrobat if you export to default PDF/X-4

  • How to convert PDF image to excel sheet

    Hi,
        I am trying to convert the PDF image /Text file to Excel sheet then I need to use that data for my reports
    Can you please let me know
    1) How I can convert the PDF into a excel sheet
    2) Which s/w i need to use for that(adobe s/w)
    3) do need to purchse any other 3rd pary s/w to use this
    s/w - software
    PDF Image - scanned document in the image format
    Regards
    Shrinivas

    You need Acrobat Standard or Pro to convert the scanned image to readable text (OCR) and then save it as a Excel file (if Acrobat recognizes that it's a table, which I doubt it will). Maybe there are some 3rd party plugins that can help you further.

  • Acrobat 9 Converting DOC to PDF Image Issues

    Just a little background:
    I am running Windows XP Pro, Microsoft Office 2007 and Acrobat 9 Pro.
    I have several .doc files that contain text and images. When converting the .doc file into a PDF sometimes the images become distorted, moved around (the image itself inside of the pixels not moved around the .doc)or possibly even big white stripes going through it.
    The text formatting is ok and everything seems the same as if I looked in the .doc file. I have noticed that if the file contains one or two pictures they do not get distorted but if it contains more then that they sometimes get distorted and sometimes now.
    Any clues to what could be causing the issue?

    If you are missing the PDF option in MS Office 2007, you may
    download a plug-in from MS to create a PDF without Acrobat, go
    to the MS article search page and do a search, or go to the MS
    Office update page and search for the file to download
    Search http://search.microsoft.com/search.aspx?mkt=en-US&setlang=en-US
    Office http://office.microsoft.com/en-us/downloads/default.aspx
    General Acrobat Problem Steps
    If you are having problems creating a PDF, there are two steps you should do FIRST
    Go to the appropriate vendor web site and apply all updates to the program you are using (several recent messages have concerned problems with MS Office conversion, with the response that different versions of Office have different BUGS that must be fixed by a download from Microsoft - and MS Office products are not the only ones which may, from time to time, need to be updated to work properly)
    Apply all updates IN NUMBER ORDER
    Acrobat Update http://www.adobe.com/support/downloads/product.jsp?product=1&platform=Windows

  • Corrupt image formatting when creating pdf from embedded visio image in MS word

    Hello, Using Adobe Acrobat 9 standard and trying to create a PDF file from an MS word (version 2003) document.  PDF file created is fine except for an image (embedded visio diagram imported into word).  When created in PDF, the image is missing many of the diagram labels, and other text boxes are in the wrong font and wrong location.  I don't know if it's my Adobe Acrobat or MS Word settings that need to be updated.
    thanks, NB

    Dear fellows,
    the problem is still present.  My system:Windows 7 32-Bit Prof, MS Office 2010 Prof, Acrobat X Pro 10.1.1, Vision 2010 Prof, all updates installed.
    Problem #1: Even the latest Acrobat X 10.1.1 version causes Visio 2010 to start with error messages, origin: the Acrobat Add-In.  Therefore I de-activated the Acrobat Add-In, again, as in the past.
    As I design grafics with Visio 2010, I mark them in the active Visio window, copy the marked parts and then insert them into my Word-Document.  So far, so good and as I want it.
    The (still present) problem: I can produce a PDF file using the "save as Adobe PDF", but the Visio drawing is incomplete and corrupt.  E.g. arrows as line endings have disappeared.
    PS: But using Adobes PDF printer sets the embedded Visio drawing correctly!  But using the printer one looses some features of the "save as Adobe PDF" command.
    Well, probably one day ...
    Regards,
    Rückenlehne.

  • Can not edit Word docs after converting from a ExportPDF file. The PDF file is converted into a non-editable Word Doc image.

    Converting a PDF file (scanned copy Adobe Reader X) to a Word Doc using ExportPDF conversion. The Word Doc created is a image file which is not editable. I need an editable Word Doc file.

    Hi robertw,
    Can you please tell me more about how the PDF was created? (If you don't know, choose File > Properties, and look for the PDF Producer on the Description tab.)
    If the PDF was created by a third-party PDF creator, it may be that it's not well-written, and that can affect the quality of the conversion. See Will Adobe ExportPDF convert both text and form... | Adobe Community.
    That said, you may be able to select the text by triple-clicking in the Word document.
    Please let us know how it goes.
    Best,
    Sara

  • Retrieving Images Embedded In XML (from rss using xsl)

    I am trying to comsuem and rss feed into my site but i dont seem to see an "image" element in the Item node. can someoen tell me how to retrieve Images Embedded In XML dreamwaever. i am using an xsl fragment on a pap server.
    Doh

    Hi there, I am using a jsp page that takes a xml page from the internet (user defines what page it is) it then takes the xsl feed and transforms the xml with a TransformerFactory object to get the results as an html page, the error is received when the jsp page is called
    Ian

  • I am trying to convert an Illustrator cs6 file to a PDF and every time I do it, I lose all text and images. Can someone provide some assistance? Thank you!

    I am trying to convert an Illustrator cs6 file to a PDF and every time I do it, I lose all text and images. Can someone provide some assistance? Thank you!

    It's always a good idea to supply platform and AI version info.
    How are you "converting" the file? Are you choosing "Save As" and selecting PDF from the file format options? This will bring up additional save options. I would suggest "Illustrator Default" setting unless you have specific needs.

  • Purchased the adobe product to convert a pdf image to a RTF file but cannot figure out to accomplis

    Bought the adobe product to convert a pdf image to an RTF file but cannot figure out how to get it to work

    HI wjleshe,
    I've checked your account using the email address that you use here in the forums, but I don't see any order history. Did you perhaps sign up with a different email address? Or did you sign up via the Reader mobile app for iOS or Android? If you got your subscription via an in-app purchase, Adobe wouldn't have record of your order. And, you would need to contact the app store if there is a problem with your subscription.
    Best,
    Sara

  • Gray lines while converting 2010 Word images to pdf

    With the new Acrobat version X1, I've run into a problem every time I try to convert images on a Word 2010 document to a PDF.  While all lines on the document are black, many of the lines including bubble-notes appear very light grey, and difficult to read.  I've searched for months for a solution without any luck, and I tried other computers with different versions to verify it wasn't just my machine. 
    Is there a specific setting that I need to change during conversion so that black lines remain black instead of this weird gray-scale problem?  See below for example.
    Before, snapshot taken from Microsoft Word:
    After conversion in .PDF document format, taken from Acrobat XI:

    Word:
    I attempted to change the picture format, specifically the image control.  Three of the four options - Automatic, Grayscale, Black & White - produced no different results.  The fourth "Washout" option made the problem worse, as the dwg became brighter in both the .doc and the .pdf.
    Autocad:
    Changing the lineweights made no difference either.  Our company standard is black background, white object lines, but even when I reverse it and make the line colors as dark as possible, I get grey lines during conversion.

  • I'm trying to convert a PDF to Word, but it won't copy signatures over, just get gobblygook... any way I can convert with an image of the signatures?

    I'm trying to convert a PDF to Word, but it won't copy signatures over, just get gobblygook... any way I can convert with an image of the signatures? I tried Word, RTF etc but they all come out the same. Help!

    Hi newromancer65,
    Would you send your document as an email attachment? 
    I will check it from my end.Please add the link to this forum post for reference.
    I have sent you my contact details on your email id.
    Regards,
    Florence

  • Pdf image whit text want to convert in  word

    pdf image whit text want to convert in  word

    Please post your question at this link:
    <https://forums.adobe.com/community/acrobat/creating__editing_%26_exporting_pdfs>
    And also explain better because an image is an image and it can't be
    changed to anything else but if you explain your requirements then
    somebody will come out with a solution for you.
    Good luck.

  • Problems converting Word 2007 images to PDF

    Hello everyone,
    I am in desperate need for help. I have been trying to convert my word 2007 files into PDF but my graphs and tables come out not as clear as the orginals, the outline of images is thicker, the characters seem darker/bold and occasionally with distortions.
    I have Microsoft Office Word 2007 and Adobe Acrobat 8 Professional.
    I really hope someone can help me because it is a major issue for me as I need to publish the content of my file very soon.
    Thank you in advance for your help.

    Just to check things here are a few steps.<br /><br />Checking that the process works in basic form:<br />1. Print to the Adobe PDF printer with Print To File selected.<br />2. Open the file in Distiller to convert to PDF.<br />a. WORKS - That means things work fine and we need to look for AcroTray.<br />b. FAILS - Look at the messages that Distiller creates. That typically provides information on the problem.<br /><br />For AcroTray:<br />1. Open Explorer and go to the Distiller folder. It is typically in Program Files>Adobe>Acrobat <your AA number>>Distiller. In that folder you should see AcroTray.exe. (in my case the folder is C:\Program Files\Adobe\Acrobat 7.0\Distillr) Put a link to AcroTray.exe in the Startup Folder of Windows so that it always starts with Windows.<br /><br />Once you have started AcroTray, you should be able to print directly to the Adobe PDF printer without the file option and get a PDF directly. All that AcroTray does is automate the print to file and running of Distiler. PDF Maker is adds an additional step by putting PDF Marks in the printer file that is written before running Distiller.<br /><br />Hope that helps. Bill

  • Convert powerpoint slide with embedded file (word, excel, etc) to pdf

    Hi all,
    I have a word document or excel spreadsheet that I added as an object into the power point slide.
    How can i convert powerpoint slide to PDF but still able to open the file when i click file icon.
    So far the result I have once the slide is in PDF, I can't no longer click the file to open it.
    Thank you.

    Hi Joel,
    The feature you are looking for, is currently not available in Acrobat.
    But you can create an interactive link (to open Word/Excel) using Acrobat. Please look at the following link:
    http://acrobatusers.com/tutorials/creating-and-editing-links
    Regards,
    Anoop

Maybe you are looking for