XML to PDF conversion of large catalogue

I have a bunch of xml content on my website which refer to items being sold. Each item has various spec and description entries. The website is regularly updated but the printed product catalogue is not(it's created separately using adobe indesign)
So I want to be able to do 2 things:
1. create parts of the catalogue in pdf format on the fly on the website. (The catalogue layout, page numbers and index page is important to be correct)
2. create the whole catalogue from my xml data, in PDF format ready to be printed.
Has anyone done anything similar to this?
How should I go on about doing this?

I have a bunch of xml content on my website which refer to items being sold. Each item has various spec and description entries. The website is regularly updated but the printed product catalogue is not(it's created separately using adobe indesign)
So I want to be able to do 2 things:
1. create parts of the catalogue in pdf format on the fly on the website. (The catalogue layout, page numbers and index page is important to be correct)
2. create the whole catalogue from my xml data, in PDF format ready to be printed.
Has anyone done anything similar to this?
How should I go on about doing this?

Similar Messages

  • I want to manage my pdf conversion to word account, but when I click on manage I get an error message saying Heading too Large.  How do I get into manage

    I want to manage my pdf conversion to word account, but when I click on manage I get an error message saying Heading too Large.  How do I get into manage

    Hi,
    May i know which service you are using?
    Would it be possible for you to send me the screenshot of the error?
    Regards,
    Florence

  • Lines in pdf conversion look thick!

    It's an architectural plan linked as eps in indesign the one I'd like to convert to pdf. In Indesign High quality display mode, in a "fit to windows zoom" I see very smooth and thin lines and everything seems perfect. I convert that page in pdf but in zoom at the same level I see very thick lines! I know they just look like that because if I zoom them everything is as it should be but in an aerial view isn't properly rendered.
    Is there an option in the pdf conversion I should check? I can use both print pdf or export...
    Here are the 2 results compared:
    http://farm4.static.flickr.com/3646/3297308538_e02b798333_o.jpg
    let me know please!
    thanks!
    mz

    If the lines are really too thick - as shown by zoom,
    where the linewidth grows proportionally - then the
    information is already in the PDF. Perhaps by applying
    'Fix Hairlines' in AcrobatPro.
    If the original vector lines in the EPS a very thin -
    which happens for instance by downscaling a technical
    drawing - then PostScript systems are handling this case
    like zero linewidth: the thinnest device pixel line is shown.
    In Acrobat use
    Edit > Preferences > Page Display > SmoothLineArt=Off
    The line is shown for all zoom factors by one-pixel
    rendering.
    For SmoothLineArt=On, the line looks considerably thicker,
    but the visual linewidth is still independent of the zoom
    factor.
    So far actually tested by CS2.
    I'm programming plenty graphics directly by PostScript as
    EPS', place in ID or PageMaker and export as PDF.
    It happened never that a sufficiently thick line was rendered
    thicker in the PDF.
    The remedy is IMO: use Fix Hairlines with a sufficiently large
    minimal linewidth, for instance 0.3 mm.
    Best regards --Gernot Hoffmann

  • PDF Conversion Truncated

    Hello, I have an issue when performing the Spool to PDF conversion.  I am using FM CONVERT_ABAPSPOOLJOB_2_PDF to convert an ALV list to a PDF file.  When I run my report in the foreground, everything works correctly, the PDF file is attached and then emailed to the appropriate recipient.  When I schedule the job to run in the background, the contents of the attachment that is created is truncated on the right side cutting off half of the report (the font is also larger).  I am out of ideas on how to solve this issue.  Any help would be appreciated.

    Hi,
    Use this code this work for you
    Please reward me if you find this answer is helpful
    *& Report  YSEND_MAIL_ATTACH
    REPORT  ysend_mail_attach  NO STANDARD PAGE HEADING  LINE-COUNT 65(2)
                                              LINE-SIZE 120.
    *TABLES/ DATA DECLARATION
    TABLES: tsp01,
            kna1.
    DATA: int_email_receivers LIKE zelift_user_3 OCCURS 0 WITH HEADER LINE.
    DATA: int_to_receivers TYPE zeinterface_mail-z_email_to OCCURS 0.
    DATA: wa_to_receivers  TYPE zeinterface_mail-z_email_to.
    DATA: int_mailsend TYPE soos1 OCCURS 0 WITH HEADER LINE.
    DATA: printer LIKE pri_params,
          mc_valid(1)      TYPE c,
          p_linsz LIKE sy-linsz VALUE 100,
          p_paart LIKE sy-paart VALUE  'X_65_132',
          mi_rqident       LIKE tsp01-rqident,
          mi_bytecount     TYPE i,
          prg_name LIKE sy-repid,
          user_name LIKE sy-uname,
          mtab_pdf    LIKE tline OCCURS 0 WITH HEADER LINE,
          mc_filename LIKE rlgrap-filename,
          req_no LIKE tsp01_sp0r-rqid_char,
          req_rc LIKE sy-subrc,
          nom_char(20),
          g_drc_diff.
    sending email stuff
    DATA: object_hd_change  TYPE sood1,
          object_type       TYPE sood-objtp,
          objcont           TYPE soli  OCCURS 0 WITH HEADER LINE,
          objhead           TYPE soli  OCCURS 0 WITH HEADER LINE,
          att_cont          TYPE soli  OCCURS 0 WITH HEADER LINE,
          att_head          TYPE soli  OCCURS 0 WITH HEADER LINE,
          packing_list      TYPE soxpl OCCURS 0 WITH HEADER LINE,
          receivers         TYPE soos1 OCCURS 0 WITH HEADER LINE,
         receivers         TYPE somlreci1 OCCURS 0 WITH HEADER LINE,
          free_recc         TYPE soos1 OCCURS 0 WITH HEADER LINE,
          object_id_new     TYPE soodk,
          sent_to_all       TYPE sonv-flag,
          all_binding_done  TYPE sonv-flag,
          office_object_key TYPE swotobjid-objkey,
          originator_id     TYPE soudk,
          objbin LIKE solisti1 OCCURS 0 WITH HEADER LINE,
          objtxt LIKE solisti1 OCCURS 0 WITH HEADER LINE,
          reclist LIKE somlreci1 OCCURS 0 WITH HEADER LINE,
          doc_chng LIKE sodocchgi1,
          tab_lines LIKE sy-tabix,
          paylist LIKE bapi7004_rl OCCURS 0 WITH HEADER LINE,
          p_info LIKE  pc407,
          pdf_table LIKE tline OCCURS 0 WITH HEADER LINE,
          pdf_fsize TYPE  i,
          pdf_line(134),
          spoolid    TYPE tsp01-rqident.
    ***data declaration for sending the mail.
    ***data declaration for output display.
    DATA:BEGIN OF int_kna1 OCCURS 0,
         kunnr LIKE kna1-kunnr,
         land1 LIKE kna1-land1,
         name1 LIKE kna1-name1,
         END OF int_kna1.
    PARAMETERS p_kunnr LIKE kna1-kunnr.
    *start-of-selection
    START-OF-SELECTION.
    perform display.
    *pdf conversion.
      PERFORM pdf_conversion.
    END-OF-SELECTION.
          FORM get_spool_number *
          Get the most recent spool created by user/report              *
    -->  F_REPID               *
    -->  F_UNAME               *
    -->  F_RQIDENT             *
    FORM get_spool_number USING f_repid
         f_uname
                    CHANGING f_rqident.
      DATA:
        lc_rq2name LIKE tsp01-rq2name.
      CONCATENATE f_repid+0(9)
                  f_uname+0(3)
        INTO lc_rq2name.
      CONDENSE lc_rq2name.
      SELECT * FROM tsp01 WHERE  rq2name = lc_rq2name
    ORDER BY rqcretime DESCENDING.
        f_rqident = tsp01-rqident.
        EXIT.
      ENDSELECT.
      IF sy-subrc NE 0.
        CLEAR f_rqident.
      ENDIF.
    ENDFORM.                               " get_spool_number
    *---getting the spool created by user/report
    *&      Form  PDF_CONVERSION
          text
    -->  p1        text
    <--  p2        text
    FORM pdf_conversion.
    *-- Setup the Print Parmaters
      CALL FUNCTION 'GET_PRINT_PARAMETERS'
        EXPORTING
         authority              = space
         copies                 = '1'
         cover_page             = space
         data_set               = space
         department             = space
         destination            = space
         expiration             = '1'
         immediately            = space
         in_archive_parameters  = space
         in_parameters          = space
         layout                 = space
         mode                   = space
         new_list_id            = 'X'
          no_dialog              = 'X'
          user                   = sy-uname
        IMPORTING
          out_parameters         = printer
          valid                  = mc_valid
        EXCEPTIONS
          archive_info_not_found = 1
          invalid_print_params   = 2
          invalid_archive_params = 3
          OTHERS                 = 4.
    *-- Make sure that a printer destination has been set up
    *-- If this is not done the PDF function module ABENDS
      IF printer-pdest = space.
        printer-pdest = 'LOCL'.
      ENDIF.
    *-- Explicitly set line width, and output format so that
    *-- the PDF conversion comes out OK
      printer-linsz = p_linsz.
      printer-linct = sy-linct.
      printer-paart = p_paart.
      printer-prrel = ' '.
      prg_name = 'YSEND_MAIL_ATTACH'.
      user_name = sy-uname.
      SUBMIT yidoc12 TO SAP-SPOOL WITHOUT SPOOL DYNPRO
                       SPOOL PARAMETERS printer
                       WITH p_kunnr = p_kunnr
                       AND RETURN.
      PERFORM get_spool_number USING prg_name user_name
                CHANGING mi_rqident.
      IF sy-subrc = 0.
       IF sendmail = 'X'.
        req_no = mi_rqident.
        PERFORM spool_pdf_conversion.
        PERFORM sendmail.
       ENDIF.
      ENDIF.
      req_no = mi_rqident.
      CALL FUNCTION 'RSPO_R_RDELETE_SPOOLREQ'
        EXPORTING
          spoolid       = req_no
       IMPORTING
      RC            = req_rc
         status        = req_rc.
    IF req_rc <> 0.
      IF req_rc = 0.
        LEAVE PROGRAM.
      ENDIF.
    ENDFORM.                    " PDF_CONVERSION
    *****pdf conversion----
    *&      Form  SPOOL_PDF_CONVERSION
          text
    -->  p1        text
    <--  p2        text
    FORM spool_pdf_conversion.
      spoolid = req_no.
      CALL FUNCTION 'CONVERT_ABAPSPOOLJOB_2_PDF'
        EXPORTING
          src_spoolid              = spoolid
          no_dialog                = space
          dst_device               = printer-pdest
        IMPORTING
          pdf_bytecount            = pdf_fsize
        TABLES
          pdf                      = pdf_table
        EXCEPTIONS
          err_no_abap_spooljob     = 1
          err_no_spooljob          = 2
          err_no_permission        = 3
          err_conv_not_possible    = 4
          err_bad_destdevice       = 5
          user_cancelled           = 6
          err_spoolerror           = 7
          err_temseerror           = 8
          err_btcjob_open_failed   = 9
          err_btcjob_submit_failed = 10
          err_btcjob_close_failed  = 11
          OTHERS                   = 12.
    ENDFORM.                    " FILE_UPLOAD
    ********converting the spool job to pdf******************************
    *sending the mail as an attachment using the fm
    *&      Form  SENDMAIL
          text
    -->  p1        text
    <--  p2        text
    FORM sendmail.
      DATA : BEGIN OF lt_pdflns OCCURS 0.
              INCLUDE STRUCTURE  solisti1.
      DATA : END OF lt_pdflns.
      DATA: l_ind,
            l_nom_key_o LIKE oijnomi-nomtk.
    General header data.
      MOVE: sy-langu TO object_hd_change-objla,
            'Ext.email' TO object_hd_change-objnam,
            'C' TO object_hd_change-objsns,
            'TXT' TO object_hd_change-file_ext,
            'REQUIRED OUTPUT' TO object_hd_change-objdes.
    Body of the message itself (255).
      MOVE ' REPORT TESTING' TO objcont-line.
      APPEND objcont.
      CONCATENATE 'Time: ' sy-uzeit0(2) ':' sy-uzeit2(2) ':' sy-uzeit+4(2)
                  INTO objcont-line.
      APPEND objcont.
      MOVE: 'RAW' TO object_type.
      LOOP AT pdf_table.
        lt_pdflns = pdf_table.
        APPEND lt_pdflns.
        CLEAR lt_pdflns.
      ENDLOOP.
      CALL FUNCTION 'SX_TABLE_LINE_WIDTH_CHANGE'
        EXPORTING
          line_width_src              = 134
          line_width_dst              = 255
        TABLES
          content_in                  = lt_pdflns
          content_out                 = objbin
        EXCEPTIONS
          err_line_width_src_too_long = 1  "content_out
          err_line_width_dst_too_long = 2
          err_conv_failed             = 3
          OTHERS                      = 4.
    DESCRIBE TABLE objbin LINES tab_lines.
    *get the mail ids for sending the emails.
    Fill email receivers
      IF int_email_receivers[] IS INITIAL.
        CALL FUNCTION 'ZEI_GET_INTERFACE_MAILIDS'
        EXPORTING
          program_name           = 'ZER02230_TSPPRA'
          interface              = 'TS_TMJ_REJ'
          key                    = 'NOMTK'
          value                  = 'FAILURE_RECEIVERS'
          DIRECTION              =
          NOTES                  =
         IMPORTING
          to_mail_recivers       = int_to_receivers
        CC_MAIL_RECIVERS       = int_cc_receivers
         EXCEPTIONS
          no_data_found          = 1
          OTHERS                 = 2
    Appending Email addresses if found.
        IF NOT int_to_receivers[] IS INITIAL.
          LOOP AT int_to_receivers INTO wa_to_receivers.
            MOVE wa_to_receivers TO int_mailsend-recextnam .
            int_mailsend-recesc = 'U'.
            int_mailsend-sndart = 'INT'.
            int_mailsend-sndpri = '1'.
            APPEND int_mailsend.
          ENDLOOP.
        ENDIF.
      ELSE.
        LOOP AT int_email_receivers.
          CONCATENATE   int_email_receivers '@exchange.aramco.com.sa' INTO
               int_mailsend-recextnam.
          int_mailsend-recesc = 'U'.
          int_mailsend-sndart = 'INT'.
          int_mailsend-sndpri = '1'..
          APPEND int_mailsend.
        ENDLOOP.
      ENDIF.
    **************end of the getting emails for sending the mails
      DESCRIBE TABLE objbin LINES tab_lines.
      MOVE: "'X'               TO packing_list-TRANSF_BIN,
             '1'               TO packing_list-head_start,
             '1'               TO packing_list-head_num,
             '1'               TO packing_list-body_start,
             tab_lines         TO packing_list-body_num,
             'RAW'             TO packing_list-objtp,
             'Attachment'      TO packing_list-objnam,
             'PDF' TO packing_list-file_ext.
      packing_list-objlen = tab_lines * 255.
      CONCATENATE 'REPORT TESTING' '--'
         sy-datum4(2) '/' sy-datum6(2) '/' sy-datum+0(4)
         sy-uzeit0(2) ':' sy-uzeit2(2) ':' sy-uzeit+4(2)
      INTO packing_list-objdes.
      APPEND packing_list.
      objhead = 'testing the email for pdf'.
      CALL FUNCTION 'SO_OBJECT_SEND'
        EXPORTING
          object_hd_change           = object_hd_change
          object_type                = object_type
        IMPORTING
          object_id_new              = object_id_new
          sent_to_all                = sent_to_all
          all_binding_done           = all_binding_done
          office_object_key          = office_object_key
          originator_id              = originator_id
        TABLES
          objcont                    = objcont
          objhead                    = objhead
          receivers                  = int_mailsend
          packing_list               = packing_list
          att_cont                   = objbin
          att_head                   = att_head
        EXCEPTIONS
          active_user_not_exist      = 1
          communication_failure      = 2
          component_not_available    = 3
          folder_not_exist           = 4
          folder_no_authorization    = 5
          forwarder_not_exist        = 6
          note_not_exist             = 7
          object_not_exist           = 8
          object_not_sent            = 9
          object_no_authorization    = 10
          object_type_not_exist      = 11
          operation_no_authorization = 12
          owner_not_exist            = 13
          parameter_error            = 14
          substitute_not_active      = 15
          substitute_not_defined     = 16
          system_failure             = 17
          too_much_receivers         = 18
          user_not_exist             = 19
          originator_not_exist       = 20
          x_error                    = 21
          OTHERS                     = 22.
      IF sy-subrc EQ 0.
        COMMIT WORK.
      ELSE.
        MESSAGE i014(ze9) WITH 'Email was not sent to the Recipients'.
      ENDIF.
    ENDFORM.                    " SENDMAIL
    ************sending mail as an attachment
    *for displaying
    form display.
    endform.
    *for displaying

  • Creating a large catalogue with InDesign CC

    I am attempting to create a large catalogue with Adobe InDesign CC, and presume that the data-merge function is the way to do this? If i use the data merge function, will the template I create adjust to varying amounts of data within the data file - eg some items will take up a quarter of a page, other items may take up a full page. Also, if I use the data merge function, can I manipulate the data manually afterwards - eg for mistakes, alignments, etc? Thanks

    How large is large?
    You can use data merge. I would be tempted to run it one record per page and run one of the scripts that will connect each frame to flow into each other. Then make the needed frame adjustments on each page. Not as onerous as it sounds. A script can be had on this site:
    http://www.loicaigon.com/en/solutions-en/downloads/
    Like Gert writes, edits in the merge file will require a new merge unless you purchase one of the commercial plug-ins.
    Set up paragraph and character styles in a sample of the data so when laying out for the merge you can handle most all (or all) of the styles you need.
    If you want a healthy learning curve, and you can obtain the data as an XML file, you can also go that route. Same procedure, make a smaller XML file, create the needed styles, map them and import the XML. Your frames will be whole page already and the data will flow from frame to frame without "stitching" the frames back together.
    Mike

  • OT: XML, XSL, & PDF

    This isn't strictly a Forte question, but I thought that possibly someone
    who was getting into Fusion and/or someone with similar thoughts had run
    into some solutions.
    We've been thinking recently about the idea of having reports in the
    application output XML to express the content of the report and then using
    XSL to map this into the viewable/printable format. This would open up the
    option of multiple XSL maps for different purposes or audiences. The idea
    would probably be to go XML --> PDF --> HTML or XML --> HTML --> PDF with
    the second conversion being an automated one that did not have a
    report-specific mapping. PDF would provide a printable report of high
    quality which could be sent anywhere and printed on just about anything and
    the HTML would provide a browser-viewable version which was readable
    page-at-time with hypertext links to connect various parts of the report in
    the fashion of Actuate.
    Does anyone know of tools to do this kind of mapping?
    =========================================================================
    Thomas Mercer-Hursh, Ph.D email: [email protected]
    Computing Integrity, Inc. sales: 510-233-9329
    550 Casey Drive - Cypress Point support: 510-233-9327
    Point Richmond, CA 94801-3751 fax: 510-233-6950

    Lo mismo que te he dicho antes, optimiza primero las
    im�genes y luego las
    pasas a pdf (Pdf995).
    Saludos,
    Julio Barroso
    "Paulina" <[email protected]> escribi�
    en el mensaje
    news:e7bmcc$lih$[email protected]..
    | Gracias por la informacio,
    | pero en el caso de que siga con la idea de usar PDF (el
    cliente manda)...
    | que resoluciones y pasos me aconsejais?
    | Julio B. escribi�:
    | > Cuando se trata de mostrar una noticia publicada en la
    prensa y que ha
    sido
    | > escaneada para tal fin, lo mejor es usar una imagen
    (jpg � gif) antes
    que un
    | > pdf, que siempre tendr� m�s peso que la
    imagen original.
    | >
    | > Otra cosa ser�a si el texto de la noticia fuera
    eso, texto, y no una
    imagen,
    | > en ese caso la opci�n m�s �ptima
    ser�a el pdf (generado con Pdf995).
    | >
    | > Te recomiendo que una vez tengas la imagen en
    Photoshop recortes todo lo
    que
    | > no sirva y la guardes como gif con la cantidad
    m�nima de colores para
    que se
    | > pueda leer el texto. Acu�rdate de ajustar el
    tama�o y la resoluci�n. As�
    te
    | > aseguras que la noticia tendr� el m�nimo
    peso posible. Tambi�n puedes
    probar
    | > con jpg, pero para bajar el peso tendr�s
    comprimir mucho, y al final se
    | > pierde legibilidad en el texto.
    | >
    | > Saludos,
    | >
    | > Julio Barroso
    | >
    | > "Paulina" <[email protected]>
    escribi� en el mensaje
    | > news:e7bb88$80f$[email protected]..
    | > | Buenos dias,
    | > | necesito consejo.
    | > | Un cliente me solicita tener un apartado en donde se
    puedan ver las
    | > | noticias que han publicado en prensa sobre su
    empresa en PDF.
    | > |
    | > | Alguien me puede decir cual es el proceso mejor para
    mantener buena
    | > | calidad y poco peso? Estoy segura que hay alguna
    combinacion
    | > | idonea...pero no lo consigo.
    | > |
    | > | He probado escaneando desde Photoshop e imprimiendo
    a Cute PDF...pero
    me
    | > | sale muy grande.
    | > | espero vuestras sugerencias
    | > |
    | > | Gracias,
    | > | Paulina
    | >
    | >

  • J2SE adapter PI 7.1 issue with XML to flat conversion and namespace length

    Dear reader,
    We are facing an issue with J2SE Adapter PI7.1 for a number of flows.
    The flow requirements:
    [1] Namespace length for interfaces is up to 100 characters
    [2] The XML message must be converted to Flat on the adapter channel
    Our PI system is at patch level 7 and we implement J2SE adapter on patch level 7 as well.
    We found that the J2SE adapter on patch level 7 does not support long namespaces [1] (as it should since this is an PI 7.1 j2SE adapter) but no issues where found with the XML to flat conversion [2].
    Experimenting with J2SE adapter on patch level 6 we found the long namespaces [1] are supported however an issue is found with the XML to flat conversion [2] as stated in SAP note 1335527.
    An SAP Customer Message is raised on this issue however your input is highly appricated!
    With Kind Regards,
    Harald Kastelijn
    Edited by: Harald Kastelijn on Mar 6, 2010 9:17 AM
    Edited by: Harald Kastelijn on Mar 6, 2010 9:19 AM

    We found that the J2SE adapter on patch level 7 does not support long namespaces [1] (as it should since this is an PI 7.1
    j2SE adapter) but no issues where found with the XML to flat conversion [2]
    I think the restriction of namespace length still remains in design time (IR).....the same however has been extended in configuration and runtime...this SAP note has some information: https://service.sap.com/sap/support/notes/870809

  • Error Message "Some content on the PDF is too large to fit on a single page.

    I get the below error when I try to download a response as a .pdf:
    Some content on the PDF is too large to fit on a single page.
    Please go to the "Design Tab" and adjust the contents, the font-size, or divide the flagged items into multiple elements.
    Any ideas?

    Go to the Design Tab and make sure you switch to the Page View (look the bottom right corner of the screen)
    Once you see the Page View (used to see what the PDF will look like for your response) scroll down and you might see which object is being trucated (it will be covered by a red rectangle). This often happen if you have a element that is too big to fit on one page (like a single or multi choice field).
    You will need to adjust your element so that nothing is trucated.
    Gen

  • Converting from PDF directly to Java Objects/XML (and PDF format questions)

    Hi,
    I posted this originally in the Acrobat Windows forums but was told I might have more luck here, so here goes:
    I am desperately trying to find a tool (preferably open source but commercial is fine also) that will sit on top of a PDF and allow me to query it's text for content and formatting (I don't care about images). I have found some tools that get me part of the way there, but nothing that seems to provide an end-to-end solution but is quite lightweight. My main question is WHY are there so many tools that go from PDF to RTF, and many tools that go from RTF to XML, but NONE that I can find that go PDF to XML.
    To clarify, by formatting I simply mean whether a line/block of text is bold/italic, and its font size. I am not concerned with exact position on the page. The background is that I will be searching PDFs and assigning importance to whether text is a heading/bodytext etc. We already have a search tool in place so implementing a pure PDF search engine is not an option. I need a lightweight tool that simply allows me to either make calls directly to the PDF OR converts to XML which I can parse.
    Some tools I have tried:
    1) PDFBox (Java Library) - Allows the extraction of text content easily, but doesn't seem to have good support for formatting.
    2) JPedal (Java Library) - Allows extraction of text content easily, and supports formatting IF XML structured data is in the PDF (not the case for my data).
    3)  Nitro PDF (Tool) + RTF to XML (script) - This works quite nicely and shows that PDF to XML is possible, but why do I have to use 2 tools? Also, these are not libraries I can integrate into my app.
    4) iText (Java Library) - Seems great at creating PDFs but poor at extracting content.
    I don't really expect someone to give me a perfect solution (although that would be nice!).
    Instead, what I'd like to know is WHY tools support PDF to RTF/Word/whatever retaining formatting, and other tools support RTF to XML with the formatting information retained. What is it about PDF and RTF/Word that makes it feasible to convert that way, but not to XML. Also, as I found in 3) above, it is perfectly feasible to end up as XML from PDF, so why do no tools support this reliably!
    Many thanks for any advice from PDF gurus.

    XML doesn't mean anything - it's just a generic concept for structuring
    information.  You need a specific GRAMMAR of XML to mean anything.  So what
    grammar would you use?  Something standard?  Make up your own?
    However, there are a number of commercial and open source products that can
    convert PDF to various XML grammars - SVG, ABW, and various custom grammars.
    But the other thing you need to understand is that most PDF files do not
    have any structure associated with them (as you saw when using JPEDAL).  As
    such, any concepts of paragraphs/sections/tables/etc. Are WILD GUESSES by
    the software in question.

  • Barcode printing in XML report PDF output

    Hi All,
            I want to print barcode of invoice number / purchase order number in the XML report PDF output.
            Anyone please suggest me with your ideas and experience.

    Hi Bogdan,
    The steps mentioned in the doc is what i did in order.
    I couldn't understand the step # 13 & 14.
    Log in as XML Publisher Administrator
    Navigate to Administration --> Font Files --> Create Font File
    Available fields are Font Name and File
    --> for Font Name, choose any descriptive name
    --> file will browse your PC to locate the font file
    Navigate to Font Mappings -->Create Font Mapping Set
    Mapping name is the name you will give to a set of fonts.
    Mapping code is the internal name you will give to this set
    Type: 'PDF Form' for PDF templates. 'FO to PDF' for all other template types.
    Create Font Mapping (this allows you to add fonts to a set)
    Font Family is the exact same name you see in MS Word under Font. If you don't use the same name the font will not be picked up at runtime.
    Style and weight must also match how you use the font in the RTF or PDF layout template. Normal and Normal are good defaults.
    Language and Territory should remain blank (NULL) unless you have a strong business reason, as these fields can cause the font not to be picked up at runtime.
    Navigate to Configuration General -> FO Processing -->Font Mapping Set. This can also be done at Data Definition and Template level, via the corresponding Edit Configuration button on those pages. The hierarchy is Site-> Data Def -> Template.
    Select your new mapping set.
    Make sure the font is not referenced under File --> Properties --> Custom in the RTF template file.
    Under General, set a Temporary Directory. The font will be stored under a /fonts directory at runtime, initially created the first time the font is used.
    Upload a template that uses your special font and test using preview or by submitting a concurrent request.

  • Automated report/ PDF conversion with AppleScript

    Hello-
    I am attempting to PDF a spreadsheet from Dropbox, and email it to a set list of recipients with AppleScript.  Much of this I have been able to figure out, but I am having some issues with the PDF conversion.  As you can see below, I am still working on sending the xls file, and have not yet been able to get AppleScript to convert a specific tab of the file to aPDF.  I expect scheduling to be handled theough crontab.
    Current script is as follows:
    tell application "Finder"
              set folderPath to folder "Macintosh HD:Users:user:Dropbox:folder:Calculated PO Sheet"
              set theFile to first file in folderPath as alias
              set fileName to name of theFile
    end tell
    set theSubject to "Current PO Report" date
    set theBody to "See attached."
    set theAddress to "recipient email"
    set theAttachment to "CALCULATED PO SHEET.xlsx"
    set theSender to "sender email"
    tell application "Mail"
              set theNewMessage to make new outgoing message with properties{subject:theSubject, content:theBody & return & return, visible:true}
              tell theNewMessage
                        set visibile to true
                        set sender to theSender
      make new to recipient at end of to recipients with properties {address:theAddress}
                        try
      make new attachment with properties {file name:theAttachment} at after the last word ofthe last paragraph
                                  set message_attachment to 0
                        on error errmess -- oops
      log errmess -- log the error
                                  set message_attachment to 1
                        end try
                        log "message_attachment = " & message_attachment
      #send
              end tell
    end tell

    Hi,
    there is a special setup for this requirement. Please see the following support note
    Reference
    Oracle Reports Output For Indian Languages Like Gujarati, Marathi [ID 980554.1]
    Roberto

  • Smart form: Spool to PDF Conversion more than 100 pages.

    Hi folks,
    I have an issue with Function Module (FM) CONVERT_ABAPSPOOLJOB_2_PDF when the report to be converted has more than 100 pages.....i have explained this scenario as below
    We have been using the FM CONVERT_ABAPSPOOLJOB_2_PDF to convert ABAP report to PDF file and then send this via mail using FM SO_DOCUMENT_SEND_API1.....
    PDF conversion using FM CONVERT_ABAPSPOOLJOB_2_PDF works well if the number of report pages is less than 100....i.e the table pdf of the FM will contain the output as expected.....
    If the report has more than 100 pages, the the FM CONVERT_ABAPSPOOLJOB_2_PDF will schedule the conversion job as a background job and returns with a SY-subrc eq 0 and the table parameter pdf will ne empty.
    Could you suggest me a woorkaround in this situtaion...i.e how do I then get the output of the background job
    And  any process to convert Spool to PDF if having more than 100 pages.
    Thanks in Advance............

    Pb,
    If the report has more than 100 pages, the the FM CONVERT_ABAPSPOOLJOB_2_PDF will schedule the conversion job as a background job and returns with a SY-subrc eq 0 and the table parameter pdf will ne empty
    i would suggest you to if pages are more than 100 than you need to send report output to sp01 with spool request number by using fm job_open..job_close..
    than from sp01 you can convert pdf by running RSTXPDFT4 in se38.
    hope this would solve your problem.
    Amit.

  • Acrobat 9 HTML to PDF conversion sets all checkboxes to checked?

    When I convert an HTML file that contains checkboxes to PDF using Acrobat 9 Standard or Pro (fully updated) on Windows XP SP3, all of the checkboxes end up checked in the resulting PDF.  I've looked in settings menus but can't find anything that seems to be a relevant option to prevent this from happening.  I've attached a simple test case .html file to this post that you can use to repeat the problem.
    To convert the file, I right-click on the file in Windows Explorer, and click Convert to Adobe PDF.  I've tried "printing" the document to the Adobe PDF printer, but that introduces other issues and is not really an acceptable solution.
    Has anyone encountered this before, and/or have ideas how to fix it?

    Input elements have no such inheritance on the checked attribute.  Furthermore, the input elements in my test case are not grouped together.  They are each encapsulated within separate list item elements, and so no inheritance should take place after the first input element.
    Just for grins, I changed the order of the elements (moved the checked one below the unchecked one), but that did not make any difference in the Acrobat 9 HTML to PDF conversion.
    I did test this with Acrobat 8 Standard, and the HTML to PDF conversion preserved the correct checked status of the input elements.  It looks to me like this is a bug that was introduced in Acrobat 9.

  • Need help about PDF conversion

    I have red words in a text box using publisher. How come the text show up as black after PDF conversion in Adobe?

    Hi,
    I have windows 8. I am using Microsoft publisher 2013. I want to make a postcard. I made a text box and my text in the box are red. Then I click save as to convert to a PDF file. It works. It opens on Adobe but the text in the text box are black and not red. What can I do to retain the original colors?

  • Why is my word and pdf files so large? How can I reduce the size?

    My document contains mostly photos from a database via a path to the actual jpeg photos on disk.
    There are 39 jpeg photos + a small amount of text with each photo.
    The 39 jpeg photos on the disk add up to be 5.79 mb.
    If I export the crystal reports document as [Microsoft Word (97-2003) Non Editable] the, the file size is 36.8 mb.
    If I export the same data as a [Microsoft Word (97-2003) Editable] the file size is only 3.1 mb.
    That is a difference of 33.7 mb! Why?
    Also, when I export the same data to a PDF file it is much larger than I think it should be; 12.7 mb.
    Why is my word and PDF files so large?
    How can I reduce the size of the word and PDF files? (without reducing the photo quality).
    Are there any tools to post process the Word or PDF files to reduce the file size?

    You can't, extra size is to hold the details

Maybe you are looking for

  • Is there a fix for CS6 Photoshop crash when adding audio track?

    I tried posting this problem yesterday, but have gotten no response: I'm video editing using Photoshop CS6 (NOT Premiere! NOT CC) and I'm consistently experiencing an application crash (i.e. quit) within a second of selecting/loading an audio track t

  • I need to create a new iPhoto library and import 'ORIGINALS' from a time machine backup on an external hard drive.

    I made the mistake of installing OS 10.9 on my older MacBook Air and the machine has become horribly slow and unpredictable. I visited the Apple store and questioned a 'Genius' about getting rid of 10.9 and reinstalling the previous OS 10.6 for which

  • ASP page on OC4J

    Hello, Is posible to run ASP page on OC4J standalone server?

  • Regd NWDI MyComponent SC

    Hi All, i am trying to create a DC, but not getting MyComponent SC in the Local DC's window. I am able to see the imported SC. i have  checked in the .dtc folder path but i am not able to see local development folder. Can any one help me to restore t

  • Unknown Server on Network "www"

    Seeing a strange PC Server in my shared networks. Restarted computer and changed my network configurations and it's still there. Not sure what it is, can't connect to it and I'm sure there are no other computers connected to the network, the only oth