Disassemble PDF based on Content Table

Suppose a PDF file has a table of contents. Is it possible to split this PDF file into multiple small PDF files that only contains one chapter?
Thanks,
P

What I try to do is find a prgrmaming way to slip a PDF file into multiple PDF files that contains only one chapter each based on Table of Contents info in PDF file. The bookmark may not contains Table of Content information. For exmaple, some PDF file do not have bookmark but they do have table of contents. How to deal with it in a programming way?
Thanks.

Similar Messages

  • Script to Rename PDFs Based on Content

    I'm trying to create an electronic archive of digitized brokerage statements so that I can get rid of the paper copies.
    Fortunately, I can download statement PDFs from my brokerage firm all the way back to 2006.  On the website the statements have reasonably descriptive titiles and are organized by date.  Unfortunately, when they are downloaded, they are all named 'statement.pdf'!
    I'd like to write a script that goes through these downloaded PDFs and renames each one based on it's content.  The resulting file name would be something like:
    <brokerage name as string constant><account description><statement date formatted as yyyy.mm.dd>.pdf
    Can an AppleScript look inside a PDF? 
    Can someone point me in the right direction regarding the approach to take, e.g., AppleScript, Automator, etc?
    Thank you!
    - nello

    I'm confused about the details of the request.  can you describe the process more clearly? 

  • Creating layout in PDF Based Form to print table content.

    Hi ,
        I am facing problem in creating the layout of PDF Based Form . I do not need any interactive text but only active table in my context whose data i need to print . What i did was drag the table from data view into  the body page and activate . When i run it i only get a table structure but without any data .
    Can any one help me or give a pointer to any tutorial for this ?. I have checked in SAPnet for PDF Based Print Form but it somehow skips how to create layout.
    With regards,
    Saurabh Kumar Pandey

    Have a look at help.sap.com:
    <a href="http://help.sap.com/saphelp_erp2005/helpdata/en/b7/64348655fb46149098d95bdca103d0/content.htm">Interactive Forms based on Adobe Software</a>
    <a href="http://help.sap.com/saphelp_erp2005/helpdata/en/4c/9cc19e5c874091a99790e540b06f3a/content.htm">Inserting a Table or Loop</a>

  • Linking to PDF Files in the Table of Contents

    I know there is a way to link directly to a PDF file from the table of contents, but it is beyond me. Anyone know how to do this?
    Thanks, Kathy

    Hi Kathy
    It's probably advisable to first add the PDF as a Baggage File. Then when you are creating the TOC entry point, you point at the Baggage File.
    If that wasn't what you were hoping to see, you could also do it by simply linking to the PDF as you would a Web Page by specifying the explicit URL that would open the same file if you were doing it using a browser.
    Cheers... Rick
    Helpful and Handy Links
    RoboHelp Wish Form/Bug Reporting Form
    Begin learning RoboHelp HTML 7 or 8 within the day - $24.95!
    Adobe Certified RoboHelp HTML Training
    SorcerStone Blog
    RoboHelp eBooks

  • DisAssembling  a PDF based on a text string on the page

    I am looking for some guidance (and an example if at all possible) on how to disassemble a multipage pdf based on text like "Tax ID" contained on certain pages. The result is that I am looking to break up a document that contains 1000 pages, 100 of those pages may contain the text "Tax ID" for 100 different people and I would like 100 different PDF's with the 1 page that has their "Tax ID" as the output. In addition...it would be great to extract the value next to the text "Tax ID" so that the PDF's could be named accordingly.
    The challenge here is how do I get the page numbers that contain the "Text ID" text along with the text sitting to the right of that text? Once I get that...then I can simply feed that information back into Assembler via the DDX for extraction.
    Any help here would be greatly appreciated.

    You've posed an interesting problem. Here is one approach that requires you to create a few steps to your Workbench process.
    Invoke the Assemble service with a DDX that extracts text information from the original PDF
    Invoke the XSLT service to convert the extracted text info into a Bookmark file.
    Invoke the Assembler with a two-part DDX with imports the Bookmark file into the original PDF and then uses the added bookmarks to disassemble the PDF.
    Invoke the Assemble service with a DDX that extracts text information from the original PDF
    Here is a DDX that extracts text info:
    <DDX xmlns="http://ns.adobe.com/DDX/1.0/">
      <DocumentText result="text">
        <PDF source="myOriginalPDF"/>
      </DocumentText>
    </DDX>
    The result will be an XML file with this appearance:
    <?xml version="1.0" encoding="UTF-8"?>
    <?xml-stylesheet type="text/xsl" href="C:\Adobe\TaxID.xslt"?>
    <DocText xmlns="http://ns.adobe.com/DDX/DocText/1.0/">
        <TextPerPage>
            <Page pageNumber="1">to market to market</Page>
            <Page pageNumber="1">TAX ID 1111 Gee I owe a lot of money to the IRS . How could this be ?</Page>
            <Page pageNumber="2">TAX ID 2222 We all owe lots of money</Page>
            <Page pageNumber="3">TAX ID 3333 We all owe lots of money</Page>
        </TextPerPage>
    </DocText>
    Invoke the XSLT service to convert the extracted text info into a Bookmark file
    Here is an XSLT that converts the text info into a Bookmark file:
    <?xml version="1.0" encoding="UTF-8"?>
    <xsl:stylesheet version="2.0" xmlns:xsl="http://www.w3.org/1999/XSL/Transform" xmlns:textInfo="http://ns.adobe.com/DDX/DocText/1.0/">
        <xsl:output method="xml" version="1.0" encoding="UTF-8"/>
        <xsl:template match="/">
            <Bookmarks xmlns="http://ns.adobe.com/pdf/bookmarks" version="1.0">
            <xsl:apply-templates/>
                </Bookmarks>
        </xsl:template>
        <xsl:template match="textInfo:Page">
        <xsl:variable name="myText" select="text()"/>
        <xsl:if  test='contains( $myText, "TAX ID")'>
            <xsl:variable name="taxID"
                select='substring($myText, 8, 4)'/>
                <Bookmark><Dest>
                <Fit>
                    <xsl:attribute name="PageNum">
                    <xsl:value-of select="@pageNumber"/>
                    </xsl:attribute>
                </Fit>
                </Dest>
                    <Title>
                    <xsl:value-of select="$taxID"/>           
                    </Title>
                </Bookmark>
            </xsl:if>
        </xsl:template>    
    </xsl:stylesheet>
    Here is the result of this XSLT applied against the example text info:
    <?xml version="1.0" encoding="UTF-8"?>
    <Bookmarks xmlns="http://ns.adobe.com/pdf/bookmarks" version="1.0">
        <Bookmark xmlns="">
            <Dest>
                <Fit PageNum="1"/>
            </Dest>
            <Title>1111</Title>
        </Bookmark>
        <Bookmark xmlns="">
            <Dest>
                <Fit PageNum="2"/>
            </Dest>
            <Title>2222</Title>
        </Bookmark>
        <Bookmark xmlns="">
            <Dest>
                <Fit PageNum="3"/>
            </Dest>
            <Title>3333</Title>
        </Bookmark>
    </Bookmarks>
    If you use this XSLT, you should refine it to search for the string "TAX ID" at the beginning of the page rather than anywhere in the page. You should also improve the identification of the TAX ID number to be independent of the length.
    Invoke the Assembler with a two-part DDX
    Write a DDX that imports the Bookmark file into the original PDF and then uses the added bookmarks to disassemble the PDF.

  • View the pdf stored in content server

    Hello,
    Can I open a pdf available in content server from web? This web based application is non SAP aaplication.
    The content server is connected to my SAP system and by using FM ALINK_RFC_DOCUMENTS_GET I get the required link.  I am not sure how can I open this link from ouside SAP system( in SAP system pfd can be open using FM OBJECT_DISPLAY_CONNECTIONS).
    I have also gone through the below SAP link also but no luck.
    http://help.sap.com/saphelp_nw70/helpdata/en/9b/e8c18ceaf811d195580000e82deb58/frameset.htm
    Thank you,
    Bhavana

    Hi
    I'm not sure you can use this URL outside your SAP System and open it without providing any kind of security credentials.
    Anyway what you could do is use ALINK_RFC_TABLE_GET to get your PDF in binary format and compose it in your Web Application
    Regards

  • Producing PDFs with 3D content on Mac OS X

    Hello everybody,
    Point 1: I am looking for the best workflow of producing PDFs with 3D content.
    Point 2: We are design studio that works entirely on Mac platform, so there is no chance of having Acrobat Pro Extended in the workflow. (Yes, of course, buying cheap Windows box or running XP on Intel-based Mac is an option, but I dont favour such a solution and will not go for it unless there will be no other way )
    The ultimate goal is as follows:
    Insert textured 3D object with transparent background into pre-made and designed PDF background file, place it so it seamlessly integrates with the background. Then optionally, add some interactivity like buttons for switching between design options etc.
    What have I achieved so far:
    1. I create textured 3D object with baked shading etc. in Blender.
    2. I export the data into Collada .dae file
    3. I import .dae file into meshlab
    4. From meshlab I export data into u3d - appart 3D file itself, it also creates .tex file which ...
    5. ... I open in TexShop (LaTex GUI frontend), where the 3D PDF is finally compiled and created.
    So, I can do 3D PDF file, but the object has white background and I cannot integrate it into design slide. Thats about it.
    Now the questions:
    Is there someone else trying to do the same stuff so we can share the ideas and knowledge?
    Where is the place to mess with the transparency? Is it to be set in .tex file? Can MacOSX version of Acrobat Pro help?
    Can we disable all lighting when displaying 3D file in PDF and use only shading form the 3D software that is baked into the textures?
    Ideas?
    Thank you,
    Petr Ludvik

    Dear Petr,
    Transparency - upload a minimal example of what you want (do it in Acrobat first).
    Consult PDF spec, MeshLab and Asymptote forums (Asymptote is a 3D drawing tool for LaTeX,
    people there know how to integrate 3D pictures into PDF LaTeX way).
    Light and textures - make the material for the surface to emit white color and set diffusive and specular colors to black.
    Like at http://www.iaas.msu.ru/tmp/u3d/cloudq.pdf.
    Ideas:
    Writing an U3D exporter for Blender is not harder than a VRML one. If you have Blender programming skills it may be a way to go.
    I can consult you, just made an U3D exporter for VTK.
    If your data comes not from Blender/MeshLab/LaTeX are not tools internal for your regular toolchain
    but just a ragtag converter for data that comes from your program - consider exporting directly to U3D
    (there is a text intermediate format, IDTF, that is relatively simple).
    Tell more about your toolchain in use, what is at the beginning (before Blender) and at the end (how you utilize the PDF with just 3D model in it).
                Sincerely, Michail
    PS. Other wording:
    Transparent background - is it doable in Acrobat?
    If yes - look into PDF and see how (what elements of what dictionaries are set),
    having PDF spec at your side.
    Upload a minimal example of a PDF file with desired features somewhere to allow me
    and others to have a look at it.
    Consult LaTeX sources of information to see if LaTeX with movie15 package is capable to do what you want
    (Asymptote forum may be the best place).
    If they do not - they may be fixed (movie15 is in active development) or a separate utility may be used.
    But first - show and, if possible, describe in technical terms what you want to achieve.
    As to lighting - make the material of the mesh you apply texture to to emit white light and
    set diffuse and specular colors to black.
    If you have programming skills and know Blender well - writing an U3D exporter for Blender
    must be not that hard, there is an intermediate simple text format.
    I can collaborate (have just made a VTK exporter).
    And that may be also an option - use VTK instead of Blender
    and you are almost there (well, me exporter does not have texture support now,
    but I can add it if there is someone to test).
    Did you ask this questions on a Meshlab forum?
    If your data comes from your own program - it may be simpler to write
    into U3D immediately (via that text intermediate format, IDTF).

  • PDF-based forms

    needed help..........
    how to create PDF-based forms

    Hi,
    1. Why do you go for PDF Based Forms when you have SAP Scripts/Smart Forms?
    In US and Europe under section 508 each and every business document should be accessible for impaired people.
    Communication is possible using XML.
    We can work off line (when SAP system is down).
    2. The Settings involved in SAP Scripts/Smart Forms.
        All Fonts, output type, Driver programs and printer details.
    Myriad Pro (some font name like that) is not available in PDF form. For other fonts no extra setting is required
    Changes in Driver Program
    Driver program
    •     Get name of the generated function module
         CALL FUNCTION 'FP_FUNCTION_MODULE_NAME'
    •     Print job needs to be opened exclusively for forms of new interface
         CALL FUNCTION 'FP_JOB_OPEN'
    •     Now call the generated function module
    … After PDF form is created it will generated FM
      CALL FUNCTION gv_fm_name
    •     Close spool job
           CALL FUNCTION 'FP_JOB_CLOSE'
    3. Does the same font/printer is supported when converted to PDF Based forms?
                Yes
    4. What difficulties are faced when conversion? Limitations in PDF Based Forms which cannot be done when trying to convert Smart forms/Sap Scripts?
         Certain conditions are there in Sap Scripts and Smart forms that can not be handled in PDF, there is something called as Scripting in PDF. Using that we can handle such kind of exceptions .But according to Good Programming technique we should rarely use it, because it affects the performance of the form.
    5. Limitations of PDF Based Forms as such?
       There is a concept called as nested tables, from Accessibility point of view there is problem.
         Headers on different pages. This is again an accessibility problem.
         Preprinted forms.
    6. Procedure Assigning output types for PDF Based forms and testing them?
        Some Basis Problem is there, that’s why I am not able to view the layout
    7. Printer configurations in PDF Based forms and the Paper size?
    •     Default printer should be Local 
    •     Page Scaling should be  None
    •     Chose paper source by PDF Page size.
    points plz (if this information is usefull for U ).
    Regard's
    Raghunath.S

  • Conditional format of one cell based on contents of another cell

    Preparing list of appointments for visitors with time for each. There are four possible status categories for visitors based on projected activity and whether they have shown up or not. (A Group Scheduled, A Group Showed, B Group Scheduled, B Group Showed). I've created the calendar in iCal with different calendars, then accessed that through Bento iCal Events. Bento adds the column with name of calendar from iCal. Info is now in Numbers. (iWorks 09)
    It shows Title, Start Date, Location, Calendar. I'm adding more fields using Lookup in Numbers drawing on entire data base brought over from Bento. I'm trying to get what info I need, but none that I don't, so the form isn't too crowded and busy.
    I would like to be able to conditionally format the Time column based on the contents of the Calendar column. If successful, the color of the Time column would tell me all I need to know about visitor status, and I could hide the Calendar column and clean up the form.
    I hope I've described it adequately. Can this be done, and if so, how?
    Thanks! Have a great Thanksgiving!

    jpcranch wrote:
    Does it work to format two cells based on contents of one of them?
    Conditional formatting of a cell depends on the contents of that individual cell, compared with another value, so no.
    But you could use a two-table approach.
    In general terms, the technique is to:
    • make the original table's cell fill transparent,
    • Add a second table to the sheet,
    • Set the second table to copy the control values into a column of cells,
    • Select the cells in that column of Table 2,
    • Apply conditional formatting rules for each possible control value (see below)
    • Set the text opacity for these cells to 0 (full transparency),
    • Click on Table 2 in the sidebar, then use (shift and) the arrow keys to slide Table 2 behind Table 1 to align the highlighted cells with the cells to appear highlighted in Table 1.
    • Set the colour of the cell borders on table 2 to none.
    Using the information you gave regarding the calendar column, you'll need four rules for conditional formatting of this column. All are of the same format:
    Text contains: A Group Scheduled
    With the text changed to match each possible entry.
    Regards,
    Barry
    BTW: by changing the width of the formatted column on table 2, you can appear to highlight as many columns as you wish on Table 1.

  • Query on PDF-BASED FOLIO - INTERNALS - ie .FOLIO files.

    When we create  image based folio  and unzip the folio and see , we see all the non-interactive elements exported as jpeg and png.
    I would like to understand  when the non-interactive contents are exported as png and jpeg.
    In similar lines , when we create pdf based folio and unzip the folio, we see only base Content (IN Folio Spec terms, assetRendition  ) and assets of ScrollableFrame Overlay ( IN FolioSpec terms, overlayAsset) as  pdfs.
    For rest of the overlays, the non-interactive content are still png/jpeg.
    I would like to understand the rationale behind it.
    Or is there any settings to force PDF for all overlays.

    With the exception of MSOs and scrollable content, overlays are rasterized. Scrollable frames are exported as vector by default. MSOs raster by default.

  • Scrollable frame in PDF-based folio prevents page swipe

    Can someone confirm for me that scrollable frames in a PDF-based folio prevent one from being able to use that area to swipe to the next page?
    Thanks,

    I confirm too.
    We publish a weekly magazine for iPad2 and iPad3.
    iPad2 folio is a jpg one, and we use pdf format for iPad3.
    Scrollable full page contents work fine on iPad2 but prevent swiping on iPad3.
    Our solution: creating "cold zones" on the edges of the pages using buttons (setting them "navto://nowhere"). I think it's the simpliest way to make it work.

  • Want to update data in a view based on multiple tables

    Hi
    I am facing a problem i want to update data in tables using a view. As that view is based on multiple tables so i am unable to update data. i came to know we can update table from view only if view is based on single table. so if anyone knows any alternative please let me know.
    Thanx
    Devinder

    Devinder,
    The table can be updated through a view based on multiple tables, if and only if the table is a "key preserved" table. Rather than explaining myself, i avoided the burden of typing by finding the material in Oracle Docs and pasting it for you :-)
    If you want a join view to be updatable, all of the following conditions must be
    true:
    1. The DML statement must affect only one table underlying the join.
    2. For an INSERT statement, the view must not be created WITH CHECK
    OPTION, and all columns into which values are inserted must come from a
    key-preserved table. A key-preserved table in one for which every primary
    key or unique key value in the base table is also unique in the join view.
    3. For an UPDATE statement, all columns updated must be extracted from a
    key-preserved table. If the view was created WITH CHECK OPTION, join
    columns and columns taken from tables that are referenced more than once
    in the view must be shielded from UPDATE.
    4. For a DELETE statement, the join can have one and only one key-preserved
    table. That table can appear more than once in the join, unless the view was
    created WITH CHECK OPTION.
    HTH
    Naveen

  • How to change background color of text in pdf based by font name

    Hi
    How to change the background color of text in PDF based by font name. Is there any option in Javascript. e.g: If PDF containing ARIAL font, the ARIAL text background color needs to be changed in red color for all pages. Same for all fonts with different different color in the PDF.
    Thanks in Advance

    Hi
    1) Is there any possibilities to highlight with different color based on font using javascript
    2) list of font used in PDF using javascript
    3) How to hilight the text using javascript
    Thanks in Advance

  • How to add a new row in Tabular Form based on a table

    Hi
    I have tabular form based on a table.
    I want the table to have an empty row when there is no data
    in the table so that I can enter data directly.
    But right now whenever the page is launched, its showing a no data found message and I have to press the 'Add Row' button to enter data.
    Can anyone help me out on this?
    Thanks

    Hi Leo
    Your suggestion works fine in the APEX 2.1
    But in 3.0.1 it gives this error :
    Error in add row internal routine: ORA-01476: divisor is equal to zero
    Error Unable to add rows.
    I am not sure why this happens.

  • Exception Handling for a Form Based on a Table

    I created a form based on a table. If a user tries to enter a
    record with key data that matches an existing record, Oracle
    Portal creates a page and displays:
    Error:
    An unexpected error occurred: ORA-00001: unique constraint
    (TIMETRACK.SYS_C007185) violated (WWV-16016)
    How can I capture this exception so that I can display a
    friendlier error message (via a JavaScript alert or other
    means) instead of this page?
    Fran

    James, I tried your suggestion as follows:
    doInsert;--- This is the default handler
    Exception
    when DUP_VAL_ON_INDEX then
    p_session.set_value(
    p_block_name => "_block",
    p_attribute_name => '_STATUS',
    p_value => ' Time has already been entered for this project on
    this date. Click the Back button to return to the MIM Time
    Entry page. ');
    Raise;
    end;
    It did nothing. All I got was the default message.
    I then changed '_STATUS' to 'A_STATUS'. The resulting error
    message was: "Error: (WWV-00000)"
    ON A RELATED NOTE...
    I discovered while testing this form on IE 5 on a Mac, that
    additional error messages are generated. With IE 5 on WindowsNT
    only a message regarding the unique constraint is produced. On
    the Mac there is an additional message "No conversion performed
    for type INTEGER, value is NsNu (WWC-49102)"
    Anyone have any thoughts as to why the output of system
    generated errors should differ between a PC and a Mac?
    Fran

Maybe you are looking for