Split Word Document

I need to split a word document into individual pages/files. How can this be done?

Then export to .pdf will be quickest and easiest.
Open it in *Acrobat Pro*.
+Menu > Document > Split Document… > Numbers of Pages > Maximum 1+
Nominate which folder you want to export to.
The files will be named *Original File Name_Part1.pdf* etc
You can then use an application like Renamer4Mac to rename the files how you would like them or use an Automator action to do the same or with more work append the student name to each file.
It seems to me this may be working on the horse after it has bolted.
Ideally you would want a sequential merge creating the files and possibly using the student name when it merges and saves. That definitely needs scripting and for that Yvan is your man.
Peter

Similar Messages

  • Splitting word document - Java API

    Hi , I would like to split a word doc in to a single document.
    ex: if i have a document with 10 pages i need to split the doc in to each one page.
    Can some one pls let me know how to do it ?? is there any Java API available ???
    Thanks in adv

    pk1234 wrote:
    Hi , I would like to split a word doc in to a single document.
    ex: if i have a document with 10 pages i need to split the doc in to each one page.
    Can some one pls let me know how to do it ?? is there any Java API available ???
    Thanks in advThis is going to be extremely difficult unless you get very lucky. I would look for new requirements or get better defined requirements.
    The problems are basically two in nature.
    1) Reading and writing Word docs sucks in general. It sucks very much for Java. Unless there is a compelling reason otherwise I would recommend using the open office plugin technique others have used rather than POI.
    2) Detecting "page" boundaries is about next to impossible in Word. Unless you are lucky enough to be able to have a document that a user has inserted page breaks into. Then maybe.
    I would in general recommend that your best approach is not to do this. Wherever the word document is being generated do what you need to do at that step.

  • Export and split mail merge to Word Documents

    Hi there,
    I have a mail merge which creates one very long Pages document. I need to split the pages document into separate MS Word documents. Any suggestions?

    I should specify that I am dealing with a large number of documents and am looking for an automated solution. Essentially I want to do a mail merge but end up with individual MS Word documents for each person in the mail merge.
    So if I have 3 people:
    Alex
    Bob
    Cathy
    And I have one document which gets mail merged, I wanted to end up with
    Alex.doc
    Bob.doc
    Cathy.doc

  • Acrobat XI Pro Won't Convert 206 Page Word Document into PDF

    Hi there
    As menetioned above, Acrobat XI Pro Won't Convert my 206 Page Word Document into a PDF.  The Word document was originally a PDF file that I converted to Word and it has split all the text into sections.
    It sounds like converting a PDF into Word isn't the best way to edit, re-format and then save as a PDF again.  I would love to hear your advice on this.
    Thanks very much for your help!
    Fiona

    First before you recreate the PDF from the Word Document.
    In word: Open Document
    Next open a new Blank docment
    switch back to Word click on the ¶ button
    scroll to of go to very end of docment.
    click just to right of the perion in the last sentence.
    now go to very beginning of document
    Hold down the Shift and click to the right of first letter in document.
    Now choose copy.
    Now switch to Blank document
    Choose Paste special.
    Now choose Text only.
    If works all the words will be there spaced correct but with no ¶'s.
    Now insert returns as desired.
    Now save as a docx file under a different name.
    IF you are on a Mac use the following directions:
    go to File menu > Print > PDF Hold down PDF button until Context menu pops up.
    Choose adobe PDF.
    follow steps when the first window opens.
    Save as PDF in desired location.
    Now open the PDF in Acrobat. Document should be properly formatted and ready to go.
    AS you've found The conversion is not seamless. Acrobat doesn't distingish between automatic end of line breaks and Returns and you have to put the pieces back again.  I wish Adobe and MS would get over the jealouscy of each other and share howcode works so Thatapplications could work seamlessly together.  BUt they never will.

  • How can I display the date a word document was last modified on my web page?

    I have lists of links to word documents on my web pages. I
    want to be able to show the date the word document was last
    modified next to each link. My pages are straight html.
    Thanks.

    >but I would have to go to each individual file (I have
    hundreds) and put this code on each link.
    No, you can use the FileSystemObject to get a list of the
    files in the directory. See if you can adapt this for your site:
    <link href="/CSS/dirlist.css" rel="stylesheet"
    type="text/css">
    <%
    Function FormatSize(SizeInBytes)
    if SizeInBytes > 1024^2 then
    FormatSize = formatnumber(SizeInBytes / 1024^2,2) & "
    MB"
    elseif SizeInBytes > 1024 then
    FormatSize = formatnumber(SizeInBytes / 1024,2) & " KB"
    else
    FormatSize = SizeInBytes & " Bytes"
    end if
    end function
    %>
    <%
    dim parts
    dim part
    dim BrowseLine
    dim RelLink
    dim i
    url = Request.ServerVariables("URL")
    Parts = split(URL,"/")
    for part = 1 to ubound(Parts)-1
    ' calc link
    RelLink = ""
    for i = 1 to ubound(Parts) - part - 1
    RelLink = RelLink & "..\"
    next
    If trim(BrowseLine) <> "" then
    BrowseLine = BrowseLine & "/"
    end if
    BrowseLine = BrowseLine & "<A href=" & RelLink
    & ">" & parts(part) & "</A>"
    next
    ' add "back" link
    BrowseLine = Browseline & "  " &
    "<A href='javascript:history.back();'><img
    src='/images/return.gif' border=0></A>"
    Response.Write "<P class = 'NavTop'>" & BrowseLine
    & "</P>"
    Dim fso
    set fso = server.CreateObject("Scripting.fileSystemObject")
    dim fldr
    dim fle
    %>
    <a href="../index.asp"><img
    src="../images/leftbanner.jpg" width="343" height="70"
    border="0"></a>
    <link href="/CSS/dirlist.css" rel="stylesheet"
    type="text/css">
    </p>
    <p></p>
    <table width="85%" border="0" cellspacing="0"
    cellpadding="0">
    <tr>
    <th width="63%"><div
    align="left">   Document
    Name</div></th>
    <th width="14%">Size</th>
    <th width="23%"><div
    align="left">  Date</div></th>
    </tr>
    <%
    ' display folders
    for each fldr in
    fso.GetFolder(server.MapPath(".")).SubFolders
    %>
    <tr>
    <td class="FileName"> <a href='<%=
    fldr.Name %>'><img src="/images/folder.gif" border=0>
    <%= fldr.Name %></a></td>
    <td class="FileSize">  <%=
    FormatSize(fldr.Size) %></td>
    <td class="FileDate"><div
    align="left">  <%=
    FormatDateTime(fldr.DateLastModified,2)
    %></div></td>
    </tr>
    <%
    next
    %>
    <%
    ' display files
    for each fle in fso.GetFolder(server.MapPath(".")).Files
    dim ext
    ext = fso.GetExtensionName(fle.path)
    if (ext <> "asp") and (ext <> "scc") then
    %>
    <tr>
    <td class="FileName"> <a href='<%=
    escape(fle.Name) %>'><img src="/images/file.gif"
    width="14" height="16" border=0>
    <%= fle.Name %></a></td>
    <td class="FileSize">  <%=
    FormatSize(fle.Size) %></td>
    <td class="FileDate"><div
    align="left">  <%=
    FormatDateTime(fle.DateLastModified,2) %></div></td>
    </tr>
    <%
    end if
    next
    Set fso = Nothing
    Set fle = Nothing
    Set fldr = Nothing
    %>
    </table>
    <p><a href="/index.asp">Return to Home
    Page</a></p>

  • Multi-Section Word Document Won't Save as Single pdf

    I'm using MS Word 2004 and have a document split into 3 sections (so to permit landscape printing of the middle section containg a bunch of big tables, portrait on the two others containg text mostly, before and after).
    When I try to create a pdf within the Word Print Dialog, 3 separate pdfs are created -- one for each section. Is there a way to get it to create just 1 pdf?
    I don't do this sort of thing very often, so hope there is some easy, inexpensive way. I thought about copying the tables and then pasting as a picture, rotating it, but the margins for the middle center section are also different (so to fit the tables).
    Any help or ideas are most appreciated.
      Mac OS X (10.3.6)  

    Is this a Word document generated by Adobe's RoboHelp product or just a document generated directly in Word?
    If the latter, you are asking in the wrong place. Even if you are using RoboHelp, this looks more like a question to ask in the Acrobat forums.
    http://forums.adobe.com/community/acrobat
    See www.grainge.org for RoboHelp and Authoring tips
    @petergrainge

  • Inserting Multiple Page PDF in Word Document

    I've been searching for an answer to what I thing ought to be a rather simple question and I can find nothing. I have a couple of users who have attempted to insert a multiple page PDF into a Word document and found that only the first page of the PDF
    shows up in their document. Is there any method to insert multiple page PDFs into Word as multiple pages without splitting the PDF into individual single page files and inserting individually?
    Thanks...Ray

    A quick and dirty way to do this is to open the pdf document in question with word separately.   You get the following warning message.
    I can then select all and copy/paste in to my word document.
    The few pdfs I tried this with pretty much turned in to all images when I did this, but it worked my purposes.
    I did end up having to resize one that didn't seem to want to fit when I pasted it in.  There may be a method to select an image and automatically fit it within the margins of the page it is on that would help that.
    The file size on my resulting word document seemed to grow quite a bit as well.  Again, not really an issue in my case though.

  • Importing the word document in RoboHelp

    Hi,
    I am importing a word document into Robohelp 7 HTML. But after the import is completed, the entire word document is converted into a single HTML page. My requirement is that each chapter should be created into seperate HTML file based on the heading level. This will help in creating the book based on the heading level. I was able to do this task successfully with RoboHelp X5 but not with any other higher versions!!!!!

    Hi there
    The key to this is to ensure the Word document has been styled consistently with headings and other formatting. If it's one of the garden variety where folks just painted across text all willy nilly you will indeed see mixed results.
    Pay careful attention to the intermediate dialogs you see when importing the document. The second one should offer a way to split on style.
    Cheers... Rick
    Click here for Adobe Authorized Captivate and RoboHelp HTML Training
    Click here for the SorcerStone Blog
    Click here for RoboHelp and Captivate eBooks

  • How to Protect a Word Table in a Word Document without Protecting the Word Document

    Is there any way we can Protect a Word Table alone without Protecting the Entire Word Document?
    or
    My requirement is to Protect the Table from Editing but at the same time, the other default word features like, Track Changes, Page Orientation etc., shouldn't be disabled. Because while the document is Protected, I couldn't Accept or Reject Track Changes,
    the menu options are being disabled.
    Any kind of help is much appreciated!
    Thanks.

    Hello,
    This forum is for discussions and questions regarding profiles and Microsoft's recognition system on the MSDN and TechNet sites. It is not for products/technologies.
    As it's off-topic here, I am moving the question to the
    Where is the forum for... forum.
    Karl
    When you see answers and helpful posts, please click Vote As Helpful, Propose As Answer, and/or Mark As Answer.
    My Blog: Unlock PowerShell
    My Book:
    Windows PowerShell 2.0 Bible
    My E-mail: -join ('6F6C646B61726C406F75746C6F6F6B2E636F6D'-split'(?<=\G.{2})'|%{if($_){[char][int]"0x$_"}})

  • Save as a word document or rtf

    Hello
    I have a problem that is driving me crazy.  I have a client that has a book in pdf form.  He needs to do some editing but he doesn't have adobe acrobat, nor can he afford it.  So I saved the pdf as a word document, the problem is, the paragraph formatting changes for the last paragraph on a page and consequently the last line in that paragraph is indented on the left by 2 or 3 character spacings.  When checking the formatting in word, the  last line seems to have been split off and forms a new paragraph.   Any help fixing this would be appreciated.

    The quality of PDF export is dictated by the quality of the PDF. This is the "build" or under-the-hood quality and not what you "see".
    (Perception is not reality).
    The export reflects the input - GIGO is operative.
    You'll have to do clean up of the Word file(s).
    Be well...

  • HT2506 I am trying to convert  word document to a PDF but only half the document accurately converts. Is there a way to fix this?

    I am trying to convert  word document to a PDF but only half the document accurately converts. Is there a way to fix this?

    I have found that complicated document structures causes Word to split the document into parts, but I've always gotten the whole thing, in multiple documents.
    However, with two pages, I can't imagine it is that complicated.
    Any odd fonts or paragraph styles?

  • Cant print online word document

    I am having trouble printing an online word document. I have an HP C6380.
    When I click print the print queue shows as spooling but nothing ever prints.
    Can anyone help.
    Thanks.

    Hi,
    According to your description, I selected the situations which your said above.
    1. A lot of Domain account could print from is IE10 to Onenote, but IE 8 not.
    2.When Domain account print to Onenote, but it hanged.
    3. Word works well if runned as administrator.
    From above, I think the printer setting was Ok, beacuse you could print something by using Onenote to physical printer.
    As far as I know, OneNote stores a raster and a vector format for the printout images. It will first load the raster image (quicker to load) then quickly switch to vector image (hi res/sharper). The image you see for a split second is the raster
    image and then vector format fails to render. I believe you did not lose any data, just that it fails to render due to a configuration issue.
    OneNote uses .NET to render the vector format and I have seen cases where the installed .NET version or not installed .NET updates would be a problem. I recommend you first make sure you have
    Win7 SP1 and Office and .NET updates that show up in Windows Update are all installed and retry after that.
    And then,please check the link:http://www.onenote-tips.com/printfails.html, it may be help for you.
    If the issue exists, Please use process monitor to selcet the log when the issue occurs.
    Regards,
    George Zhao
    TechNet Community Support

  • Linking a Word Document

    Hi Guys,
    Apologies if this is really stupid question, but I'm having problems I can't sort out.  I am trying to educate project managers who report in MS Word and store data in MS Excel - because they like colours and can manipulate simple Excel features.
    My intermediate solution was to interrogate the Excel data in Crystal Reports ver 10 (or BusinessObjects Enterprise XI) and link the basic report, a Word document (as the report header). The idea being that project managers could update the Word document and continue to use Excel - and Crystal would tidily put the two together. 
    However - no probs with Excel data but if I link the Word document into the header (or any other report section) I get ony the first table of the document!  The Word document is a series of tables one of which spans several pages, with the tables formatted to allow rows to split over a page break.  I can link the individual tables - but the version produced always stops at a page break.
    All ideas very gratefully received - either to solve my problem or suggest a better way of working completely.
    Barrie

    Only the 1st page will be displayed. This is by design. For possible work-arounds see the following KBases:
    [1211717 - Crystal Reports only displays first page of multiple-page OLE object|http://www.sdn.sap.com/irj/scn/go/portal/prtroot/docs/oss_notes_boj/sdn_oss_boj_erq/sap(bD1lbiZjPTAwMQ==)/bc/bsp/spn/scn_bosap/notes%7b6163636573733d36393736354636443646363436353344333933393338323636393736354637333631373036453646373436353733354636453735364436323635373233443330333033303331333233313331333733313337%7d.do]
    [1202196 - How to import Microsoft Word documents into Crystal Reports|http://www.sdn.sap.com/irj/scn/go/portal/prtroot/docs/oss_notes_boj/sdn_oss_boj_erq/sap(bD1lbiZjPTAwMQ==)/bc/bsp/spn/scn_bosap/notes%7b6163636573733d36393736354636443646363436353344333933393338323636393736354637333631373036453646373436353733354636453735364436323635373233443330333033303331333233303332333133393336%7d.do].
    Ludek

  • Infopath Multi Textbox and OpenXML to generate word document

     Hi,
    I'm reading multiple InfoPath Textbox (set to display multiple lines) using OpenXML and generating a word document based on a template collecting data from InfoPath. I'm using following code,  the code gives error in
    AddMultiLineText at Paragraph p = sdt.GetFirstChild<SdtContentBlock>().GetFirstChild<Paragraph>(); 
    The code actually works when I use only one Multi Textbox but fails if its more than one. I'm quite new to Infopath and OpenXML, I would appreciate some help ASAP as its holding me in my project:
    Event Receiver
      public override void ItemAdded(SPItemEventProperties properties)
               try
                   this.EventFiringEnabled = false;
                   string siteUrl = properties.WebUrl;
                   SPListItem item = properties.ListItem;
                   SPWeb web = properties.Web;
                   SPList lib = web.Lists["DocLib"];
                   SPFile file = lib.RootFolder.Files["template.docx"];
                   string generatedDoc = "";
                   if (file != null)
                       byte[] templateBytes = file.OpenBinary();
                       using (MemoryStream ms = new MemoryStream())
                           ms.Write(templateBytes, 0, (int)templateBytes.Length);
                           byte[] convertedDocBytes = ConvertInfoPathToWord(ms, item.File);
                           generatedDoc = item["LinkFilename"].ToString().Replace("xml", "docx");
                           SPFile newFile =
                             lib.RootFolder.Files.Add(generatedDoc, convertedDocBytes, true);
                           ms.Close();
     private byte[] ConvertInfoPathToWord(MemoryStream ms, SPFile file)
               byte[] bytes = file.OpenBinary();
               using (MemoryStream msInternal = new MemoryStream(bytes))
                   XmlDocument doc = new XmlDocument();
                   doc.Load(msInternal);
                   XPathNavigator root = doc.CreateNavigator();
                   root.MoveToFollowing(XPathNodeType.Element);
                   string ns = root.GetNamespace("my");
                   XmlNamespaceManager nsMgr = new XmlNamespaceManager(new NameTable());
                   nsMgr.AddNamespace("my", ns);
                    string multiLineText1 = root.SelectSingleNode(
                     "/my:myFields/my:field1", nsMgr).Value;
                   string multiLineText2 = root.SelectSingleNode(
                     "/my:myFields/my:field2", nsMgr).Value;
                    using (WordprocessingDocument myDoc =
                     WordprocessingDocument.Open(ms, true))
                       MainDocumentPart mainPart = myDoc.MainDocumentPart;
                       List<OpenXmlElement> sdtList = InfoPathToWord.GetContentControl(
                         mainPart.Document, "multilinetext1");
                       InfoPathToWord.AddMultiLineText(multiLineText1, ref sdtList);
                       List<OpenXmlElement> sdtList = InfoPathToWord.GetContentControl(
                         mainPart.Document, "multilinetext2");
                       InfoPathToWord.AddMultiLineText(multiLineText2, ref sdtList);
                        myDoc.Close();
                   msInternal.Close();
               return ms.ToArray();
            public static List<OpenXmlElement> GetContentControl(
              Document doc, string name)
                List<OpenXmlElement> list = new List<OpenXmlElement>();
                List<SdtBlock> sdtList = doc.Descendants<SdtBlock>()
                  .Where(s => name.Contains(s.SdtProperties.GetFirstChild<SdtAlias>()
                  .Val.Value)).ToList();
                if (sdtList.Count == 0)
                    List<SdtRun> sdtRunList = doc.Descendants<SdtRun>()
                      .Where(s => name.Contains(s.SdtProperties.GetFirstChild<SdtAlias>()
                      .Val.Value)).ToList();
                    foreach (SdtRun sdt in sdtRunList)
                        list.Add(sdt);
                else
                    foreach (SdtBlock sdt in sdtList)
                        list.Add(sdt);
                return list;
            public static void AddMultiLineText(
              string multiLineText, ref List<OpenXmlElement> sdtList)
                string[] lines = multiLineText.Split(new char[] { '\n' });
                if (sdtList.Count != 0)
                    foreach (OpenXmlElement sdt in sdtList)
                        for (int i = 0; i < lines.Length; i++)
    Paragraph p =
                              sdt.GetFirstChild<SdtContentBlock>().GetFirstChild<Paragraph>(); //this ERRORS
                            if (i == 0)
                                InfoPathToWord.WriteText(lines[i], ref p);
                            else
                                Paragraph pNext = sdt.AppendChild((Paragraph)p.Clone());
                                InfoPathToWord.WriteText(lines[i], ref pNext);

    Additional Info: multiLineText1 and multiLineText2 are "Plain Text Content Control" in the word template used to create the word document.

  • Can JAVA read Microsoft word document?

    Hi
    I am trying to make a program that read a word document, and split the document in to each pages and save each pages with file as it's footer of each pages.
    --- example-----
    test.doc
    Page 1 |
    |
    |
    |
    |
    |
    footer - hi |
    Page 2 |
    |
    |
    |
    |
    |
    footer - test |
    Page 3 |
    |
    |
    |
    |
    |
    footer - still |
    Page 4 |
    |
    |
    |
    |
    |
    footer - hahaha |
    Page 5 |
    |
    |
    |
    |
    |
    footer - lastpage |
    -----------------------> Out put of the program
    hi.doc , test.doc, still.doc, hahaha.doc, lastpage.doc
    Is there any way to import word document and process like that above?
    I have searched about this, and found out 'Jakarta POI' can access to word document but with many restrictions. (I heard it doesn't support word version 2003 and 97)
    Can any one please answer this?
    Thankx

    I'm not really sure that java is the best language to use for this particular project. I haven't looked into writing a standalone application to do this sort of thing, but Microsoft Word has a built in VB editor that allows you to write scripts that will do this very simply, using Word's Document Object Model.
    I would imagine the same sort of the thing could be done from a Stand-alone aplication as well, but you might need some libraries for reading the document and obtaining the object with the same model. I would look into Microsoft's .net solutions for this one. You might even be able to work out a solution in C++, C#, or even J# (which I understand is "similar" to java), if you don't know visual basic.
    It is possible that there is a library out there that will read a Word Document, and create the appropriate Java object with the same object model, but I don't know where it can be found.
    Anyway, if the option is available to you, take a look at Microsoft's .Net. You'll probably have more luck that way. Good Luck!

Maybe you are looking for

  • Compatability Issue: Need Help Disabling Additional Functions - URGENT!

    I am experiencing a problem with running Pro Tools 8.0.4 LE on my HP Pavilion dv7, which is encountered as a DAE error -6006 when loading Pro Tools. Research indicates that this error relates somehow to the firewire connection, and I am able to use P

  • Two problems with displaying images in JPanel

    Hi, I need to display some small rectangular images inside a JPanel. These images also need to be componenents as they will have listeners associated with them. I have never had any experience with images, so I might totally be down the wrong path. I

  • Reinstalling wired desk top to Home Hub 4

    My wired desk top computer has been disconnected for 4 weeks. How do I reinstall broadband to my computer?

  • Integrate Web Service in SQL Server assembly

    Hi, I need to create an assembly in SQL Server 2010 who integrate a web service reference. What I did so far: 1) Create a SQL CLR database project in VS 2010 2) Add a Web Service reference. Service is added correctly and methods are exposed. 3) Add a

  • Itunesprefs.xm.

    How can I set the preferences to C:\Music for all users across the enterprise. Right now it seems to point to the default My Documents\my music folder. I can change it with the itunes software, which changes the itunesprefs.xml file but I cannot then