Split Word Document

I need to split a word document into individual pages/files. How can this be done?

Then export to .pdf will be quickest and easiest.
Open it in *Acrobat Pro*.
+Menu > Document > Split Document… > Numbers of Pages > Maximum 1+
Nominate which folder you want to export to.
The files will be named *Original File Name_Part1.pdf* etc
You can then use an application like Renamer4Mac to rename the files how you would like them or use an Automator action to do the same or with more work append the student name to each file.
It seems to me this may be working on the horse after it has bolted.
Ideally you would want a sequential merge creating the files and possibly using the student name when it merges and saves. That definitely needs scripting and for that Yvan is your man.
Peter

Similar Messages

Splitting word document - Java API

Hi , I would like to split a word doc in to a single document.
ex: if i have a document with 10 pages i need to split the doc in to each one page.
Can some one pls let me know how to do it ?? is there any Java API available ???
Thanks in adv

pk1234 wrote:
Hi , I would like to split a word doc in to a single document.
ex: if i have a document with 10 pages i need to split the doc in to each one page.
Can some one pls let me know how to do it ?? is there any Java API available ???
Thanks in advThis is going to be extremely difficult unless you get very lucky. I would look for new requirements or get better defined requirements.
The problems are basically two in nature.
1) Reading and writing Word docs sucks in general. It sucks very much for Java. Unless there is a compelling reason otherwise I would recommend using the open office plugin technique others have used rather than POI.
2) Detecting "page" boundaries is about next to impossible in Word. Unless you are lucky enough to be able to have a document that a user has inserted page breaks into. Then maybe.
I would in general recommend that your best approach is not to do this. Wherever the word document is being generated do what you need to do at that step.

Export and split mail merge to Word Documents

Hi there,
I have a mail merge which creates one very long Pages document. I need to split the pages document into separate MS Word documents. Any suggestions?

I should specify that I am dealing with a large number of documents and am looking for an automated solution. Essentially I want to do a mail merge but end up with individual MS Word documents for each person in the mail merge.
So if I have 3 people:
Alex
Bob
Cathy
And I have one document which gets mail merged, I wanted to end up with
Alex.doc
Bob.doc
Cathy.doc

Acrobat XI Pro Won't Convert 206 Page Word Document into PDF

Hi there
As menetioned above, Acrobat XI Pro Won't Convert my 206 Page Word Document into a PDF. The Word document was originally a PDF file that I converted to Word and it has split all the text into sections.
It sounds like converting a PDF into Word isn't the best way to edit, re-format and then save as a PDF again. I would love to hear your advice on this.
Thanks very much for your help!
Fiona

First before you recreate the PDF from the Word Document.
In word: Open Document
Next open a new Blank docment
switch back to Word click on the ¶ button
scroll to of go to very end of docment.
click just to right of the perion in the last sentence.
now go to very beginning of document
Hold down the Shift and click to the right of first letter in document.
Now choose copy.
Now switch to Blank document
Choose Paste special.
Now choose Text only.
If works all the words will be there spaced correct but with no ¶'s.
Now insert returns as desired.
Now save as a docx file under a different name.
IF you are on a Mac use the following directions:
go to File menu > Print > PDF Hold down PDF button until Context menu pops up.
Choose adobe PDF.
follow steps when the first window opens.
Save as PDF in desired location.
Now open the PDF in Acrobat. Document should be properly formatted and ready to go.
AS you've found The conversion is not seamless. Acrobat doesn't distingish between automatic end of line breaks and Returns and you have to put the pieces back again. I wish Adobe and MS would get over the jealouscy of each other and share howcode works so Thatapplications could work seamlessly together. BUt they never will.

How can I display the date a word document was last modified on my web page?

I have lists of links to word documents on my web pages. I
want to be able to show the date the word document was last
modified next to each link. My pages are straight html.
Thanks.

>but I would have to go to each individual file (I have
hundreds) and put this code on each link.
No, you can use the FileSystemObject to get a list of the
files in the directory. See if you can adapt this for your site:
<link href="/CSS/dirlist.css" rel="stylesheet"
type="text/css">
<%
Function FormatSize(SizeInBytes)
if SizeInBytes > 1024^2 then
FormatSize = formatnumber(SizeInBytes / 1024^2,2) & "
MB"
elseif SizeInBytes > 1024 then
FormatSize = formatnumber(SizeInBytes / 1024,2) & " KB"
else
FormatSize = SizeInBytes & " Bytes"
end if
end function
%>
<%
dim parts
dim part
dim BrowseLine
dim RelLink
dim i
url = Request.ServerVariables("URL")
Parts = split(URL,"/")
for part = 1 to ubound(Parts)-1
' calc link
RelLink = ""
for i = 1 to ubound(Parts) - part - 1
RelLink = RelLink & "..\"
next
If trim(BrowseLine) <> "" then
BrowseLine = BrowseLine & "/"
end if
BrowseLine = BrowseLine & "<A href=" & RelLink
& ">" & parts(part) & "</A>"
next
' add "back" link
BrowseLine = Browseline & "  " &
"<A href='javascript:history.back();'><img
src='/images/return.gif' border=0></A>"
Response.Write "<P class = 'NavTop'>" & BrowseLine
& "</P>"
Dim fso
set fso = server.CreateObject("Scripting.fileSystemObject")
dim fldr
dim fle
%>
<a href="../index.asp"><img
src="../images/leftbanner.jpg" width="343" height="70"
border="0"></a>
<link href="/CSS/dirlist.css" rel="stylesheet"
type="text/css">
</p>
<p></p>
<table width="85%" border="0" cellspacing="0"
cellpadding="0">
<tr>
<th width="63%"><div
align="left">   Document
Name</div></th>
<th width="14%">Size</th>
<th width="23%"><div
align="left">  Date</div></th>
</tr>
<%
' display folders
for each fldr in
fso.GetFolder(server.MapPath(".")).SubFolders
%>
<tr>
<td class="FileName"> <a href='<%=
fldr.Name %>'><img src="/images/folder.gif" border=0>
<%= fldr.Name %></a></td>
<td class="FileSize">  <%=
FormatSize(fldr.Size) %></td>
<td class="FileDate"><div
align="left">  <%=
FormatDateTime(fldr.DateLastModified,2)
%></div></td>
</tr>
<%
next
%>
<%
' display files
for each fle in fso.GetFolder(server.MapPath(".")).Files
dim ext
ext = fso.GetExtensionName(fle.path)
if (ext <> "asp") and (ext <> "scc") then
%>
<tr>
<td class="FileName"> <a href='<%=
escape(fle.Name) %>'><img src="/images/file.gif"
width="14" height="16" border=0>
<%= fle.Name %></a></td>
<td class="FileSize">  <%=
FormatSize(fle.Size) %></td>
<td class="FileDate"><div
align="left">  <%=
FormatDateTime(fle.DateLastModified,2) %></div></td>
</tr>
<%
end if
next
Set fso = Nothing
Set fle = Nothing
Set fldr = Nothing
%>
</table>
<p><a href="/index.asp">Return to Home
Page</a></p>

Multi-Section Word Document Won't Save as Single pdf

I'm using MS Word 2004 and have a document split into 3 sections (so to permit landscape printing of the middle section containg a bunch of big tables, portrait on the two others containg text mostly, before and after).
When I try to create a pdf within the Word Print Dialog, 3 separate pdfs are created -- one for each section. Is there a way to get it to create just 1 pdf?
I don't do this sort of thing very often, so hope there is some easy, inexpensive way. I thought about copying the tables and then pasting as a picture, rotating it, but the margins for the middle center section are also different (so to fit the tables).
Any help or ideas are most appreciated.
Mac OS X (10.3.6)

Is this a Word document generated by Adobe's RoboHelp product or just a document generated directly in Word?
If the latter, you are asking in the wrong place. Even if you are using RoboHelp, this looks more like a question to ask in the Acrobat forums.
http://forums.adobe.com/community/acrobat
See www.grainge.org for RoboHelp and Authoring tips
@petergrainge

Inserting Multiple Page PDF in Word Document

I've been searching for an answer to what I thing ought to be a rather simple question and I can find nothing. I have a couple of users who have attempted to insert a multiple page PDF into a Word document and found that only the first page of the PDF
shows up in their document. Is there any method to insert multiple page PDFs into Word as multiple pages without splitting the PDF into individual single page files and inserting individually?
Thanks...Ray

A quick and dirty way to do this is to open the pdf document in question with word separately. You get the following warning message.
I can then select all and copy/paste in to my word document.
The few pdfs I tried this with pretty much turned in to all images when I did this, but it worked my purposes.
I did end up having to resize one that didn't seem to want to fit when I pasted it in. There may be a method to select an image and automatically fit it within the margins of the page it is on that would help that.
The file size on my resulting word document seemed to grow quite a bit as well. Again, not really an issue in my case though.

Importing the word document in RoboHelp

Hi,
I am importing a word document into Robohelp 7 HTML. But after the import is completed, the entire word document is converted into a single HTML page. My requirement is that each chapter should be created into seperate HTML file based on the heading level. This will help in creating the book based on the heading level. I was able to do this task successfully with RoboHelp X5 but not with any other higher versions!!!!!

Hi there
The key to this is to ensure the Word document has been styled consistently with headings and other formatting. If it's one of the garden variety where folks just painted across text all willy nilly you will indeed see mixed results.
Pay careful attention to the intermediate dialogs you see when importing the document. The second one should offer a way to split on style.
Cheers... Rick
Click here for Adobe Authorized Captivate and RoboHelp HTML Training
Click here for the SorcerStone Blog
Click here for RoboHelp and Captivate eBooks

How to Protect a Word Table in a Word Document without Protecting the Word Document

Is there any way we can Protect a Word Table alone without Protecting the Entire Word Document?
or
My requirement is to Protect the Table from Editing but at the same time, the other default word features like, Track Changes, Page Orientation etc., shouldn't be disabled. Because while the document is Protected, I couldn't Accept or Reject Track Changes,
the menu options are being disabled.
Any kind of help is much appreciated!
Thanks.

Hello,
This forum is for discussions and questions regarding profiles and Microsoft's recognition system on the MSDN and TechNet sites. It is not for products/technologies.
As it's off-topic here, I am moving the question to the
Where is the forum for... forum.
Karl
When you see answers and helpful posts, please click Vote As Helpful, Propose As Answer, and/or Mark As Answer.
My Blog: Unlock PowerShell
My Book:
Windows PowerShell 2.0 Bible
My E-mail: -join ('6F6C646B61726C406F75746C6F6F6B2E636F6D'-split'(?<=\G.{2})'|%{if($_){[char][int]"0x$_"}})

Save as a word document or rtf

Hello
I have a problem that is driving me crazy. I have a client that has a book in pdf form. He needs to do some editing but he doesn't have adobe acrobat, nor can he afford it. So I saved the pdf as a word document, the problem is, the paragraph formatting changes for the last paragraph on a page and consequently the last line in that paragraph is indented on the left by 2 or 3 character spacings. When checking the formatting in word, the last line seems to have been split off and forms a new paragraph. Any help fixing this would be appreciated.

The quality of PDF export is dictated by the quality of the PDF. This is the "build" or under-the-hood quality and not what you "see".
(Perception is not reality).
The export reflects the input - GIGO is operative.
You'll have to do clean up of the Word file(s).
Be well...

HT2506 I am trying to convert word document to a PDF but only half the document accurately converts. Is there a way to fix this?

I am trying to convert word document to a PDF but only half the document accurately converts. Is there a way to fix this?

I have found that complicated document structures causes Word to split the document into parts, but I've always gotten the whole thing, in multiple documents.
However, with two pages, I can't imagine it is that complicated.
Any odd fonts or paragraph styles?

Cant print online word document

I am having trouble printing an online word document. I have an HP C6380.
When I click print the print queue shows as spooling but nothing ever prints.
Can anyone help.
Thanks.

Hi,
According to your description, I selected the situations which your said above.
1. A lot of Domain account could print from is IE10 to Onenote, but IE 8 not.
2.When Domain account print to Onenote, but it hanged.
3. Word works well if runned as administrator.
From above, I think the printer setting was Ok, beacuse you could print something by using Onenote to physical printer.
As far as I know, OneNote stores a raster and a vector format for the printout images. It will first load the raster image (quicker to load) then quickly switch to vector image (hi res/sharper). The image you see for a split second is the raster
image and then vector format fails to render. I believe you did not lose any data, just that it fails to render due to a configuration issue.
OneNote uses .NET to render the vector format and I have seen cases where the installed .NET version or not installed .NET updates would be a problem. I recommend you first make sure you have
Win7 SP1 and Office and .NET updates that show up in Windows Update are all installed and retry after that.
And then,please check the link:http://www.onenote-tips.com/printfails.html, it may be help for you.
If the issue exists, Please use process monitor to selcet the log when the issue occurs.
Regards,
George Zhao
TechNet Community Support

Linking a Word Document

Hi Guys,
Apologies if this is really stupid question, but I'm having problems I can't sort out. I am trying to educate project managers who report in MS Word and store data in MS Excel - because they like colours and can manipulate simple Excel features.
My intermediate solution was to interrogate the Excel data in Crystal Reports ver 10 (or BusinessObjects Enterprise XI) and link the basic report, a Word document (as the report header). The idea being that project managers could update the Word document and continue to use Excel - and Crystal would tidily put the two together.
However - no probs with Excel data but if I link the Word document into the header (or any other report section) I get ony the first table of the document! The Word document is a series of tables one of which spans several pages, with the tables formatted to allow rows to split over a page break. I can link the individual tables - but the version produced always stops at a page break.
All ideas very gratefully received - either to solve my problem or suggest a better way of working completely.
Barrie

Only the 1st page will be displayed. This is by design. For possible work-arounds see the following KBases:
[1211717 - Crystal Reports only displays first page of multiple-page OLE object|http://www.sdn.sap.com/irj/scn/go/portal/prtroot/docs/oss_notes_boj/sdn_oss_boj_erq/sap(bD1lbiZjPTAwMQ==)/bc/bsp/spn/scn_bosap/notes%7b6163636573733d36393736354636443646363436353344333933393338323636393736354637333631373036453646373436353733354636453735364436323635373233443330333033303331333233313331333733313337%7d.do]
[1202196 - How to import Microsoft Word documents into Crystal Reports|http://www.sdn.sap.com/irj/scn/go/portal/prtroot/docs/oss_notes_boj/sdn_oss_boj_erq/sap(bD1lbiZjPTAwMQ==)/bc/bsp/spn/scn_bosap/notes%7b6163636573733d36393736354636443646363436353344333933393338323636393736354637333631373036453646373436353733354636453735364436323635373233443330333033303331333233303332333133393336%7d.do].
Ludek

Infopath Multi Textbox and OpenXML to generate word document

Hi,
I'm reading multiple InfoPath Textbox (set to display multiple lines) using OpenXML and generating a word document based on a template collecting data from InfoPath. I'm using following code, the code gives error in
AddMultiLineText at Paragraph p = sdt.GetFirstChild<SdtContentBlock>().GetFirstChild<Paragraph>();
The code actually works when I use only one Multi Textbox but fails if its more than one. I'm quite new to Infopath and OpenXML, I would appreciate some help ASAP as its holding me in my project:
Event Receiver
public override void ItemAdded(SPItemEventProperties properties)
           try
               this.EventFiringEnabled = false;
               string siteUrl = properties.WebUrl;
               SPListItem item = properties.ListItem;
               SPWeb web = properties.Web;
               SPList lib = web.Lists["DocLib"];
               SPFile file = lib.RootFolder.Files["template.docx"];
               string generatedDoc = "";
               if (file != null)
                   byte[] templateBytes = file.OpenBinary();
                   using (MemoryStream ms = new MemoryStream())
                       ms.Write(templateBytes, 0, (int)templateBytes.Length);
                       byte[] convertedDocBytes = ConvertInfoPathToWord(ms, item.File);
                       generatedDoc = item["LinkFilename"].ToString().Replace("xml", "docx");
                       SPFile newFile =
                         lib.RootFolder.Files.Add(generatedDoc, convertedDocBytes, true);
                       ms.Close();
private byte[] ConvertInfoPathToWord(MemoryStream ms, SPFile file)
           byte[] bytes = file.OpenBinary();
           using (MemoryStream msInternal = new MemoryStream(bytes))
               XmlDocument doc = new XmlDocument();
               doc.Load(msInternal);
               XPathNavigator root = doc.CreateNavigator();
               root.MoveToFollowing(XPathNodeType.Element);
               string ns = root.GetNamespace("my");
               XmlNamespaceManager nsMgr = new XmlNamespaceManager(new NameTable());
               nsMgr.AddNamespace("my", ns);
                string multiLineText1 = root.SelectSingleNode(
                 "/my:myFields/my:field1", nsMgr).Value;
               string multiLineText2 = root.SelectSingleNode(
                 "/my:myFields/my:field2", nsMgr).Value;
                using (WordprocessingDocument myDoc =
                 WordprocessingDocument.Open(ms, true))
                   MainDocumentPart mainPart = myDoc.MainDocumentPart;
                   List<OpenXmlElement> sdtList = InfoPathToWord.GetContentControl(
                     mainPart.Document, "multilinetext1");
                   InfoPathToWord.AddMultiLineText(multiLineText1, ref sdtList);
                   List<OpenXmlElement> sdtList = InfoPathToWord.GetContentControl(
                     mainPart.Document, "multilinetext2");
                   InfoPathToWord.AddMultiLineText(multiLineText2, ref sdtList);
                    myDoc.Close();
               msInternal.Close();
           return ms.ToArray();
        public static List<OpenXmlElement> GetContentControl(
          Document doc, string name)
            List<OpenXmlElement> list = new List<OpenXmlElement>();
            List<SdtBlock> sdtList = doc.Descendants<SdtBlock>()
              .Where(s => name.Contains(s.SdtProperties.GetFirstChild<SdtAlias>()
              .Val.Value)).ToList();
            if (sdtList.Count == 0)
                List<SdtRun> sdtRunList = doc.Descendants<SdtRun>()
                  .Where(s => name.Contains(s.SdtProperties.GetFirstChild<SdtAlias>()
                  .Val.Value)).ToList();
                foreach (SdtRun sdt in sdtRunList)
                    list.Add(sdt);
            else
                foreach (SdtBlock sdt in sdtList)
                    list.Add(sdt);
            return list;
        public static void AddMultiLineText(
          string multiLineText, ref List<OpenXmlElement> sdtList)
            string[] lines = multiLineText.Split(new char[] { '\n' });
            if (sdtList.Count != 0)
                foreach (OpenXmlElement sdt in sdtList)
                    for (int i = 0; i < lines.Length; i++)
Paragraph p =
                          sdt.GetFirstChild<SdtContentBlock>().GetFirstChild<Paragraph>(); //this ERRORS
                        if (i == 0)
                            InfoPathToWord.WriteText(lines[i], ref p);
                        else
                            Paragraph pNext = sdt.AppendChild((Paragraph)p.Clone());
                            InfoPathToWord.WriteText(lines[i], ref pNext);

Additional Info: multiLineText1 and multiLineText2 are "Plain Text Content Control" in the word template used to create the word document.

Can JAVA read Microsoft word document?

Hi
I am trying to make a program that read a word document, and split the document in to each pages and save each pages with file as it's footer of each pages.
--- example-----
test.doc
Page 1 |
|
|
|
|
|
footer - hi |
Page 2 |
|
|
|
|
|
footer - test |
Page 3 |
|
|
|
|
|
footer - still |
Page 4 |
|
|
|
|
|
footer - hahaha |
Page 5 |
|
|
|
|
|
footer - lastpage |
-----------------------> Out put of the program
hi.doc , test.doc, still.doc, hahaha.doc, lastpage.doc
Is there any way to import word document and process like that above?
I have searched about this, and found out 'Jakarta POI' can access to word document but with many restrictions. (I heard it doesn't support word version 2003 and 97)
Can any one please answer this?
Thankx

I'm not really sure that java is the best language to use for this particular project. I haven't looked into writing a standalone application to do this sort of thing, but Microsoft Word has a built in VB editor that allows you to write scripts that will do this very simply, using Word's Document Object Model.
I would imagine the same sort of the thing could be done from a Stand-alone aplication as well, but you might need some libraries for reading the document and obtaining the object with the same model. I would look into Microsoft's .net solutions for this one. You might even be able to work out a solution in C++, C#, or even J# (which I understand is "similar" to java), if you don't know visual basic.
It is possible that there is a library out there that will read a Word Document, and create the appropriate Java object with the same object model, but I don't know where it can be found.
Anyway, if the option is available to you, take a look at Microsoft's .Net. You'll probably have more luck that way. Good Luck!

Split Word Document

Similar Messages

Maybe you are looking for