Possible to compare Word 2007 XML documents?

I am trying to compare two word 2007 XML documents (.doc) using XMLUnit but I keep getting an assertion fail on the CDATA fields:
org.custommonkey.xmlunit.Diff [different] Expected text value '28/05/2009 22.50' but was '27/05/2009 15.02'This are the corresponding parts of the word xml files that differ:
     <w:t><![CDATA[28/05/2009 22.50]]></w:t>
<w:t><![CDATA[27/05/2009 15.02]]></w:t>
I have tried to enable:
    XMLUnit.setIgnoreDiffBetweenTextAndCDATA(true);But it still treats the sections as differences. When I compare the two files I have tried:
// test 1
    InputSource xml1 = new InputSource("c:\\test\\actual.xml");
    InputSource xml2 = new InputSource("c:\\test\\expected.xml");
    Diff myDiff = new Diff(xml1, xml2);
     myDiff.overrideElementQualifier(new RecursiveElementNameAndTextQualifier() );
     assertXMLEqual("Not similar", myDiff, true);
// test 2
    assertXMLEqual(expectedOutPutFileAsString, actualOutPutFileAsString);But the assertion fails in both cases.
Is it even possible to compare two word 2007 files like this?

Telling someone that they can convert a Powerpoint to PDF and send that to their phone is NOT a "third party hack."
Which BTW is what you want to do because it will preserve all the pagination, orientation, resolution, etc, all in one nice tidy file. And, (if you email it to your phone through an IMAP enabled mail server like GMail) you can then forward the PDF file directly from the iPhone to someone else.

Similar Messages

  • Word 2007 XML Format

    Good Afternoon,
    I am playing around with VBA and I came across something that I tried doing some time ago. Add a custom XML file into the Word 2007 XML Format and then access it programmatically.
    For instance; I have a list that I would like to load into a dropdown ribbon control. If I can create a small XML file with this information then I could read the information from the XML file and load it into the control.
    Hope I am being clear.
    Gus
    Gustavo Miller

    Hi Gus,
    Do you mean you want to use VBA code to add a DropDownList Ribbon control for the word document and populate some items from a custom XML file?
    As far as I know, you can't modify the custom ui xml for Word document in VBA code. I recommend that you move to VSTO Word Add-In, use managed add-in project to create/read/write custom xml file, add DropDownList control to the Ribbon and edit the items.
    https://msdn.microsoft.com/en-us/library/bb386097.aspx
    Or use CustomUIEditor to manually add Ribbon controls for the Word document:
    https://msdn.microsoft.com/en-us/library/cc508991(office.11).aspx#UsingtheCustomUIEditor2_AddingTemplatestotheCustomUIEditor
    By the way, to manipulate XML files, these documents should help:
    A Beginner's Guide to the XML DOM
    Editing Data in an Excel 2007 Open XML File with VBA
    We are trying to better understand customer views on social support experience, so your participation in this interview project would be greatly appreciated if you have time. Thanks for helping make community forums a great place.
    Click
    HERE to participate the survey.

  • Can't open Word 2007 (.docx) document on my Mac

    I have Microsoft Office 2003 (which I purchased) on my Mac. I am unable to open a document that was emailed to me - it is a .docx file (Word 2007 on a PC). What is going on? Is there an update/download I need? I called Apple Support and they were totally stumped. What have those idiots at Microsoft done now? Are you telling me a document created in Word 2007 can't be opened in Word 2003? That is absurd. Is there even an update for Mac users who use Office 2003?

    Please search the discussions before posting, as this question has been asked and answered many times already, and you could have answered your own question and saved you the time of waiting for a response.
    Until Microsoft releases the file converters for Office 2004, you'll need to tell whoever saved the file to re-save it in an Office 2003 or older format.

  • I blog on TypePad using Word 2007 and documents freeze when I'm looking at them in Windows explorer; why do I get the error message that Mozilla is looking for a script?

    I have now had two error messages - the first one looked for this script: Script: resource://gre/modules/XPCOMUtils.jsm:258. The second one (today) looked for Script: resource://gre/components/nsPrompter.js:68.
    I have reset my word template, also reset word registry for data and options (as per Microsoft's help pages) for both those 'fixes' there was an initial improvement, but the problem happened again after I reposted on my blog today and then went to Windows explorer to view my files. It seems to 'seize' as I select a file, so that the preview does not appear (for up to 15-20mins).
    I see on the internet using the second script as a search just now that one other has had a similar problem and one also since Mozilla 4 upgrade. My problems have also been pretty much since I upgraded.
    It happens (apart from my fixes) every time I go to look at a word document via Windows explorer. Excel files are less affected.
    One web suggestion is to clear the cache, but I am hesitant to lose all my passwords unless I have to.

    I still have the problem although I uninstalled Mozilla (clearing the cache didn't work and neither did Word diagnostics), so have ended up thinking it could be Norton (the manual update worked x1) or Windows. Today I found an answer: disable the preview pane in Windows Explorer - there's a history of problems with this view.

  • Word 2007 Corrupt Document Recovery

    Hi All,
    I might be a little off topic in this category but am hoping you can help or point me in the right direction, i have a corrupt .doc document, basically it was a doc i resumed working on which was launched off a usb hdd, midway through the document i accidentally unplugged the hdd.
    After plugging it back in i tried to relaunch the document in the hope of just starting from where i originally resumed, however the text has now converted into all sorts of code which cannot be translated into English, i tried all the other MS docs explaining how to launch and repair the document to no avail,
    Can anyone please point me in the right direction? =(
    Thanks in advance.
    Ovi.

    Did you use MS Word's "Open and Repair" function to open those corrupted .docx files? If files are really very crucial and severely corrupted, then you have to buy a third-party application. This is the only way to recover those files as there's
    no freeware, as far as I think. I have also faced the similar situation of corruption in some of Word .docx files. I tried
    Word Repair Toolbox software to repair my corrupt .docx files.
    You may know more at
    http://www.word.repairtoolbox.com/

  • Word 2007 document presented without editing notes in wors 2013 pro plus

    hi,
    A word 2007 edited document which contains editing notes such as specific words in a sentance that are crossed and then the editor inserted his or her words to replace the words that were crossed.
    when trying to open the document in word 2007 all is well and the edited document appears with all the editing notes and replaced words.
    when opening the same document in word 2013 olny the replaced words appear in each place with no sign on the crossed out words.
    is there any way to make word 2013 display the document as in Word 2007 (i.e. with both the crossed words and the words that are written right after those crossed words)?
    THX

    Hi,
    I'm marking the reply as answer as there has been no update for a couple of days.
    If you come back to find it doesn't work for you, please reply to us and unmark the answer.
    Thanks
    George Zhao
    Forum Support
    Come back and mark the replies as answers if they help and unmark them if they provide no help.
    If you have any feedback on our support, please click "[email protected]"

  • Word 2007: Error when saving file that contains equations

    Using Microsoft Word 2007, version 12.0.6425.1000 with Service Pack 2
    I am a High School Math teacher, and when working on a document that contains a number of math equations. When I try to save the document, Word refuses to save. It displays the following error: "A file error has occurred. <filename>. OK",
    and I am unable to save the file. 
    Reading the forum, I have found this to be a very common issue going back to 2009, but I have not seen a definitive solution or patch. 
    I have found a workaround:
    Creat a new document
    Copy and paste from the corrupted doc each page, saving the new document after each page
    Until I hit the error, then I know the page that has the issue, and can usually narrow it down to a specific math equation. Nothing fancy just some basic math. 
    Manually rewrite the equation in the new document and it will successfully save.
    But the issue has been occurring more and more frequently now and it is becoming very frustrating. Any help would be greatly appreciated! 

    There has been a long time known issue with the equation editor.  it causes document corruption by "losing" XML tags.
    Apparently fixes for some causes of these problems have been rolled out in Windows updates. But the problem still exists.  You've got it easy. A lot of people running into this problem are students working on documents, like a thesis, that is on a short
    deadline ...
    One identified cause of these problems is editing equations
    after you created them.  DON'T DO THAT! 
    If you have to change an existing equation, create a new correct one and delete the old one.
    Simpler fix. Don't use Word (or Office) for creating equations.  Find some other product, like a pencil.
    Over in the "Answers" forum there are a couple volunteers, Tony Jollans and Jeeped
    , who have been manually fixing this type of error if you post a new question or add a reply to one of the ongoing discussions on this issue.
    Sorry.
    If you want to invest the time to figure out what your error looks like you may be able to come up with a fix other than copy /paste method.
    Can't open Word File because of end tag/start tag mismatch error... XML Tag 
    - XML Error – Fix It tool - “The name in the end tag of the element must match the element type in the start tag”
    This error is caused when Word either “forgets” to write an XML tag, or writes them in the wrong order.
    Tony Jolans was the first person that I heard of with home made tool to fix the problem. Now MS has released a Fix It for one specific variation of the problem.
    If the tools don’t fix your problem, the file will have to be fixed manually, repairing the tag order.
    The Fix It article notes that the document is still in a fragile state. You have to do some addition fixing to avoid repeats of the problem.
    https://blogs.technet.com/b/wordonenotesupport/archive/2011/03/24/error-when-opening-a-word-2007-or-2010-document.aspx
    http://support.microsoft.com/kb/2528942- FIX IT
    This fix it will work for one specific tag error where there are equations and graphics in the same paragraph AND Office 2010 SP1 has not been applied.
    Preventative suggestions
    <snip Jeeped>
    I don't think that anyone can completely stop editing equations, but pre-planning them should avoid unnecessary edits.
    While I have no concrete, reproducible evidence that editing equations is a cause, I have made these empirical observations:
    I cannot state precisely what many of the DOCX XML tags do, but basic XML syntax rules would suggest that code like:                   <m:oMath><m:e><m:ctrlPr/></m:e></m:oMath>
    ... does nothing at all since it closes everything it opens and offers no content. It looks to me that this is a result of deleting one or more characters from an equation.
    While Word 2010 reports these as a problem at:                  Line: 2  Column: 0 ... Word 2007 will still report the actual position, e.g.:                
     Line: 2  Column: 2726981 I keep a copy of Word 2007 side-by-side Word 2010 for no other purpose. It's the only Office 2007 program on my computer. This seems like it is actually a step backwards from a resolution since Word 2010 no longer seems
    to be able to parse its own error.
    Whether the syntax is truly useless and non-effective for any intent or purpose is actually beside the point. The syntax passes conformity tests and the DOCX
    should launch. Why an 85 page document is 'broken' due to a few empty XML formatting tags while retaining legal syntax structure is beyond me.
    I try to pass along the area that the problem was in and several times people have remarked that the area I report was the place they were last making modifications/deletions to equations (not additions) when the no-load corruption
    occurred.
    When no indication from the OP is offered on what was worked on last, the Line: 2 Column: 0 corruption often comes within a formula at the very end of the unfinished Word/Document.xml file and it may be inferred
    that this was the last place being worked upon.
    The corrupt DOCX files I've worked on are very commonly at the last stages of development with a large complex document. While there couldn't be a worse time for a corruption to appear, it would seem that small edits to existing content are causing
    it and not large scale new content generation.
    Not one of these points is a definitive 'smoking gun', but put together they seem to indicate
    formula editing and not formula writing as a cause for the Line: 2 Column: 0 corruption. If it looks like a duck and quacks like a duck, it is most often a duck.
    </snip>
    Copy “True Autosave Macros for Office” to this place in reply
    Let me fix it myself
    If you are familiar with editing XML, you can try to fix the problem yourself by correcting the sequence of the mismatched oMath tags in the document. See the following example:
    Incorrect tags:
    <mc:AlternateContent>
    <mc:Choice Requires=”wps”>
    <m:oMath>
    </mc:AlternateContent>
    </m:oMath>
    Correct tags:
    <m:oMath>
    <mc:AlternateContent>
    <mc:Choice Requires=”wps”>
    </mc:AlternateContent>
    </m:oMath>
    Note: You will have to use an application such as Notepad to edit the XML.
    Manual Technique
    <snip>  A DOCX document is actually a .ZIP file that contains many internal components. There is an internal folder called word which will always contain a document.xml file. This file is the basis of the document's layout
    and content.
    Assuming that the DOCX archive structure has not been corrupted, it can be opened in an archive utility. I use
    WinRAR for this. Once opened in
    WinRAR, you can drill down into the
    word folder and see the document.xml file. I use
    Notepad++ as a text editing tool and have the .xml file extension associated with this program so i can simply double-click
    document.xml in WinRAR to open an editing session.
    Notepad++ has cursor positioning in the right-hand side of the status bar and finding the position of the error (supplied by a failed open in Word) is pretty straightforward. I look for empty formatting
    tags first and only remove content if removing empty tags does not allow the document to be opened. I try to note existing content in the area that I make modifications in order that I can supply position reference information to the owner of the DOCX.
    When editing, I assume the philosophy that kess is more. My target is to get the document to open in Word with as little modification as possible and let the original owner of the DOCX make any necessary adjustments.
    I should mention that after making a change to word/document.xml you need to save it in
    Notepad++ then go to
    WinRAR and acknowledge that you want to update the file in the DOCX archive. Once the archive is updated, you can attempt to open the DOCX in Word to see if your efforts are successful.
    WinRAR                    
    Notepad++                 
    </snip>
    Further Fixes
    The Fix it solution in this article should let you recover your Word document. However, the symptoms will reappear when you make any further edits to the document unless the core problem in the structure of the document is resolved.
    To try to correct the core problem, follow one of these workarounds:
    Install Office 2010 Service Pack 1
    Office 2010 Service Pack 1 resolves this issue for new files. It will also prevent the problem from recurring with any files that were recovered with the Fix it solution in this article.
    To download Office 2010 Service Pack 1, follow the steps provided in this Microsoft knowledge base article:
    2460049 - Description of Office 2010 SP1
    Grouping Objects
    The steps provided work best under Word 2010:
    After you open the recovered document, turn on the Selection pane. This can be found in the
    Home tab of the ribbon. The editing group of the
    Home tab has a dropdown button named Select.
    Click the Select button, and then click
    Selection Pane...
    Press the Ctrl button on your keyboard and then click each text box in the selection pane.
    Click the Group button under the Format tab. This will group all the objects together.
    As soon as you have all objects grouped on each page, save the document under a new name.
    Save the document in the .RTF file format
    The steps provided work for both Word 2007 and Word 2010:
    After you open the recovered document, click File and select
    Save (for Word 2007 click the Office button and select
    Save As)
    In the Save As dialog box, click "Save as type:" dropdown and select
    Rich Text format (*.rtf).
    Click Save.
    Click to view this
    blog for more information about this issue.
    Bonus tip: Win7 Win8 Math Equation Input Panel / Math input Panel
    http://www.lytebyte.com/2009/07/24/guide-to-math-input-panel-in-windows-7/
    http://www.7tutorials.com/windows-7s-tablet-input-panel-text-entry-and-handwriting-recognition
    http://www.7tutorials.com/training-tablet-input-panel-work-even-better
    http://www.7tutorials.com/do-math-easy-way-math-input-panel
    Not an answer to the problem, just a bonus that may make it easier to input formula’s in Win7.
    Here’s another one of those didn’t-know-it-existed-until-I-clicked-it-by-accident tools in Win7. It’s called the
    Math Input Panel.
    To access it, simply click Start, and in the Search Box that appears above, type in
    Math Input Panel.
    The Window should look like this:
    Let the fine folks at Microsoft explain what it’s used for:
    “Math Input Panel uses the math recognizer that’s built into Win7 to recognize handwritten math expressions. You can then insert the recognized math into a word-processing or computational program. “
    Tony Jolan’s Automatic Fix
    Download  http://www.wordarticles.com/temp/Rebuilder.dotm Microsoft Office Word Macro-Enabled Template (.dotm) and open it.
    Click Options button on the Security warning and select Enable this content.       
    Click the Broken Documents tab at the far right of the ribbon.      
    Click the Rebuild button in the left-hand side      
    Locate and open your corrupt document in the file open dialog.
    That's it. The process will repair your document if possible and create a new document with (Rebuilt)
    appended to the filename. Be patient as it may take a few minutes. If a repair is not possible, you can then post to a public file area and someone here can attempt a manual repair.
    Manual Fix
    XML Maker V1.1 is free. It will allow you to open the document.xml file and edit it. It also marks errors and warnings. 
    I just didn’t have much luck working with it.
    A poster used XML Maker V2.1 (US$125, 30day free trial, enough for average person to fix a file)
    Notepad ++ is a good, free editor for this type of task
    Make a copy of the file
    Rename the copy from DOCX to ZIP
    Open … .ZIP/word/document.xml in notepad
    Copy the contents of the file to clipboard
    Open Word
    Paste a copy of the copied XML into Word
    (optional) the XML is one long string too hard to read, you can replace some tags, with that tag plus a para mark to break up the text to make it more people readable.
    Open an XML validator, ie this site on the internet:
    http://www.w3schools.com/xml/xml_validator.asp
    Paste another copy of the XML into the “Syntax Check Your XML” input window
    Click on “validate” button
    Copy the missing tag, ie </mc:Fallback>  (yours will be different)
    Return to word Find: mc:Fallback>  (without the </ so you find both open and closing tags). 
    Repeat find until you hit 2 open tags in a row.  Then you just have to figure out where to put the closing tag between them. 
    Look for other tags before and after a proper closing tag so you can match the problem area to a good area.
    Discussion by many affected people, a couple in discussions are also fixing problem if Tony’s fix doesn’t work
    http://social.answers.microsoft.com/Forums/en-US/wordcreate/thread/581159d0-9ebc-4522-b30c-53e33e8268e1
    Document Recovery
    http://www.wordarticles.com/Shorts/Corruption/Formats.php
    This page has the most readable description of Word file structures, DOC and DOCX, I have seen so far
    The logical structure of a Word 97‑2003 format document is one of a series of elements arranged in a hierarchy, much like a mini file system. As an example, here is the structure of a simple Word 97‑2003 (.doc) format document:
    MyDocument.doc
    1Table
    *CompObj
    Word Document
    *SummaryInformation
    *DocumentSummaryInformation
    The physical structure of the complete file bears little relation to the logical structure; it is, again, of a proprietary design, a compound, or structured storage, file. Briefly, and loosely, the separate logical elements of the file are broken up into
    blocks; these blocks are treated as individual units, which units are then organised without regard for their logical arrangement, and catalogued, catalogue and organisation detail being held alongside the blocks themselves, to enable recombination into logical
    components when necessary.
    Just to give you a flavour, here are some views of three small parts of such a document, viewed in a hex editor:
    Views of a Word 97-2003 format Document
    The logical structure of a Word 2007 format document is one of a series of elements arranged in a hierarchy, much like a mini file system. As an example, here is the structure of a simple Word 2007 (.docx) format:
    MyDocument.docx
    _rels
    rels
    docProps
    app.xml
    core.xml
    word
    _rels
    document.xml.rels
    theme
    theme1.xml
    document.xml
    settings.xml
    fontTable.xml
    webSettings.xml
    styles.xml
    [Content_Types].xml
    As briefly as before, the [Content_Types] file and the _rels folders, along with the subordinate files therein, contain information about the logical structure, and the two files in the docProps folder contain much the same as the two Information files
    in the old format. The document.xml element within the word folder holds the bulk of the document content and the other files within that same folder hold formatting details.
    So, you might say, the internal structure of a document has changed a little, so what? There are, however, other changes that make a bigger difference. The first is that, although both logical formats are conceptually similar, they are wrapped up in
    completely different ways to make a single file. Instead of the proprietary physical structure used for Word 97‑2003 format documents, a fairly standard, and open, Zip Archive format is used for Word 2007 format documents. The second change is that instead
    of using obscure binary codes, everything in Word 2007 format documents, well almost everything, is held in XML format.
    All data held as XML? In a standard Zip Package? It should be much easier to work with, then? Judge for yourself; here are some views of parts of a Word 2007 format document taken from a hex editor:
    Views of a Word 2007 format Document
    FreeFileViewer – reads 100+ text, Office, audio, video format file types – Can open some XML tag error files
    http://www.freefileviewer.com/formats.html
    Can't open Word file due to undeclared prefix, Location: Part: /word/document.xml, Line:91, Column: 49921
    The most likely cause of your particular problem is that you are missing a
    schema prefix reference within the opening <document ...> XML tag (usually the second one). Different
    schema references are required for various types of specialty content. Here is a sample opening <document ...> tag with a large number of various schema prefix codes. If I remove one or two of these, i can reproduce
    your error message.
    <w:document xmlns:ve="http://schemas.openxmlformats.org/markup-compatibility/2006" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:r="http://schemas.openxmlformats.org/officeDocument/2006/relationships" xmlns:m="http://schemas.openxmlformats.org/officeDocument/2006/math"
    xmlns:v="urn:schemas-microsoft-com:vml" xmlns:wp="http://schemas.openxmlformats.org/drawingml/2006/wordprocessingDrawing" xmlns:w10="urn:schemas-microsoft-com:office:word" xmlns:w="http://schemas.openxmlformats.org/wordprocessingml/2006/main" xmlns:wne="http://schemas.microsoft.com/office/word/2006/wordml">
    It's impossible to blindly say exactly what you are missing but if you post an unadulterated copy of your document to a public file area (as noted above) and post the location back here, someone may be able to help. If you are going to go this route, send
    a copy that has not already been subjected to repair attempts.
    FWIW, while it may be hard to determine exactly which schema you may be missing, I do not believe that having extras causes any problems.
    BTW, you don't actually have to rename the DOCX file extension if you have an archiving program. I use WinRAR and simply
    <right-click>, Open With... to expose the archive.

  • Can Screen Tips be output as Alt Text from RH HTML to Word 2007?

    I have a large documentation project that is single-sourced in RH HTML to create WebHelp and printed documentation. My expectation is that if I put screen tips in the image properties, these will be converted to alternate text both in WebHelp and printed documentation. WebHelp is fine. I output to Word 2007 because document cleanup is always required. In my Word file, the screen tip is gone, replaced with the document file path. When converted to PDF, that's what appears as alternate text. Is there any way to get this done short of entering the text twice (in RH HTML and again the Word doc)?

    When they upgrade you, RoboHelp will have to be reinstalled and that may fix the problem. You are creating the link using the Rh menu rather Word's menus?
    See www.grainge.org for RoboHelp and Authoring tips
    @petergrainge

  • Navigate XML document as tree

    Hi Otn,
    Can you help me! if is possible to navigate a specifed XML document as editable Tree / table tree using ADF components or using any another JSF component.
    Thanks in advance :)))

    hi user13326790
    "A URL service data control is an Oracle ADF data control for URL services, such as XML files that can be accessed through a URL. ...", see "Use a URL Service Data Control"
    at http://www.oracle.com/technetwork/developer-tools/jdev/ccset10-all-091120.html
    But "H.1 Introduction to Data Controls" in the Developer's Guide for ADF says about the "URL Service Data Control":
    "... This type of data control is not updateable. ..."
    at http://download.oracle.com/docs/cd/E14571_01/web.1111/b31974/appendix_datacontrols.htm#ADFFD22257
    Possibly the other types of Data Controls can be "programmed" to provide updateable access to an XML document, but can't immediately suggest a "preferred" approach.
    (tip : You can use "Your Control Panel" to make your name visible in forum posts.)
    regards
    Jan Vervecken

  • Updating XML document

    After creation of a XML document through DOM is it possible to update the same XML document with new tags
    Thanks
    Sasanka

    s, it is possible to xmldocument with new tags.for example u need to create a new node of requirement and import that node to required document and use replace to update.

  • Corrupted .docx file. Word 2007. Can't open the document. Tags mismatch. Help?

    The Office Open XML file *.docx cannot be opened because there are problems with the contents.
    Details: The name at the end tag of the element must match the element type in the start tag.
    Location: Part: /word/document.xml, Line: 2, Column: 3487212
    Hmm, I guess I should give some background info and what I've tried so far, right?
    The document is in word 2007, Windows 7. Last night, I was in a hurry, and got a lot of things open. I was opening the document to do a few quick spontaneous revisions, but it was so laggy and I was late for an appointment, so I was feeling panicky
    and frustrated. In hindsight, it probably wasn't the best option to just force shut the PC down cause word stopped responding in the middle of opening the document. Still, while I'm no Sherlock, I knew stopping it while it's in the middle of saving the document
    would bebad, I didn't think stopping it while opening the document (and not really modifying the document at all, at least that was what I thought before), could have drastic consequences! The document's quite
    large, a few hundred words, and about 500 or so pages. It's this really big project I was doing at work, for months now really, and because it's somewhat business confidential in nature, I can't exactly share it freely, or I would've uploaded a copy, sorry.
    Anyway, I woke up this morning and opened it, and the error came up. By the way, the document was saved in a 2TB external hard with a few disk errors in the past, if that helps, even if I'm pretty much sure its not a problem with the hard. The error
    came up... and yeah. I first made a copy, which I've been trying everything on, in case I do end up making things worse. I have no previous versions of it, just a relatively very old backup. Anything less than 80% recovery would set me back weeks with a wage
    cut. The good news is that I was able to open the document in WordPad, managing to recover the first 287 pages (131,543 words), with no errors or data loss, and saved it in a separate file. Apparently, according to information I obtained later, MSWord tends
    to not open at all when it encounters an error, but Wordpad tends to stop reading the rest of the code once it encounters an error. So, naturally, I assumed (so correct me if I'm wrong), that behind a few sentences I might lose due to the error, the rest could
    also be salvageable. I then looked up the problem all over the internet. Read a microsoft article on troubleshooting/recovering corrupted documents (Open & Repair, Draft Mode, Creating Link, Recover text from any file converter, etc). No dice.
    I first saw a similar question on answers.microsoft, and tried to use Tony Jollan's Rebuilder, macros enabled and all. Sadly, no luck.
    I managed to make my first breakthrough when I found out that .docx was just a .zip file, and could be renamed as such, with the document.xml extracted and manually fixed using an XML editor (not that I knew how to do that, but I was desperate and
    willing to learn). So, made another copy, changed extensions, and tried to specifically extract the document.xml. I believe that it is the main body text, right? That is the only thing really necessary for me, since everything I've done so far is entirely
    spartan, with no fancy fonts, formatting, header/footer/notes, media objects, formulas, tables, bullet points, numbered lists, etc. All pure sans-serif text, with some Japanese Kanji thrown in. 500 pages of pure text.
    I then hit a snag, when WinRAR encountered an error on extracting document.xml, stating that "CRC failed in word\document.xml. The file is corrupt". So I tried to fix this using several ZIP repair programs and stuff. Nothing worked. At
    least not so far. Managed to extract an incomplete version of document.xml using WinRAR's 'Keep Broken Files' option when extracting. The extracted
    document.xml came up to 3.31 MB while the original in archive is 7.53 MB. Viewed it in the Windows XML Editor which opened up the text in Internet Explorer, a jumble of text with no line breaks or paragraphs. Still, it actually extracted a few pages less than
    the open-using-Wordpad method tried earlier. So trying to fix the archive again...
    So I decided to give up the manual route for the meantime and focus on readymade solutions. I came across yet another microsoft article, but this one was at least more relevant than the last. It had this auto FixMe thing. I ran it, didn't
    work. Apparently, as I found out later, "This fix will work for one specific tag error where there are equations and graphics in the same paragraph AND Office 2010 SP1 has not been applied."
    Tried several (read: dozens) corrupt Word recovery software, from freeware to pro trials, to varying degrees of effect, although unsuccessful
    in goal. Some failed to read it, saying it was too corrupted for them to handle, the best managed to recover about three-pages-worth less data compared to the Wordpad method. So yeah, anybody with a similar problem, always open it with wordpad first and recover
    what you can. That doesn't mean I'm giving up though.
    So here I am, tearing my hair out in frustration. Whew, I feel like I told you guys my life story. I guess worst case scenario, I report this to my boss, so I personally or from the company, I/we'll hire a team of professionals to deal with it.
    But that's not an ideal scenario. It's gonna be out of my salary either way (the company has a firm policy of 'You reap what you sow'), along with a wage cut for making such an amateurish mistake even with continuous reminders to back it up every two sentences,
    so I'd rather avoid that.
    I'm currently looking to see whether there's a way to recover a previous version of an overwritten document somehow using a third-party software or something. (I didn't have Windows Backup enabled, so no previous version on Windows). So far, no
    autosaved documents on MSWord AutoRecover, even though I have it enabled set to every 3 mins (or maybe I'm just not seeing it since I'm trying to look for it manually?). Or are the temporary files wiped on shutdown? I don't have 'Always save backup copy' option
    enabled on word either.
    So yeah, an auto fix, or a lengthy answer detailing what exactly I should do (from the very very basics), or a link to a site with such info, would be much appreciated. Seriously :D
    Please. Thanks for at least making an effort :)

    When a Word document file is corrupted, then you can try several methods to recover it:
    1. First of all, you can try the recovery function integrated with Microsoft Word, as follows:
    1) On the File menu, click Open.
    2) In the Look in list, click the drive, folder, or Internet location that contains the file that you want to open.
    3) In the folder list, locate and open the folder that contains the file.
    4) Select the file that you want to recover.
    5) Click the arrow next to the Open button, and then click Open and Repair.
    You may find more information about this at:
    http://office.microsoft.com/en-us/word-help/recover-the-text-from-a-damaged-document-HP005189610.aspx (for Word 2003)
    http://support.microsoft.com/kb/893672/en-us (for Word 2007/2010/2013)
    2. If you have multiple corrupt Word documents, then you can use the VBA macro provided in article
    http://support.microsoft.com/kb/893672/en-us so that all the files will be opened in "Open and Repair" option automatically.
    3. There are also free tools from third-parties that can open and read Microsoft Word documents, for example,
    3.1 OpenOffice at http://www.openoffice.org. This is a very famous open source project that is designed to support Office file formats, including Word documents. The software can run under Windows.
    3.2 LibreOffice at http://www.LibreOffice.org. Another free office suite.
    3.3 AbiWord at http://www.abisource.com. This is a cross-platform tool that works under Unix and Windows.
    3.4 Google Drive at https://drive.google.com/ also support to load Word document files.
    Sometimes when Word fails to open your document, these tools may be able to open it successfully. If that is the case, then after the document is opened, you can just save it as a new document which will be error-free.
    4. For docx files, they are actually a group of files compressed in Zip file format. Therefore, sometimes, if the corruption is only caused by the Zip file, then you can use Zip repair tools such as WinRAR at
    http://www.rarlab.com to repair the file, as follows:
    4.1 Assuming the corrupt document is a.docx, then you need to rename it to a.zip
    4.2 Start WinRAR, go to "Tools > Repair Archive" to repair a.zip and generated a fixed file a_fixed.zip.
    4.3 Rename a_fixed.zip back to a_fixed.doc
    4.4 Using Word to open a_fixed.doc.
    There may still be some warnings when opening the fixed file in Word, just let ignore it and Word will try to open and repair the fixed file. If the file can be opened successfully, then you can just save the contents into another error-free file.
    5. If all above methods does not work, then you may try third-party tools such as DataNumen Word Repair at
    http://www.datanumen.com/word-repair/
    I have used it to repair some word documents successfully. It provides a free demo version so that you can try to see if the data you want can be recovered or not.
    Good luck!

  • Problem with the XML in Word 2007 (Word Template)

    Hi Experts,
    i am new on CRM 2007 and i have a problem with the XML Structure of the Word Template.
    First i built a Web Service Design Tool. Then i saw on the Testpage, that it works.
    So i started the Document Templates and created a new Template. Object Type was BUS2000126 - CRM Business Activity. Web Service was my created and tested Web Service Tool.
    As i opened the Word 2007 with the XML-Structure, i recognized, that there was something wrong.
    The Responce on my Testpage from the Web Service Tool had the following structure:
    response (test.types.p1.CrmostZlaWord5ReadResponse)
       Output (test.types.p1.CrmostZla010RoszlaWord5001)
            ZlaWord5 (test.types.p1.CrmostZla010Rosbtorder)
                Administrativeheaderoforder (test.types.p1.CrmostZla010Rosbtorderhea001)
                     Partiesinvolvedofheader (test.types.p1.CrmostZla010Rosbtheaderpa001)
                         Allpartiesinvolved (test.types.p1.CrmostZla010Rosbtpartnera002[]) Displaying 3 elements of 3
                              element1 (test.types.p1.CrmostZla010Rosbtpartnera002)
                                   Btpartneraddress (test.types.p1.CrmostZla010Rosbtpartnera001)
    My Problem is now, that the XML-Structure got not that point "element1".
    Instead of "element1" there is the point "item" in my XML-Structure in Word 2007.
    I guess that is the Problem why i am not getting the fields of the Btpartneraddress filled in my Word.
    Can anyone help me? Or put me in the right direction that i can change the XML?
    Thanks for your help
    André

    Hi andré, I guess the issue is coming from the fact that you selected "AllPartiesInvolved" and that may contain any numbers of entries. So when you test your webservice, you put a key and then get a result for that key, and in that case you might get "element1" until "element3" for example if there was 3 partners involved in you activity.
    But, when you design your template, you don't have a key at that moment, so in the Web Service structure, you have "items" which stands for all the possible entries you might retrieve at runtime. I guess you could use an index in your template to specify which item you need, but this is quite hasardeous, so i would be you, I would not design my web service to use "AllPartiesInvolved" but rather a specific Partner type like contact person for instance.
    Regards,
    Xavier

  • Good day. Is there into Pages the possibility to compare two documents? In order to reveal changes.

    Good day. Is there into Pages the possibility to compare two documents? In order to reveal changes.

    Hi iMayner,
    I don't know of a way to easily compare two documents in Pages. Maybe there is an app to do this, but I don't know of one.
    What you can do in Pages is to open both documents, resize the windows and drag them side-by-side on the screen. Pages 5 has a limit on how narrow you can make the windows, so this may be a bit of a pain.
    The first step would be to show the Word Count in both documents. (Pages 5.5.1 Menu > View > Show Word Count, or click on the View button in the ToolBar and Show Word Count). If the Word Counts are the same, you have **some** confidence that the documents are identical, but no guarantee.
    An example of two short documents with different word counts:
    Document A (33 words)
    The quick brown fox jumps over the lazy dog.
    Don't count your chickens until they hatch.
    Don't cross your bridges until you come to them.
    An apple a day keeps the doctor away.
    Document B (34 words)
    The quick brown fox jumps over the lazy dog.
    Don't count your chickens until they have hatched.
    Don't cross your bridges until you come to them.
    An apple a day keeps the doctor away.
    To home in on that difference, copy and paste each document into a column in a spreadsheet such as Numbers or Excel, so that each paragraph becomes a cell in a column. Turn on Wrap text in cell.
    The different heights of each cell (paragraph) become apparent.
    Happy Christmas!
    Regards,
    Ian.

  • Do you know if it's possible to generate Word 2003 files from RoboHelp if we're using Office 2007 and RoboHelp 2007?

    Do you know if it's possible to generate Word 2003 files from
    RoboHelp if we're using Office 2007 and RoboHelp 2007? We are
    thinking of upgrading, but have customers that would still require
    Word 2003 formats because they won't have 2007 installed.
    Thanks!

    Hi NewtoRobohelp
    Unfortunately I don't have Office 2007 in front of me to test
    with. But I'm thinking that as long as Office 2007 still produces
    the same formats as 2003, you could do it this way.
    From RoboHelp 7 and using Office 2007, generate Printed
    ouptut. Open said printed output in Office 2007 and perform a Save
    As. Save the document as a RTF (Rich Text Format) document.
    RTF is more universal. So I'm thinking this may be a possible
    way around it. There is also the possibility that Word 2007 offers
    up an ability when saving a file to save in an older format.
    Cheers... Rick

  • XML Publisher Patch for MS Word 2007 to get the AddInns menu

    Hi,
    What patch i need to download from metalink for XML Publisher.
    I use MS Word 2007, for creating rtf document, i need the AddInns menu in the word.
    Thanks.

    Why yes. How did you know that I saw the way, the truth and the light?
    I just had so many people ask me how to do this, that I thought I might share it with as many people as I could and help the masses To bad that Apple just did not have it as a simple "Save As:" Maybe in the next update.
    -Apple //GS

Maybe you are looking for

  • Can you have more than one itunes account on a mac

    is it possible to have more than one itunes account on a mac??

  • Can't print to networked hp laserjet 1320tn

    hi there. so i have printer driver updates using apple software update, but haven't been able to print to a networked hp laserjet 1320tn since upgrading snow leopard to 10.6.2. it keeps "looking for printer;" this printer queue worked on 10.6.1. if i

  • Upgrading from CR 8.5 to CR 2008, along with Oracle DB from 9 to 11G

    We currently use Crystal Reports 8.5, yeah its ancient, but suited our needs.  Our web guys upgraded our server to oracle 11g R2, and now we have lil issues with the changes and how the old CR works.  So figured it was time to upgrade it as well.  My

  • Upload excel data into Internal table

    Hi, I'm trying to upload excel data into internal table, well the excel file layout will be different on each run of the report. Excel file will have 60 columns and 500 record limit. I can upload the excel data using 'ALSM_EXCEL_TO_INTERNAL_TABLE' an

  • Delay in AJP Connector jBoss/TomCat Solaris 10

    I have: - IIS 6, Windows 2003 Server on dual PIII with jk 1.2.15 installed and working - Solaris 10 on Sun E420R with Quad Processors with jBoss / Tomcat version 4.0.3SP1 There is a significant delay in page loading that occurs randomly (every 10-20