Convertion of Microsoft .doc to .txt

Hello there,
Could any give my some ideas on how to use Java to open a Miscroft document ".doc" then save it (or convert it) as a plain text ".txt"? I know you can do it using MS " word" via save as. But it would be quite a hassel if you are opening and closing a thousand documents. I want to do some document convertion and re-generation. Any help would be very much appericated.
Thanks
Sam

Hello there,
The reason I was try to convert .doc to .txt is because I want to further manipulate the data.... in fact I am trying to convert .doc to .html. The MS word has a function can "save as" a html doc.... but the formatting is really WIRED. Everything was flying around all over the places. SO I thouhg if you can convert it into .txt...then it would be easier to add html tags and format it nicely and automatically..... then maybe using forntpage or dreamwaver to further "decorate" the .html....
Someone suggets to use macro in frontpage to do the job, but the thing is...I am not familer with VBS. So I thought of Java. But then I am going to do LOTS of document convertion and I am not sure using the POI is convenient or apporiate....Any advice would be great!
Thanks,
Sam

Similar Messages

  • Won't convert a Microsoft doc.

    When I try to open a Microsoft Doc so I can convert it to PDF I get this message"
    "Because it is either not  a supported file type or because the file has meen damaged (for example it was sent as an e-mail attachment & wasn't corectly decoded):
    The doc was not sent as an e-mail attachment. 
    I bought this thing so I could quickly and easily convert a microsoft word doc to pdf format.  HELP! Please!

    Adobe Reader is a free product that is unable to produce PDF files from any kind of source file.
    If you bought Adobe Reader, you were scammed. If in fact you are talking about Adobe Acrobat, then you're posting in the wrong forum.
    Post in one of the Acrobat forums (for example http://forums.adobe.com/community/acrobat/creating__editing_%26_exporting_pdfs), but provide more information, like your OS, exact version of Office (I highly doubt it's 3, since that was released in the early 90's) and Acrobat. Also be aware that you can't just open a Word file in Acrobat, you have to convert it first to a PDF, either from within Word or from Acrobat itself.

  • I use to be able to convert pdf's to microsoft docs. but now I get "an error occurred..."

    I use to be able to convert pdf's to microsoft docs. but now I get "an error occurred..."

    Hi the duce,
    Please see this document: "Error occurred when trying to access this service" when logging on to Acrobat.com
    If you're still having trouble logging in, please let us know.
    Best,
    Sara

  • Why can I not Convert a Microsoft Office Document to a PDF using the Context Menu?

    Why can I not Convert a Microsoft Office 2013 Document to a PDF using the options found in the Context Menu? (Ex: Convert to PDF, Combine Supported Files into PDF?)
    I updated to Acrobat XI PRO recently, but now i'm unable to combine or convert microsoft word docs to PDF.
    In Adobe Acrobat X I had this feature below, and it would combine Microsoft Office Documents all into a single PDF. Now I no longer have this issue in Adobe Acrobat XI Pro. It seems like it was program named Adobe Elements that was running the conversion.

    Ajlan. That page is showing as not available. Would the fix apply to Adobe Acrobat X and XI?
    Zach Moses
    Direct Phone and Fax (615) 577-5814 | [email protected]
    W Squared, Inc.
    5500 Maryland Way | Suite 200 | Brentwood, TN 37027 | www.wsquared.com<http://www.wsquared.com>
    This email and any attachments may be confidential and are solely for the use of the individual to whom they were intended. If you are not the intended recipient of this email, you must take no action based upon it, nor must you copy it or show it to anyone. Please immediately reply to the sender if you suspect you were not the intended recipient. All contents of this email are provided "as-is" without warranty of any kind and are subject to change without notice. W Squared assumes no risk from the recipient's use of this email. W Squared is not a certified tax firm or law firm and recipient should not rely on any communication from W Squared or its employees as having such authority.

  • Converting the WebI reports to .txt format

    I have developed reports using Web Intelligence, now the requirement is to convert these webI reports to .txt format.
    I am able to do it after dowloading the report into .CSV format but i wanted to convert these reports directly to .txt format and schedule these reports using a third party tool (ASG Zena).
    can anybody please respond to this query if they have any idea or have done it already.
    Thank you in advance.
    Regards,
    Saradhi.

    Hi,
    you can export WebI docs. only in .csv format, not .txt
    Regards
    -Seb.

  • I do not get a readable or editable PDF file that has been converted to Microsoft Word.

    In the past I have been able to convert PDF files to MicroSoft Word, and Edit and Save the Documents. Now, the converted files are unreadable and uneditable. How do I correct this problem?

    Thanks for the email, Stacy!
    I don't have an old PDF file that I converted to MicroSoft Word, however the new file that I am trying to convert and edit is the same kind of file that I've converted and edited before; An Invoice from the same Company. Is there a possibility that a download, MicroSoft Compatibility Pack for the 2007 Office System, onto MicroSoft Windows XP Home Edition Version 2002 computer might be causing the problem? Should I download to (.doc) or (.docx)? Any help will be greatly appreciated!
    Thanks,
    Joseph Renn Little
    Date: Fri, 21 Mar 2014 08:48:32 -0700
    From: [email protected]
    To: [email protected]
    Subject: I do not get a readable or editable PDF file that has been converted to Microsoft Word.
        Re: I do not get a readable or editable PDF file that has been converted to Microsoft Word.
        created by StacySison in Adobe ExportPDF - View the full discussion
    Hi Joseph,
    I'd like to assist!
    Have you tried converting one of your old PDF's that have worked in the past to see if it still works?
    Are these files scanned to PDF files?
    You may want to try utilizing the OCR solution.
    Let me know if that works.
    Looking forward to hearing back from you!
    Kind regards, Stacy
         Please note that the Adobe Forums do not accept email attachments. If you want to embed a screen image in your message please visit the thread in the forum to embed the image at http://forums.adobe.com/message/6231210#6231210
         Replies to this message go to everyone subscribed to this thread, not directly to the person who posted the message. To post a reply, either reply to this email or visit the message page: http://forums.adobe.com/message/6231210#6231210
         To unsubscribe from this thread, please visit the message page at http://forums.adobe.com/message/6231210#6231210. In the Actions box on the right, click the Stop Email Notifications link.
               Start a new discussion in Adobe ExportPDF at Adobe Community
      For more information about maintaining your forum email notifications please go to http://forums.adobe.com/thread/416458?tstart=0.

  • After scanning my document and converting to Microsoft Word the size of characters are different

    After scanning my document and converting to Microsoft Word the size of characters are different and things like puntuation are distorted. How do I get the uniformity like the original?

    Of course what lands in the Word file will differ from the viewed picture/image of text created by the scanner.
    (The output of all scanners is always an image file. For an image of textual content the best output file format is TIFF.)
    So you scan the hardcopy of text.
    The scanner output image (picture) is brought into PDF.
    At this point the only PDF page content that you can export to Word is the image (nope, no "text" just the image).
    Consequently you use Acrobat's OCR feature to do OCR of the image of text.
    With a decent paper source, proper resolution and a black and white image you'll get acceptable accuracy of recognition of the pictures of the characters.
    (the Optical Character Recognition)
    Acrobat's Searchable Image and Searchable Image (Exact) provides output that used text rendering mode 3 (no fill, no stroke for the glyphs).
    So, invsible / hidden text.
    The third OCR method is ClearScan.
    You could play with each of the three to see what goes into a Word file.
    Might try export to RTF, DOC and DOCX as well.
    Anyway -- What is exported is the OCR output; Not the image of text.
    And, of course, the image of the text is not the imprint on the paper that was scanned.
    Each step to the way you have some deviation.
    Once you have the exported PDF content in a Word file you can use Word to cleanup as desired / needed.
    OR
    Prop up the hardcopy and transcribe to a Word file.
    Be well...

  • Export pdf garbles conversion to microsoft doc

    I used export pdf to convert a pdf to microsoft.doc - it ended up I could not edit in the document - would keep
    jumping around on the screen when I try to insert info. This product needs to be recalled. I want a refund!
    Patrick Pierson 

    Hi
    Thanks for reproting this issue.
    We would like to take a look at your file, can you please share this file with us?
    You can upload your file using this form, https://adobeformscentral.com/?f=qJiclooYWGGNFtWfj8g3wg#.
    Thanks
    -sarabjit

  • Content in Jsp to be converted to Word Doc

    I have .jsp page. with some generated content. In that page, there is an option Convert to Word DOC PAGE. When the link is clicked , the content in the JSP page has to be converted to a Word Doc. How to do?

    <%@ page language="java" %>
    <%@ page import="java.util.*" %>
    <%@ page import = "java.io.*" %>
    <HTML>
    <HEAD>
    <script language="JavaScript">
    var fso = new ActiveXObject('Scripting.FileSystemObject');
    var wdApp = new ActiveXObject("Word.Application");
    function readFromFile(fileName)
         if (fileName == "C:\\Award_Ltr.TXT")
    var fs = fso.OpenTextFile(fileName);
    var result = fs.ReadAll();
    return result;
    function readFromWord()
    alert("PLEASE SAVE THE FILE AS C:\PPY Letter for Annuities to Retirees and Alternate Payees wi_temp.doc");
    var pause = 0;
    var wdDialogFileOpen = 80;
    var wdApp = new ActiveXObject("Word.Application");
    var dialog = wdApp.Dialogs(wdDialogFileOpen);
    var button = dialog.Show(pause);
    </SCRIPT>
    </HEAD>
    <BODY>
    <FORM NAME="formName">
    <INPUT TYPE="file" NAME="fileName">
    <INPUT TYPE="button" VALUE="show"
    ONCLICK="this.form.fileContent.value = readFromFile(this.form.fileName.value)">
    <BR>
    <TEXTAREA NAME="fileContent" ROWS="20" COLS="90" WRAP="off"></TEXTAREA>
    <BR>
    <INPUT TYPE="button" VALUE="SaveExtract" >
    <BR>
    <INPUT TYPE="button" VALUE="Modify Template" onClick = "readFromWord()">
    </FORM>
    </BODY>
    </HTML>

  • How can i convert New Microsoft Office Word Document to adobe

    how can i convert New Microsoft Office Word Document to adobe

    Hi itchigo,
    You can use Microsoft Word's inbuilt feature of converting the doc to pdf by simply selecting Save As> 'pdf doc format' from the drop down.
    Please refer: http://office.microsoft.com/en-001/word-help/save-as-pdf-HA010354239.aspx#BM11
    Adobe Reader does not have the capability of converting docs to pdf or vice versa. It can only be used to read pdf files.
    Regards,
    Rave

  • Converting docx to doc files using wordconv.exe

    Hello,
    I have a requirement wherein I need to convert the docx files to doc files. I looked around and found the "Microsoft Office Compatibility Pack for Word, Excel, and PowerPoint 2007 File
    Formats" ("http://www.microsoft.com/downloads/details.aspx?FamilyId=941B3470-3AE9-4AEE-8F43-C6BB74CD1466&displaylang=en"). Installing this would place a bunch of files in the C:\Program Files\Microsoft Office\Office12\ directory. This folder also contains an executable named "Wordconv.exe" which according to http://www.oooninja.com/2008/02/office-compatibility-pack-review.html converts the docx files to doc files if you use the execute something like the following in the command prompt:-
    "C:\Program Files\Microsoft Office\Office12\wordconv.exe" -oice -nme <input file> <output file>
    I downloaded the compatibility pack and tried the above command. Nothing happens. No error message,no output,nothing. I wonder what is the problem. In the above link,some guys have suggested to download the latest windows updates from Windows.
    Well,I tried this in my Windows XP (with Service Pack 2) and only thing I am left to install in service pack 3. Is that required? This WIndows XP machine does not have the Office 2007 installed. Is it required?
    Also,I tried this on a Windows server 2003 machine which also has the compatibility pack and result is same.This machine does have the Office 2007 installed.
    Am I missing anything? If yes,please let me know as I am kinda stuck in this. I dont want to use a commercial product like "Aspose.Words" for this.
    Is there any other tool available from Microsoft to convert docx to doc files? Please let me know. 
    I am cuurently looking into the Office tools.
    Thanks in Advance,
    Ashish

    Hello!
    I've got the same problem. I've tried to google it, and found this topic.
    Have you found the solution?
    Thanks, Victor.

  • Need to Convert PDF to doc in Russian, why programm do not recognice it?

    Need to Convert PDF to doc in Russian, why programm do not recognice it?

    Hi alsu22,
    The OCR system that converts documents does not recognize any cyrillic languages, such as Russian.  If the text is already renderable (selectable), you may want to try converting your document without the OCR function enabled. You can find steps here: http://forums.adobe.com/docs/DOC-3062
    -David

  • When I convert my pdf doc to word, the fonts go really weird and it also puts some text into boxes. when I try to select the test and change the font, it does not change it properly?

    When I convert my pdf doc to word, the fonts go really weird and it also puts some text into boxes. when I try to select the text and change the font, it does not change it properly? This is making it impossible to amend.

    Hi Janedance1,
    If the PDF that you converted already has searchable text, please try disabling OCR as described in this document: How to disable Optical Character Recognition (OCR) when converting PDF to Word or Excel. (If the PDF was created from a scanned document and doesn't already have searchable text, disabling OCR isn't a great option, as the text won't be searchable/editable in the converted Word doc.)
    Please let us know how it goes.
    Best,
    Sara

  • How do I convert a word doc to pdf?

    I cannot seem to convert a word doc to a pdf- it keeps taking me back to the subscription page and I already have signed up for that. HELP!

    Hello Saundra,
    I'm sorry to hear you're having trouble.  Could you post a screenshot of what you see when you try to log in here: http://createpdf.acrobat.com/signin.html ?
    -David

  • How do I convert a WORD doc to pdf on my Mac

    How do I convert a WORD doc to pdf

    Easiest way if you have Word installed: from Word's File menu, do "Save as..." and you will find a pdf option in the "Format" pull-down:
    The example is from Office:Mac 2008; should be similar in Office 2011.

Maybe you are looking for

  • Problem with my MSI KT4 Ultra

    Hi all! I have a BIG problem, I have tried to look at this board, but there is many different answers out there  ;( I have KT4 Ultra with Athlon XP 2600+ I have set FSB to 166, and when I startup BIOS says Athlon XP 2600+. But in windows system, it o

  • 1st generation ipod not recognized by mac

    Everything was working fine until I installed the latest version of itunes. i have installed the iPod shuffle reset Utilitiy and when I open it it tells me to "Please connect iPod shuffle" My iPod shuffle is connected, however I can not click on the

  • After updating my iPhone 5, my screen won't turn on. How do i fix this?

    I updated my iPhone 5 and my screen seems to be unresponsive. The home button, when held, activates siri and unplugging it and replugging it, makes it vibrate. But my screen just won't tell on. I am very disappointed in Apple Products and will probab

  • Core files for GLLEZL in  /var/core

    Hi All, We are running on 11.5.10.2 and in 10.2.0.3 DB Today we got lots of core files in /var/core directory core_<server name >GLLEZL201_201_1257203633_25107 Can you please advice on why these were generating and any issues are there because of thi

  • MacBook Pro Migration

    How can you accomplish an efficient migration to a MacBook Pro from the original MacBook if the new one no longer has ethernet, firewire 800 and the new one has USB 3.0?