Convert doc to txt

hi,
i need to convert one .doc document to simple text (.txt). can you help me how to do this.
the problem is that there are bookmarks in the word document and i want to know where are they and what is their name.
regards

www.microsoft.com

Similar Messages

  • Applescript batch convert DOC to TXT with line breaks

    Hey guys, I recently got stuck at work having to convert over 1,000 DOC files to TXT files with line breaks.
    I've found online several different Applescripts that work great at converting DOC files to TXT files but I can't find one that will do the TXT files with line breaks.
    If anyone has a script that can do this I would be crazy grateful.
    Converting these one by one with Word is taking forever to do.
    Thanks for any help you can give me.

    Excuse me for a moment for speaking harshly to you.  You are causing yourself utterly unnecessary headaches by not being clear with us and not stopping to think, and it's high time you learned that that is an incorrect way to approach anything on a computer.  Consider:
    you don't know what you're doing (in the sense that you don't know what 'text with line breaks' means)
    you don't know (or at least haven't explained) why this needs to be done
    (therefore) you don't know if this needs to be done at all
    (and yet) you are doing it anyway, in a mindlessly repetitive fashion, driving yourself batty and irritating me
    At least for the time being, humans are the ones who think and computers the ones who grunt away mindlessly; try to reverse those roles and everything gets done badly and slowly. Stop, look, think, plan ahead - that's what your brain is good at if you give it a chance.
    Now, as far as I can tell from poking around the web, 'text only with line breaks' means that the document  is saved as a plain-text file, but with a carriage return linefeed combination (CR/LF) as a paragraph delimiter (this is a Windows format - unix uses a single linefeed, Macs might use a single carriage return or a single linefeed). I don't know why anyone would want that format - most software will convert that seamlessly (or at least can be told to convert that).  Are you trying to feed this into some dinosaur of a database?  At any rate, if that's what you want, this script should do it. caution, this script overwrites the original files; I suggest you make a copy of one or two files in a separate folder, and run the script on them first to check that the output works for whatever reason you're doing this:
    set baseFolder to choose folder with prompt "Choose a folder of files to process"
    tell application "Finder"
              set fileList to (every file of baseFolder whose name extension is "txt") as alias list
    end tell
    repeat with thisFile in fileList
              set itsText to read thisFile
              if (offset of (return & linefeed) in itsText) = 0 then
      -- file is not already formatted with CR/LF, so convert
                        set itsChunks to tid(itsText, {return, linefeed})
                        set itsNewText to tid(itsChunks, return & linefeed)
                        set fp to open for access thisFile with write permission
                        set eof of fp to 0
      write itsNewText to fp as text
      close access fp
              end if
    end repeat
    on tid(input, delim)
      -- handler for text items conversions
              set {oldTID, my text item delimiters} to {my text item delimiters, delim}
              if class of input is list then
                        set output to input as text
              else
                        set output to text items of input
              end if
              set my text item delimiters to oldTID
              return output
    end tid

  • Convertion of Microsoft .doc to .txt

    Hello there,
    Could any give my some ideas on how to use Java to open a Miscroft document ".doc" then save it (or convert it) as a plain text ".txt"? I know you can do it using MS " word" via save as. But it would be quite a hassel if you are opening and closing a thousand documents. I want to do some document convertion and re-generation. Any help would be very much appericated.
    Thanks
    Sam

    Hello there,
    The reason I was try to convert .doc to .txt is because I want to further manipulate the data.... in fact I am trying to convert .doc to .html. The MS word has a function can "save as" a html doc.... but the formatting is really WIRED. Everything was flying around all over the places. SO I thouhg if you can convert it into .txt...then it would be easier to add html tags and format it nicely and automatically..... then maybe using forntpage or dreamwaver to further "decorate" the .html....
    Someone suggets to use macro in frontpage to do the job, but the thing is...I am not familer with VBS. So I thought of Java. But then I am going to do LOTS of document convertion and I am not sure using the POI is convenient or apporiate....Any advice would be great!
    Thanks,
    Sam

  • Error in converting *.doc to *.pdf

    Running winXP home sp2, adobe pro 7.1.0
    This problem has started suddenly.
    When converting *.doc to *.pdf a window titled AcroDist.exe-Application Error comes up:
    "0x00441ae2" referenced memory at "0x0218ebbc". The memory could not be "read".
    Acrobat stops and can only be stopped from the Task Manager. Try to print something in *.pdf -- same thing happens.
    Anyone, any suggestions? I would be thankful.
    joeZy

    Gloria Mc,
    This occurs with all .doc, Excel, .rtf, .txt (even when trying to print .pdf from a txt software like notepad. It does not occur with picture format files like .jpg or .tiff. I also can combine .pdf files.
    As far as JR's question, if I generate a .prn file and drop it in AcroDist (Distiller), it tries to open Distiller and the same error message comes up.
    All other functions of AA7 seem to work, only when trying to convert document files to .pdf is there a problem.
    If I try to bring up Distiller by itself (no file), the same error occurs.
    I'm still willing to try any suggestions. Thanks for your interest.
    joeZy

  • How to convert Flat file(.txt) data to an Idoc format(ORDERS05)

    Hi,
    How to convert Flat file(.txt) data to an Idoc format(ORDERS05). If any FM does the same work please let me know.
    thanks in advance,
    Chand
    Moderator message : Duplicate post locked. Read forum rules before posting.
    Edited by: Vinod Kumar on Jul 26, 2011 11:11 AM

    Hi,
            For more information, please check this link.
    http://sdn.sap.com/irj/servlet/prt/portal/prtroot/docs/library/uuid/46759682-0401-0010-1791-bd1972bc0b8a
    Have a look at the FM IDOC_XML_FROM_FILE. May be it helps...
    Regards

  • How to convert doc to text

    how to convert doc to text

    from what app to what app? open a document and save a .txt? or do you mean cutting and pasting into a text message?
    need a little more info please.

  • .doc to .txt

    Hi
         when I am converting .doc file to .txt file, I am losing the format of .doc file in txt file.like, If I want to convert a table in .doc file to .txt file, I am not getting the table in my txt file. How can I convert .doc file to .txt file with out losing its format?
             Thanks,
             Vipul

    Hi Mike,
                             I've attached one document here. I think now you can understand my problem better.
    Thanx,
    Vipul
    Attachments:
    modem.doc ‏29 KB

  • I can not open adobe files. mainly ones that end in doc. I purchased the 19.99 to convert .doc files

    I can not open adobe files that end in .doc. I purchased the 19.99 to convert doc to pdf and I still cant open. I get an error reading.

    Hi,
    Which Adobe Service did you purchased?
    If you have purchased CreatePDF please visit: https://createpdf.acrobat.com/SignIn.html
    Sign in with your Adobe ID and password, and then convert your word doc into PDF.
    Please let me know if that works.
    If you have Adobe Reader, you cannot convert .doc file to .PDF by drag and drop.
    ~ Aditya

  • How to convert .doc file into .rtf file in Java?

    Hello All,
    I want to convert doc file into rtf format in java and for the same i am not getting any help so pls suggest some solution for that.
    Thanks and Regards
    only1Vinay

    MS-Word formats (DOC) are notorious for not being standardized from one version to another, so what ever you get will be version specific. If you must do the conversion, I suggest you do a MS-Script in Word to do it or one of the .Net languages. As stated the Word format from version to version is not standardized.

  • How can i open a DOC or TXT file and insert the data into table?

    How can i open a DOC or TXT file and insert the data into table?
    I have a doc file . the doc include some columns and some rows.(for example 'ID,Name,Date,...').
    I'd like open DOC file and I'd like insert them into the table with same columns.
    Thanks.

    Use the SQL*Loader utility or the UTL_FILE package.

  • How can i retrieve documents(.doc,.pdf, .txt) using forms from the database.

    How can i retrieve documents(e.g .doc,.pdf, .txt etc) using forms from the database.
    i inserted the documents using sql*loader, below is the control and data files.
    -- control file
    LOAD DATA
    infile 'load.txt'
    INTO TABLE husman
    APPEND
    FIELDS TERMINATED BY ','
    (id integer external,
    fname FILLER CHAR(50),
    docu LOBFILE(fname) TERMINATED BY EOF)
    --data file
    1,../husman/dell.doc,
    2,../husman/me.pdf,
    3,../husman/export.txt,
    in the form i have a text field to display the id and an OLE container to display the document as an icon. but when i execute query, i only get the id number and not the document.
    any help will be appreciated.
    Thanks
    Hussein Saiger

    Step by step
    1. Erase all contents and settings
    2. You'll be asked twice to confirm
    3. You'll see Apple logo and progress bar
    4. You'll see a big iPad logo on screen
    5. Configuration start
    6. Set language
    7. Set country
    8. Enable Location Service
    9. Select network, enter password and join network
    10. You'll be given 3 options (a) Setup as New iPad (b) Restore from iCloud Backup (c) Restore from iTune Backup
    11. Selected Restore from iCloud Backup
    12. You'll be required to enter Apple ID and Password
    13. Agree to Terms and Conditions
    14. Select Backup file
    15. You'll see progress bar
    16. Red slider will appear; slide to unlock; step #1 to #16 is fast
    17. Pre-installed apps will be restored first
    18. Message: Purchased apps and media will now be automatically downloaded
    19. You'll see a pageful of apps with Waiting/Loading/Installing
    20. Message: Some apps cannot be downloaded, please sync with computer

  • Javascript in .PDF's - Extracting text from .doc or .txt

    Hello All,
    I am very new to javascript in .pdfs -- but I seem to find my around doing misc. work with forms. What I need:
    I need a Form with a Submit button that locates and extracts the text from a file and places it into another field.
    Example:
    on Server:
    one.txt or one.doc
    two.txt or two.doc,
    ...etc
    You type one in the form and submit -- it pulls all of the txt from one.txt off the server and places it into a field.
    Also if there is anyway to do this with tables to avoid multiple files that would be even better.
    I know I am a newbie, but this would be a game-changer for what I do.
    Thank you.

    Thanks for the advice
    It is accessing a shared file server (among employees) and it is to be a .pdf used in Adobe Acrobat Professional
    Basically I want it to be a form that pulls txt based on what was in the typed box or drop-down menu from a .txt or .doc

  • How to convert .doc files to .docx in a sharepoint library programmatically.

    Is there any possibility to Convert .doc files to .docx in a sharepoint document library.
    I have thousands and lakhs of .doc files and I need to automate to convert those .doc files to .docx with an automation script or powershell script or doing it programmatically.
    Can someone help me get through this.
    Thanks
    Gayatri

    Hello Gayatri,
    You can convert files from doc to docx using following options
    Option 1 
    in bulk using  Office File Converter (OFC) and Version Extraction Tool. Please refer below url for reference - http://technet.microsoft.com/en-us/library/cc179019.aspx
    Option 2 - PowerShell
    please refer url -http://blogs.msdn.com/b/ericwhite/archive/2008/09/19/bulk-convert-doc-to-docx.aspx
    Convert DOC to DOCX using PowerShell
    I was tasked with taking a large number of .DOC and .RTF files and converting them to .DOCX. The files were then going to be imported into a SharePoint site. So I went out on the web looking for PowerShell scripts to accomplish this. There are plenty to
    choose from.
    All the examples on the web were the same with some minor modifications. Most of them followed this pattern:
    $word = new-object -comobject word.application
    $word.Visible = $False
    $saveFormat = [Enum]::Parse([Microsoft.Office.Interop.Word.WdSaveFormat],”wdFormatDocumentDefault”);
    #Get the files
    $folderpath = “c:\doclocation\*”
    $fileType = “*doc”
    Get-ChildItem -path $folderpath -include $fileType | foreach-object
    $opendoc = $word.documents.open($_.FullName)
    $savename = ($_.fullname).substring(0,($_.FullName).lastindexOf(“.”))
    $opendoc.saveas([ref]“$savename”, [ref]$saveFormat);
    $opendoc.close();
    #Clean up
    $word.quit()
    After trying out several I started to convert some test documents. All went well until the files were uploaded to SharePoint. The .RTF files were fine but even though the .DOC fiels were now .DOCX files they did not allow for all the functionality of .DOCX
    to be used.
    After investigating a little further it turns out that when doing a conversion from .DOC to .DOCX the files are left in compatibility mode. The files are smaller, but they don’t allow for things like coauthors.
    So back to the drawing board and the web and I found a way to set compatibility mode off. The problem was that it required more steps including saving and reopening the files. In order to use this method I had to add a compatibility mode object:
    $CompatMode = [Enum]::Parse([Microsoft.Office.Interop.Word.WdCompatibilityMode], “wdWord2010″)
    And then change the code inside the {} from above to:
    $opendoc = $word.documents.open($_.FullName)
    $savename = ($_.fullname).substring(0,($_.FullName).lastindexOf(“.”))
    $opendoc.saveas([ref]“$savename”, [ref]$saveFormat);
    $opendoc.close();
    $converteddoc = get-childitem $savename
    $opendoc = $word.documents.open($converteddoc.FullName)$opendoc.SetCompatibilityMode($compatMode);
    $opendoc.save()
    $opendoc.close()
    It worked, but I didn’t like it. So back to the web again and this time I stumbled across the real way to do it. Use the Convert method. No one else seems to have used this in any of the examples but it is a much cleaner way to do it then the compatibility
    mode setting. So this is how I changed my code and now all the files come in to SharePoint as true .DOCX files.
    $word = new-object -comobject word.application
    $word.Visible = $False
    $saveFormat = [Enum]::Parse([Microsoft.Office.Interop.Word.WdSaveFormat],”wdFormatDocumentDefault”);
    #Get the files
    $folderpath = “c:\doclocation\*”
    $fileType = “*doc”
    Get-ChildItem -path $folderpath -include $fileType | foreach-object
    $opendoc = $word.documents.open($_.FullName)
    $savename = ($_.fullname).substring(0,($_.FullName).lastindexOf(“.”))
    $word.Convert()
    $opendoc.saveas([ref]“$savename”, [ref]$saveFormat);
    $opendoc.close();
    #Clean up
    $word.quit()

  • Converting .DOC, .XSL filo to PDF

    Hi all,
    currently I am looking for a Java API which would be able to convert DOC and XSL files to PDF file + adding bookmark information into both files also. I am not interested in solutions using hidden instances of some applications like Jacob (using MS Word instance) or OpenOffice SDK (using OpenOffice.org appl.) as it is the problem I am trying to avoid.
    Thanks a lot for any advice.
    Frank
    P.S.: I am a newie in here so I hope I didn't "do" anything against rules... :)

    You can not. the best you can do is use something like POI to give an approx. render of the page, and spit it out to PDF, or using a native word view, and script it in some way to print to a PDF.

  • How to convert Doc file into image

    hello frnds
                     Can any body guide me how to convert doc file into image and show into swf loader.
    actually i have to convert doc files into swf files in runtime so that i have to use this flow.
    is it possible to convert doc file into byte array and than convert into image.
    Thanks And Regards
        Vineet Osho

    You can convert any DisplayObject to byeArray using this function ImageSnapshot.captureBitmapData().getPixels()

Maybe you are looking for

  • Text Frame Options

    Hi, I am using CS6 in Saudi-Arabia. When I put a new text frame with more than one column on a page, the cursor automatically jumps to the right column. In English texts I can't get it to start on the top of the left column. Who can help?

  • Fusion drive lack of response and system freeze

    I have a 2013 iMAC with fusion drive. I find that I get very slow response when I open a folder in finder - it takes anything up to 10 seconds for the contents of that folder to be displayed.  I also get the system just hang when I do simple photosho

  • Deprecated Keyword?

    It seems to me that the support for deprecation in the Java language could use a little help. As it stands, the compiler looks for the @deprecated JavaDoc tag to tell if a type, method, or field has been deprecated or not. If it is, it places a Depre

  • Default date as first of month

    Hi, I have a report parameter $Rundate which is used as a reference date and as a parameter in my Query, Although i have the option to get it to default to a particular date, or to current date (using sysdate). is there a way i can get it to default

  • Creating Attribute Dimensions

    I have a Cost Center dimension with 10,500 members. I want to quickly tag these cost centers with 2 attributes. 1st is the first 3 characters of the string(8 in total) as company and the 4th position as dept. How can I do this in an automated way sin