How to read from a MS Word Document

Hi,
I need some help in reading the content from the MS Word 2000 document, through Java code.
I have used Java streams and tried to fetch the data into a stream and parse it for my customized need. I am getting some code characters in the beginning and at the end of the file. This might be because of some formatted information in the properties of that document.
How to get rid of that and get only the data from the ".doc" file?
If any one can guide me thru the conversion of a ".doc" file to PDF in java code, it will be of much more help to me.
Regards,
Subbu

if you are only intrested in the content of the file (text content) then save the *.doc file as *.txt file from your java code and then read the content of this *.txt file.... later you can also delete the *.txt file if you want.
good luck!

Similar Messages

  • I am working in Adobe Acrobat 9 Pro and just created a pdf form from a MS Word document. I need to find out how to have a date field in my form which will update automatically. Can some one out there help me?

    I am working in Adobe Acrobat 9 Pro and just created a pdf form from a MS Word document. I need to find out how to have a date field in my form which will update automatically.

    Update automatically under which circumstances, exactly?

  • How do I copy and paste text from a converted Word document?

    I just converted a pdf file into word.  I want to copy and paste the informatiuon into another document.  How do I do that?

    Hi Sara.  Tanks for responding.  If I cannot figure this out (I  was up
    until 2am last night), I want to cancel my subscription.
    Here are two files you can check on/help me with.  When I try again,  it
    comes up 'an error has occurred when trying to access the service.'
    Please advise.
    Pat Tomassi
    In a message dated 11/1/2014 12:55:23 A.M. Eastern Daylight Time, 
    [email protected] writes:
    How  do I copy and paste text from a converted Word document?
    created by Sara.Forsberg (https://forums.adobe.com/people/Sara.Forsberg) 
    in Adobe Acrobat.com Services - View the full  discussion
    (https://forums.adobe.com/message/6888908#6888908)

  • How to duplex print from a MS Word document in HP 1020 Plus printer

    Two days before I have acquired a HP 1020 Plus Laserjet printer . How to duplex print from a MS Word document and Corel Draw document in HP 1020 Plus printer? I tried manual duplex printing by printing the odd pages first and then even pages ....it came out correct ,...but whenit comes to Page numbers continuity it is not properly printed ....why?? Can anyone highlight on this and give me a step by step instructuion to print Duplex and thereby help me to create 20-----------page or 30 page booklets.....from MS word typed document or corel draw document......Expecting a speedy solution
    thanks & regards,
    K Narayanan
    This question was solved.
    View Solution.

    @immo22269  
    The laserjet 1020 only has manual duplexing capabilities. Please refer to this document for a complete set of instructions.
     Print on both sides of the paper (manual duplexing)
    I am an HP employee.
    Say Thanks by clicking the Kudos Star in the post that helped you.
    Please mark the post that solves your problem as Accepted Solution

  • How to read each and every word from a string.

    Hi all,
       I have a string which is having many label numbers. if the string is lv_str, its value is like, 11111111111111##22222222222222##3333333333333.
    I need to move the values alone into internal table. each value should be updated as a single row into one internal table. How to read each and every word of the string and move to an internal table.
    the internal table should be like this.
    11111111111111
    22222222222222
    3333333333333
    Can any one give me a suggestion in this regard.
    POINTS PROMISED.
    Regards,
    Buvana

    Hi,
    If you know the format and length of the data
    Use split at '#' so that you will get the individual values.
    Thean append it to internal table.
    Reward iof helpful.

  • How can I change a Microsoft Word document file into a picture file?

    How can I change a Microsoft Word document file into a picture or jpeg file? I am wanting to make the image I created my background on my macbook pro.

    After I had the document image the way I wanted it, I saved it as a web page and went from there. Below are the steps starting after I did the "save as" option in Word:
    1) Select "Save As Web Page". I changed the location from documents to pictures when the window came up to save it as a web page.
    2) Go to "Finder" on you main screen, or if it's on your main toolbar at the bottom.
    3) Click on the "Pictures" tab and find the file you just re-saved as a web page. (I included "web page" or something similar in the new title so I could easily find the correct file I was looking for)
    4) Open the correct file and then "right click" on the actual image. (Use 2 fingers to do so on a Mac)
    5) Select 'Use Image As Desktop Picture", and voilà! The personally created image, or whatever it is that you wanted, is now your background.
    **One problem I encountered while doing this is that the image would show up like it was right-aligned in relation to the whole screen. The only way I could figure how to fix this was to go back to the very original document in Word, (the one before it was saved as a web page), and move everything over to the left.
    I hope this helps someone else who was as frustrated as I was with something that I thought would have been very simple to do! If you have any tips or suggestions of your own, please feel free to share. : )

  • I cannot cut and paste .rtf from a MS Word document to a web page.

    I am running a MAC desktop with version 10.5. I use MS Office for MAC. When I try and cut and paste from a .rtf file to an amateur writing site, I always get a timeout. Help please!

    Hi Sara.  Tanks for responding.  If I cannot figure this out (I  was up
    until 2am last night), I want to cancel my subscription.
    Here are two files you can check on/help me with.  When I try again,  it
    comes up 'an error has occurred when trying to access the service.'
    Please advise.
    Pat Tomassi
    In a message dated 11/1/2014 12:55:23 A.M. Eastern Daylight Time, 
    [email protected] writes:
    How  do I copy and paste text from a converted Word document?
    created by Sara.Forsberg (https://forums.adobe.com/people/Sara.Forsberg) 
    in Adobe Acrobat.com Services - View the full  discussion
    (https://forums.adobe.com/message/6888908#6888908)

  • Generate topics from a linked Word document failes

    RH 9: Every time I try to generate topics from a linked Word document, RH failes and has to be shut. How do I solve this problem? or is this a bug in the application?

    There is no Conversion Settings screen.  Here are the steps I took:
    1. Created a new RH9 project.  
    2. Right-clicked in the Project Manager and selected "Link | Word Document."  (The message "scanning" appeared and then the MS Word document I selected appeared as an MSWord doc in the project folder.)
    3. Once all editing was finalized, I right-clicked on the MSWord doc in the Project Manager and selected "Update."
    There was no Conversion Settings screen.  The MSWord doc now appeared as a single HTML topic in Project Manager.

  • How to restrict Firefox to open word document in browser window with href

    How to restrict firefox to open word document from a website link to open in the browser window, instead it should open in native program.
    1.I have tried other website, and have clicked on similar links they are opening word document correctly in native program
    2. MS word document from one specific site is opening in browser causing junk character in place of word document.
    Could you please help on this.

    Thanks for your response...
    But As i said, I have tried with other website on the same browser, there it is opening word document in native program. Also the above mentioned options have been tried earlier.
    I have some questions related to this?
    How is this href link is behaving differently opening in browser for mozilla only. Technically, Can we make it open it native program only? If so, then how?

  • Read,write and create word document with list data

    Hi,
    My requirement is that I have a custom list called List1 and which has a 4 (suppose XName, location, phone, email)columns and also have a Template document(.dot). If I click on save button , new document has to create from the Template document(.dot) and
    should modify the content depends on list columns.
    For that I need to read the document , find out the text where XName , location,,phone, email and replace with the list item data(user entered data). 
    Can anybody please refer links for read,write and create word document?
    Thanks in advance.

    Yes, you can using Office Open XML. I found it to be a lot more cumbersome and in the end not a money saving approach:
    https://msdn.microsoft.com/en-us/library/office/bb448854.aspx?f=255&MSPPError=-2147217396
    Kind regards,
    Margriet Bruggeman
    Lois & Clark IT Services
    web site: http://www.loisandclark.eu
    blog: http://www.sharepointdragons.com

  • Receiving Mail Sent From Inside A Word Document

    I have a work colleague who is using a Windows environment and sending me MS Word documents by emailing from inside the Word document. When the email arrives in Mail the Word document shows as an attachment with the name winmail.dat. I have not been able to find a way of opening the file in a readable format. I am running OS X 10.4.6 and using Mail. I also have Office for Mac 2004 installed and have tried using Word on the winmail.dat without success. Entourage does not seem to have a means of finding files stored elsewhere. Preview does not cope and nor does GraphicConverter which is what Finder chooses as the default application to open winmail.dat.
    Can anyone suggest a solution?

    Hello Colin.
    This indicates the sender is using Windows Outlook as their email client which is the cause of this problem.
    See Dealing with the winmail.dat file: the problem and the solutions.
    You can try using TNEF's Enough.

  • How to copy NCLOB value(Contains Word Document) into file system

    How to copy NCLOB value(Contains Word Document) into file system or display in sqlplus

    The UTL_FILE package will write it only to text file not(NCLOB Value[containts images as well as text])

  • How to read and upload microsoft word file into database using forms9i

    Hi,
    How to read and upload microsoft word file into oracle database using forms9i. I appretiate if anyone can send me example or atleast a sujjetion.
    Thanks in advance
    Mahesh Ragineni

    The webutil package includes the ability up upload from the client to the database. See otn.oracle.com/products/forms and click on webutil for more details.
    Regards
    Grant Ronald
    Forms Product Management

  • Insert SAP ICONS into from SAP into Word document (OLE)

    Hi ,
    I want to Insert SAP ICONS into from SAP into Word document (OLE) .
    Please let me know how can I do it? I would really appreciate it.
    Regards,
    Sanjeev

    Hello,
    Try this out:
    (1) Copy program DD_ADD_PICTURE into your own version called ZDD_ADD_PICTURE. Make sure
    you select all the checkboxes (including GUI Status and screens).
    (2) Paste the modified code at the end of this reply into your ZZ_ADD_PICTURE program.
    (3) Run ZZ_ADD_PICTURE for a range of Icons (e.g. enter Icon name ICON_IN* on the selection screen)
    (4) When you get the result list, type in ok-code EXPO directly in the ok-code
    field (you could also add a button for this function in the GUI status).
    (5) Download all the displayed icons as .gif files into a Windows folder
    that you have created to hold the icon .gif files (e.g. C:SAPICONS)
    (6) Now you can work with the icon files as you would any .gif file. (e.g. In a Word doc, use menu path
    Insert -> Picture -> From file.)
    Here is the code:
    REPORT dd_add_picture.
    TYPE-POOLS: sdydo.
    DATA: do TYPE REF TO cl_dd_document.
    DATA: is_displayed.
    TABLES: icont.
    DATA: BEGIN OF icontab OCCURS 0.
            INCLUDE STRUCTURE icon.
    DATA: END OF icontab.
    select-options: s_icon for icontab-name obligatory.
    SELECT * FROM icon INTO TABLE icontab WHERE locked NE 'X'
                        AND name in s_icon.
    * Event Handler Definition, handling changes of GUI fonts, colors,...
    CLASS cl_my_event_handler DEFINITION.
      PUBLIC SECTION.
        METHODS:
          use_new_resources FOR EVENT resources_changed OF cl_gui_resources.
    ENDCLASS.
    DATA: my_handler TYPE REF TO cl_my_event_handler.
    CREATE OBJECT my_handler.
    * Call Screen
    CALL SCREEN 100.
    *&      Module  STATUS_0100  OUTPUT
    MODULE status_0100 OUTPUT.
      IF is_displayed IS INITIAL.
        SET PF-STATUS 'BRP'.
        SET HANDLER my_handler->use_new_resources.
    * create document
        CREATE OBJECT do.
    * fill document
        PERFORM dd_add_icon USING do.
    * merge document
        CALL METHOD do->merge_document.
    * display document .
        CALL METHOD do->display_document
                           EXPORTING  container          = 'HTML'
                           EXCEPTIONS html_display_error = 1.
                                           " do some exception handling ...
        is_displayed = 'X'.
      ENDIF.
    ENDMODULE.                             " STATUS_0100  OUTPUT
    MODULE user_command_0100 INPUT.
      CASE sy-ucomm.
        WHEN 'BACK'.                       "Beenden
          LEAVE PROGRAM.
        WHEN 'PRN'.
          CALL METHOD do->print_document
                  EXPORTING reuse_control = 'X'.
        WHEN 'PRN_NEW'.
          DATA text TYPE sdydo_text_element.
          CALL METHOD do->initialize_document.
          text = 'Dies Dokument wurde speziell fürs Drucken erstellt!' &
                            ' Druckdatum: '(500).
          CALL METHOD do->add_text EXPORTING
                               text         = text
                               sap_fontsize = cl_dd_area=>large.
          WRITE sy-datum TO text DD/MM/YYYY.
          CALL METHOD do->add_text EXPORTING text = text .
          CALL METHOD do->new_line EXPORTING repeat = 2.
          PERFORM dd_add_icon USING do.
          CALL METHOD do->merge_document.
          CALL METHOD do->print_document.
        WHEN 'EXPO'.
          CALL METHOD do->export_document EXPORTING to_filesystem = 'X'.
      ENDCASE.
      CLEAR sy-ucomm.
    ENDMODULE.                             " USER_COMMAND_0100  INPUT
    *&      Form  DD_ADD_ICON
    *       text
    FORM dd_add_icon USING p_do TYPE REF TO cl_dd_document.
      DATA ta TYPE REF TO cl_dd_table_element.
      DATA col1 TYPE REF TO cl_dd_area.
      DATA col2 TYPE REF TO cl_dd_area.
      DATA col3 TYPE REF TO cl_dd_area.
      DATA text TYPE sdydo_text_element.
    * set Heading
      text = ' Bilder in Dynamischen Dokumenten'(001).
      CALL METHOD p_do->add_text EXPORTING text = text
                                      sap_style = 'heading'.
      CALL METHOD p_do->new_line.
      CALL METHOD p_do->new_line.
      CALL METHOD p_do->add_table EXPORTING with_heading    = 'X'
                                          no_of_columns     = 3
                                          width             = '100%'
                                          IMPORTING table   = ta.
    * set columns
      text = 'Ikone'(011).
      CALL METHOD ta->add_column EXPORTING heading  = text
                                 IMPORTING column   = col1.
    * fill table
      LOOP AT icontab.
        SELECT SINGLE * FROM icont WHERE langu = sy-langu
                                   AND   id    = icontab-id.
        CALL METHOD col1->add_icon EXPORTING sap_icon = icontab-name
                                             sap_color = 'LIST_GROUP'.
      ENDLOOP.
    ENDFORM.                               " DD_ADD_ICON
    * CLASS cl_my_event_handler IMPLEMENTATION.
    CLASS cl_my_event_handler IMPLEMENTATION.
      METHOD use_new_resources.
        IF is_displayed EQ 'X'.
    * initialize document
          CALL METHOD do->initialize_document.
    * fill document
          PERFORM dd_add_icon USING do.
    * merge document
          CALL METHOD do->merge_document.
    * display document
          CALL METHOD do->display_document
                              EXPORTING reuse_control        = 'X'
                                        reuse_registration   = 'X'.
        ENDIF.
      ENDMETHOD.
    ENDCLASS.
    Regards,
    Vasanth

  • How can i open microsoft office word documents on my ipad 4 and edit it.

    How can i open microsoft office word documents on my ipad 4 and edit it. How to transfer my scaned documents on it

    There are a number of apps that are compatible with Office documents. Apple makes iOS versions of their own iWork apps (Pages, Keynote, Numbers) that are available free if you purchase a new iOS device and that can be purchased separately for older devices. There are also 3rd party combined suites that cost less than the separate Apple apps. Ones often recommended include Documents 2 Go,  Quickoffice and Office2 HD.
    If you can accept needing a network connection whenever you want to use the apps and have an Office 365 subscription, Microsoft has just released their Office Mobile app, though it has significant limitations. Or a server-based solution such as CloudOn might be an option for you.
    Regards.

Maybe you are looking for