Writing html page into word document

HI,
I want to write the HTML document using java code in to a word document.
When I save the webpage as word document. It's properly saving the font...
I almost did it with following code but the font is missing it is just writing all the text in single font.I want the exact replica of the html page in to the word document.Plz help me out with some input.
Thanx in advance.
Regards,
Ashok.
public String readContent(){
String contText="";
EditorKit kit = new HTMLEditorKit();
Document doc = kit.createDefaultDocument();
// The Document class does not yet
// handle charset's properly.
doc.putProperty("IgnoreCharsetDirective",Boolean.TRUE);
try {
// Create a reader on the HTML content.
Reader rd = getReader(fileName);
// Parse the HTML.
kit.read(rd, doc, 0);
// Iterate thsrough the elements
// of the HTML document.
//ElementIterator it = new ElementIterator(doc);
//javax.swing.text.Element elem;
//while ((elem = it.next()) != null) {
//SimpleAttributeSet s = (SimpleAttributeSet)elem.getAttributes().getAttribute(HTML.Tag.A);
// code to read the content
int nleft = doc.getLength();
Segment text = new Segment();
int offs = 0;
text.setPartialReturn(true);
while (nleft > 0) {
contText+=doc.getText(offs,nleft);
System.out.println(contText);
// do someting with text
nleft -= text.count;
offs += text.count;
if(nleft > 0)break;
/*if (s != null) {
//System.out.println(s.getAttribute(HTML.Attribute.HREF));
System.out.println(s.getAttribute(HTML.Attribute.HREF));
} catch (Exception e) {
e.printStackTrace();
//System.exit(1);
return contText;
} //end of reading content method
static Reader getReader(String uri)
throws IOException {
if (uri.startsWith("http:")) {
// Retrieve from Internet.
URLConnection conn = new URL(uri).openConnection();
return new InputStreamReader(conn.getInputStream());
} else {
// Retrieve from file.
return new FileReader(uri);

Maybe Im missing something here, but where are you doing anything using MS Word? I see you write the text to std out but where does it go to a word document? How do you create the word document, using JNI or is it a java API?

Similar Messages

  • How can I convert my Open Source document files into Word document files? I cannot download Pages since my Macbook Air does not have the most recent software.

    How can I convert my Open Source document files into Word document files? I cannot download Pages since my Macbook Air does not have the most recent software. I downloaded open office to my mac to try and save money. It worked well for a while. Now I get this pop-up message that asks me to "Reopen" and when I select the option, nothing happens. I cannot save my documents anymore and I cannot convert them to word. Help!

    dwb wrote:
    Does OpenOffice output Word documents by default or do you have to select it manually?
    You have 17 options to save as in Open Office, one of which is .doc  files,  yes it needs to be saved manually.
    You may be able to default to DOC, but have not tried same.
    Since Open Office is 99% same as Word, I use it, or Word, either one.  Open Office is a bit less buggy than Word 11'

  • External html page into flash

    how to load external html page into flash? (AS3)

    You cannot load an html page into Flash, though you can try to load the html code that it contains into a textfield.  Flash supports only a very limited number of html tags, so it is highly likely you won't get what you are after if you are reading an html file that hasn't been edited for Flash specifically.  Just look up the TextField.htmlText property in the AS3 help documents and it lists the html tags that are supported. 
    For AS3 you need to look into the URLLoader class as far as loadng an extrernal text file goes.

  • Save a JSF page as word document

    Hi All,
    I have a simple JSF page with some generated content. In that page, there is a button as "save as word". When the link is clicked , i need the save dialog to appear and prompt the user to save the content in the JSP page as word document.How can i do this in JSF.
    Please guide me in this regard.
    Thanks & Regards,
    Pals

    I don't think this is easily achieveable, if even possible
    A web page as you see it is the result of a web browser rendering the html in its own way. Different browsers can display the same html differently. So what gets displayed to the user is dependent on their browser.. In order to produce an image of what they see, you would need to somehow render the image differently depending on the value of the User-Agent header.
    Maybe by hacking around the hotjava browser you could start to do something like this, but I'd imagine it would be quite messy and again different to what the user views.

  • Exporting smartform into word document

    how to export smartform into word document . i tried converting into pdf and then download but it showing run time error that conversion is not possible and also it tells that otf command // missing. is it not possible to export directly to word document instead of pdf.plz give clear description of what to be done exactly with sample codes.
    marks will be rewarded.

    Hi Lavanya,
    Converting the output from Spool to Word is possible.
    Here is the sample code.
    I cut pasted a code from a link i got in the website, see if it helps.
    ZSPOOL2WORD
    Genera un fichero Word a partir de una orden de spool
    MÓDULO : FI *
    TIPO : Listado *
    TITULO : Generación fichero Word
    DESCRIPCION : Genera un fichero Word a partir de una orden de spool
    AUTOR: Andres Picazo FECHA: 24/03/2003 *
    MODIFICACIONES *
    FECHA NOMBRE DESCRIPCION *
    REPORT ZSPOOL2WORD
    NO STANDARD PAGE HEADING
    LINE-COUNT 065
    LINE-SIZE 080.
    INCLUDE OLE2INCL.
    *----TABLAS/ESTRUCTURAS--
    *----TABLAS INTERNAS--
    DATA I_BUFFER(132) OCCURS 1000000 WITH HEADER LINE.
    *----VARIABLES--
    *----PARAMETER/SELECT-OPTIONS EN PANTALLA--
    SELECTION-SCREEN BEGIN OF BLOCK BLK_PAR WITH FRAME TITLE TEXT-SEL. "Pará
    PARAMETERS: P_SPOOL LIKE TSP01-RQIDENT OBLIGATORY.
    SELECTION-SCREEN END OF BLOCK BLK_PAR.
    SELECTION-SCREEN BEGIN OF BLOCK BLK_WOR WITH FRAME TITLE TEXT-WOR.
    PARAMETERS: P_WORD AS CHECKBOX DEFAULT 'X'.
    PARAMETERS: P_FWOR LIKE RLGRAP-FILENAME DEFAULT 'C:MAYOR.DOC'.
    PARAMETERS: P_PLAN LIKE RLGRAP-FILENAME
    DEFAULT 'D:DATOSAPISMAYORPLANTILLA LIBRO MAYOR.DOC'.
    SELECTION-SCREEN END OF BLOCK BLK_WOR.
    SELECTION-SCREEN BEGIN OF BLOCK BLK_FIC WITH FRAME TITLE TEXT-FIC.
    PARAMETERS: P_CTXT AS CHECKBOX DEFAULT ''.
    PARAMETERS: P_FTXT LIKE RLGRAP-FILENAME DEFAULT 'C:MAYOR.TXT'.
    SELECTION-SCREEN END OF BLOCK BLK_FIC.
    LOGICA DEL PROGRAMA
    INITIALIZATION
    INITIALIZATION.
    START-OF-SELECTION.
    START-OF-SELECTION.
    PERFORM LEER_SPOOL.
    IF NOT P_CTXT IS INITIAL.
    PERFORM GRABA_FICHERO.
    ENDIF.
    IF NOT P_WORD IS INITIAL.
    PERFORM LANZA_WORD.
    ENDIF.
    FORMS ADICIONALES
    *& Form LEER_SPOOL
    Lee la orden de spool en el buffer
    FORM LEER_SPOOL.
    CALL FUNCTION 'RSPO_RETURN_ABAP_SPOOLJOB'
    EXPORTING
    RQIDENT = P_SPOOL
    FIRST_LINE = 1
    LAST_LINE = 9999999
    TABLES
    BUFFER = I_BUFFER
    EXCEPTIONS
    NO_SUCH_JOB = 1
    NOT_ABAP_LIST = 2
    JOB_CONTAINS_NO_DATA = 3
    SELECTION_EMPTY = 4
    NO_PERMISSION = 5
    CAN_NOT_ACCESS = 6
    READ_ERROR = 7
    OTHERS = 8.
    IF SY-SUBRC NE 0.
    MESSAGE E398(00) WITH 'Error' SY-SUBRC
    'al leer la orden de spool' P_SPOOL.
    ENDIF.
    ENDFORM. " LEER_SPOOL
    *& Form GRABA_FICHERO
    Graba el contenido del spool a fichero de texto.
    FORM GRABA_FICHERO.
    CALL FUNCTION 'WS_DOWNLOAD'
    EXPORTING
    BIN_FILESIZE = ' '
    CODEPAGE = ' '
    FILENAME = P_FTXT
    FILETYPE = 'ASC'
    MODE = ' '
    WK1_N_FORMAT = ' '
    WK1_N_SIZE = ' '
    WK1_T_FORMAT = ' '
    WK1_T_SIZE = ' '
    COL_SELECT = ' '
    COL_SELECTMASK = ' '
    NO_AUTH_CHECK = ' '
    IMPORTING
    FILELENGTH =
    TABLES
    DATA_TAB = I_BUFFER
    FIELDNAMES =
    EXCEPTIONS
    FILE_OPEN_ERROR = 1
    FILE_WRITE_ERROR = 2
    INVALID_FILESIZE = 3
    INVALID_TABLE_WIDTH = 4
    INVALID_TYPE = 5
    NO_BATCH = 6
    UNKNOWN_ERROR = 7
    GUI_REFUSE_FILETRANSFER = 8
    OTHERS = 9.
    IF SY-SUBRC NE 0.
    MESSAGE E398(00) WITH 'Error' SY-SUBRC
    'al grabar el fichero' P_FTXT.
    ENDIF.
    ENDFORM. " GRABA_FICHERO
    *& Form LANZA_WORD
    Abre la plantilla de Word y pega el contenido del portapapeles.
    FORM LANZA_WORD.
    DATA: WORDAPP TYPE OLE2_OBJECT,
    DOCUMENT TYPE OLE2_OBJECT,
    SELECTION TYPE OLE2_OBJECT.
    Copia el contenido del buffer en el portapeles
    CALL FUNCTION 'CLPB_EXPORT'
    TABLES
    DATA_TAB = I_BUFFER
    EXCEPTIONS
    CLPB_ERROR = 1
    OTHERS = 2.
    Abre Word
    CREATE OBJECT WORDAPP 'word.application'.
    IF SY-SUBRC NE 0.
    MESSAGE E398(00) WITH 'No se ha podido abrir el Word'.
    ENDIF.
    Lo pone en visible
    SET PROPERTY OF WORDAPP 'Visible' = 1.
    Cogemes el objeto documento
    CALL METHOD OF WORDAPP 'Documents' = DOCUMENT.
    Abrimos el fichero plantilla
    IF P_PLAN IS INITIAL.
    CALL METHOD OF DOCUMENT 'Add'.
    ELSE.
    CALL METHOD OF DOCUMENT 'Open' EXPORTING #1 = P_PLAN.
    IF SY-SUBRC NE 0.
    MESSAGE E398(00) WITH 'Error al leer el fichero plantilla'.
    ENDIF.
    ENDIF.
    Coge el objeto selección
    CALL METHOD OF WORDAPP 'Selection' = SELECTION.
    Pega el contenido del portapapeles
    CALL METHOD OF SELECTION 'Paste'.
    IF SY-SUBRC NE 0.
    MESSAGE E398(00) WITH 'Error al pegar contenido del portapapeles'.
    ENDIF.
    Graba el fichero
    CALL METHOD OF WORDAPP 'ActiveDocument' = DOCUMENT.
    CALL METHOD OF DOCUMENT 'SaveAs' EXPORTING #1 = P_FWOR.
    IF SY-SUBRC NE 0.
    MESSAGE E398(00) WITH 'Error al grabar el nuevo documento'.
    ENDIF.
    Cierra Word
    CALL METHOD OF WORDAPP 'Quit'.
    IF SY-SUBRC NE 0.
    MESSAGE E398(00) WITH 'Error al cerrar Word'.
    ENDIF.
    ENDFORM. " LANZA_WORD
    check this also.................
    By using FM RSPO_RETURN_ABAP_SPOOLJOB you will be able to get the ASCII text of your Spool, which you can download to your local HD and open with M$ Word.
    Check the function module
    CALL FUNCTION 'RSPO_DOWNLOAD_SPOOLJOB'
         EXPORTING
              id    = p_spool
              fname = p_file.
    Give the file extn as .DOC. it will downlaod it as a
    Word doc.But I fear you wont get the table formats and
    all.
    ~~Guduri

  • Insert SAP ICONS into from SAP into Word document (OLE)

    Hi ,
    I want to Insert SAP ICONS into from SAP into Word document (OLE) .
    Please let me know how can I do it? I would really appreciate it.
    Regards,
    Sanjeev

    Hello,
    Try this out:
    (1) Copy program DD_ADD_PICTURE into your own version called ZDD_ADD_PICTURE. Make sure
    you select all the checkboxes (including GUI Status and screens).
    (2) Paste the modified code at the end of this reply into your ZZ_ADD_PICTURE program.
    (3) Run ZZ_ADD_PICTURE for a range of Icons (e.g. enter Icon name ICON_IN* on the selection screen)
    (4) When you get the result list, type in ok-code EXPO directly in the ok-code
    field (you could also add a button for this function in the GUI status).
    (5) Download all the displayed icons as .gif files into a Windows folder
    that you have created to hold the icon .gif files (e.g. C:SAPICONS)
    (6) Now you can work with the icon files as you would any .gif file. (e.g. In a Word doc, use menu path
    Insert -> Picture -> From file.)
    Here is the code:
    REPORT dd_add_picture.
    TYPE-POOLS: sdydo.
    DATA: do TYPE REF TO cl_dd_document.
    DATA: is_displayed.
    TABLES: icont.
    DATA: BEGIN OF icontab OCCURS 0.
            INCLUDE STRUCTURE icon.
    DATA: END OF icontab.
    select-options: s_icon for icontab-name obligatory.
    SELECT * FROM icon INTO TABLE icontab WHERE locked NE 'X'
                        AND name in s_icon.
    * Event Handler Definition, handling changes of GUI fonts, colors,...
    CLASS cl_my_event_handler DEFINITION.
      PUBLIC SECTION.
        METHODS:
          use_new_resources FOR EVENT resources_changed OF cl_gui_resources.
    ENDCLASS.
    DATA: my_handler TYPE REF TO cl_my_event_handler.
    CREATE OBJECT my_handler.
    * Call Screen
    CALL SCREEN 100.
    *&      Module  STATUS_0100  OUTPUT
    MODULE status_0100 OUTPUT.
      IF is_displayed IS INITIAL.
        SET PF-STATUS 'BRP'.
        SET HANDLER my_handler->use_new_resources.
    * create document
        CREATE OBJECT do.
    * fill document
        PERFORM dd_add_icon USING do.
    * merge document
        CALL METHOD do->merge_document.
    * display document .
        CALL METHOD do->display_document
                           EXPORTING  container          = 'HTML'
                           EXCEPTIONS html_display_error = 1.
                                           " do some exception handling ...
        is_displayed = 'X'.
      ENDIF.
    ENDMODULE.                             " STATUS_0100  OUTPUT
    MODULE user_command_0100 INPUT.
      CASE sy-ucomm.
        WHEN 'BACK'.                       "Beenden
          LEAVE PROGRAM.
        WHEN 'PRN'.
          CALL METHOD do->print_document
                  EXPORTING reuse_control = 'X'.
        WHEN 'PRN_NEW'.
          DATA text TYPE sdydo_text_element.
          CALL METHOD do->initialize_document.
          text = 'Dies Dokument wurde speziell fürs Drucken erstellt!' &
                            ' Druckdatum: '(500).
          CALL METHOD do->add_text EXPORTING
                               text         = text
                               sap_fontsize = cl_dd_area=>large.
          WRITE sy-datum TO text DD/MM/YYYY.
          CALL METHOD do->add_text EXPORTING text = text .
          CALL METHOD do->new_line EXPORTING repeat = 2.
          PERFORM dd_add_icon USING do.
          CALL METHOD do->merge_document.
          CALL METHOD do->print_document.
        WHEN 'EXPO'.
          CALL METHOD do->export_document EXPORTING to_filesystem = 'X'.
      ENDCASE.
      CLEAR sy-ucomm.
    ENDMODULE.                             " USER_COMMAND_0100  INPUT
    *&      Form  DD_ADD_ICON
    *       text
    FORM dd_add_icon USING p_do TYPE REF TO cl_dd_document.
      DATA ta TYPE REF TO cl_dd_table_element.
      DATA col1 TYPE REF TO cl_dd_area.
      DATA col2 TYPE REF TO cl_dd_area.
      DATA col3 TYPE REF TO cl_dd_area.
      DATA text TYPE sdydo_text_element.
    * set Heading
      text = ' Bilder in Dynamischen Dokumenten'(001).
      CALL METHOD p_do->add_text EXPORTING text = text
                                      sap_style = 'heading'.
      CALL METHOD p_do->new_line.
      CALL METHOD p_do->new_line.
      CALL METHOD p_do->add_table EXPORTING with_heading    = 'X'
                                          no_of_columns     = 3
                                          width             = '100%'
                                          IMPORTING table   = ta.
    * set columns
      text = 'Ikone'(011).
      CALL METHOD ta->add_column EXPORTING heading  = text
                                 IMPORTING column   = col1.
    * fill table
      LOOP AT icontab.
        SELECT SINGLE * FROM icont WHERE langu = sy-langu
                                   AND   id    = icontab-id.
        CALL METHOD col1->add_icon EXPORTING sap_icon = icontab-name
                                             sap_color = 'LIST_GROUP'.
      ENDLOOP.
    ENDFORM.                               " DD_ADD_ICON
    * CLASS cl_my_event_handler IMPLEMENTATION.
    CLASS cl_my_event_handler IMPLEMENTATION.
      METHOD use_new_resources.
        IF is_displayed EQ 'X'.
    * initialize document
          CALL METHOD do->initialize_document.
    * fill document
          PERFORM dd_add_icon USING do.
    * merge document
          CALL METHOD do->merge_document.
    * display document
          CALL METHOD do->display_document
                              EXPORTING reuse_control        = 'X'
                                        reuse_registration   = 'X'.
        ENDIF.
      ENDMETHOD.
    ENDCLASS.
    Regards,
    Vasanth

  • Scanning multiple pages into single document

    Is there a way to scan multiple pages into single document?  I am using the following: HP pavilion laptop with windows 8,  HP photosmart C6380 all in one printer scanner copier.

    Hi,
    Please try
    Double click printer icon on desktop,
    Select Scan a Document or Photo,
    Put the first page on the glass (face down),
    Check options (size, dpi ...), and select Scan document to file,
    Click Scan - machine will scan the first page
    Remove the first page on the glass, put the second page,
    Click + (plus sign) It sits on the left hand side of a red x
    Machine will scan the second page, put 3rd page on the glass and click + again ..... to the end then click Save
    Click Done after Save
    Regards.
    BH
    **Click the KUDOS thumb up on the left to say 'Thanks'**
    Make it easier for other people to find solutions by marking a Reply 'Accept as Solution' if it solves your problem.

  • How do I scan multiple pages into one document?

    How do I scan multiple pages into one document?
    HP Photosmart 5514-e-All-in-One Printer B111h
    Model CQ 183 A

    Hi,
    Have you tried this (using PDF format).
    Launch the scan from the HP Solution Center
    Click Change Settings.
    Select the desire Scan Shortcut, then click Save to file save options
    Make sure Create One File per scanned page... is Unchecked, Approve and perform the Scan.
    Regards,
    BH
    **Click the KUDOS thumb up on the left to say 'Thanks'**
    Make it easier for other people to find solutions by marking a Reply 'Accept as Solution' if it solves your problem.

  • How do I scan multiple pages into one document using the CanoScan LiDE 200?

    How do I scan multiple pages into one document using the CanoScan LiDE 200?
    I can't seem to find a way to get them to scan continuously, or a way to stitch them together afterwards.

    Hi dagda24,
    You can scan multiple pages into a single document with the scan to PDF option.  Use the following steps to do so:
    1.  Open MP Navigator.
    2.  Click One Clcik.
    3.  Click Save to PC.
    4.  Change the File Type from PDF to PDF (multiple pages).
    5.  Make any other changes as needed, then click scan.
    Did this answer your question? Please click the Accept as Solution button so that others may find the answer as well.

  • How to get number of pages in word document

    I am developing an OCR application where I need to get number of pages of word documents. I am currently assuming that each page consists of 46 lines but some how, I am not getting convinced with this logic. Is there any simplest way to get this information?
    Let me tell you that in the OCR system I am converting tiff iamges to .txt file and again reading txt file and counting number of pages based on number of lines.
    regards,
    Ranjan

    The number of pages depends on the font size, length of each line and any page break in between.
    You could just say, one page per 46 lines, it depends on how accurate you must be.
    Why not count the number of pages you scanned?

  • I've become increasingly frustrated with the ipad adobe app. I've subscribed to be able to convert my pdf files into word documents and it has yet to work I've paid for a service that does not work which in turn makes me a disgruntled customer to say the

    I've become increasingly frustrated with the ipad adobe app. I've subscribed to be able to convert my pdf files into word documents and it has yet to work I've paid for a service that does not work which in turn makes me a disgruntled customer to say the least very disappointed with such horrible service

    Which service did you subscribe to?  Adobe PDF Pack?
    Once I know the service that you subscribed to, I can move this post to the right forum so that you can get in touch with the folks who can assist you.

  • How do you convert a jpeg file into word document so i can edit it?

    How do you convert a jpeg file into word document so i can edit it?

    http://office.microsoft.com/en-us/mac-word-help/training-edit-pictures-in-office -for-mac-2011-RZ103709558.aspx

  • Insert one html page into another html page

    Hi there,
    I wonder if sombody can help. I am trying to insert one short
    html page into another page. I could not find any option or feature
    in Dreamweaver that allow me to insert the page at all. The page I
    want to insert has the links and it is a short page, just like a
    banner. When I update the links on that page it will update all
    other pages in the website. I do not have to open many pages to
    update. I have been using FrontPage and I am now converting to
    Dreamweaver. Some codes from FrontPage does not work in
    Dreamweaver. I would very much appreciate if somebody can help with
    the codes.
    This is my website so that you can understand what I mean. At
    the top of the screen there are many links that are from one page I
    inserted into index.html. I use FrontPage. But, Dearmweaver does
    not work that way.
    Thank you. Kevin

    Be aware that IFrames carry all the disadvantages that frames
    do, for both
    you and your client's visitors.
    Murray --- ICQ 71997575
    Adobe Community Expert
    (If you *MUST* email me, don't LAUGH when you do so!)
    ==================
    http://www.projectseven.com/go
    - DW FAQs, Tutorials & Resources
    http://www.dwfaq.com - DW FAQs,
    Tutorials & Resources
    ==================
    "polarl light" <[email protected]> wrote in message
    news:g06ad0$2rm$[email protected]..
    >
    >> I wonder if sombody can help. I am trying to insert
    one short html page
    >> into
    >> another page. I could not find any option or feature
    in Dreamweaver that
    >> allow
    >> me to insert the page at all. The page I want to
    insert has the links
    >> and it
    >> is a short page, just like a banner. When I update
    the links on that
    >> page it
    >> will update all other pages in the website. I do not
    have to open many
    >> pages
    >> to update. I have been using FrontPage and I am now
    converting to
    >> Dreamweaver.
    >> Some codes from FrontPage does not work in
    Dreamweaver. I would very
    >> much
    >> appreciate if somebody can help with the codes.
    >
    > Depending on what you want to do you can use SSIs or an
    Iframe. SSIs are
    > good for things such as headers, menus and nav bars that
    you want to stay
    > the same across a range of pages. Iframes let you load
    an external HTML
    > file into a predefined area of your page so you can
    display different
    > content while staying on the same page.
    >

  • Turning PDF Documents into Word Documents

    Can I use my Adobe Creative Suite 6 to turn PDF documents into Word Documents?

    Acrobat comes with most versions of Creative Suite and it is capable of converting PDFs to Word documents. The quality of the results depend on a number of things. If the PDF was originally generated from a Word source and it's well tagged, you will get much better results. More complicated layouts will generally have worse results and there will be considerable cleanup in Word needed. Exporting to text is sometimes better.
    For the future, there is a better forum for this type of question: http://forums.adobe.com/community/acrobat/creating__editing_%26_exporting_pdfs?view=discus sions

  • How to download graph output into word document?

    Hello experts!
    I have plotted graphs using some data.
    The graph is actually output of a function module into a container.
    Can anybody tell me , how can i download this graphs into a word document?
    I am able to download text data into word documents, but not graphs.
    Regards,
    Rahul

    Steelers, unfortunately your reply didn't get through. Perhaps you tried to send an attachment or screen shot by email? Please try again (but not sending attachments or screen shots by email).

Maybe you are looking for