Convert  html to word document

convert html to word document ,
I tried poi-3.0.2-FINAL,Apache POI - HWPF - Java API to Handle Microsoft Word Files
it is not working...

My actual goal is convert html file into word document,
i posted into forum, some people are suggested HWPF just look,
I tried one by one program i not getting any answer for example one program,
HWPFDocument     doc = new HWPFDocument (new FileInputStream ("c:\\temp.doc"));
               Range r = doc.getRange();
          System.out.println("Example you supplied:");
          System.out.println("---------------------");
          for (int x = 0; x < r.numSections(); x++)
          Section s = r.getSection(x);
          for (int y = 0; y < s.numParagraphs(); y++)
          Paragraph p = s.getParagraph(y);
          for (int z = 0; z < p.numCharacterRuns(); z++)
          //character run
          CharacterRun run = p.getCharacterRun(z);
          //character run text
          String text = run.text();
          // show us the text
          System.out.print(text);
          // use a new line at the paragraph break
          System.out.println();
          }catch(NullPointerException exception){
               exception.printStackTrace();
          } catch (FileNotFoundException e) {
               // TODO Auto-generated catch block
               e.printStackTrace();
          } catch (IOException e) {
               // TODO Auto-generated catch block
               e.printStackTrace();
java.io.IOException: Invalid header signature; read 5789751444030890300, expected -2226271756974174256

Similar Messages

  • How do I download a Adobe PDF file and get it converted to a Word document?

    How do I download a Adobe PDF file and get it converted to a Word document?

    Download: you just download it as you would any other document.
    Convert to Word: either using the ExportPDF online service, or Acrobat.

  • How do I maintain my hyperlinks when converting a Microsoft Word Document to a PDF file?

    How do I maintain my hyperlinks when converting a Microsoft Word document to a PDF file?

    Hi Marbarrose,
    When you make word file including hyperlinks then convert in pdf.
    It will automatically maintain hyperlinks.
    Regards,
    Florence

  • Convert PDF to Word Document

    I'm trying to convert a PFD file into a word processing Word Document. Don't know if this is the right place, but if anyone can tell me how to do it, or direct me a software program, I would appreciate it.
    Thank you
    Peter

    Hello Peter,
    Perhaps this utility can assist you with this task.
    http://echoone.com/filejuicer/pdf-to-word
    Or this one.
    http://www.nemopdf.con/products/pdftoword.html
    A simple Google search for "convert PDF to Word" search should give you a plethora of sites as well.
    Hope this helps.
    B-rock

  • Convert Smartform in WORD Document

    Hi Gurus
    How can i convert a smartform Output in a word document ??? and  download into PC..
    Regards
    Gregory

    There is no standard possibility to do Word outputs in SAP.
    Anyway you can develop connection of Mail-merge technology of Microsoft and SAP ABAP class i_oi_document_proxy.
    http://help.sap.com/saphelp_sm32/helpdata/EN/e9/0be980408e11d1893b0000e8323c4f/content.htm
    with interface get_mail_merge_interface:
    http://help.sap.com/saphelp_sm32/helpdata/en/6e/8fc2e3dd0d11d2bdba080009b4534c/content.htm
    Simple example I found here:
    http://www.sap2word.de/abap.html
    Hope, it helps
    Edited by: Dimy IT dev on Sep 7, 2009 2:58 PM

  • Convert a Microsoft Word document to a publishing wikipage

    I have added a site collection of type Publishing site using the Enterprise Wiki template. now i have many word documents that contains the following contents; table of content, numbering, images, etc. Currently i want to copy a whole word document inside
    the wiki page's rich content editor, but if i simply open the word document, copy its content,past the content inside the rich text i will loose the following:-
    1. images.
    2. numbering sequence.
    3. table of content will not refer to the correct section.
    so my question is whether there is a way to convert a word document to a wiki page , without loosing any of the word document contents?
    Thanks

    There is no easy way to convert word documents directly to a wiki page in SharePoint.  SharePoint 2013 is better at handling cut and paste from a word document to a Wiki page while maintaining formatting.  But images will need to be uploaded and
    re-inserted separately and links from things like a TOC will also need to be rebuilt.  Here's a BLog post that discusses some of the problems and suggests an alternate solution.
    http://sharepoint-revolution.blogspot.com/2013/07/copy-word-documents-including-images.html
    There are also third party tools that automate the process.  For example this one:
    http://www.kaboodlekonnect.com/Pages/Word-to-Wiki.aspx
    Paul Stork SharePoint Server MVP
    Principal Architect: Blue Chip Consulting Group
    Blog: http://dontpapanic.com/blog
    Twitter: Follow @pstork
    Please remember to mark your question as "answered" if this solves your problem.
    the first option requires many steps ,, but the third party tool seems promising. but I have some questions about the tool (if you used the tool before):-
    1. i have some mandatory managed metadata fields such as wiki category, etc. so will i be able to convert a word document to a wiki page , even if i need to fill some mandatory fields?
    2. will this tool automatically build a table of content that reference the correct section within the wiki pages?
    3. can i download a trial version of this tool ?

  • Acrobat XI Pro Won't Convert 206 Page Word Document into PDF

    Hi there
    As menetioned above, Acrobat XI Pro Won't Convert my 206 Page Word Document into a PDF.  The Word document was originally a PDF file that I converted to Word and it has split all the text into sections.
    It sounds like converting a PDF into Word isn't the best way to edit, re-format and then save as a PDF again.  I would love to hear your advice on this.
    Thanks very much for your help!
    Fiona

    First before you recreate the PDF from the Word Document.
    In word: Open Document
    Next open a new Blank docment
    switch back to Word click on the ¶ button
    scroll to of go to very end of docment.
    click just to right of the perion in the last sentence.
    now go to very beginning of document
    Hold down the Shift and click to the right of first letter in document.
    Now choose copy.
    Now switch to Blank document
    Choose Paste special.
    Now choose Text only.
    If works all the words will be there spaced correct but with no ¶'s.
    Now insert returns as desired.
    Now save as a docx file under a different name.
    IF you are on a Mac use the following directions:
    go to File menu > Print > PDF Hold down PDF button until Context menu pops up.
    Choose adobe PDF.
    follow steps when the first window opens.
    Save as PDF in desired location.
    Now open the PDF in Acrobat. Document should be properly formatted and ready to go.
    AS you've found The conversion is not seamless. Acrobat doesn't distingish between automatic end of line breaks and Returns and you have to put the pieces back again.  I wish Adobe and MS would get over the jealouscy of each other and share howcode works so Thatapplications could work seamlessly together.  BUt they never will.

  • Acrobat Adobe 9; editing converted pdf to word document

    I have acrobat adobe 9 standard and I have converted my document from pdf to microsoft word where
    I am suppose to be able to edit it as a word document.  It shows up as an image and I am not able to edit it.  Does anyone know what I am doing wrong
    or can help me.

    How did you convert?
    Acrobat → http://forums.adobe.com/community/acrobat/creating__editing_%26_exporting_pdfs
    ExportPDF → http://forums.adobe.com/community/exportpdf

  • Adobe Acrobat 8 Standard Question - Converting PDF to word document

    Question:  I currently have Adobe Acrobat 8.  I need to convert a PDF to word document.  I know how to do that but the outcome of the word document sometimes varies as to retaining the exact formatting.  How can I retain exact formatting?  Is there something that I'm not doing that I need to be doing in order to retain the formatting.  Also do newer versions (Adobe Acrobat 11 (Standard or Pro?)) do a better job of converting and retaining formatting?

    Retaining the exact formatting is not possible in practice or in theory because Word documents are nothing like PDFs. For example, Word will cheerfully reflow text onto new lines or pages, while this will never happen with a PDF.
    That said, Adobe keep trying to get closer to what people need. Sometimes this results in complex parts of the file being made into an uneditable graphic or text box.
    Bottom line is you can get the basics into Word and then (according to your time, experience, and the abilities of Word) you might be able to reconstruct.
    On no account convert official forms to Word.

  • How do I convert a microsoft word document to a pdf?

    I need help to submit an assignment for school but I can't figure out how to convert word to a pdf.  My old computer was easy.  All I had to do was right click and convert, not with this computer. Can anyone shed some light? Thank you

    Hey cherelle,
    You might need to open the word document and choose 'PDFMaker' from the menubar or 'Adobe PDF' printer option from the Files menu for the same.
    Hope this helps.
    Regards,
    Anubha

  • How do I convert pdf to word document

    I have a pdf file and I want to convert it to a word document. How do I do this.

    Hi Auntie Faye,
    If you have a subscription to ExportPDF, the process is very straightforward. Here's some getting started info: Getting Started with ExportPDF
    If you'd like information about subscribing, please see Acrobat Pro, PDF Pack, Export PDF & More | Acrobat Document Solutions.
    Best,
    Sara

  • How do I edit a scanned file converted to a word document?

    How do you take a scanned document to a word document & then edit it?

    Many scanners have the ability to save as PDF.  If you scan your document to a PDF file, you can then use the ExportPDF service to convert that file to a Word document.  The ExportPDF service will perform optical character recognition (OCR) on your document to try to identify the text as actual text instead of an image.  You should then be able to edit the file.
    Please let us know if you have any questions.
    -David

  • Converting pdf into word document

    I have just signed up for adobe. how do i convert a pdf into a word document?

    Hi Ali,
    Welcome to ExportPDF!
    Here is a handy 'getting started' guide that should assist you.
    Let me know if that helps.
    Looking forward to hearing back from you.
    Regards, Stacy

  • I do not have the ability to edit my .pdf once converted into a word document. Did I purchase the wrong version?

    When I covert my pdf into a word document it does not allow me to edit the actual text. I can add random text to the document, not following the format of the document (not aligned, not the same font, etc.). I can add text, but I cannot what is already in a sentence or paragraph.
    Did I purchase the wrong version?
    Thanks,
    Cindy.

    Hi cindyjay,
    You have the right tool for converting PDF files to Word. For starters, please try triple-clicking in the text block that you want to edit. If that doesn't work, we'll need to take a closer look at how the PDF that you converted was created (did it start out as a scanned document, for example). If it did start as a scanned document, it's important that OCR (optical character recognition) was performed during conversion to convert scanned text to editable text. OCR is on by default when you convert via the ExportPDF website, but can be turned off if you convert via Reader.
    If triple-clicking doesn't work, let me know, and then we'll take a closer look at your file.
    Best,
    Sara

  • Trouble converting pdf to word document with ocr

    I just tried converting a pdf to a word document. I've done this successfully before. I checked the box indicating I wanted character recognition in English. The conversion I received was only a picture of the pdf file. No OCR. What's wrong?

      Thanks for your reply. My file is confidential, so I tried creating a similar file, 1 page long,that wasn't confidential. I found that Adobe easily converted the similar file using OCR.
    My original confidential file is 20 pages long, with 5 columns on each page. I thought maybe its length was the problem, so I tried converting only one of the pages to Word and it worked with OCR recognition. There were many, many mistakes though.  Also, it wasn't possible to copy and paste the columns into a new document. When I did that, the data reorganized itself into a list format with irregular formatting--as if I were pasting in text format. It seems like the OCR process only partially worked.
    I thought that maybe enough of the conversion was accurate for me to be able to work with it. when I looked at it closely I realized it wouldn't be possible.
    Unlike me, my colleague has the full version of Adobe. When she returns tomorrow (Thurs.), I will see if she is able to change any of the writing on the original document to make it non-confidential. If she can, I will include a screenshot.

Maybe you are looking for