Word to HTML

Hi,
I use DAD to upload and download DOC files from BLOB in JSPs,How can I convert DOC format to HTML format in downloading and display tithe files in IE browser?
Best Regard,
Yanming Xu

There are no easy way.
You can use MS Word SaveAs command.
You may execute OLE automatation object (MS Word in your case) from Oracle Server.
For example see using COM Automation in Oracle
http://download-west.oracle.com/docs/cd/B10501_01/win.920/a95499/toc.htm

Similar Messages

  • Convert word into html in unix environment

    We are in the process of developing a coldfusion web
    application which allows the users to upload resumes in word
    format. Once the word document is uploaded we need to convert the
    word document into html on the fly. We know it can be accomplished
    using com object in the windows platform. But ours is a unix
    environment. We need some coldfusion coding which can convert the
    uploaded word file into html. Please take note coldfusion server
    runs on unix platform in our case.

    For step 3 call a 3rd party component such as Aspose.Words to
    handle converting test.doc to test.html. I'd recommend wrapping
    this logic inside a CFC.
    For example using Aspose.Words java object appears to be as
    easy as the Java code:
    Document doc = new Document(getMyDir() + "Document.doc");
    doc.save(getMyDir() + "Document.ConvertToHtml Out.html",
    SaveFormat.HTML);
    Note: I've used the .NET version of Aspose.Words, not the
    Java version, but I've been pleased with the product.
    Aspose.Words formats supported
    http://www.aspose.com/documentation/file-format-components/aspose.words-for-.net-and-java/ com/aspose/words/saveformat.html
    You might also investigate Apache POI
    http://poi.apache.org/

  • CF function/UDF to clean Word-generated HTML?

    Hello, everyone.
    Is there, out there somewhere, a CF function or UDF that will take a string of Word-generated HTML and remove all the cruft from it?
    I'm using TinyMCE as the rich-text editor for a CMS, and the people who are creating/modifying the content are using Word - and when they paste into the TinyMCE editor, most of the time there is no problem.  However, every once in a while something gets pasted that, no matter what one does, entire paragraphs will come out in bold (even though they weren't bolded in Word) and other stuff.. I'd like to clean that out BEFORE it goes to the database.
    Thanks,
    ^_^

    Damn right. I hate it when places just say that "it's policy" and can't be done.
    Luckily where I work people are reasonable and the boss is an ex-programmer. Therefore if I go along and say "I want to base our entire SNMP switch monitoring system around these open-source classes I got from the internet", I'm trusted enough that it'll be allowed, and it's assumed that I've done an assessment and have deemed it safe.
    In general, it's simple enough to get the source code for these projects, go through it with them if they're that bothered and make sure there's nothing obviously dodgy going on. If people are really that fussed, look for ways to tighten up security on the box. Rather than installing the Java libs on your production box, how about having a separate system that has no access to your data, installing the libs on there and just writing a little API that does the work?
    I've had situations where I've had to go to managers and put it in their hands; "yes I can do what you want, but only if we install/use this product. Your call".
    If they're not willing to compromise then they can't always get the result they would like. Tough crap.

  • There is no Clean up Word with HTML.js in my DW CS3

    I posted a few days ago about photos breaking and an error message about HTML.js.
    Now I find there is NO HTML.js found within my DW CS3.
    I tried to RE-INSTALL Dreamweaver from my install disc and it went about 1/4th of the way through and then asked me to put a PS Elements disc in my DVD drive. WEIRD!!!!  I don't even have PS Elements.  I have the CS3 Design Suite.
    SO, now I am stuck. I don't know what to do and why I cannot re-install DW without uninstalling the whole suite.?
    I was trying to re-install so as to be able to download any missing element that may have gotten lost in the installation the first time. (JOKE???)
    I used to be able to do that with Photoshop when something got lost in a new OS installation.
    Anyway, I installed this Suite after purchasing my new computer.
    Also, under "SITE", synchronize sitewide stays GREY  and I cannot use it. I cannot use COMMANDS, as the HTML.js  "error 2 " comes up. I have to click it 5 times to make it disappear from the screen. There are 5 photos on the web page.
    Any ideas?

    >>> Arunachalagirl <[email protected]> 6/09/2012 3:51 PM >>>
    Re: There is no Clean up Word with HTML.js in my DW CS3 created by Arunachalagirl ( http://forums.adobe.com/people/Arunachalagirl ) in Dreamweaver - View the full discussion ( http://forums.adobe.com/message/4676875#4676875 ) Actually, when I went to Preferences, Compatability,  I did find 95-98. Hmmmm But not sure if it would save in that. I have a big celebration tonight and I need to get back to preparations. I will check back tomorrow and see if I can fix this.
    BUT, the issue is, I never saved a word.doc and tried to put it into Dreamweaver, that I know of. So, I was totally surprised at what happened.
    R U saying that if there IS a doc and I change it in the DW site files, this issue may go away???
    Replies to this message go to everyone subscribed to this thread, not directly to the person who posted the message. To post a reply, either reply to this email or visit the message page: http://forums.adobe.com/message/4676875#4676875
    To unsubscribe from this thread, please visit the message page at http://forums.adobe.com/message/4676875#4676875. In the Actions box on the right, click the Stop Email Notifications link.
    Start a new discussion in Dreamweaver by email ( mailto:[email protected].adobe.com ) or at Adobe Forums ( http://forums.adobe.com/choose-container!input.jspa?contentType=1&containerType=14&contain er=2240 )
    For more information about maintaining your forum email notifications please go to http://forums.adobe.com/message/2936746#2936746.
    Message protected by MailGuard: e-mail anti-virus, anti-spam and content filtering.
    http://www.mailguard.com.au/mg

  • Word to HTML converter

    Probably asking for the impossible here!
    My client has spent a long time preparing a large document with extremely complex formatting which he was hoping to be able to paste directly into a web page. I am now looking for a Word to HTML converter. Dreamweaver does a good job cleaning up the HTML but leaves all of the styles inline. Is there a converter which will automatically "collect" any similar styles in the document, and automatically place these in the header?
    Any Word to HTML converters I have found so far online, have stripped all of the styles which will mean creating a large array of new styles and re-applying them later on.
    Ideas??? thanks!!
    Pixelwarrior

    mgrist wrote:
    Dreamweaver does a good job cleaning up the HTML but leaves all of the styles inline.
    Pixelwarrior
    If this is the case then you can use a free Dreamweaver Extension that can create a separate CSS file that can be linked in the <head> section of your page.  The free extension is at this link:
    <http://www.dmxzone.com/go?4087>
    You may need to register to download it so please use a free email from hotmail/yahoo/gmail or anything that you use to store all your spam messages.
    Good luck.

  • Problem in converting word to html- file get error msg 'This command is not available because no document is open'

    Hi,
    I write the some asp code to save the word file from client machine to server machine , and convert it into the html file.
    it working fine when i debug the code into the visual studio but when i deploy code on iis it give me a error 'This command is not available because no document is open' , while i try to save file into html format.
    lot of time i try this give all the security full access to iis user and other things .
    can any one help me. i fully frustrated from this problem. 
    my need only upload the word document from client to server machine and convert it into the word document. 
    only IIS give me the problem, suggest me what setting i need to do in iis 
    please help me humble request 

    Hi,
    In this forum we mainly discuss questions about Office client questions and feedbacks, issues related to coding are not supported here.
    Based on the description, although the question is about converting Word documents, it's more likely to be a permission issue, the question is better to be posted in the IIS.NET forum:
    http://forums.iis.net/
    The reason why we recommend posting appropriately is you will get the most qualified pool of respondents, and other partners who read the forums regularly can either share their knowledge or learn from your interaction with us. Thank you for your understanding.
    Regards,
    Melon Chen
    TechNet Community Support
    It's recommended to download and install
    Configuration Analyzer Tool (OffCAT), which is developed by Microsoft Support teams. Once the tool is installed, you can run it at any time to scan for hundreds of known issues in Office
    programs. Please remember to mark the replies as answers if they help, and unmark the answers if they provide no help. If you have feedback for TechNet Support, contact
    [email protected]

  • Display the Word or html doc  which is stored as blob intable and SearchApplication

    Hi All,
    I have two doubts and are as follows....
    1)Created a table name
    create table documents(id number,doc blob);
    I created a form based on the above table. I have two text boxes on the form, one is to enter the id and the other one is upload the document. The default button are insert,query,update,reset ....
    I inserted a word document into the table with id = 1. when i query the table with the id=1. the word document is not displayed whereas when i insert a .bmp file i am able to see the image when i query.
    How can i display the word document when i query the table. Please give me suggestion to do it.
    what is the method to do it...If at all if the document(blob) is a HTMl document..How can i display...
    2) I have to create a search application...
    A text Box: To enter a word
    Serch Button: On click, it should search the documents in the documents Table(as mentioned above)
    And should display the list of documents that matches the criteria.....On click of a result item(link) the doc or html should be displayed....
    Thanks in Advance...
    Sreedhar

    Oliver -
    TestStand does not have any hooks to allow a test system developer to override the internal searching for a file on disk. The only simple option that I see is to to query the database and download the latest sequences ahead of time. This is similar to a Source Code Control mechanism. Keep in mind that once an execution loads a sequence file, the file is typically not released until the last execution completes so you cannot load an updated copy of the sequence file while executing, especially the client sequence file.
    Scott Richardson
    National Instruments

  • Highlight words in html page set on editorpane

    hi everyone
    please help me quickely
    I have java application
    I could display html with hyperlinks on Jeditorpane
    but my problem is that
    i want to get specific word in this page
    to highlight it and display this html page with highlighted word
    can anyone solve my problem?
    thanks

    Thank you for the reply. I can understand if it was changed intentionally for some reason, though I can't think what that reason might be.
    I know that it definitely works fine in Firefox 21, and changed in Firefox22 and 23 (I tested the beta, aurora and nightly versions and the behavior is the same as 22 & 23 there too)
    I have attached two screen shots of what it looks like when I open a new tab on firefox 21 vs firefox 23. When I use a completely new profile (or even on a different operating system), the behavior is consistent as well, so I think this is definitely something that changed from 22 onwards. (link titles changed)
    I should add, this quite drastically changes the way I use firefox, because I am no longer able to use middle click to paste URLs or search terms into the URL bar. I might select the out put of an error message in a terminal and then just middle click to paste in the bar and hit enter to go from there.

  • How to convert word documents to html page in sharepoint online 2013

    Hi,
    I am new SharePoint and still learning it.
    I have been tasked to do the following on office 365 E3 SharePoint 2013 Online edition.
    1) I have to create a Web page in asp.net
    2) This page needs to show document from a given SharePoint folder and bind them in a grid or dropdown on the asp .net web page
    3) On selecting the document from the drop down or gird (on asp .net webpage), I need to show the SharePoint word document as HTML on the webpage (something like word to html) Note: These SharePoint word document may contain Images, bullets, tables etc. 
    What I have been able to do till now
    1) I have been able to connect to SharePoint from ASP .net application.
    2) I have been able to retrieve document from a specific SharePoint folder.
    3) Read the document from SharePoint folder and bind them to a drop down on the asp .net page.
    What is missing?
    I am not aware about any API that SharePoint Online provides to convert Word document to HTML. Any code sample or reference on how to will be much appreciated. 
    I am not also not sure what is the best way of achieving the functionality this?
    Thanks 
    Krishna

    If this was SharePoint server then it would be easy however in O365 You need to create a app which will use the word automation service and below is  powershell which you can use for the conversion:-
    # This script will convert Docx to PDF using word automation and similarly it can be used to convert to HTML
    $wordFile="http://contoso/kick.docx"
    $pdfFile="http://contoso/kick.pdf"
    $wasp = Get-SPServiceApplicationProxy | where { $_.TypeName -eq "Word Automation Services Proxy" }
    $site = Get-SPSite "http://contoso"
    $ConvertJob = New-Object Microsoft.Office.Word.Server.Conversions.SyncConverter($wasp)
    $ConvertJob.UserToken = $site.UserToken
    $ConvertJob.Settings.UpdateFields = $false
    $ConvertJob.Settings.OutputFormat = "PDF"
    $ConvertJob.Convert($wordFile, $pdfFile)

  • Mapping Styles from Word 2010 to RoboHelp HTML 10

    Hello all,
    I have created customized styles for myself and my team to use for authoring content, which I then import to RoboHelp, as I am the only individual who understands RoboHelp and HTML/CSS. I created a corresponding CSS style sheet in RoboHelp to correspond with the names of the Word styles. When I go to import the files in RoboHelp, I am running in to an issue with paragraph tags and heading tags.
    When I import a file from a Word document, I have not found a way to map a <p> tag that is parsed from a Word document to an <h> tag, as heading tags do not appear as a selectable item in the Conversion Settings  window when importing Word files. I would be able to keep the formatting if I left the headings as <p> tags; however, it is my understanding that RoboHelp 10 search relies heavily on the heading tags for weighting search results. I have about 400 topics, and search is used heavily by those using my documentation. I would prefer to keep the <h> tags in place in the RoboHelp files and retain the search functionality, if at all possible, while finding an easier import option from Word files.
    At this point, I have to manually change the HTML for each <p> tag to match the correct heading style. While Find and Replace works well for this, I would prefer to have a seamless import from Word (which bloats an HTML file horrendously, I know) by mapping from one style to the next. Has anyone found a way to map a <p> tag to an <h> tag so that the search functionality does not suffer? Or am I just approaching this the wrong way?

    You are correct in your assumption that I am referring to paragraph styles when speaking of <p> tags. The paragraph styles come over as <p> tags, and I was just thinking of my style sheet. Sorry for the confusion.
    Speaking of my workflow, I'm attempting to map the styles for one topic at a time when importing a new Word file. I set up a test project to test out my style sheet, as I am upgrading from RoboHelp 7 right now. .I need to get one document to map correctly to save the settings for future imports.
    From the Import window, I click the Edit... button in the Word Document section. After RoboHelp scans the Word file, a list of all the styles from the Word document display. When I attempt to select from the available styles I want to map to from my RoboHelp style sheet, I don't have the option to select the heading styles I created in my RoboHelp CSS file. Does this provide you enough information to give you context on how I am attempting to map the styles?
    I found this article that leads me to belive that RoboHelp does not support mapping to a customized heading style from a CSS file. I realize that this is around the printed output, but it states the following:
    If custom heading styles aren’t named in the format Heading <number>, they are not treated as headings.
    If it doesn't work going from HTML to printed output, I'm guessing that there is not support going from Word to HTML. Am I wrong in this assumption? Do I just need to alter my style sheet to only use the standard <h> tags to fix this issue?

  • "RoboHelp for Word" vs "RoboHelp HTML"

    Hi all,
    I note there are two flavors for RoboHelp (wrt also working
    with Word + importing + exporting):
    - RoboHelp for Word
    - RoboHelp HTML
    Questions:
    1. Am I right that it seems you can do everything with
    RoboHelp HTML that you can do with RoboHelp for Word?
    2. If not (1, above), what are the relative advantages and
    disadvantages of each for starting a new RoboHelp project?
    3. Is there a reason for ever trying to change application
    for an existing RoboHelp project, and would there be expected any
    particular conversion problems?
    Tia,
    - avi

    Welcome to our community, Avi
    I see you posted the question here as well as the TechShoret
    list. I further see you have at least one reply over on TechShoret.
    Here are my thoughts. Yes, you can do most of what RoboHelp
    for Word can do using RoboHelp HTML. I say "Most" here because
    there are a couple of things you can't do. You can't easily create
    a .HLP file using RoboHelp HTML. RoboHelp HTML doesn't offer tabs
    like you can use with RoboHelp for Word.
    Now I realize that may sound a bit as if I'm saying that
    RoboHelp for Word is superior to RoboHelp HTML. I'm not. As with
    anything in life, there are trade offs and things to consider. In
    this case, I would say you should consider the output type.
    Most everything is moving away from WinHelp (.HLP files) and
    toward an HTML based format. I noted the TechShoret reply mentioned
    that with RoboHelp for Word, everything is stored in a single Word
    file, vs having multiple HTML files with RoboHelp HTML. While this
    is true, I would say that if your output is going to be HTML based,
    you should seriously consider using an editor that was designed
    from the ground up to support HTML. Let's face it. Anything HTML
    coming from Word is a kludge, as Word was designed for print. There
    are also some things that will constantly frustrate you if you
    elect to use RoboHelp for Word and are creating HTML output.
    So while it may seem an attractive option to use, I'd suggest
    avoiding RoboHelp for Word and going with RoboHelp HTML.
    Sincerely... Rick

  • RoboHelp for Word vs for HTML

    Hi there,
    I am currently evaluating RoboHelp 7 and am wondering what
    the advantage may be to using the HTML editon over the Word
    edition. We currently use RoboHelp x5 for Word, but that doesn't
    necessarily mean I have to stay with it.
    Part of me wants to stay with Word because I am familiar with
    it, and because I think I will need to use its track changes
    feature as a means of identifiying changes for reviews and for
    translation purposes (i.e. identifying what has been changed).
    Part of me wants to move to HTML though as I have dabbled a
    little in that environment in the past and feel I am missing out by
    remaining in Word. Also, (and I might be wrong here) but I think
    from a source control perspective it is easier to identify changes
    comparing HTML files than Word files.
    I'm just wondering what others in this forum think. Why do
    you use the Word or HTML edition? And if you have changed from one
    to the other, what was your deciding factor in doing so?
    I appreciate your time in helping me decide which way to go.
    Cheers
    Heather

    Hi Heather
    I'd say that it depends a bit on your output. If your output
    is anything HTML based, RoboHelp HTML hands down. After all, Word
    does horrible HTML. Why not use something designed with HTML in
    mind?
    If you stick with Word, you aren't doing yourself or anyone
    else any favors. There are things that Word will do that will
    frustrate you.
    Then again, that's just my opinion. Personally, I would have
    voted long ago to banish the RoboHelp for Word application.
    Cheers... Rick

  • Link one solution manager word document into another solution manager HTML

    Hi,
    I have requirement in solution manager that there are two documents uploaded into solution manager.
    1. One word document in one of the project folder
    2. HTML page in another folder of same project
    I have to set up a hyper link of word document in the HTML web page.
    If anyone has gone through this type of requirement, please let me know the details of how did you set up the link between word and HTML document.
    Appreciate the early response.
    Thanks,
    Siva

    Go to the properties of the file you want to link your word document to. (attributes icon)
    There you will find a button "Generate Document URL (F9)". Clicking this button will buffer the URL into clipboard.
    Then you open your Word document in edit mode and paste the URL there.
    source: Inserting a document into another in Solution Manager

  • Convert smart quotes and other high ascii characters to HTML

    I'd like to set up Dreamweaver CS4 Mac to automatically convert smart quotes and other high ASCII characters (m-dashes, accent marks, etc.) pasted from MS Word into HTML code. Dreamweaver 8 used to do this by default, but I can't find a way to set up a similar auto-conversion in CS 4.  Is this possible?  If not, it really should be a preference option. I code a lot of HTML emails and it is very time consuming to convert every curly quote and dash.
    Thanks,
    Robert
    Digital Arts

    I too am having a related problem with Dreamweaver CS5 (running under Windows XP), having just upgraded from CS4 (which works fine for me) this week.
    In my case, I like to convert to typographic quotes etc. in my text editor, where I can use macros I've written to speed the conversion process. So my preferred method is to key in typographic letters & symbols by hand (using ALT + ASCII key codes typed in on the numeric keypad) in my text editor, and then I copy and paste my *plain* ASCII text (no formatting other than line feeds & carriage returns) into DW's DESIGN view. DW displays my high-ASCII characters just fine in DESIGN view, and writes the proper HTML code for the character into the source code (which is where I mostly work in DW).
    I've been doing it this way for years (first with GoLive, and then with DW CS4) and never encountered any problems until this week, when I upgraded to DW CS5.
    But the problem I'm having may be somewhat different than what others have complained of here.
    In my case, some high-ASCII (above 128) characters convert to HTML just fine, while others do not.
    E.g., en and em dashes in my cut-and-paste text show as such in DESIGN mode, and the right entries
        &ndash;
        &mdash;
    turn up in the source code. Same is true for the ampersand
        &amp;
    and the copyright symbol
        &copy;
    and for such foreign letters as the e with acute accent (ALT+0233)
        &eacute;
    What does NOT display or code correctly are the typographic quotes. E.g., when I paste in (or special paste; it doesn't seem to make any difference which I use for this) text with typographic double quotes (ALT+0147 for open quote mark and ALT+0148 for close quote mark), which should appear in source code as
        &ldquo;[...]&rdquo;
    DW strips out the ASCII encoding, displaying the inch marks in DESIGN mode, and putting this
        &quot;[...]&quot;
    in my source code.
    The typographic apostrophe (ALT+0146) is treated differently still. The text I copy & paste into DW should appear as
        [...]&rsquo;[...]
    in the source code, but instead I get the foot mark (both in DESIGN and CODE views):
    I've tried adjusting the various DW settings for "encoding"
        MODIFY > PAGE PROPERTIES > TITLE/ENCODING > Encoding:
    and for fonts
        EDIT > PREFERENCES > FONTS
    but switching from "Unicode (UTF-8)" to "Western European" hasn't solved the problem (probably because in my case many of the higher ASCII characters convert just fine). So I don't think it's the encoding scheme I use that's the problem.
    Whatever the problem is, it's caused me enough headaches and time lost troubleshooting that I'm planning to revert to CS4 as soon as I post this.
    Deborah

  • Converting non-ascii characters generated by MS word

    Hello,
    I've encountered some files that were originally exported from MS Word as html. The problem is they contain some characters that fall into the 128 to 255 range. Some appear to be fancy quotes and apostrophes, but others I just can't figure out. On a mac or Firefox on windows they appear as:
    Ö ë í ì î ñ ô † © Æ ∑ ∆ “ ÷ › · Î Ï Ì Ó Ô Ò Ù
    The decimal values of the above chars are:
    133 145 146 147 148 150 153 160 169 174 183 198 210 214 221 225 235 236 237 238 239 241 244
    As charater entities they appear as:
    … ‘ ’ “ ” – ™ © ® · Æ Ò Ö Ý á ë ì í î ï ñ ô
    Before I try to reinvent a square wheel, I thought I'd ask here if anyone knows of an existing command line tool that might help with this.
    Cole
    15 PB   Mac OS X (10.3.9)  

    Thanks for all the replies. I think I've solved the problem. It indeed was a problem with high bit WinLatin1 (cp 1252) characters. Here's a technote that discusses the problem. So I wrote a short perl script based on this table:
    <pre style="overflow: auto;font-size:small; font-family: Monaco, 'Courier New', Courier, monospace; color: #222; background: #ddd; padding: .3em .8em .3em .8em; font-size: 10px;">#!/usr/bin/perl -wpi
    # Define an array for double byte unicode characters
    # Undefined characters are marked as 0.
    my @uni = (
    8364, 0, 8218, 402, 8222, 8230, 8224, 8225,
    710, 8240, 352, 8249, 338, 0, 381, 0, 0,
    8216, 8217, 8220, 8221, 8226, 8211, 8212,
    732, 8482, 353, 8250, 339, 0, 382, 376
    # Characters 128 through 159 are mixed set of double byte unicode characters,
    # so get these out of our $uni array. Undefined characters in this range are deleted.
    s/([\x80-\x9f])/ $uni[ord($1)-128] ? sprintf("&#%d;", $uni[ord($1)-128]) : ""/eg;
    # Characters 160 through 255 can be used as is.
    s/([\xa0-\xff])/sprintf("&#%d;", ord($1))/eg
    </pre>I only hope that perl is clever enough to not create the $uni array for each line. Anyone happen to know?
    Thanks for any tips.
    Cole

Maybe you are looking for