Convert "html to xml"

Does anyone know if HomeSite or Dreamweaver has the ability
to convert HTML documents to XML documents? This would save alot
instead of buying another program that does this.
PG

does htmltidy do the conversion you need?
homesite integrates with it, though htmltidy itself is open
source.

Similar Messages

  • Is it possible to convert HTML to XML in as2?

    Hi,
    I am trying to develop a stock widget to use in my mobile using flashlite 2.x.
    I have chosen "http://charting.nasdaq.com" to obtain 5days chart /6months chart..
    But I am not able to use XML object for this and parse.. The values are in table format.. So I have to use LoadVars.. but writing a AS2 code
    to obtain values like High/Low price of the stock or volume is very lengthy since the data obtained are not in XML format..
    i tried this
    var example_xml:LoadVars = new LoadVars();
    example.ignoreWhite = true;
    example_xml.load("http://charting.nasdaq.com/ext/charts.dll?2-1-14-0-0-75-03NA000000www-&SF:4|5-WD=539-HT=39 5--XTBL-");
    example_xml.onLoad = function(success) {
         if (success == true) {
              trace(example_xml);
    Is it possible to convert them to xml and use relevent methods or xml properties?
    If I use example_xml:XML = new XML() then tracing example_xml (as above) doesn't give any data
    Any help?

    I had a project a few years ago where I converted 10s of thousands of TestStand HTML files from TS 1.0 and 3.0.
    Most of what I did is specific to our internal data structures, but I'll try to help get you started.
    The general idea is to pick out the report header and data sections from each file, then eliminate the HTML formatting.
    The attached file will extract the Report Header and data sections.
    To use this, you will need to download this additional VI that I posted here in the regex board a few years ago.
    http://forums.ni.com/t5/BreakPoint/Regular-Expressions-Board/m-p/1269088#M14343
    Here is one more file that will parse out each data record and dtermine it's Name, status, type value, limits and duration/timestamp info
    Now is the right time to use %^<%Y-%m-%dT%H:%M:%S%3uZ>T
    If you don't hate time zones, you're not a real programmer.
    "You are what you don't automate"
    Inplaceness is synonymous with insidiousness
    Attachments:
    Parse TestStand HTML.vi ‏42 KB
    Parse TestStand HTML Data Section.vi ‏55 KB

  • Converting HTML to XML

    Does any one know if there is a
    plug-in for HomeSite to convert HTML documents to XML
    documents? This would save alot instead of buying another program
    that does this. Because Adobe just finished a online tutorial on
    how to use AJAX which made it very easy. Now I need to switch my
    HTML docs to XML.
    PG

    In my previous post I have added a singleton tag img with the closing of it.
    Normally the tags which are singleton tags in HTML can beclosed by using /> at the ned of the tag
    Like following:
    <img src="abcd.gif" />
    <input type="button" />
    However, I agree with your point about Quotes. In HTML it is not mandatory to put quotes around a value.
    But if it is possible then you  can make it a habit to put quotes around all the HTML attribute values.
    [This is also a standard practice]

  • How to convert HTML into XML

    I know I can parse XML into some HTML, but is there any tools or methods existed to parse HTML into XML?
    I have a not well-formed HTML with a lot data fields, including a lot not closed tags. This HTML is generated by some XML(as I can see), but I can't find a way to reform it into a XML, and eventually stored the data into another database.
    Anyone can help me? I appreciate!
    KIB

    As SAm has told you, you can use jTidy, for the purpose, a sample code , which can convert an html file to xml file is given at following url:
    see the documentation as well.
    http://sourceforge.net/docman/display_doc.php?docid=1298&group_id=13153
    gaurav_k1

  • HTML to XML converter

    please who knows where one can download a java HTML-to-XML converter class where all that is needed is to supply any http link and it will output XML to the outputstream or whatever
    thanks

    You must realize that there is no possible way all valid HTML can be made into valid (well-formed) XML - right?
    html can have over lapping tags (not real tags here, but you'll see):
    <tag1>
    <tag2>
    <tag1>
    <tag2>
    That's valid html, but totally invalid xml (xml doesn't let you overlap tags).
    If you're using XHTML, then your html is already xml.
    If you're going from XML to HTML, then you can use XSTL; but it won't work in the other direction.

  • Converting PDF Files to Html or Xml

    how can i tranfrom the pdf file to html or xml using Acrobat's API? The software already have the function(http://tv.adobe.com/watch/learn-acrobat-x/converting-pdf-files-to-other-file-formats/). In C# ,I can use the acrobat's dll open the pdf file  and  can invoke the  MenuItem SaveAs;
    like this:
                AcroApp.Show();
                AcroAVDoc.Open(@"D:\xpdf\a.pdf","aaaa");
                AcroApp.MenuItemExecute("SaveAs");
                AcroApp.CloseAllDocs();
                AcroApp.Exit();
    But this is not automatic.

    Try the forum for Acrobat SDK.

  • Export pdf to html/txt/xml

    Hi,
    I downloaded "adobe acrobat x pro" for trying the "save as"/export functionality to xml/htm/text etc. and the result was exactly what I was looking for in terms of output, keeping formatting etc.
    However, I am building an application which need to have an embeded library in order to do pdf to html/txt/xml conversion on the fly keeping formatting.
    I have tried a number of libraries for pdf to html/txt/xml conversion an none of them deliver anything near what adobe acrobat x pro does in terms om keeping format/tables etc.
    So, my question is how can I get access to the "save as"/export functionality in adobe acrobat x pro in any official adobe library, sdk, service, product etc. since I assume acrobat x pro does not expose any api for convert functionality or may be used serverside?
    Best regards,
    Rick

    It sounds like you want to use Acrobat as a web service. Rather than pursue this route, you may want to note that such a use of Acrobat is not permitted under the license. Thus it may not worth pursuing. Why convert to HTML is a possible question anyway, at least on a regular basis? On occasions I can understand the need.
    For programmable features you should probably check in the SDK forum.

  • HTML to XML Conversion ?

    Developed a content presentation java servlet implmenting xmlparser2.jar classes, works well. We're storing content (in XML) format as blob, then using parser we are able to do the transformation of the xml file to HTML for presentation.
    stream = null;
    String result = null;
    URL URLStream = new URL(xmlIn);
    ByteArrayOutputStream xbaos = new ByteArrayOutputStream();
    if(mStylesheet.startsWith("http"))
    stream = getURLInputStream(mStylesheet);
    else
    stream = new FileInputStream(mStylesheet);
    XSLProcessor processor = new XSLProcessor();
    DOMParser parser = new DOMParser();
    parser.setValidationMode(false);
    parser.setPreserveWhitespace(true);
    parser.parse(in);
    xdoc = parser.getDocument();
    XSLStylesheet xss = new XSLStylesheet(stream, URLStream);
    processor.processXSL(xss, xdoc, xbaos);
    result = xbaos.toString();
    parser.reset();
    return result; -- HTML conversion
    We are evaluating using xslt to convert the XML to a form based medium for content maintenance. Wondering if once a XML document is parsed to HTML (DOM) can it be parsed back to XML for subsequent update to stored value in blob column. Specifically interested in conversion (parser) from HTML to XML
    Simply can HTML (in DOM format validated against a xsd) be transformed back to XML ?

    Do you know of a method in the xdk that takes a well formed HTML doc and using xsd / xslt convert back to original xml spec?
    Because you created (and as long as you create) the HTML from XML it will be well formed (every tag will be ended with an end-tag) and you can therefore transform it back into XML.
    Most times it will not be possible to convert HTML found on the 'internet' into XML because this HTML is not well formed. For example, many people forget to end a paragraph of text within HTML with the </p> tag.
    We are evaluating using xslt to convert the XML to a form based medium for content maintenance. Wondering if once a XML document is parsed to HTML (DOM) can it be parsed back to XML for subsequent update to stored value in blob column. Specifically interested in conversion (parser) from HTML to XML
    Simply can HTML (in DOM format validated against a xsd) be transformed back to XML ?

  • Converting audio to xml form

    hi
    i need to know how to convert audio to xml form.. can any one help me on this...

    take a look at this article:
    http://www.javaworld.com/javaworld/javatips/jw-javatip117.html?tip

  • [Q] convert DOM to XML Document

    Hi,
    Is there any class or library which convert DOM to XML document?
    ----- java program ------------
    import org.w3c.dom.*;
    import org.apache.crimson.tree.XmlDocument;
    public class Sample {
      public static void main(String args[]) {
        // create a document and root element
        Document doc = new XmlDocument();
        Element root = doc.createElement("html");
        doc.appendChild(root);
        // append a data
        Element body = doc.createElement("body");
        root.appendChild(body);
        body.appendChild(doc.createTextNode("Hello"));
        // convert DOM to XML Document
    }----- expected result ----------
    <html>
    <body>
      Hello
    </body>
    </html>Could you help me?

    Hi,
    Look at the package javax.xml.transform
    or
    try the following code:
    I tried the following code.
    ----- java program ------------
    import javax.xml.parsers.*;
    import org.w3c.dom.*;
    import javax.xml.transform.*;
    import javax.xml.transform.dom.*;
    import javax.xml.transform.stream.*;
    import java.io.OutputStream;
    public class Sample {
      public static void main(String[] args) {
        try {
          // create a document and root element
          Document doc = DocumentBuilderFactory.newInstance()
                                .newDocumentBuilder()
                                .newDocument();
          Element root = doc.createElement("html");
          doc.appendChild(root);
          // append a data
          Element body = doc.createElement("body");
          root.appendChild(body);
          body.appendChild(doc.createTextNode("Hello"));
          Element ul = doc.createElement("ul");
          String[] list = {"foo", "bar", "baz"};
          for (int i=0; i<list.length; i++) {
           Element li = doc.createElement("li");
           li.appendChild(doc.createTextNode(list));
         ul.appendChild(li);
    body.appendChild(ul);
    // convert DOM to XML Document
    TransformerFactory factory = TransformerFactory.newInstance();
    Transformer transformer = factory.newTransformer();
    //OutputStream stream = new FileOutputStream("output.xml");
    OutputStream stream = System.out;
    transformer.transform(new DOMSource(doc), new StreamResult(stream));
    } catch (Exception ex) {
         ex.printStackTrace();
    ----- result ------------
    <html>
    <body>Hello<ul>
    <li>foo</li>
    <li>bar</li>
    <li>baz</li>
    </ul>
    </body>
    </html>Great! Very thanks!

  • JTIDY Html to XML convertor

    Hi this is nauman , and i want a tool or API which can convert or help to convert Html directly into the XML .. so plz tell me about this tool JTIDY, whether it converts HTML directly into XML or not..
    Regards.
    Nauman

    Hi nauman, let me introduce you to Google. You can use it to search the Internet. Go there and type the word JTIDY in the box and click on search.

  • Html to Xml translation

    Hi!
    Are there any good 'ready-made' translators out there?
    I'm parsing an xml feed consisting of a lot of elements with CDATA-sections.
    Stuff like:
    <p>
    </br> is common.
    I need to translate the html entities to xml entities since I will put the data in a new xml file - without CDATA
    sections.
    Well, best case scenario would be to translate. Second best would be to strip out the html entities.
    /best regards, Håkan Jacobsson - System developer in Stockholm, Sweden

    There are tools for converting HTML to XHTML out there, yes. Some of them are written in Java and some aren't but I'm not sure that should matter to you.
    I didn't understand what the part about CDATA meant, but generally parsers will convert CDATA sections to the equivalent text elements without even having to be asked.

  • How to convert HTML files to PDF files.

    Hi,
    I have an urgent requirement explained below:
    I need to convert my HTML files to PDF file from JAVA program.
    Please help me with your suggestions.
    Your help is highly appreciated.
    Thanks &Regards
    Ramavardhan

    Try FOP and XSL style sheets. Although RPWithey is
    correct that there is nothing to magically change a
    viewable HTML file to PDF you can convert a single
    HTML file to XML and use FOP to create a PDF. You will
    need to manage the transition though.
    If you want to change a random HTML file into PDF
    dynamically this would take quite a bit more work. You
    might be able to use JTidy to convert the HTML to XML
    and then FOP to get to PDF but it will likely not be
    formatted the way you wish.That's true - FOP does do that, doesn't it. It must have to make certain assumptions about the appearance of the output. If the output from the stylesheet is an HTML file, you might be able to bypass that bit altogether. I haven't really looked at FOP, although I've read the overview. Check it out at http://xml.apache.org/fop/index.html.
    RObin

  • Convert HTML to PDF or AFP

    As part of the project we have to convert html documents to PDF or AFP. We tried with different tools like HTMLDOC and we are not able to get the perfect matching tool. Any help on finding the best tool for conversion of HTML to PDF or AFP will be appreciated.
    My basic requirement is
    1) The conversion process needs to be automated
    2) the tool has to run on Linux.
    3) Everything in the page (text, image etc) should be extracted in a single file
    Background
    A batch job which runs on Q&R cache servers to run every evening. The job has a list of 1500 symbols which iterates through and does an http get of the Stock Summary page for each ticker in the list. The next step is to launch HTMLDOC or another tool to convert to PDF, APF, or other format.
    Regards,
    Jags.

    I'm not sure it'll help you, but take a look at
    http://xml.apache.org/fop/index.html
    maybe you can go this way
    XHTML->XML->FOP->PDF
    ???

  • Help converting html/xhtml to pdf with bfo pdf lib or other tools?

    Hi,
    I need to convert from xhtml to pdf. I tried parsing it with xerces and crimson but wouldnt work. So the samples coming with big faceless pdf library dosent work. Is there any other method to convert html/xhtml to pdf - any libraries for direct conversion. BFO pdf report generator takes xml as i/p to convert to pdf.
    Thanks for any help

    Look here:
    http://www.adobe.com/sea/products/server/pdfgenerator/

Maybe you are looking for