HTML to speech converter

i wanna implement HTML to speech converter. i ve already implemented text to speech converter..but i don't kno..how to proceed further???? plzz help me

HTML basically an XML format.
So you need an XML parser.
However, a lot of HTML web pages are not strictly XML, and may fail to parse.\
A while back, I remember reading about "HTML Tidy", which took an HTML file which may not have been well-formed XML, and it cleaned it up to become well-formed.
From there, you could parse it as XML.
( there's tonnes of documentation on parsing XML in Java )
Working out which XML tags within the HTML file contained the text you want to output is another matter. And there would be plenty of Java script, comments, and other tags which would probably complicate matters.
regards,
Owen

Similar Messages

  • Html to text converter

    i wanna implement HTML to speech converter...for that i need a html parser. so how to implement a HTML parser????? plzz send me any link related to it.

    It looks like you are experiencing a character encoding mismatch. You can have a different default character encoding that your browser uses and yet another encoding when you are doing the conversion.
    Looks like you are doing your conversion with the UTF-8 encoding and then displaying it with yet another encoding. That is why you need to explicitly set your character encoding to UTF-8 so the characters are displayed properly.
    You can either specify a character encoding in your page like you are doing or you can force your convereter to the character encoding of your choice, like ISO-8859-1, as follows:
    htmlstring.getBytes("ISO-8859-1");
    Thanks,
    Justyna

  • Text to speech converter

    Hello,
    I'm doing a project on text to speech converter using java and i'm very confused. What speech engine should i use? and how do i use it with java? i've downloaded javax package but when i run the sample codes hello world from Java speech api, it does work n i'm geting compiling errors saying tht package javax.speech does not exist.
    plz i badly need help to clear my confusion.
    thanks
    muskaan

    I would like to add a ability to my website http://www.artikelpedia.com whitch is in deutch language to be translated online from Germay to English. I have some pages now in english: http://www.artikelpedia.com/artikel/englisch/9/englisch5.php deutsch: http://www.artikelpedia.com/artikel/deutsch/27/deutsch2.php and franch: http://www.artikelpedia.com/artikel/kunst/2/kunst5.php
    but i wolud like to offer to my visitors ability to translate this pages from one language to another.
    For any sugestions about price and other facility of using java for thi please reply

  • HTML to XML converter

    please who knows where one can download a java HTML-to-XML converter class where all that is needed is to supply any http link and it will output XML to the outputstream or whatever
    thanks

    You must realize that there is no possible way all valid HTML can be made into valid (well-formed) XML - right?
    html can have over lapping tags (not real tags here, but you'll see):
    <tag1>
    <tag2>
    <tag1>
    <tag2>
    That's valid html, but totally invalid xml (xml doesn't let you overlap tags).
    If you're using XHTML, then your html is already xml.
    If you're going from XML to HTML, then you can use XSTL; but it won't work in the other direction.

  • Sorry about first try : how to avoid html-text tag converting & to & amp;

    subject
    how to avoid html:text tag converting "&" to "& amp;"?
    body
    hi,
    i have some values on DB like "& #351;" and when i use html:text to
    show binding's value, html:text converts "&" to "& amp;". in generated
    html, it looks like "& amp;#351;".
    how to avoid this conversion?
    thanks...
    Ayhan G�ng�r
    note: i use white-space among special characters because browser renders them. ex : (& amp; to &)

    hi, i use property attribute of html:text.
    property is declared in UIModel xml file.
    i mean, i don't use something like
    <html:text value="data"/>i use just like
    <html:text property="bindingName"/>and value is shown in generated html input tag as value.
    html:text has no attribute like filter.
    i think i should override html:text tag, and create a new tag that checks if value includes "& #351;" this type data. If there is, don't convert "&" to "& amp;"?
    any suggestions?
    thanks...
    Ayhan

  • HTML to PDF convertion tool

    Hi.
    I need a tool which let me to convert HTML pages to PDF documents. It would be great If I could find a Java library for that (free or to pay for). Does anyone know something about this kind of tool? Once again: I'm intersted in HTML to PDF conversion NOT to create PDF from pure Java code.
    Any help would be great...

    I don't think that you'll find a native HTML to PDF converter for Java. If you want something windows based I think you can take the HTML to PDF converter library for .NET from http://www.dotnet-reporting.com or from http://www.winnovative-software.com and build a ASP.NET 2.0 web service that you can furher call from your Java application.
    All the conversion can be done in a few lines for C# code:
           // Create the PDF converter. Optionally you can specify the virtual browser
            // width as parameter. 1024 pixels is default, 0 means autodetect
            PdfConverter pdfConverter = new PdfConverter();
            // set the license key
            pdfConverter.LicenseKey = "P38cBx6AWW7b9c81TjEGxnrazP+J7rOjs+9omJ3TUycauK+cLWdrITM5T59hdW5r";
            // set the converter options
            pdfConverter.PdfDocumentOptions.PdfPageSize = PdfPageSize.A4;
            pdfConverter.PdfDocumentOptions.PdfCompressionLevel = PdfCompressionLevel.Normal;
            pdfConverter.PdfDocumentOptions.PdfPageOrientation = PDFPageOrientation.Portrait;
            pdfConverter.PdfDocumentOptions.ShowHeader = false;
            pdfConverter.PdfDocumentOptions.ShowFooter = false;
            // set to generate selectable pdf or a pdf with embedded image
            pdfConverter.PdfDocumentOptions.GenerateSelectablePdf = selectablePDF;
            // Performs the conversion and get the pdf document bytes that you can further
            // save to a file or send as a browser response
            byte[] pdfBytes = pdfConverter.GetPdfFromUrlBytes(urlToConvert);

  • Preserving HTML links when converting Word docs

    I created a document in MS Word for Mac 2011 (ver 14.5.2)
    When I used Adobe Pro 11 to convert the document into pdf format, all the HTML links are lost. 
    I have tried converting from MS Word, opening the word version with Adobe Pro, printing from MS Word to pdf.  Nothing works. Even opened the MS Word document using Apple's Pages software, re-formatted and then converted to pdf.  Still no live links.
    Best work around so far is to convert and then use the edit feature in Adobe Pro to re-insert the HTML links as invisible rectangles on top of the still-blue-and- underlined text.  So to the user it looks like the html links are still live, but what a pain for editor.
    I have seen this issue raised in other posts, but none of the answers seem to work. And the work around described above is clearly less than ideal.
    Very curious, as reading other posts, the issue apparently does not arise when the word doc converted from a Wintel computer.  But I can't imagine Adobe writes software one way for Intel and another for Mac.

    Imagine it. this has been 15 year fued between Microsoft and Adobe.  Adobe Claims that Mac office doesn't hav ethe proper hooks for URLs. Microsoft says the fault is with adobe.
    Since a Word Created file will work when opened in the windows version  and saved as a PDF, the links work just fine. Just opening and not saving, but converted to Pdf does nothing to the actual Word File.
    They had it fixed last year  in Acrobat X if your dropped the file on to Acroabt directly. But broke it again with the upgrade to XI.
    IF you have iWork and Open the word file in Pages then exported as a Word.docx file then create The PDF the links will become active.  Also if you Open in OpenOffice and export as docx file the resulting PDF when open in Word and PDF is Created the links will become active.
    If you have neither you will have to open the PDF and add the links. Note the Links will be hot (active) but the links will not turn Blue and  be Underlined
    Mac Office2011 is a Conversion of Office2010/2007 code.  So there should be no pproblem.
    Also Don't use the Save As . . .  PDF Method. Instead go to Print Menu  click on PDF wait for context menu > the choos Adobe Quality PDF or Adobe PDF (uses Adobe's PDF engine).
    wait for next screen that shows qualtity leave as sent unles you need specific job options. click okay then next screen File name. Rename as necessary the browser to desired to location then click save.  Or you can drop the saved Word document (with Word quit.) on to Acrobat and after a minute or so the PDF will be created. (Using this method in AcrobatX would actully show URL s or Mailtos as active - They broke this in Acrobat XI PDF still can be created but hot links no longer work).
    Well it seems it does work on occasion  see: http://www.screencast.com/t/cib2kcYG

  • Import Html text and convert to InDesign tags

    Hello everybody,
    The script receives a JSON with text (html markup: headers <h1>, <b></b>, new line characters, ...).
    I have to save the converted text into a temp text file and then do .place() into a text frame. That's the only way it can recognise new tags.
    //text and contentArea are defined previously.
    var convertedText = convertTags(text);
    var convertedTextFile = new File (FILE_PATH + "/Temp/convertedTextFile.txt");
    convertedTextFile.open('wr');
    convertedTextFile.write(convertedText);
    convertedTextFile.close();
    contentArea.place(convertedTextFile);
    function convertTags(htmlTaggedText){              
         var inDesignTaggedText = "<ASCII-MAC>\n" + htmlTaggedText.replace("<h1>", "<ParaStyle:Heading 1>").replace("</h1>", "");
         return inDesignTaggedText;
    General question is, whether there is any more elegant solution to this?
    Thanks in advance

    Hello everybody,
    The script receives a JSON with text (html markup: headers <h1>, <b></b>, new line characters, ...).
    I have to save the converted text into a temp text file and then do .place() into a text frame. That's the only way it can recognise new tags.
    //text and contentArea are defined previously.
    var convertedText = convertTags(text);
    var convertedTextFile = new File (FILE_PATH + "/Temp/convertedTextFile.txt");
    convertedTextFile.open('wr');
    convertedTextFile.write(convertedText);
    convertedTextFile.close();
    contentArea.place(convertedTextFile);
    function convertTags(htmlTaggedText){              
         var inDesignTaggedText = "<ASCII-MAC>\n" + htmlTaggedText.replace("<h1>", "<ParaStyle:Heading 1>").replace("</h1>", "");
         return inDesignTaggedText;
    General question is, whether there is any more elegant solution to this?
    Thanks in advance

  • How to Export InDesign Interactivities into HTML 5 without converting it through any tool?

    I created some simple InDesign Interactivities like Drag and Drop, Text Input Fields, Drawing, Multiple check box, Radio selection, Drop Down etc.
    When i export that interactivities in HTML, It will give me output of HTML page with interactivity in flash file. I expect that interactivities JavaScripts, CSS files. I also refer other suggestions on Adobe InDesign forum like below
    1. http://forums.adobe.com/message/4962514
    2. http://forums.adobe.com/message/4447318#4447318
    3. http://forums.adobe.com/message/4907133#4907133
    They suggested Export interactivities into .fla/.swf and Convert it into javascript and Css using some online tolls like wallaby or Swiffy.
    Is there any direct way available in InDesign that will give me proper HTML output with JavaScript and CSS that also compatible with devices like iPad, iPhone, Smartphones and other mobile devices?

    They suggested Export interactivities into .fla/.swf and Convert it into javascript and Css using some online tolls like wallaby or Swiffy.
    Last I checked Swiffy and Wallaby have limited ActionScript capabilities—they work best with simple timeline animations. I doubt you'd have much sucess converting ID .swf exports.

  • HTML to PDF converter

    Hi All,
    Is there any tool which we can use with Coldfusion to generate the PDF's from HTML/webpage?
    I have seen many tool kit, when we want to convert a PDF from the webpage, we need to click "find" on the browser and select the tool , where the PDF will be created.
    Instead can we use some API of that and implement the same functionality ?
    All I need is a link or button in my webpage, when the user clicks on that, the PDF should be generated with the current page as the content.
    I hope my question is understood.
    Your help in highly appreciated.
    Thanks in advance!

    In your current pages (assuming they are cfm).
    <form action="makepdf.cfm">
    <input type="hidden" name="makeapdf" value="yes">
    <input type="hidden" name="desiredpage" value="#listgetat(script_name,listlen(script_name,"/"),"/")#">
    <cfset FormVariables = "">
    <cfif IsDefined("Fieldnames")>
        <cfloop index="variable" list="#Fieldnames#">
                      <cfif ListFind(FormVariables, variable) is 0>
                                              <cfoutput>
                                                                                <input type="hidden" name="#variable#"
                                                                                                         value="#evaluate('Form.#variable#')#">
                                                                 </cfoutput>
                 <cfset FormVariables = ListAppend(FormVariables, variable)>
               </cfif>
    </cfloop>
    </cfif>
    <input type="submit" value="Create as PDF">
    </form>
    <!--- and on makepdf.cfm --->
    <cfif isdefined("makeapdf")>
      <cfdocument  format="pdf"  pagetype="letter" pagewidth="8.5" pageheight="11">
    <cfinclude template="#desiredPage#">
        </cfdocument>
    </cfif>
    </body>
    </html>

  • Html to struts converter

    Are there any tools to convert html to struts?
    Thanks

    This is like asking as "How to convert a Volkswagen Beetle to a Porsche 911?".
    Just learn Struts and write it yourself.

  • Requirements for HTML to get converted into masterpage by Design Manager

    Hi All, I am new SharePoint Developer. I am not designer, doesn't know much about CSS. I am getting HTML from outside. We have prepared the Wireframe in Visio as how the top and bottom should look like. My question is - Is there anything that I need to give
    to the Designer who doesn't know SharePoint? Is there any guidelines to prepare the HTML? Should I need to give them SharePoint CSS Chart or something that I should give it to designer? or It would be just plan basic HTML and CSS and Design Manager would convert
    itself?
    Please advise.
    Regards,
    Khushi

    Khushi
    If your designer has not used SharePoint before then the only thing you need to tell him/her is to take care of the html/xml validation. HTML should be W3C compliant. He can use any online tool to validate his HTML. There is a tool available on W3C.org (??)
    as well. All tags should have ending tags. Once you have the validated HTML, SharePoint will take care of the rest. Design manager will convert your default html file into a master page. Remaining html pages will have to be converted into SharePoint pages
    manually (Create pages in the Pages library). Ask your designer to layout the site using three sections: Header, Content and Footer. It will make your life much easier if you had these three sections in the master page.  Then all you will need to do is
    add content in the Content area of each of the remaining site page.
    Btw, I have written a complete chapter on this topic in my new book on SharePoint (See my signature below for link). It is available in Kindle edition also if you want to get an immediate access to it.
    Saifullah Shafiq
    Blog:  
    Twitter:   LinkedIn:
      Facebook:
      Google+:
      Latest Book:
    A Practical Guide to SharePoint 2013

  • Using .docx-to-.html dashboard widget converter

    I've downloaded docx-converter.wdgt from the apple site in order to be able to read .docx files created by windows office 2007 and sent to me as attachments. The widget has installed fine on the dashboard. However, it seems that to use the widget, I need to drag the file onto the widget icon in the dashboard, and the dashboard fades when I click on the desktop to grab & drag the file icon. I've tried dragging the icon directly onto the dashboard icon with no luck.
    Thanks in advance for any advice.

    I've got the same problem. I found the PF12 method in the "how-to" on the converter's web page. However, when I push PF12 on my Mac, the "eject" symbol appeared, not the dashboard. Dashboard preferences are set to PF12 being dashboard. No helper keys helped. So, I set the Dashboard preferences temporarily to use the middle mouse button for Dashboard and then was able to get the file to drag to the widget. The conversion didn't work, however. I think that the .docx file I was sent isn't any good. Thanks Microsoft.
    JanG

  • Problem with converting html to pdf using LiveCycle ES Java API

    I am using this code to convert html to pdf.
    * 1. adobe-generatepdf-client.jar
    * 2. adobe-livecycle-client.jar
    * 3. adobe-usermanager-client.jar
    * 4. adobe-utilities.jar
    * 5. wlclient.jar
    import java.io.File;
    import java.util.Properties;
    import com.adobe.idp.Document;
    import com.adobe.idp.dsc.clientsdk.ServiceClientFactory;
    import com.adobe.idp.dsc.clientsdk.ServiceClientFactoryProperties;
    import com.adobe.livecycle.generatepdf.client.GeneratePdfServiceClient;
    import com.adobe.livecycle.generatepdf.client.HtmlToPdfResult;
    public class ConvertHTML {
       public static void main(String[] args)
            try{
            //Set connection properties required to invoke LiveCycle ES                             
            Properties connectionProps = new Properties();
            connectionProps.setProperty(ServiceClientFactoryProperties.DSC_DEFAULT_EJB_ENDPOINT, "t3://localhost:7001");
            connectionProps.setProperty(ServiceClientFactoryProperties.DSC_TRANSPORT_PROTOCOL,Service ClientFactoryProperties.DSC_EJB_PROTOCOL);       
            connectionProps.setProperty(ServiceClientFactoryProperties.DSC_SERVER_TYPE, "WebLogic");
            connectionProps.setProperty(ServiceClientFactoryProperties.DSC_CREDENTIAL_USERNAME, "administrator");
            connectionProps.setProperty(ServiceClientFactoryProperties.DSC_CREDENTIAL_PASSWORD, "password");
            //Create a ServiceClientFactory instance
            ServiceClientFactory factory = ServiceClientFactory.createInstance(connectionProps);
              //Create a GeneratePdfServiceClient object
            GeneratePdfServiceClient pdfGenClient = new GeneratePdfServiceClient(factory);
           //Get an HTML document to convert to a PDF document a
            String inputFileName = "http://www.adobe.com";
            //String inputFileName = "C:\\Documents and Settings\\venkat\\Desktop\\Adobe.htm";
            String securitySettings = "No Security";
            String fileTypeSettings = "Standard";
    System.out.println("one");
            //Convert HTML content to a PDF document
            HtmlToPdfResult result = pdfGenClient.htmlToPDF2(inputFileName, fileTypeSettings, securitySettings, null, null);
    System.out.println("two");         
            //Get the newly created document
            Document createdDocument = result.getCreatedDocument();
            //Save the PDF document as a PDF file
            createdDocument.copyToFile(new File("C:\\test.pdf"));
        catch (Exception e) {
            System.out.println("Error OCCURRED: " + e.getMessage());
            e.printStackTrace();
    I can able to compile this class but while running i am getting error like below.
    Error OCCURRED: Internal error.
    ALC-DSC-000-000: com.adobe.idp.dsc.DSCRuntimeException: Internal error.
            at com.adobe.idp.dsc.provider.impl.ejb.EjbMessageDispatcher.doSend(EjbMessageDispatcher.java
    :160)
            at com.adobe.idp.dsc.provider.impl.base.AbstractMessageDispatcher.send(AbstractMessageDispat
    cher.java:57)
            at com.adobe.idp.dsc.clientsdk.ServiceClient.invoke(ServiceClient.java:208)
            at com.adobe.livecycle.generatepdf.client.GeneratePdfServiceClient.htmlToPDF2(GeneratePdfSer
    viceClient.java:666)
            at ConvertHTML.main(ConvertHTML.java:84)
    Caused by: java.rmi.RemoteException: Remote EJBObject lookup failed for 'ejb/Invocation'; nested exc
    eption is:
            org.omg.CORBA.COMM_FAILURE:   vmcid: SUN  minor code: 203  completed: No
            at com.adobe.idp.dsc.provider.impl.ejb.EjbMessageDispatcher.initialise(EjbMessageDispatcher.
    java:101)
            at com.adobe.idp.dsc.provider.impl.ejb.EjbMessageDispatcher.doSend(EjbMessageDispatcher.java
    :130)
            ... 4 more
    Caused by: org.omg.CORBA.COMM_FAILURE:   vmcid: SUN  minor code: 203  completed: No
            at com.sun.corba.se.impl.logging.ORBUtilSystemException.writeErrorSend(Unknown Source)
            at com.sun.corba.se.impl.logging.ORBUtilSystemException.writeErrorSend(Unknown Source)
            at com.sun.corba.se.impl.transport.SocketOrChannelConnectionImpl.writeLock(Unknown Source)
            at com.sun.corba.se.impl.encoding.BufferManagerWriteStream.sendFragment(Unknown Source)
            at com.sun.corba.se.impl.encoding.BufferManagerWriteStream.sendMessage(Unknown Source)
            at com.sun.corba.se.impl.encoding.CDROutputObject.finishSendingMessage(Unknown Source)
            at com.sun.corba.se.impl.protocol.CorbaMessageMediatorImpl.finishSendingRequest(Unknown Sour
    ce)
            at com.sun.corba.se.impl.protocol.CorbaClientRequestDispatcherImpl.marshalingComplete1(Unkno
    wn Source)
            at com.sun.corba.se.impl.protocol.CorbaClientRequestDispatcherImpl.marshalingComplete(Unknow
    n Source)
            at com.sun.corba.se.impl.protocol.CorbaClientDelegateImpl.invoke(Unknown Source)
            at com.sun.corba.se.impl.protocol.CorbaClientDelegateImpl.is_a(Unknown Source)
            at org.omg.CORBA.portable.ObjectImpl._is_a(Unknown Source)
            at weblogic.corba.j2ee.naming.Utils.narrowContext(Utils.java:126)
            at weblogic.corba.j2ee.naming.InitialContextFactoryImpl.getInitialContext(InitialContextFact
    oryImpl.java:94)
            at weblogic.corba.j2ee.naming.InitialContextFactoryImpl.getInitialContext(InitialContextFact
    oryImpl.java:31)
            at weblogic.jndi.WLInitialContextFactory.getInitialContext(WLInitialContextFactory.java:41)
            at javax.naming.spi.NamingManager.getInitialContext(Unknown Source)
            at javax.naming.InitialContext.getDefaultInitCtx(Unknown Source)
            at javax.naming.InitialContext.init(Unknown Source)
            at javax.naming.InitialContext.<init>(Unknown Source)
            at com.adobe.idp.dsc.provider.impl.ejb.EjbMessageDispatcher.initJndiContext(EjbMessageDispat
    cher.java:213)
            at com.adobe.idp.dsc.provider.impl.ejb.EjbMessageDispatcher.getJndiContext(EjbMessageDispatc
    her.java:226)
            at com.adobe.idp.dsc.provider.impl.ejb.EjbMessageDispatcher.initialise(EjbMessageDispatcher.
    java:87)
            ... 5 more
    can u plz give me some way to do the convertion.

    Yes Sir.....Thanks for ur suggestion.....
    But i didn't find exact solution..well..yes i found some but not exactly there were not in the way i required...I jus need to convert HTML to PDF using iText API for java.....I already used some classes in that like HTMLParser.....etc..
    So Any thing else...Any one...Sure can help me in this................

  • Use PHP to convert html page to a pdf file for email

    I want to use my php scripts to send an email (with a pdf attachment) of the web page that is being viewed.
    My problem I have no way to convert the html page to a pdf file without the end user being prompted to (1) naming the file (2) saving it
    Is there a program or a way i can save the html as "auto_name.pdf" file on the server without user input?
    I have Adobe Acrobat 6.0 Professional on server

    There are lots of good Html to Pdf conversion tools for developers, both free and paid. I'd suggest to take a look at PD4ML Html to Pdf converter - it has lots of features (CSS support, page layout presets, etc.) and can be obtained for free for non commercial usage. There is also a topic at Stackoverflow with a list of such converters.

Maybe you are looking for

  • How do I use ITunes on my laptop with 2 different accounts and keep music etc separate

    We have one lap top and now have 2 separate ITunes accounts.  We want our own (different) music on our own devices.  How do we keep music library separate?  Until now it was just one library, but we have downloaded from CD's my husbands old county mu

  • Bios Update Problem P35 Neo2 - MS 7345 V 1.1

    Hello I've update my board bios because a "hard disk not present" problem and since it I was having several problems: I'm running Windows XP SP2 and it takes a very long time to start (in bar and loading windows screen). The major problem I have is w

  • How to handle events between two custom components?

    Hi ,      i want to handle events between two custom components, example if an event is generated in one custom component ,and i want to handle it any where in the application.....can any one suggest me any tutorial or meterial in this concept... tha

  • Add string replacement to file renaming

    This feature is very handy over in Bridge CS5 and would also be useful in LR 3.x when there is a need to replace part of a filename and not touch the sequence number applied to random files.

  • Keeping Sprite in Different Frames

    Hi Guys, Just after a bit of help when building a "escape the room" game. I'm creating a game in director where a person is stuck in a room, but can find clues from different parts of the room that will help them escape, included in this is a items b