Meaning of 'Text is HTML' option?

Illustrator CS3
iMac 10.4.11
2GB RAM
I have selected an object and created a slice via Object > Slice > Create from Selection. On selecting this slice & bringing up the Slice Options dialog and changing the Slice Type to 'No Image', I see a checkbox 'Text is HTML'. I wanted to know what does checking ON this checkbox do & how is it different from when it is unchecked (default)? This checkbox is also available for Text object slices created with the Object > Slice > Make command.

It means exactly what it says.If you check it the text will appear as a slice that has text like in any browser using the HTML version of text. it will be live that is i will be scaleable and selectable, you will be able to copy it to a text editor or to an graphic or word processing application and you will be able to edit it.
If you uncheck i you have a ordinary slice that is it will more then likely be a jpeg and any text in the sliced would be a part of the raster image file that is the slice.
One is a picture of the text and the other is actual type.

Similar Messages

  • How to add a value(text) from javascript to html:option tag .....

    How to populate the <html:option> tag dynamically
    I have a tree view, in tree view tag ( <layout:maeuItem> tag i have written "onclick " event )
    <layout:menuItem key="${agrovoc.AGURI}" link="#"
                                       onClick="return copyVal(this)">
    <script language="javascript" type="text/javascript">
                   function copyVal(obj){
                   str= obj.innerHTML; 
                   l = str.length-4;
                   xyz= str.substring(12,l);
                   document.getElementById('agrovoc').value =document.getElementById('agrovoc').value + xyz+'\n';
                   </script>Here "xyz" contains the Term ( the term clicked in the tree view) now i want add these terms ( clicked in tree view) into <html: option> tag.... how to do this.. plz give me some ideas
    <html:select  property="agrovocbuffer" size="10" >
                        <html:option value="" styleId="agrovoc1"></html:option>
                        </html:select>

    Hi,
    i have tried with this, it's works fine... but when i click on the next page ( i mean 2 or 3 page etc ) the content of <html:option> are erasing (deleting).. i wanted those terms wht i have selected in the 1st page to remain in <html:option> when i click on 2 nd page also...
    Tree view code
    <layout:menuItem key="${agrovoc.AGURI}" link="#"
                                       onClick=" copyVal(this)">JavaScript
    <script language="javascript" type="text/javascript">
                    var i=0;     
                    var myArray=new Array();
                   function copyVal(obj){
                        var flag=0;     
                   str= obj.innerHTML; 
                   l = str.length-4;
                   xyz= str.substring(12,l);
                   while (xyz != null) {
                   for( var l=0;l<myArray.length;l++){
                   if( myArray[l]==xyz){flag=1;alret("The Term is already selected.");}
                   if(flag!=1){
                   document.getElementById('agrovoc').options[i] =new Option(xyz,xyz);
                   myArray=xyz;
                   i++;
                   //alert(i);
                   xyz=null;
                   function del(){
                                  var oc=document.getElementById('agrovoc').options;
                                  //alert(oc.length);
                                  firstBox = document.getElementById('agrovoc');
                                  selectedOption = firstBox.options[firstBox.selectedIndex].value;
                                  selected = firstBox.options[firstBox.selectedIndex].index;
                                  //alert(selected);
                                  for( var l=0;l<myArray.length;l++){
                   if( myArray[l]==selectedOption){myArray[l]="";}
                                  firstBox.options[firstBox.selectedIndex] = null;
                                  var oc=document.getElementById('agrovoc').options;
                                  //alert(oc.length);
                                  for(var x=selected;x<oc.length;x++,selected++)
                                  document.getElementById('agrovoc').options[x] =new Option(firstBox.options[selected].value);
                                  i=oc.length;
                   </script>
    <html:select> code<div align="left" id="hopt">     
                   <td valign="top">
                   <logic:present name="hcount">
                        <html:select property="agrovocbuffer" size="15" styleId="agrovoc" multiple="true" style="width:200px;" >
                        <html:option value="" ></html:option>
                        </html:select><html:button property="button" onclick="del()"> DEL</html:button>
                   </logic:present>
                   </td>
                   </div>
    And all the above code is in <html:form> tag & i have written a custom pagination page
    Now i want  all terms which are in <html:option> remain  in options when i goto 2 nd page..
    how to do this? In the DOM inspector i'm getting the options, but when see the source HTML there are no <html:options>.. when click on the 2 page link the terms which are in <html:option> are not going to the "Action" class ..  :(
    how to do this .. any idea plz..?                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               

  • Trying to dynamically populate html:options

    I am using iframe.
    on the first frame the user inputs letters and the database is searched for anything that starts with those letters.
    <html:text property="usersInput"/>
    <html:submit value="search" onclick="middle.document.forms[0].property='list'"/>
    the second frame "middle" is suppose to display the results of the search.
    html:select property="list">
    <html:options property="list"/>
    </html:select>
    of course with this code the second frame "middle" gives an err msg, because list is null.
    Is there anyway to get around this so that initially the select list that appears on the pages is empty and then when the search is done (button is clicked) the list will populate with the results?

    I guess there must be something wrong with my question :( no one is helping me. Well let me ask this...
    With formbeans and iframes...
    Is the information process while in the <html:form>...</html:form> or once it leaves the form? because now I'm doing the search in one frame and then onclick="..." I'm calling another frame to display the results of the search using <html:select><html:options>...</html:select>, but I keep getting an err msg "null pointer".
    It seems like the information from the first iframe, either has not been "processed" or got "deleted" by the formbean, when the second frame tries to access it.
    Can someone please give me an idea of what's going on. I'VE HIT A WALL!!!!

  • How to convert a Word document to text or html in an ABAP program

    Hi,
    At my client's site, for the recruitment system, they have the word processing system set to RTF, instead of SAP Script. This means that all the correspondence is in Word format. A standard SAP program takes the word letter, loads word, does the mail merge with the applicant's info and then sends the document to a printer.
    The program name is RPAPRT05. The program creates a document proxy (interface I_OI_DOCUMENT_PROXY) and manipulates the document using the methods of the interface.
    Now what we want to do is to instead of sending the document to a printer, we want to email the document contents to the applicant. But I don't know how to get the content from the Word document into text or html format so that I can make an email from it.
    I know I can send an email with the word document as an attachment, but we'd prefer not to do that.
    I would appreciate any help very much.
    Thanks

    Ok, here's what I ended up doing:
    First of, in order to call FM 'CONVERT_RTF_TO_ITF' you need the RTF document in a table with line length 156. The document is returned from FM 'DP_CREATE_URL' in a table with line length 132. So first I convert the table:
        Transform data table from 132 character lines to
        256 character lines
          LOOP AT data_table INTO dataline.
            IF newrow = 'X'.
            Add row to new table
              APPEND INITIAL LINE TO xdatatab ASSIGNING .
              newrow = space.
            ENDIF.
          Convert the raw line of old table to characters
            ASSIGN dataline TO .
          Check line lengths to determine how to add the
          next line of old table
            newlinelen = STRLEN( newline ).
            ADD addspaces TO newlinelen.
            linepos = linemax - newlinelen.
            IF linepos > datalen.
            Enough space available in new table line for all of old table line
              newline+newlinelen = oldline.
              oldlinelen = STRLEN( oldline ).
              addspaces = datalen - oldlinelen.
              CONTINUE.
            ELSE.
            Fill up new table line
              newline+newlinelen(linepos) = oldline(linepos).
              ASSIGN newline TO .
              newrow = 'X'.
            Save the remainder of old table to the new table line
              IF linepos < datalen.
                oldlinelen = STRLEN( oldline ).
                addspaces = datalen - oldlinelen.
                CLEAR newline.
                newline = oldline+linepos.
              ELSE.
                CLEAR newline.
              ENDIF.
            ENDIF.
          ENDLOOP.
        Write the last line to the table
          IF newrow = 'X'.
            APPEND INITIAL LINE TO xdatatab ASSIGNING .
    Next I call FM 'CONVERT_RTF_TO_ITF' to get the document in SAPScript format:
        Convert the RTF format to SAPScript
          CALL FUNCTION 'CONVERT_RTF_TO_ITF'
            EXPORTING
              header            = dochead
              x_datatab         = xdatatab
              x_size            = xsize
            IMPORTING
              with_tab_e        = withtab
            TABLES
              itf_lines         = itf_table
            EXCEPTIONS
              invalid_tabletype = 1
              missing_size      = 2
              OTHERS            = 4.
    This returns the document still containing the mail merge fields which needs to be filled in:
          LOOP AT itf_table INTO itf_line.
            WHILE itf_line CS '«'.
              startpos = sy-fdpos + 1.
              IF itf_line CS '»'.
                tokenlength = sy-fdpos - startpos.
              ENDIF.
              token = itf_line+startpos(tokenlength).
              REPLACE '_' IN token WITH '-'.
              ASSIGN (token) TO .
              ENDIF.
              MODIFY itf_table FROM itf_line.
            ENDWHILE.
          ENDLOOP.
    And finally I use FM 'CONVERT_ITF_TO_ASCII' to convert the SAPScript to text. I set the line lengths to 60, since that's a good length to format emails to.
        Convert document to 60 char wide ascii document for emailing
          CALL FUNCTION 'CONVERT_ITF_TO_ASCII'
            EXPORTING
              formatwidth       = 60
            IMPORTING
              c_datatab         = asciidoctab
              x_size            = documentsize
            TABLES
              itf_lines         = itf_table
            EXCEPTIONS
              invalid_tabletype = 1
              OTHERS            = 2.
    And then the text document gets passed to FM 'SO_NEW_DOCUMENT_ATT_SEND_API1' as the email body.

  • Text type 'HTML' for the Channel 'Mail (Letter)'

    Dear Experts,
    I'm unable to use the text type 'HTML' for the Channel 'Mail (Letter)' in mail forms, as a result of which fonts like arial and images like jpeg could not be used in mail forms created for letters.
    Is there a way to use HTML type or to have more font options, insert JPEG images, tables etc in mail forms for letters?
    Regards,
    Kathir

    In case, if you are using custom excel export functionality, you should use fileDownloadActionListener with in a button.
    Here, you could specify the content type.
    <af:commandButton text="Say Hello">
                  <af:fileDownloadActionListener filename="hello_txt"
                                            contentType="text/plain; charset=utf-8"
                                            method="#{bean.sayHello}"/>
                </af:commandButton>For more details, check here:
    http://docs.oracle.com/cd/E16162_01/apirefs.1112/e17491/tagdoc/af_fileDownloadActionListener.html

  • Is it possible to turn off CSS and design text with HTML only?

    I'm trying to design text with HTML only ie: Font, Color, Size. The only method I'm aware of is to highlight the text and then go to Insert> Html> Text Objects. That method is tedious and time consuming.
    I'm not looking for arguments or reasons as to why I should use CSS, I'm simply looking for a solution to disable CSS and design with HTML. I'm using Dreamweaver CS5.
    Thank you,
    Paul

    Murphy,
    Thank you for your response. I checked the preferences as you advised and could not find any option to turn off CSS. I don't understand why there doesn't seem to be a straightforward solution to this, it seems so simple. The reason is that the code will be inputed into an eBay listing and therefore CSS code is not supported.
    Any other input for a seemingly simple yet complex solution to this is welcome and appreciated.

  • How to retrive the font color, style and size of the copied text from html

    I have requirement, where I need to retrive font size and style of the copied text from html page. Here copied text I mean, the text we select and copy using either windows copy command or using Ctrl+C.
    Please help me to get the solution for this req.
    Thanks in advance,
    Amodnk.

    You can also try this, especially if you've got the Text Inspector and the Color Picker open already.
    Select the text to be colored (note that if the text is already multiple different colors the swatch under Color & Alignment section of the Text Inspector still only shows one out of the several)
    Find the color you want in the Color Picker
    Click and drag from the Color Picker into the swatch under Color & Alignment in the Text Inspector
    That will also change all the selected text to the chosen color.
    Also, regarding web safe colors, that SHOULD come as a part of the Color Picker. With the Color Picker open, select the third icon at the top (If you mouse over it, it should indicate Color Palettes. Click the popup menu button and you should see Web Safe Colors as one of the choices. With this and the Text Inspector open, you can drag and drop your way to identical colors in no time!
    That same drag and drop trick works for text on the slide as well. If you just created a bit of text and you want to apply a color, scroll until you find the color you want, then drag and drop over to the text (it will highlight in blue showing you what you're about to color).

  • Help Implementing HTML OPTION with HTML

    Hi:
    I have a HTML page with the folowing code:
    </SELECT></TD>
    </TR>
    <TR>
    <TD width="156">Mode</TD>
    <TD width="86"></TD>
    <TD width="206"><SELECT name="mode">
    <OPTION value="regmode" selected>REG Mode</OPTION>
    <OPTION value="testmode">TEST Mode</OPTION>
    <OPTION value="devmode" selected>Dev Mode</OPTION>
    </SELECT></TD>
    I need help to implement the OPTION using JSP and JavaBean,
    For a simple text entry I am using the following code:
    <input size=20 type=text name=userID value=<jsp:getProperty name="_welcomeJSPBean" property="userID"/>>
    I am looking for a similar way to get all the posibles values for the HTML OPTION,
    Thank in Advance.
    Leonardo

    This is why many of the tag libraries and frameworks exist. I use Struts, and it has html taglib that includes tags for automating select/option to a very high degree.
    If the list is dynamic, you can use:
    <html:select property="commtypeno">
      <html:options collection="commtypeList" property="commtypeno" labelProperty="commtype" />
    </html:select>
    Or, if the list is static, you can use:
    <html:select property="commtypeno">
      <html:option value="1">Type 1</html:option>
      <html:option value="2">Type 2</html:option>
      <html:option value="3">Type 3</html:option>
    </html:select>
    Anthony

  • Set maximum size in Text Form Field Options for a field in bi publisher RTF

    Hi All,
    How to set maximum size in Text Form Field Options for a field in bi publisher RTF.
    I have a RTF whch is having a field in that i need to add some validation condition but after adding certain condition in Add help text tab ,it is not accepting after certain length, how i can increase the length to unlimited,please help me on this
    Thnaks

    Form fields have some restrictions if your are using version lower than 11g.
    They can accommodate only 393 chars. You can add the text in both status bar and help key, which can in total consume 393 chars.
    If your code logic is more than that, it can be split into multiple form fields as Avinash suggested or you can use sub template logic and handle coding over there. Again in sub template code can be within/outside form fields.
    So there is no option for user to increase the size of form field.

  • No text message forwarding option in my iphone 5 ios 8.1

    Please help!
    I have downloaded the latest ios 8.1 on my iphone 5, and osx yosemite in my macbook. But i still dont have a "TEXT MESSAGE FORWARDING" option in my iphone 5.
    I have already included my icloud email in the send and receive options. but i still dont see any text message forwarding

    Hi there omengomeng,
    You may want to make all of the settings are configured properly on your devices. Take a look at the article below for more information. 
    Connect your iPhone, iPad, iPod touch, and Mac using Continuity
    To use Continuity for SMS and MMS on your iPad, iPod touch, and Mac: 
    Update your iPhone to iOS 8.1 (when available).
    Enable iMessage on your Mac and iOS devices.
    On your iPhone, go to Settings > Messages and turn on Text Message Forwarding.
    Your iPad, iPod touch, or Mac will display a code you can enter on your iPhone to verify the SMS and MMS feature.
    -Griff W.

  • Using Item UIDRef, can we get data as set in "text on path option" dailog box for each "text on path" item

    Hi all,
    I have a Item UIDRef but facing a problem for getting data as set in "text on path option" dailog box for each "text on path" item.
    What I did:
    - Getting the  pointer "IMainItemTOPData" using item UIDRef as
      InterfacePtr<IMainItemTOPData> mainItemTOPData(shapesUIDRef, UseDefaultIID());
    - This interface has the method GetTOPOptionData () which return ITOPOptionsData pointer
    - But SDK don't have "ITOPOptionsData" class implementation.
    Second Approch:
    - Used "ITextOnPathSelectionSuite" and getting the correct result for Desktop plugin.But I want the correct result in server plugin also.
    Anyone who has an idea how to get this using UIDRef, please let me know.
    Regards,
    Jitendra Kumar Singh

    Hi Pulse,
    Unfortunately, Flex doesn't currently support text on a path. However, I binged it and found this:
    http://blog.tsclausing.com/post/49
    That might be useful.
    -Adam

  • ITS 620 - how to create a service file for SAPGUI for HTML option

    Hi,
    We have the following configuration :
    ITS 620 installed
    IIS webserver
    NTLM Authentication on webserver
    SAP 46C
    We have some reports that use the IACs option of creating ITS reports and we are able to create the service files using webstudio where we can save a service id and password so that when the ITS page is called, it logs on the user directly into the system.
    Now, we are trying to create a new ITS page for a custom Z transaction and would like to use the SAPGUI for HTML option so that we do not have to use Webstudio to create and modify the HTML pages.
    When I call the ITS page for the new transaction, it brings up the logon screen for the user, which we want to avoid. If I create a new service file for this Z transaction without any HTML pages then I get an error.
    Is there a way to copy the WEBGUI.SRVC file to a custome SRVC file, I tried to create to copy it to a test file called ZTEST.SRVC for my test transaction called ZTEST and now I get an error.
    thanks in advance
    N.S.

    Hello N.S.
    You can create a service file, like ZSERVICE.SRVC.  The parameters should be:
    ~webgui 1
    ~transaction ztransaction
    ~client  001
    ~login   username
    ~password  password
    ~language  EN
    The "~webgui 1" allows the ITS to use the Webgui to dynamically generate the page (no IAC needed.)
    The "~transaction" is the transaction code of your service.
    The "~client" is the client number.
    The "~login" is the username.
    The "~password" is the user's password.
    the "~language" is the language for the users.
    Note that the last four are required for automatic logon.  If end users use a variety of languages though the ~language can be prepended to the URL to support different languages.  (Leave it blank in the srvc file then.)
    Edgar

  • How to include text as HTML elements (see DOMElement)

    I am working with Flash PRO CC v. 14.0.  to convert my Flash website to HTML5 / javascript
    I have converted a file to the HTML5 Canvas
    I am very happy that the new Flash Pro has the feature to convert to HTML5 canvas
    HOWEVER:
    In my original .FLA file project I use only one font: Copperplate Bold.  I use several sizes of that font within the project / scene
    In the original file for all text I use static text, Letter spacing, AntiAlias, AutoKern and single line (Linetype)
    - none of which the HTML5 canvas seem to allow / support?
    How do I maintain the FONT look that I have chosen in my original FLASH project, after I convert to HTML5 canvas?
    Is there a way in the HTML canvas to maintain the FONT look that I want?
    HTML5 canvas will not allow Font embedding
    The device font destroys the LOOK of my Copperplate Bold font.
    How do I include text as HTML elements (see DOMElements)?
    WARNINGS generated when I convert the original file into an HTML Canvas:
    Warnings generated while copying/importing in 140827a HTML test.fla:
    * AntiAlias is not supported in HTML5 Canvas document, and has been converted to DeviceFonts in an instance of Text.
    * AutoKern is not supported in HTML5 Canvas document, and has been removed in an instance of Text.
    * Frame Scripts have been commented
    * LetterSpacing is not supported in HTML5 Canvas document, and has been converted to 0.0 in an instance of Text.
    * LineType is not supported in HTML5 Canvas document, and has been converted to MultiLineNoWrap in an instance of Text.
    * Some artwork contains Hairline stroke, which is not supported in HTML5 Canvas document, and has been converted to Solid.
    * StaticText is not supported in HTML5 Canvas document, and has been converted to DynamicText in an instance of Text.
    New HTML Canvas Document created.
    NOTE:  So far the only way I have been able to maintain the font look is to convert the fonts to .png files
    This is painstaking work that I would like to avoid.
    Even then I still get a WARNING when I test my scene - (no doubt because I left the original FONT text  in guide layers)
    After conversion ON TEST SCENE:
    WARNINGS:
    Frame numbers in EaselJS start at 0 instead of 1. For example, this affects gotoAndStop and gotoAndPlay calls. (18)
    Only circular (not oval) radial gradients are supported. (85)
    Text support is limited. It is generally recommended to include text as HTML elements (see DOMElement). (6)
    Color effects are published as a filter and subject to the same limitations. (4)
    Filters are very expensive and are not updated once applied. Cache as bitmap is automatically enabled when a filter is applied. This can prevent animations from updating. (2)
    Content with both Bitmaps and Buttons may generate local security errors in some browsers if run from the local file system.
    HOW CAN I MAINTAIN the FONT LOOK that I have chosen for my project?
    How do I include text as HTML elements (see DOMElements)?
    ANY HELP will be appreciated
    A good, in depth, tutorial on the subject (FONTS) would be a BIG help to many using the convert to HTML5 canvas features.

    GOOGLE HAS
    https://www.google.com/fonts
    choose a font from above site
    then:
    google generates instructions on how to embed that font
    Montserrat
    3. Add this code to your website:
    <link href='http://fonts.googleapis.com/css?family=Montserrat:400,700' rel='stylesheet' type='text/css'>
    4. Integrate the fonts into your CSS:
    The Google Fonts API will generate the necessary browser-specific CSS to use the fonts. All you need to do is add the font name to your CSS styles. For example:
    font-family: 'Source Sans Pro', sans-serif;
    font-family: 'Ubuntu', sans-serif;
    font-family: 'Montserrat Alternates', sans-serif;
    font-family: 'Montserrat', sans-serif;
    font-family: 'Open Sans', sans-serif;

  • "Recognize Text Using OCR" Option Grayed Out in Acrobat 9 Pro (9.5.1)

    Running Adobe Acrobat 9 Pro.  I'm working with electronically filed court documents.  I regularly use the OCR tool (Document -> OCR Text Recognition -> Recognize Text Using OCR...) on these court documents.
    Problem is, every once in awhile, I'll run into a document where the "Recognize Text Using OCR" option is inexplicably grayed out.  I have no idea what is causing this.  I have checked the Document Properties and confirmed there are no security restrictions for the document.  It happens inconsistently, in that OCR will work with a document filed by an attorney in one case, but it won't work in the same kind of document filed by the same attorney in a different case.
    Any help getting OCR to work on these few rogue documents is appreciated!

    Form created with LiveCycle Designer are XML forms in a PDF wrapper and many of the usual PDF properties are not available. This is like embedded rich media in a PDF. If you want to research this, Adobe and ISO have the PDF Reference manual available as a free download.

  • Problem to extract text from HTML document

    I have to extract some text from HTML file to my database. (about 1000 files)
    The HTML files are get from ACM Digital Library. http://portal.acm.org/dl.cfm
    The HTML page is about the information of a paper. I only want to get the text of "Title" "Abstract" "Classification" "Keywords"
    The Problem is that I can't find any patten to parser the html files"
    EX: I need to get the Classification = "Theory of Computation","ANALYSIS OF ALGORITHMS AND PROBLEM COMPLEXITY","Numerical Algorithms and Problem","Mathematics of Computing","NUMERICAL ANALYSIS"......etc .
    The section code about "Classification" is below.
    Please give any idea to do this, or how to find patten to extract text from this.
    <div class="indterms"><a href="#CIT"><img name="top" src=
    "img/arrowu.gif" hspace="10" border="0" /></a><span class=
    "heading"><a name="IndexTerms">INDEX TERMS</a></span>
    <p class="Categories"><span class="heading"><a name=
    "GenTerms">Primary Classification:</a></span><br />
    � <b>F.</b> <a href=
    "results.cfm?query=CCS%3AF%2E%2A&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">Theory of Computation</a><br />
    � <img src="img/tree.gif" border="0" height="20" width=
    "20" /> <b>F.2</b> <a href=
    "results.cfm?query=CCS%3A%22F%2E2%22&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">ANALYSIS OF ALGORITHMS AND PROBLEM
    COMPLEXITY</a><br />
    � � � <img src="img/tree.gif" border="0" height=
    "20" width="20" /> <b>F.2.1</b> <a href=
    "results.cfm?query=CCS%3A%22F%2E2%2E1%22&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">Numerical Algorithms and Problems</a><br />
    </p>
    <p class="Categories"><span class="heading"><a name=
    "GenTerms">Additional�Classification:</a></span><br />
    � <b>G.</b> <a href=
    "results.cfm?query=CCS%3AG%2E%2A&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">Mathematics of Computing</a><br />
    � <img src="img/tree.gif" border="0" height="20" width=
    "20" /> <b>G.1</b> <a href=
    "results.cfm?query=CCS%3A%22G%2E1%22&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">NUMERICAL ANALYSIS</a><br />
    � � � <img src="img/tree.gif" border="0" height=
    "20" width="20" /> <b>G.1.6</b> <a href=
    "results.cfm?query=CCS%3A%22G%2E1%2E6%22&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">Optimization</a><br />
    � � � � � <img src="img/tree.gif" border=
    "0" height="20" width="20" /> <b>Subjects:</b> <a href=
    "results.cfm?query=CCS%3A%22Linear%20programming%22&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">Linear programming</a><br />
    </p>
    <br />
    <p class="GenTerms"><span class="heading"><a name=
    "GenTerms">General Terms:</a></span><br />
    <a href=
    "results.cfm?query=genterm%3A%22Algorithms%22&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">Algorithms</a>, <a href=
    "results.cfm?query=genterm%3A%22Theory%22&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">Theory</a></p>
    <br />
    <p class="keywords"><span class="heading"><a name=
    "Keywords">Keywords:</a></span><br />
    <a href=
    "results.cfm?query=keyword%3A%22Simplex%20method%22&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">Simplex method</a>, <a href=
    "results.cfm?query=keyword%3A%22complexity%22&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">complexity</a>, <a href=
    "results.cfm?query=keyword%3A%22perturbation%22&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">perturbation</a>, <a href=
    "results.cfm?query=keyword%3A%22smoothed%20analysis%22&coll=ACM&dl=ACM&CFID=22820732&CFTOKEN=38147335"
    target="_self">smoothed analysis</a></p>
    </div>

    One approach is to download Htmlparser from sourceforge
    http://htmlparser.sourceforge.net/ and write the rules to match title, abstract etc.
    Another approach is to write your own parser that extract only title, abstract etc.
    1. tokenize the html file. --> convert html into tokens (tag and value)
    2. write a simple parser to extract certain information
    find out about the pattern of text you want to extract. For instance "<class "abstract">.
    then writing a rule for extracting abstract such as
    if (tag is abstract ) then extract abstract text
    apply the same concept for other tags
    Attached is the sample parser that was used to extract title and abstract from acm html files. Please modify to include keyword and other fields.
    good luck
    import java.io.BufferedReader;
    import java.io.FileReader;
    import java.io.IOException;
    import java.io.InputStream;
    import java.io.InputStreamReader;
    import java.util.ArrayList;
    import java.util.List;
    public class ACMHTMLParser
         private String m_filename;
         private URLLexicalAnalyzer lexical;
         List urls = new ArrayList();
         public ACMHTMLParser(String filename)
              super();
              m_filename = filename;
          * parses only title and abstract
         public void parse() throws Exception
              lexical = new URLLexicalAnalyzer(m_filename);
              String word = lexical.getNextWord();
              boolean isabstract = false;
              while (null != word)
                   if (isTag(word))
                        if (isTitle(word))
                             System.out.println("TITLE: " + lexical.getNextWord());
                        else if (isAbstract(word) && !isabstract)
                             parseAbstract();
                             isabstract = true;
                   word = lexical.getNextWord();
              lexical.close();
         public static void main(String[] args) throws Exception
              ACMHTMLParser parser = new ACMHTMLParser("./acm_html.html");
              parser.parse();
         public static boolean isTag(String word)
              return ( word.startsWith("<") && word.endsWith(">"));
         public static boolean isTitle(String word)
              return ( "<title>".equals(word));
         //please modify according to the html source
         public static boolean isAbstract(String word)
              return ( "<p class=\"abstract\">".equals(word));
         private void parseAbstract() throws Exception
              while (true)
                   String abs = lexical.getNextWord();
                   if (!isTag(abs))
                        System.out.println(abs);
                        break;
         class URLLexicalAnalyzer
           private BufferedReader m_reader;
           private boolean isTag;
           public URLLexicalAnalyzer(String filename)
              try
                m_reader = new BufferedReader(new FileReader(filename));
              catch (IOException io)
                System.out.println("ERROR, file not found " + filename);
                System.exit(1);
           public URLLexicalAnalyzer(InputStream in)
              m_reader = new BufferedReader(new InputStreamReader(in));
           public void close()
              try {
                if (null != m_reader) m_reader.close();
              catch (IOException ignored) {}
           public String getNextWord() throws IOException
              int c = m_reader.read();   
              if (-1 == c) return null; 
              if (Character.isWhitespace((char)c))
                return getNextWord();
              if ('<' == c || isTag)
                return scanTag(c);
              else
                   return scanValue(c);
           private String scanTag(final int c)
              throws IOException
              StringBuffer result = new StringBuffer();
              if ('<' != c) result.append('<');
              result.append((char)c);
              int ch = -1;
              while (true)
                ch = m_reader.read();
                if (-1 == ch) throw new IllegalArgumentException("un-terminate tag");
                if ('>' == ch)
                     isTag = false;
                     break;
                result.append((char)ch);
              result.append((char)ch);
              return result.toString();
           private String scanValue(final int c) throws IOException
                StringBuffer result = new StringBuffer();
                result.append((char)c);
                int ch = -1;
                while (true)
                   ch = m_reader.read();
                   if (-1 == ch) throw new IllegalArgumentException("un-terminate value");
                   if ('<' == ch)
                        isTag = true;
                        break;
                   result.append((char)ch);
                return result.toString();
    }

Maybe you are looking for

  • Ipod 60 GB Video and I Tunes 7 no longer friends

    Recently my system (C) drive failed. I had to rollback to a earlier windows restore point my iTunes folder was on my c: drive. all my music is intact, my iTunes itl and xml file are there too. BUT, when i open iTunes it's acting like i never had an i

  • How do I install Oracle Configuration Manager in oracle Fusion - discoverer 11i

    Hello Gurus, I need some guidance to install Oracle Configuration Manager. I have just successfully installed Oracle Fusion - following "Fusion Middleware Installation Guide for Oracle Portal, Forms, Reports and Discoverer 11g Release 1 (11.1.1.6.0)

  • Netbook and Wireless Router

    I am a bit technology-challenged, so I apologize in advance if this is a stupid question. I have a Toshiba Mini NB255-N245. Can I simply purchase a wireless router; hook it up to my cable modem; and I'm "good to go?" With another laptop, I had to ins

  • TC Initial Backup OK - Now unable to complete - error while copying

    Time Capsule I want this to work well. It's like magic, when it works. I did the initial backup successfully - ethernet cable - and I can enter Time Machine and go back to March 6. But it will not back up now with the "Time Machine Error: Unable to c

  • Javadoc, what are the benefits?

    i'm new to java and have just heard about the javadoc tool. What are the benefits of using this tool rather than just commenting your code well?