Parsing url from HTML file

hello
I have a html page converted to text format. I need to parse the url's present in that file.
I got all the url's but I want to extract only a specific url i.e., url in the anchor tag
e.g., like <a href = "http://www.java.sun.com"> Java </a>
In that file, if I need only the url in the anchor tag... what would be the method ?
Any help would be great
Thanks in advance
k

Thank you very much for your prompt reply and for the link u have provided.

Similar Messages

  • How to read .html file and store values into oracle table  from html file

    Hi all ,
    How to read .html file and store values into oracle table from html file using pl/sql
    Please Help.....

    Hi,
    Kindly find following sample html code ,i want to store every value in different column in database .
    <html><body><p/>
    <div style="position:absolute;top:47px;left:37px;font-family:'Times New Roman';font-size:10pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:47px;left:680px;font-family:'Times New Roman';font-size:10pt;white-space:nowrap;">  
    </div>
    <div style="position:absolute;top:94px;left:151px;font-family:'Times New Roman';font-size:1pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:1080px;left:115px;font-family:'Times New Roman';font-size:8pt;white-space:nowrap;">4497743
    </div>
    <div style="position:absolute;top:1079px;left:442px;font-family:'Times New Roman';font-size:9pt;white-space:nowrap;"> Miclyn Express Offshore Pre-Quotation Disclosure
    </div>
    <div style="position:absolute;top:1079px;left:680px;font-family:'Times New Roman';font-size:9pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:1079px;left:723px;font-family:'Times New Roman';font-size:9pt;white-space:nowrap;">page 5
    </div>
    <div style="position:absolute;top:1083px;left:151px;font-family:'Times New Roman';font-size:1pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:107px;left:151px;font-family:'Times New Roman';font-size:10pt;white-space:nowrap;"><b>Attachment 2 ¿ indicative statement of 20 largest shareholders </b>
    </div>
    <div style="position:absolute;top:139px;left:262px;font-family:'Times New Roman';font-size:10pt;white-space:nowrap;"><b>Name </b>
    </div>
    <div style="position:absolute;top:131px;left:415px;font-family:'Times New Roman';font-size:10pt;white-space:nowrap;"><b>Number of Shares </b>
    </div>
    <div style="position:absolute;top:147px;left:458px;font-family:'Times New Roman';font-size:10pt;white-space:nowrap;"><b>Held </b>
    </div>
    <div style="position:absolute;top:131px;left:560px;font-family:'Times New Roman';font-size:10pt;white-space:nowrap;"><b>Percentage of </b>
    </div>
    <div style="position:absolute;top:147px;left:567px;font-family:'Times New Roman';font-size:10pt;white-space:nowrap;"><b>shares held </b>
    </div>
    <div style="position:absolute;top:179px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Macquarie Capital Group Limited 92,378,000
    </div>
    <div style="position:absolute;top:179px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:179px;left:618px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">34.00%r
    </div>
    <div style="position:absolute;top:179px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:212px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">HSBC Custody Nominees (Australia)
    </div>
    <div style="position:absolute;top:227px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Limited
    </div>
    <div style="position:absolute;top:220px;left:464px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">36,458,220
    </div>
    <div style="position:absolute;top:220px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:220px;left:618px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">13.42%
    </div>
    <div style="position:absolute;top:220px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:260px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Ray Rider Limited 27,170,000
    </div>
    <div style="position:absolute;top:260px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:260px;left:618px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">10.00%
    </div>
    <div style="position:absolute;top:260px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:300px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:300px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">7.96%
    </div>
    <div style="position:absolute;top:300px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:333px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">National Australia Bank Custodian
    </div>
    <div style="position:absolute;top:348px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Services
    </div>
    <div style="position:absolute;top:341px;left:464px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">12,866,550
    </div>
    <div style="position:absolute;top:341px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:341px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">4.74%
    </div>
    <div style="position:absolute;top:341px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:381px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Citigroup Nominees Pty Ltd 6,942,541
    </div>
    <div style="position:absolute;top:381px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:381px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">2.56%r
    </div>
    <div style="position:absolute;top:381px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:421px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:421px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">2.14%r
    </div>
    <div style="position:absolute;top:421px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:462px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">UBS Securities Australia Ltd 4,806,760
    </div>
    <div style="position:absolute;top:462px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:462px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">1.77%
    </div>
    <div style="position:absolute;top:462px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:494px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Merrill Lynch Equities (Australia)
    </div>
    <div style="position:absolute;top:510px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Limited
    </div>
    <div style="position:absolute;top:502px;left:472px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">4,325,000
    </div>
    <div style="position:absolute;top:502px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:502px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">1.59%
    </div>
    <div style="position:absolute;top:502px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:550px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Equities Ltd
    </div>
    <div style="position:absolute;top:542px;left:472px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">4,150,000
    </div>
    <div style="position:absolute;top:542px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:542px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">1.53%
    </div>
    <div style="position:absolute;top:542px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:575px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Bond Street Custodians Limited - A/C
    </div>
    <div style="position:absolute;top:590px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Institutional
    </div>
    <div style="position:absolute;top:583px;left:472px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">2,750,000
    </div>
    <div style="position:absolute;top:583px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:583px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">1.01%
    </div>
    <div style="position:absolute;top:583px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:623px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Cogent Investment Operations Pty Ltd 2,599,321
    </div>
    <div style="position:absolute;top:623px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:623px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">0.96%
    </div>
    <div style="position:absolute;top:623px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:663px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Skeet Nominees Pty Ltd 2,276,736
    </div>
    <div style="position:absolute;top:663px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:663px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">0.84%
    </div>
    <div style="position:absolute;top:663px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:704px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Diederik de Boer 1,917,561
    </div>
    <div style="position:absolute;top:704px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:704px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">0.71%
    </div>
    <div style="position:absolute;top:704px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:744px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Ecapital Nominees Pty Limited 1,594,736
    </div>
    <div style="position:absolute;top:744px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:744px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">0.59%
    </div>
    <div style="position:absolute;top:744px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:777px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Neweconomy Com Au Nominees Pty 9
    </div>
    <div style="position:absolute;top:792px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Limited &#60;900 Account&#62;
    </div>
    <div style="position:absolute;top:784px;left:472px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">1,594,7360
    </div>
    <div style="position:absolute;top:784px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:784px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">0.59%
    </div>
    <div style="position:absolute;top:784px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:825px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Sonray Capital Markets Pty Ltd 1,236,842
    </div>
    <div style="position:absolute;top:825px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:825px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">0.46%
    </div>
    <div style="position:absolute;top:825px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:865px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Argo Investments Limited 1,050,000
    </div>
    <div style="position:absolute;top:865px;left:531px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:865px;left:625px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">0.39%
    </div>
    <div style="position:absolute;top:865px;left:663px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;"> 
    </div>
    <div style="position:absolute;top:905px;left:161px;font-family:'Times New Roman';font-size:11pt;white-space:nowrap;">Idameno (No 79) Nominees Pty Limited 724,210</div>
    <div style="position:absolute;top:1103px;">
    </body></html>
    Thanks..........................

  • Safari is sandboxed to not reveal filepath from html file select to Javascript ?

    I know safari is sandboxed not to reveal filepath from html file select control to Javascript via the Value attribute , but i am unable to find any documentation regarding how this is done and since when this is present .. Can somebody please guide me with this ..?

    Since Safari is able to save as HTML file a page if it is opened online (from web), making "Save as...", and it is unable to make so if the webarchive file is on the hard disk...I tried to upload on a web server some .webarchive files.
    Then I went online in the web page with the list of all the uploaded .webarchive files... (ex.: http://....filename1.webarchive, etc...)...
    My goal was to open ONLINE (not on hard disk) the files and then make "save as...",... but... SURPRISE!!! Safari is UNABLE to open a webarchive file from the web, it asks me where to save the file... it can only save it, it writes that it is an "application"...
    I think this happens because .webarchive is not a real web page but a "package"... but my displeasure is this:
    Once opened an hard disk's webarchive file, its content is loaded in ram and it appears like a normal web page (html + images + etc...)... so, in this moment, Safari should be able to save it "as...." HTML file too.... discarding images and other objects....
    Any advice?
    Thank you!!!

  • Pick Image from Html file

    I want to retrieve the image from html file & then save this file in jpg format.
    I have a html file with following contents.
    Patient id = 1
    Dated = 03/06/2008
    Pcr_Volume = 0.5
    Notes:
    Principle :
    after principle heading there is a graph file attached.
    Result:
    Remarks:
    I want to pick this graph file which is present after Principle heading and save it to jpg format.
    Regard's

    I want to pick Image(which is actually a graph image of patient test). This image is present is html file with some contents and this file is auto generated by the machine in which test perform.
    I just open that html file and click on that image which is pasted in between that html file when i clik on that image a toolbar appear with save & some other button & then I press
    save button and save that image in jpg format.
    After that i use this image to display in patient report.

  • Collect email addresses from html file or URL

    Hi guys, it sounds never been disccussed here,
    I have a HTML file which is huge , contains text graphics and email addresses, if I want to copy the email address manually, it will take long time,
    so have anybody thought of a JSP script to do this task ? , just to read the html or the URL and to collect all the email addresses there !

    Hi guys, it sounds never been disccussed here, Probably because most developers dont like to be
    spammed and dont like to make programs that can
    collect email addresses from a website or url?OWNED
    definitely owned.

  • Find Info from HTML file

    I am trying to develop a program to read URLS and extract specific content from the source of the URLS. So far my program
    Returns the HTML of a URL and writes the HTML to a file called Results.txt.
    I now need to write a program that opens up this Results file and extracts the info that appears after certain tags. Some of these files are rather large to say the least and parsing HTML files is no simple task compared to files separated by simple white space.
    Can anyone advise how I can search an HTML file for A particular tag. Is tokenisaing the file the answer? If so How can I define a token since HTML does not separate tokens by white spaces always.
    Thanks for your help
    Ross

    Well ok I agree with you in what you say however I have designed my final year at uni project for parsing HTML and that's what Im commited to doing now. In hindsight I would have done things differently.
    I am having difficulty knowing how to parse the HTML tho. Basically to look at, it's not nice at all. For example the HTML below how would I extract the info after the words "Double Rooms from" ?
    </td></tr>     <tr><td colspan="2"><hr size="1"/>
         <font size="3"><b>Orwell Lodge Hotel, Dalry</b></font>
    (2.6 miles / 3.6 km from the centre of Sighthill)
    </td></tr>
         <tr><td><img src="http://www.activehotels.com/photos/218697/AAB218697.jpg" border="0" width="96" height="72" alt="hotel" /></td>
         <td><font size="2">Single rooms from: &pound;40.00, Double rooms from: &pound;40.00</font>     
         <p /><font size="3"><b>For more details and online booking click here.</b></font>
         <p /><font size="2">Hotel details in other languages:
         <a href="http://www.orwelllodgehotel.activehotels.com/KNW&LANGUAGE=fr&subid=                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                       

  • How can i get all links from html file ?

    I tried some code:
    List<string> metas = new List<string>();
    string text = "";
    string mys = "";
    public List<string> LockedThreads(string filename)
    HtmlWeb web = new HtmlWeb();
    HtmlAgilityPack.HtmlDocument doc = new HtmlAgilityPack.HtmlDocument();
    doc.Load(filename, Encoding.UTF8);
    var metaTags = doc.DocumentNode.SelectNodes("//meta");
    if (metaTags != null)
    foreach (var tag in metaTags)
    if (tag.Attributes["name"] != null && tag.Attributes["content"] != null)
    text = tag.Attributes["name"].Value;
    mys = tag.Attributes["content"].Value;
    var linksOnPage = from lnks in doc.DocumentNode.Descendants()
    where lnks.Name == "a" &&
    lnks.Attributes["href"] != null &&
    lnks.InnerText.Trim().Length > 0
    select new
    Url = lnks.Attributes["href"].Value,
    Text = lnks.InnerText
    foreach (HtmlNode node in doc.DocumentNode
    .SelectNodes("//a/@href | //img/@src"))
    metas.Add(node.Name);
    In the var linksOnPage i'm getting all the links but some of them not only links for example the first item i see:
    [0] = { Url = "/", Text = "&nbsp;&nbsp;&nbsp;&nbsp;" }
    Then the second item:
    [1] = { Url = "http://rotter.net/cgi-bin/forum/dcboard.cgi?az=login", Text = "התחבר" }
    Next i want to add another loop or variable that will contain only the links after im src for example:
    <img
    src="http://mysite.com"
    border="0"></a></TD><TD
    ALIGN="right"
    VALIGN="TOP"
    WIDTH="55%">
    So to get only http://mysite.com
    So in the end i will have two Lists:
    First List will contain all links from the html file.
    Second List only links that are in img src

    Ok i'm using regex now and i can get the links i need:
    List<string> metas = new List<string>();
    List<string> metas1 = new List<string>();
    string text = "";
    string mys = "";
    public List<string> LockedThreads(string filename)
    string[] fall = File.ReadAllLines(filename);
    foreach (string hh in fall)
    if (hh.Contains("http://rotter.net") && hh.Contains("locked")||
    hh.Contains("locked_icon_general") ||
    hh.Contains("locked_icon_anchor") ||
    hh.Contains("icon_anchor") ||
    hh.Contains("locked_icon_fire") ||
    hh.Contains("locked_icon_sport")||
    hh.Contains("locked_icon_camera")||
    hh.Contains("locked_icon_movie"))
    Regex linkParser = new Regex(@"\b(?:https?://|www\.)\S+\b", RegexOptions.Compiled | RegexOptions.IgnoreCase);
    foreach (Match m in linkParser.Matches(hh))
    metas.Add(m.Value);
    So now in the metas i have in this case two links.
    For example the first link is from this block in the file:
    The link i got is : http://rotter.net/forum/Images/hot_icon_anchor.gif
    And this is the right link from this block but now how do i get the text from this block and how i connect how do i know that this text belong to the link ? I need to add this text from this block to a new List. Before all that i counted on my own lines
    and got the text but that's a bad idea.
    I need somehow to get: שרשור תשדירי בחירות מרוכז:
    To get the text between the ToolTip',' and the second ',' but not from all the html file only in places of this two links i extracted.
    <TD ALIGN="RIGHT"
    VALIGN="TOP">
    <body onmousemove="overhere()">
    <a onmouseover="EnterContent('ToolTip','שרשור תשדירי בחירות מרוכז:','<u><span style=color:#000099;>כתב: menahem בתאריך: 03.03.15
    שעה: 08:51</span></u><br>1.<br>http://youtu.be/utUaqyiRkJ8<br><br><br>...'); Activate();"
    onmouseout="deActivate()"
    href="javascript:void(0)">
    <img src="http://rotter.net/forum/Images/hot_icon_anchor.gif"
    border="0"></a></TD><TD
    ALIGN="right"
    VALIGN="TOP"
    WIDTH="55%">
    <FONT CLASS='text15bn'><FONT
    FACE="Arial">
    <a href="http://rotter.net/cgi-bin/forum/dcboard.cgi?az=read_count&om=187966&forum=scoops1"><b>
    <font color="red">שרשור תשדירי בחירות מרוכז:</b>
    </a></font></TD>

  • How to parse contents from XML file in Java

    Hi All,
    I have a scenario like this . I have one xml file with key value pairs of ( name , URL ) . I have retrieved contents from XML file , now I want to parse these contents and store in a bean object.
    How to parse Contents of XML file??
    Thanks in advance,
    Rajendra.

    Hi All,
    I have a scenario like this . I have one xml file with key value pairs of ( name , URL ) . I have retrieved contents from XML file , now I want to parse these contents and store in a bean object.
    How to parse Contents of XML file??
    Thanks in advance,
    Rajendra.

  • How to link javascript files in myApp.app directory in IOS from html file in app StorageDirectory

    I'm using Adobe Air 3 for IOS offline app, and I want to implement a download updates (html files + js,css) feature from within the application. The files would be displayed in a StageWebView component.
    Since Apple doesn't allow saving files inside the installation directory, I must use ApplicationStorageDirectory instead to download files into.
    I want those download html files to link to .js / .css files already installed within the main application directory (e.g. at myApp.app).
    The reason for that is I want to minimize the number of the downloaded (shared) files as much as possible. Is this possible?
    Thank you.

    I think this doesn't work in StageWebView:
    http://help.adobe.com/en_US/air/build/WSfffb011ac560372f7e64a7f12cd2dd1867-8000.html#WS901 d38e593cd1bac354f4f7b12e260e3e67-8000
    "Note: Links in a StageWebView instance cannot open URLs that use a custom URI scheme."
    So I can't for example do this:
    <link rel="stylesheet" href="app:/CSS/Templates.css" type="text/css" media="screen, mobile" title="main" charset="utf-8">
    What I need is something like the following:
    Inside ApplicationStoarageDirectory (/Library/Application Support/):
         test.html
    I need it to link to a css file located inside ApplicationDirectory (installation directory), something like:
         <link rel="stylesheet" href="../../MyApp.app/CSS/Templates.css" type="text/css" media="screen, mobile" title="main" charset="utf-8">

  • XML parser to parse XML inside HTML file

    Hi,
    I wish to know is there any other parsers apart from JAXP to parse xml content present inside HTML file. For example,
    <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
    <html xmlns="http://www.w3.org/1999/xhtml">
    <head>
    <title></title>
    </head>
    <body>
    <form id="j_id_jsp_1394907664_1" name="j_id_jsp_1394907664_1" method="post" action="/msaiphoneportal1.1c/pages/xmlchech.faces;jsessionid=5666F0E1CF0E44B978940F021012AA41" enctype="application/x-www-form-urlencoded">
    <input type="hidden" name="j_id_jsp_1394907664_1" value="j_id_jsp_1394907664_1" />
    <?xml version="1.0" encoding="UTF-8"?>
    <hospital>
    <Users>
    <User id="1" password="x" type="staff" username="x"/>
    <User id="2" password="y" type="staff" username="y"/>
    <User id="3" password="z" type="staff" username="z"/>
    </Users>
    <Survey/>
    <Patients>staaatus</Patients>
    </hospital>
    <input type="hidden" name="j_id_jsp_1394907664_1:j_id_jsp_1394907664_2" /><input type="hidden" name="javax.faces.ViewState" id="javax.faces.ViewState" value="-4298162632826268059:-1507671971163298623" autocomplete="off" />
    </form>
    </body>
    </html>
    I need to read the XML content inside. Is there any way please let me know
    Edited by: DHURAI on Jul 22, 2010 12:59 AM

    DHURAI wrote:
    while reading, we can fetch the starting of XML through <?xml> tag, but how we know the ending of the XML as it seems to be dynamic.1) Extract the document root element which follows the <?xml ... ?>
    2) From this root element , construct the associated root element terminal by inserting a / after the <.
    3) Search for the terminal.
    If the root name can also be the name of an enclosed element then you will have to count the number of terminals.

  • Problem with .PDF files from .html files at pagebreaks

    I am using DW CS5.  When I make a .pdf from the .html file it is not handling the images with captions correctly, or even consistently, at the bottom of the .pdf page.
    Here is the Link to the .html code:
    http://www.schembs.com/0_PDF_PAGE_PROBLEM.htm
    and to the .css, if you need it:
    http://www.schembs.com/0_PDF_files/2011_ZachBuch_No_Lines.css
    The test page repeats the same image with caption several times. 
    See at the bottom of page 1. There is not enough space for Image #3 so it correctly puts it at the top of page 2.  However it puts the caption for Image #3 in the vertical position as if there were no page break and Image #3 were positioned just below the caption for Image #2.  Here is a screenshot of the bottom of page 1:
    It then places image #4 below the caption for Image #3, which results in Image #4 being on top of Image #3.
    At the bottom of page 2 there is not space for Image #6 so it puts it at the top of page 3.  It properly places the caption below Image #6 and properly places Image #7.  Here is the bottom of page 2;
    At the bottom of page 3 there is not space for all the lines of the caption, so it puts the last line of the caption for Image #8 at the top of page 4.  This is o.k.  Here is the bottom of page 3:
    I can live with it breaking up the caption (not ideal) but not with the caption overwriting the image or images overwriting images. 
    Thanks very much for your ideas on how to fix this.
    jds zigzag

    Hello Jim,
    in my IE 8 your website looks great, I don't see any overlaps. To show you what I did, here you can follow and check my proceeding by using my (German) "Adobe Acrobat 9 Pro".
    Step 1:
    Step 2:
    Step 3:
    Step 4:
    I'm sure that you can understand the used German terms, if not, bitte frage mich nachher noch einmal.
    Good luck! (German: Glück auf!)
    Hans-Günter
    P.S.
    Would you say where in one of these countless "St. Alban's" did you find these "Fratzensteine"?

  • Suppressing headers and footers in PDFs made from HTML files

    When I convert a HTML file to PDF with Acrobat 9 Standard, the PDF contains a footer (file path) and header (file name).  I don't want these to appear in my PDF.  I have disabled headers and footers in MSIE 8, but that has no effect.  I can't deleted these objects from within Acrobat, unless I edit each page with the text touch up tool.  Is there a way to suppress the header/footer?  The subject HTML files came from another system.  Thanks.

    Thanks, Bernd, but where are these settings?  While I'm writing, is there a setting to make an html file fit on one page in Acrobat?

  • Unable to Display Background Graphic From HTML File Generated from AI File

    I am trying to figure out if this is a MAC OS X or Safari issue. I have already a message into the author of the plug-in as well as created a posting on the Adobe Forum.
    I am following a tutorial on Lynda.com called "Design the Web: Illustrator to Animated HTML5 Canvas." I am using a plug in called Ai2CanvasMac with Adobe Illustrator, It essentially generates an HTML file from an Illustrator file. The Illustrator file has an embedded PNG file as a background image and a small vector graphic in front and centered. After the plug-in generates the HTML file, it automatically opens the file up in the browser. This is where I am getting mixed results.
    Safari: The small vector does appear and in the center where the embedded background should appear. But the background image isn't displaying, only a white background is showing.
    Firefox and Chrome: The page is totally blank. Not even the vector graphic appears.
    It will work with two different approaches.
    1. If I edit the source image web links in HTML file and use links from any graphics I find off the web. That works fine. The images are displayed in the browser window.
    2. If I embedded PNG images from my desktop into the Illustrator file and generate the HTML file. That works fine too.
    Does anyone have any ideas as to why I can't the graphics to display properly in the browser without doing this workaround?

    Hi,
    Ex. 1 - If the Index file is in the same directory, called "test"
    tales/numbers.jpg
    Ex. 2 - If the image is in a subdirectory, called "test1" within "test"
    test/test1/numbers.jpg
    img src="test/test1/numbers.jpg"
    Regards
    Ayyapparaj

  • How to generate PDF file from HTML file using Acrobat API's

    Hi,
    I want to generate a PDF file from an HTML file on server side(C# .Net).
    Their is a COM interop called "AcrobatWeb2PDF" availaible but could not find any document regarding how to use it.
    I cant use "Adobe live cycle PDF Generator" as we just have license for Adobe Acrobat 8 Professional.
    Please help...
    Thanks and Regards,
    Anand Mahadik.

    > It is hard to believe that Adobe doesn't provide a toolkit for generating PDF files, so many web based applications have vector based content that needs to be converted to PDF!!!!
    They do, it's just not free (A company in business to make money? I'm sure IBM would never think this way... ;)). As mentioned you have Adobe LiveCycle PDF Generator, which you can customize and extend with Java. You also have the Adobe PDF Library SDK, which is written for use with C/C++ although if you license it from Datalogics (the only company in NA Adobe allows to license the PDF Library) you will also get .NET and Java interfaces (part of the DLE - DataLogics Extensions).
    > There must be a way to generate PDF dynamically on a server or from Javascript!
    JavaScript? Not really, no. As far as I'm aware JavaScript has no file system access capabilities without some form of intermediary (like sending the data to a webservice that writes it out to file). How would you create a PDF file with JavaScript?
    The PDF Standard is also in ISOs hands now (ISO 32000-1:2008), it is no longer owned by Adobe - you can download a copy of the specification from them and write your own library based on that as well.

  • Why are signatures from .HTML files no longer appended (v31.1.0)?

    I have been using Thunderbird for some six months now. At the outset I set up an .HTML signature file for each of the accounts, and up to now this option has functioned well.
    At some time over the last couple of days the feature stopped working --no signature information is added to new message, replies, or forwarded messages. I have checked the signature files themselves, and the entries under Tools / Account Settings / account-id ("Attach the signature from a file..." / file path name) and these appear to be unchanged. Has anyone any further suggestions about items to check?
    I am running Thunderbird v31.1.0 under Windows 8.1.

    I am glad to hear that your installation is now functional again.
    I agree it's hard to pass the opportunity to get new shiny features, fixes and of course security fixes when a new version is out. However, with something as critical as email it's also hard to justify not waiting at least a few days / weeks / months and being more conservative. I encourage you to keep an eye on future release notes and news, and perhaps even bug reports if you have the time.
    Of course coming here in my opinion is the easiest, fastest way to bring up issues and Cget a workaround, while we (the Mozilla support volunteers) raise developers attention to it.
    BTW thanks for marking the solution, This will help other users with similar problems find the solution more easily.
    Thank you for contacting Mozilla Support!

Maybe you are looking for