Is there any class to get the head, title, anchors of any page?

hi, i am study at computer science and i have a senior project that is related to Focused Crawler.
1 ) So, i have an issue. I have to get head,title,metas, anchors of any page. But, i don't want to recode it, may be there any classes that performs my willings. For example:
URL a = new URL(....)
string1 = a.getHeadComponent();
string2 = a.getTitleComponent();
string3 = a.getMetas();
string4 = a.getAnchors();
As i written above, is there any code to get these components without extra codding?
2 ) Is it possible to modulate download speed while downloading a page by using URL class of java?
Thanks for responses.

fuatsungur wrote:
hi, i am study at computer science and i have a senior project that is related to Focused Crawler.
1 ) So, i have an issue. I have to get head,title,metas, anchors of any page. But, i don't want to recode it, may be there any classes that performs my willings. For example:
URL a = new URL(....)
string1 = a.getHeadComponent();
string2 = a.getTitleComponent();
string3 = a.getMetas();
string4 = a.getAnchors();
As i written above, is there any code to get these components without extra codding? No.
You have to open a connection. Get the input stream. Read the input stream. Parse the content.
There are HTML parsers out there.
>
2 ) Is it possible to modulate download speed while downloading a page by using URL class of java?
By using URLConnection? No.
If you use raw sockets then increase speed maybe. But "modulate" not really either way.

Similar Messages

  • Is there any report to get the header/item text at GR stage together?

    Hi gurus,
    Is there any report to get the header/item text at GR stage together with a date range?
    at the point of goods receipt?  Or a report that could be run to search for header/item text at GR stage together with a date range?
    Text that was entered at the point of goods receipt?

    Hi
    Check MB51 report with posting dates for materials/vendors and use the display variant document header text (which is GR header text). I hope this is the one you are looking for.
    Thanks

  • When using an external (ergonomic mouse) is there any way to get the wheel to scroll one full page at a time?

    When using an external (ergonomic mouse) is there any way to get the wheel to scroll one full page at a time? Much appreciated.

    I can't use the keyboard to scroll. I need to keep my hand on the mouse, otherwise I'd never get anything done. Since I have no problem getting Firefox to scroll one page at a time, and I'm constantly jumping between 3 browsers, it would be nice to be able to scroll a full page at a time in Safari, just like I can in Firefox. Thanks.

  • The inline line object in the header is offset in facing pages. How can I make it align with the outer margin?

    I have a Pages document with a header that contains both text and a line (a shape, inline) below the text. When I make the document two-sided (facing pages), with a 1" inside margin and a .75" outside margin, the line offsets either left or right in each section of the document. When a section starts with a right-hand page, the line is right justified on ALL the pages in the section (1" from the left and .75" from the right), even though it should be .75" from the LEFT on the LEFT-HAND pages. When a section starts with a left-hand page, the line is left justified on ALL the pages in the section.
    Is there any way to get the header  line to align with the margins properly in facing pages?

    Hi Debra,
    Click on Inspector > Layout > Section > Configuration
    Is there a tick in 'Left and right pages are different'?
    Please post a screen shot of your headers. That will help to show what you are trying to achieve.
    To take a screen shot, hold down the shift and command keys, then type 4. The cursor will change to crosshairs. Release the shift and command keys. Drag over that part of your screen then release the mouse/trackpad. A screen shot will appear on your desktop. In a reply to a message, click on the camera icon in the Toolbar above your reply and
    Choose File > Choose > Insert Image
    You may have to try this twice. Camera icon sometimes needs a wake-up call, but works the second time.
    Remove any personal details before taking the screen shot.
    Regards,
    Ian.

  • Is there any way to get the height/width of an image before importing it in the indesign document.

    Hi All,
    I need to obtain an image's attributes such as dimensions (height, width in pixels) without placing image in indesign document.
    I have full path of the image (say abc.jpg is stored at c:\my pic\abc.jpg).
    I have obtained the IDFile for this image, tried getting size using GetFileSize() which correctly return size in bytes.
    Is there any way to get the height/width of image without importing it in the indesign document.
    Please, give me some hints. I have spent quite a lot time digging in CHM. I have searched in FileUtils, IDFile API's but found no method which serves this purpose.
    At this point I am clueless where to search next.
    Any help will be appriciated.
    Just a point to mention, I am able to get image height and width for an image in indesign doc though Its not my requirement.
    Thnx,
    D.

    You might be able to examine the contents of the PlaceGun after calling kImportAndLoadPlaceGunCmdBoss without actually placing the image in a document. Not sure, but would be worth looking at.
    Otherwise you will probably have to write platform specific code, ideally with a generic platform-independant wrapper (see SDKSamples/paneltreeview/PlatformFileSystemIterator).
    For the Mac, look at CGImageGetWidth() etc., not sure what the best option is for windows.
    Perhaps Quicktime could provide you with a platform independant solution.

  • Is there any way to get the files from other computer ?

    Hai all,
    I have 2 computers connected. I know the IP address of other computer. Is there any way to get the files from other computer. (for e.g. I want to get the files from specific folder. In Java netwroking is it possible? (Any programs)
    regards,
    Namanc

    Bro Take it easy.
    Think you need to send a Image file named ("abc.gif")
    now develop a server using ServerSocket and connect it in local IP and any PORT
    For example i think ur server is connected in "localhost" and 9000 port.
    Now make ur server educated using some commands.
    For example:
    FileName: <File Name String>
    [Means new file is sending by connected client]
    FileSize: <File size long or int value>
    [ So that ur server can determind about the total length of sender file]
    DataModeOpen
    [Now your client will send its image data, this command means your client software is sending data]
    fjadlkfjaofaijojfwoeiurfodkjflsajlfksa
    l23j4lj23lkjlasjfoq23j4rokjelfkjasldkf
    2o3j4lk2j3ljslakdjf2l3j4 l23kjlk23k4j
    DataModeClose
    [Means file sending finished]
    Close
    [Means your client connection is closed]
    Now develop a client supporting this command.
    For example(Server):
    String getFileName=null;
    long getFileSize=0;
    public handleCommand(String getCmd) {
        if(getCmd.startsWith("FileName")) {
                String[] splitData=getCmd.split(":");
                this.getFileName=   splitData[1];
                outPutStream.writeBytes("+OK");
        } else if(getCmd.startsWith("FileSize")) {
                String[] splitData=getCmd.split(":");
                this.getFileSize=Long.parseLong(splitData[1]);
                outPutStream.writeBytes("+OK");
        } else if(getCmd.startsWith("DataModeOpen")) {
               //data mode open so receive data
              receiveData();
              outPutStream.writeBytes("+OK");
        } else if(getCmd.startsWith("DataModeClose")) {
              //close client socket
             getClientSocket.close();
    }For client:
    DataInputStream dataIn=null;
    BufferedOutputStream bout=null;
    public sendFile(bytes[] getBytes) {
            //if ok found then do other or show error message
        if(sendCmd("FileName:"+getFileName)) {
            if(sendCmd("FileSize:"+getFileSize)) {
                if(sendCmd("DataModeOpen")) {
                   bout.write(getBytes);
                } else {
                   showError(3);
            } else {
                   showError(2);
       } else {
            showError(1);
    }i think it will help u

  • I have downloaded a Audio Book from Itunes Store, but rencently i have had to reinstall windows, is there any way to re-download the audio book from Itunes, i know that the audio books are one time downloads, but is there any way to get the audio book

    I have downloaded a Audio Book from Itunes Store, but rencently i have had to reinstall windows, is there any way to re-download the audio book from Itunes, i know that the audio books are one time downloads, but is there any way to get the audio book back as i have already paid for it once and i would not like to pay for it twice.

    If you haven't got a backup copy and if it's still in the store then you can try contacting iTunes support and see if they will grant you a re-download (there is no guarantee that they will) : http://www.apple.com/support/itunes/contact/ - click on Contact iTunes Store Support on the right-hand side of the page

  • Is there any way to get the server app other that through the app store. We do not have our system conneted to a network. I typical downloaded items on a non-apple system.

    is there any way to get the server app other than through the app store. The system that I need it on are not connected to the internet.
    We run a closed-loop network of apple systems and I tipically downloaded updates on a non-apple system that was on the internet.
    Any ideas?

    I did buy the server app, went to the app store to download it. This system I'm using for downloads is Snow Leopard 10.6.8, it shows that I can download it but only as an Install which says I need 10.8. I can't seem to find a way for it to just download as a package or .dmg file so that I can copy it to DVD/CD or flash drive.

  • Is there any way to get the canvases names on the form

    i have a form in Orcle 6i
    i want when the form load i get the names of the Canvases on the Form automatically
    to change their visual attribute
    is there any way to get the name of the canvases on the form automatically
    Thanks in advance

    The amount of time to evaluate each item in a Form is a lot less than you think. You are dealing with milliseconds. Worse case scenario, you might add 1 second to your form load time.
    I think Francois's suggestion is the best solution, but if this is still too much time, you might concider creating strategic When-New-Item-Instance triggers on an item in each of your canvases that would perform what you want to do.
    This will most likely cause problems of its own, but it is an alternative.
    Food to get the thought process flowing... :-)
    Craig...

  • I synced my phone that had an itunes account but no music on it to a computer and it erased all the music on that computer. is there any way to get the music back?

    i synced my phone that had an itunes account but no music on it to a computer and it erased all the music on that computer. is there any way to get the music back?

    See this older post from another forum member Zevoneer covering the different methods and software available to assist you with the task of copying content from your iPod back to your PC and into iTunes.
    https://discussions.apple.com/thread/2452022?start=0&tstart=0
    B-rock

  • Is there any way to get the query used in discoverer from backend??

    Hi All,
    I am new to dicoverer.
    Is there any way to get the query used in discoverer from backend??
    Thanks,
    Sachin

    Hi,
    you can view the SQL from within the discoverer menu:
    in desktop: View-> SQL Inspector
    in Plus: Tools-> Show SQL
    The SQL in the DB tables is encrypted so it's a problem getting it from there.
    You can export the workbook into SQL but it is not working for every worksheet (subquery, inline pivot tables...)
    you can create a batch file to export them all using the output of this script:
    select null,null,'echo off'
    from dual
    union all
    select distinct
    qs.qs_doc_name Workbook_name,
    qs.qs_doc_details Worksheet_Name,
    'start /wait <path>\dis51usr.exe /connect '||'&p_user_name'||'/'||'&p_password'||'@'||'&p_env'||' /apps_user /apps_responsibility "&responsibility" /opendb '||
    chr(34)||qs.qs_doc_name||chr(34)||' /sheet "'||qs.qs_doc_details||'"'||' /export SQL "C:\Export Discoverer SQL\'||qs.qs_doc_name||'-'||qs.qs_doc_details||'.sql" /batch'
    from eul_us.eul5_qpp_stats qs
    where qs.qs_doc_name is not null
    --order by 1,2
    union all
    select null,null,'echo off'
    from dual
    union all
    select null,null,'exit'
    from dual

  • Is there any function module getting the cpu type?

    Is there any function module getting the cpu type?

    I guess the database server...
    I would like to get the SAP system cpu type as done in transaction st06 under system information.
    Thanks.

  • Is there any Vi for getting the CPU and Memory usage of the Local as well as Remote System

    Is there any Vi for getting the CPU and Memory usage of the Local as well as Remote System

    Find the attachment(LV8.5) for local machines. 
    You can use shared variables to monitor the remote machine's usage.
    Attachments:
    Task Monitor 85.vi ‏25 KB

  • Is there any way to get the list of Essbase and planning users  in excel wi

    Is there any way to get the list of Essbase and planning users in excel with last login details.

    Hi,
    This tool might be of some help to you, 'OlapUnderground Advanced Security Manager'. I havent used it personally but went through its features. Give it a try!
    Thanks,
    Junaid

  • Is there any way to get the file for your site from Creative Cloud?

    I lost the file for my Muse site and was wondering if there was any way to get the file from the server or from somewhere in Creative Cloud.
    Thanks.

    Are you Mac or Windows?

Maybe you are looking for