Trace through entire site

I have taken over as webmaster of a site that has outgrown it's roots. It is a static site that I am upgrading to PHP/MySQL. Because space on the server is an issue, I'd like to get rid of files that aren't needed and accessed any longer. I was going to write a java program that parsed the files, starting at index.htm and opening each link it finds and parsing that file and so on and so forth. A list will be generated of every page ( and image ) that is "on" the site. I would imagine that this is a common issue and that someone has already written a program like this. I searched the internet for a while, but didn't really know how to search for such a thing and didn't find much. Does anyone know of such a thing or have a better solution? It doesn't matter what language.
Thanks,
Ben Anderson

what you want is a spider
here is an example that (i think does exactly what you want)
http://www.developer.com/java/other/article.php/1573761
btw use google
searching for "parse entire site check for bad links" came up with http://www.inter7.com/osfree.html as its second hit :-)

Similar Messages

  • Consistent swf display through entire site?

    Hi all,
    This is probably a topic that has been covered already,
    probably too many times, but I've been searching for a post on this
    and have had no luck. Anyway, here is my question.
    I am trying to develop a banner for an event planner, she
    wants me to take the pictures she gave me to put into a Flash
    gallery (autoplay) but as a banner.
    How do I make it so that the transition between my html pages
    does not reload the swf on every page. I need to keep it html or I
    would do a complete flash site.
    Maybe there is a way to load html into a flash page (publish
    flash and load html into the published flash html)?
    Any response is appreciated! Thanks.

    Ok, so I guess I have one more question.
    After looking through some of Adobe's tech notes, I realize
    that with sharedobject almost every note explains how to pass data
    or text between site pages. So my question is this:
    How would I be able to get the swf to set a variable for the
    current frame the movie is on, and then load that same variable in
    the newly loaded page?
    So, how would I attach a current frame variable to something
    like this :
    myLocal_so =
    sharedobject.getLocal("flashcookie","/movies/mymovie.swf");
    (tech note tn_16194)
    Thanks again for your help

  • Rollout recursively through entire site?

    Hi,
    I am wondering if there's a way to rollout the whole site?
    I have tried the option of 'Rollout page and all sub pages' but that did not work. Only the selected page was rolled out.
    I have more than 500 pages and it will take quite a lot of time to rollout one by one.
    Is there anything similar to the 'Actvate Tree' tool?
    Thanks!

    Which version of CQ are you using?
    In CQ5.4 there was some problem rolling out to multiple sites in one go. Only one rollout would be successful and the rest would fail.
    To by pass this, a configuration had to be made for the rollout manager in Apache Felix console.
    http://forums.adobe.com/message/4555262
    For CQ5.5 it should be resolved.
    - Ashish

  • How do I add music to play on my entire site?

    Hi!!
    Pretty green at this stuff so bear with me!!
    Designing a website for my small business and would LOVE to add music that could play through the entire site rather than on the page the media is placed on.. does that make any sense to anyone?
    Thanks Much!!

    Did you try the method described in my demo page?
    Note: Internet Explorer and Firefox often have problems with images with reflections. This tutorial describes how to easily convert the image and it's reflection into a single jpeg file: #7 - Converting Photos w/Frames, Drop Shadows and/or Reflections into a Single JPG Image.
    Message was edited by: Old Toad

  • Create a PDF from webpage using entire site option doesn't works

    OK, guys, this is the problem...
    After YEARS of testing from Acrpbat 4 thru 9 on Create a PDF from webpage using entire site option... it doesn't works properly and doesn't got the entire site. ALWAYS get an error of memory or any other error but FINALLY you NEVER got the entire site. I'm talking for a big website... not like amazon.com, but a big one.
    I make a walkarround to try to capture the website in parts but Acrobat is not "intelligent" to make a resume capture of the site, because ALWAYS start from the begining instead from the resume position of the site...
    My question is, HOW can get the ENTIRE SITE in a PDF document... without getting errors or stopping the capture process...
    Don't have problem of low RAM memory because I'm in a MONSTER MACINTOSH.... 3.2 GHZ with 8 Core and 32 GB of RAM under Mac OS X Leopard 10.5.6 in Acrobat 9 Pro.
    If I'm not wrong, the GET ENTIRE SITE option in Web Capture (Create a PDF from webpage using entire site option) doesn't works from Acrobat 4 thru 9 in Pro version... tested, you'll never got the entire site... I'm talking in capture a huge entire site and not a little one...
    Can someone help me?
    Thanks.

    Ok so I go to a web page and select something to print. Once I've clicked print I switch over from printer/micosoft xps docu writer/fax and select Adobe PDF. All goes through with a 'Create Adobe PDF' coming up progressing through to eventually save and store. Once I open the file it end up as the picture above. I have the same version on another computer and that works fine but this particular Sony laptop it doesn't seem to work properly. 

  • Moving entire site to DW - running into problems

    Ok, I'm moving a pretty big site over to Dreamweaver from *ahem* Netscape Composer. It's long overdue, I know. I built the entire site through Composer, and dumped every single file into one huge folder on my server, which is now causing me headaches for obvious reasons. My local files are scattered around on my hard drive, so I re-did the whole thing.
    I think I've properly categorized my website now in DW with folders and sub-folders, and I'm hoping it'll "mirror" correctly on the server when I upload. I haven't put it on the server yet, since I want to make sure it's correct first - also because I'll have to delete all the existing files on the server.
    This site is my family photo album, and I have thumbnails pointing to the regular-size images. I also have my assorted backgrounds, bars, clip-art, etc on the site. When I open one of my HTML pages (which I've downloaded from the server) in DW, it just gives me broken links for my icons and images still.
    My main concern is - do I have to go in and manually re-link all my existing links, or can/will DW do it when it saves my changes locally? Otherwise - what a headache I'm in for, and a huge expenditure of time it's going to take me.
    Thanks for any input. Btw, I have DM MX, not the newest one(s).

    Make sure you have properly defined your site in Dreamweaver...local site folder, etc., and that all your site files are IN that folder. Then, once you get it all working correctly locally, it will work correctly when you upload it back to your server. Do not upload it prematurely, because you will then have a site that might not work (on the server) and have to correct it in two places.
    To help you along, Dreamweaver will run a list of broken links. I think that your version supports that. Check out the broken links one by one (at least in the new version, you can double-click them and bring up the associated file that needs linking) right from the broken links results page.
    Best of luck to you...
    Beth

  • How do I get my entire site to come up when someone clicks search results of just one page of my sit

    When someone searches for my site in google, sometimes only one page of  my site comes up. If they click it, they may only get my menubar, or a  page, such as my calendar page without a menubar or topbar. How can I  make it so when they click the link my entire site comes up. even if the  search they did just lists one of my pages? Help... Thanks

    Are you using FRAMES?
    If not, you need to post a link to your site.

  • Can I publish one page at a time to the ftp when publishing entire site?

    I am currently using iweb to manage my website and i changed a number of pages and require to re-publish the entire site. When i go to do this it attempts to bundle all the files together and publish, but the site only allows for 8 connections via ftp. Is there a way that when i publish the entire site that i can publish one page at a time?
    Thanks for your help!

    Publish your site to a local folder...
    http://www.iwebformusicians.com/iWeb/Publish-Website.html
    ... and upload the files using an FTP application...
    http://www.iwebformusicians.com/Search-Engine-Optimization/Upload.html
    That way you can upload individual files and folders - one at a time.

  • File, create PDF, from web page, entire site...questions

    I am new to adobe, is this the proper forum for Adobe Acrobat 9 Pro for macintosh?  I think yes...
    Created a PDF from:  File, create PDF, from web page, entire site.
    Is there a way to print this without the background color?  If you printed from a browser, you could choose not to print the back ground color.  I know exactly the color.
    Is there a way to make this PDF look like the web, with no page breakes?  I have tried various things, but the page breaks are always displayed.
    Is there a way to create bookmarks on somthing other than the title tag in the web site?  The title tag is an SEO 1 sentence summary of the page, which makes for very long book mark names.
    Thanks for your help.
    bob
    www.answerstat.net

    I don't use v9, but what I would do is click the FILE--PRINT option, print to PDF, and enter to print one page (default is first page)

  • I am an artist who built my site on iWeb 08. Although I have easily been able to transfer to Godaddy, my ability to publish is limited. I must upload my entire site and it is time consuming. How do I convert to iWeb 09 and is this easy to do?

    Help! I am an artist who had my site professionally designed on iWeb 08. Although I have  been able to transfer to Godaddy, my ability to publish is limited. I must upload my entire site and it is time consuming.  I was advised to update to IWeb 09 (which I could get, I think? from Amazon) because it has a built- in option to publish to any FTP. How do I convert my 08 folder to an 09 folder?  Will it convert correctly?  I know eventually I will need to rebuilt the site, but I would like to use this option for awhile to buy time. Is this feasible? I currently own a Mac Pro.

    Its just a question of installing the later version of iWeb from the iLife disk and opening your website in that version. Its a good idea to have a backup of your Domain.sites2 file before doing this. See this page for its location...
    http://www.iwebformusicians.com/iWeb/iWeb-Tips.html
    Publishing settings are shown here...
    http://www.iwebformusicians.com/iWeb/Publish-Website.html
    If you are using Lion/Mountain Lion, see this page for more info...
    http://www.iwebformusicians.com/iWeb/mountain-lion.html

  • Wifi is not available where I come from. I have broadband connection where data transmission is through cell sites then to USB modem connected to a computer. The modem draws power from the computer. Will this setup work with the ipad?

    Wifi is not available where I come from. I have broadband connection where data transmission is through cell sites then to USB modem connected to a computer. The modem draws power from the computer. Will this setup work with the ipad?

    iPad requires Wifi (or 3G /LTE) to connect to the Internet. You cannot connect a USB modem to the iPad.
    You can create your own WiFi hotspot through your computer for your iPad to connect to, if your computer supports this functionality. All Wifi Macs and many Wifi PCs do. Check your computer manual for how to do it.

  • Applying a new template to entire site?

    Is there an easy way to apply a new template to the entire
    site? I created a new template and I can open each page
    individually and click MODIFY/TEMPLATES/APPLY TEMPLATE TO PAGE but
    there has to be an easier way to apply the new template to the
    entire site without having to open every single page in the
    site.

    That would be true ONLY on pages that are already child pages
    of templates.
    Try it on a page that is not a child page, but that has
    existing content.
    This is what the OP was asking.
    Murray --- ICQ 71997575
    Adobe Community Expert
    (If you *MUST* email me, don't LAUGH when you do so!)
    ==================
    http://www.dreamweavermx-templates.com
    - Template Triage!
    http://www.projectseven.com/go
    - DW FAQs, Tutorials & Resources
    http://www.dwfaq.com - DW FAQs,
    Tutorials & Resources
    http://www.macromedia.com/support/search/
    - Macromedia (MM) Technotes
    ==================
    "TheTechChik" <[email protected]> wrote in
    message
    news:[email protected]...
    > As long as you don't change the title of each editable
    region, it should
    > know
    > exactly where to put everything. When I changed
    templates on individual
    > pages
    > it worked flawlessly, I don't see why doing a batch
    process would
    > introduce any
    > new issues as you suggest.
    >
    > Mandy
    >
    > Originally posted by: Newsgroup User
    > > MODIFY/TEMPLATES/APPLY TEMPLATE TO SITE
    >
    > I don't see how it could be done. Imagine a template
    with 3 or more
    > editable regions. How would it know where to put which
    content.
    >
    > Just for fun, take a page with existing content and
    apply a template with
    > a
    > single editable region to it. How'd it work?
    >
    > --
    > Murray --- ICQ 71997575
    > Adobe Community Expert
    > (If you *MUST* email me, don't LAUGH when you do so!)
    > ==================
    >
    http://www.dreamweavermx-templates.com
    - Template Triage!
    >
    http://www.projectseven.com/go
    - DW FAQs, Tutorials & Resources
    >
    http://www.dwfaq.com - DW FAQs,
    Tutorials & Resources
    >
    http://www.macromedia.com/support/search/
    - Macromedia (MM) Technotes
    > ==================
    >
    >
    > "TheTechChik" <[email protected]> wrote
    in message
    > news:[email protected]...
    > > Thanks Alan!
    > >
    > > I too like Raizel have had things go haywire when I
    tried something
    > > similar in
    > > the past but I'm sure I just selected to update
    pages at the wrong time
    > > and/or
    > > didn't update them when I should have.
    > >
    > > You would thing there would be a tool/feature in DW
    to do this easily
    > > like
    >
    > > MODIFY/TEMPLATES/APPLY TEMPLATE TO SITE. Hint Hint
    Adobe ;-)
    > > Changing
    > > file
    > > names and determining when to update and when not
    to can be dangerous
    > > for
    > > many.
    > >
    > > Mandy
    > >
    >
    >
    >
    >
    >
    >

  • Pre-ordering through apple site  or vzw

    I have been seeing a lot of people say that they will be pre-ordering through apples site..however, I thought the pre-orders were only through vzw. Does anyone know for sure...Its about the only thing I am not sure of, that matters to me. If you can do it on apples site what time will they start..I would assume 3am eastern but, I hate to assume anything when it comes to vzw

    ja1234 wrote:
    I have been seeing a lot of people say that they will be pre-ordering through apples site..however, I thought the pre-orders were only through vzw. Does anyone know for sure...Its about the only thing I am not sure of, that matters to me. If you can do it on apples site what time will they start..I would assume 3am eastern but, I hate to assume anything when it comes to vzw
    I chatted with an Apple CSR on Saturday.  I asked him if the online apple store would be offering the verizon iphone preorder and he said yes.  I asked him if they would be starting at 3am ET as well, and he said they had not been given that information at that time.
    We still can't be 100% sure, but I feel more inclined to believe them over the Verizon CSRs at this point, lol.

  • Can't "get" entire site

    Hello,
    I'm trying to get my entire site using DW 5.5 so that I can work on it locally, but lots of the file are being left behind. I follow the instructions to do so, click yes when prompted if I want to get the entire site and the process begins. It then stops, saying that it's complete, but most of the data is missing. It manages to get about 350mb but the total size of the site is nearly 4gb.
    Any advice much appreciated.
    Cheers in advance,
    Matt.

    Why don't you just access the site by ftp and get the files...........I never use Dreamweaver for putting or getiing anything.....it's useless in that regards.
    Geez just read 4gb.... what is it a hi-res image/video library? I would definitely either use ftp or maybe go into the sites control panel and make a back up of the site files and download them from there. Most control panels send you a link to a download zip file once the server has backed up the files.

  • CS3 Synchronization Problems (Entire Site)

    The following problems occur when I attempt to synchronize my
    entire site (954 files) using DW 8 Windows, DW 8 Mac, or DW CS3
    Mac. (If I select a group of files, synchronization always works as
    advertised.)
    1. If I tell DW to synchronize the entire site (in either
    direction) and specify that it should delete files, synchronization
    eternally builds a file list. DW doesn't hang; I can cancel out of
    it and continue to use DW.
    2. If I tell DW to synchronize the entire site (in either
    direction) without specifying that it should delete files, DW most
    often incorrectly announces that no synchronization is necessary.
    Immediately afterwards, if I use the icon on the top left of the
    files panel and select
    Edit | Select Newer Local or
    Edit | Select Newer Remote, DW correctly identifies the
    files that have changed. I get the same behavior if I do those two
    things in the opposite order. In other words, DW simultaneously
    knows and doesn't know which files are newer.
    Is there a fix or a workaround?

    I dunno. Go into DW8 and EXPORT the site definitions with
    their login info.
    Import them into CS3.
    Murray --- ICQ 71997575
    Adobe Community Expert
    (If you *MUST* email me, don't LAUGH when you do so!)
    ==================
    http://www.dreamweavermx-templates.com
    - Template Triage!
    http://www.projectseven.com/go
    - DW FAQs, Tutorials & Resources
    http://www.dwfaq.com - DW FAQs,
    Tutorials & Resources
    http://www.macromedia.com/support/search/
    - Macromedia (MM) Technotes
    ==================
    "Pat Jones" <[email protected]> wrote in message
    news:f4pkcp$1rb$[email protected]..
    > Hi Murray;
    >
    > DW8 is still there and they're both on the same drive.
    Only the log on
    > info is missing. A while ago, DW8 was having problems
    losing the last used
    > site's log on due to IE7 (as I understand the cause). I
    downloaded a patch
    > for that. Any connection?
    >
    > Thanks;
    >
    > Pat
    >
    >
    > "Murray *ACE*" <[email protected]>
    wrote in message
    > news:f4pjkd$111$[email protected]..
    >> The site defs should have migrated forward. Did you
    install on the same
    >> drive? Did you uninstall DW8 before installing CS3?
    >>
    >> --
    >> Murray --- ICQ 71997575
    >> Adobe Community Expert
    >> (If you *MUST* email me, don't LAUGH when you do
    so!)
    >> ==================
    >>
    http://www.dreamweavermx-templates.com
    - Template Triage!
    >>
    http://www.projectseven.com/go
    - DW FAQs, Tutorials & Resources
    >>
    http://www.dwfaq.com - DW FAQs,
    Tutorials & Resources
    >>
    http://www.macromedia.com/support/search/
    - Macromedia (MM) Technotes
    >> ==================
    >>
    >>
    >> "Pat Jones" <[email protected]> wrote in
    message
    >> news:f4pj04$7h$[email protected]..
    >>> Hi;
    >>>
    >>> I upgraded to the CS3 web premium suite from
    studio 8. My site caches
    >>> were imported but the log on info for each was
    not. Is there a fix for
    >>> this ?
    >>>
    >>> Thanks;
    >>> Pat
    >>>
    >>
    >
    >

Maybe you are looking for

  • Videos no longer sync to ipod touch

    We have ipod touch gen 3 and gen 4.  After update we can no longer sync movies to ipods.  Some of our own videos sync, but none of the videos from the itunes store will sync.  They will play in Itunes.  Ipods are up to date with operating system.  I

  • My audiobooks no longer play on my computer from where I stopped listening on my phone.

    I used to be able to listen to my audiobooks (Audible or other) on my iPhone, sync the phone to my laptop and start listening through iTunes where I stopped on my phone. This doesn't seem to work any longer. The book will start from the beginning or

  • Windows XP lost connection to airport extreme base station

    I had a perfect connection after setting up my Airport Extreme with my Mac Book Pro and my HP laptop and one HP desktop then when trying repair media center. I did install the airport utility in XP but when trying to scan it cannot pick up my base st

  • Problem about user-defined resource in RAC

    i do what doc says ,to create user-defined resource , [oracle@rac1 ~]$ crs_profile -create network1 -t application -a /opt/ora/product/10.2.0/crs_1/bin/usrvip -o oi=eth0,ov=192.168.40.221,on=255.255.255.0 [oracle@rac1 ~]$ crs_register network1 [root@

  • Excel

    Hello masters!! Im encountering a problem when I download a file in excel. Actually I dont know if its abap side or excel thing. the problem is I have a description field example of text in description field. example ::    color blue #house what happ