Subscribe to AfterDawn's weekly newsletter.
help with wget downloading website
#1
23 Dec 2009 @ 22:06
kk0425
Visit user's personal pageSend private message to this user
Junior Member
So, I'm having a small problem with wget. I'm trying to download a website but, due to the nature of the site, not all the images are available all the time and I can't download just the folder with the images I want because of the robots.txt file (I found a way around that).
I need wget to act like a crawler, but it should download the pictures. But from my understanding the --spider command will not download anything. It just crawls the site looking for "stuff".
I'm using the command 'wget -r -e robots=off sitename.com' and that gets some of it, but only the links available. If I try 'wget -r sitename.com/images/' I get a 404.
If I use 'wget --spider -r -e robots=off sitename.com' it doesn't download anything.
Thank you for your time!
I need wget to act like a crawler, but it should download the pictures. But from my understanding the --spider command will not download anything. It just crawls the site looking for "stuff".
I'm using the command 'wget -r -e robots=off sitename.com' and that gets some of it, but only the links available. If I try 'wget -r sitename.com/images/' I get a 404.
If I use 'wget --spider -r -e robots=off sitename.com' it doesn't download anything.
Thank you for your time!
AfterDawn
Advertisement
This discussion thread has been automatically closed, as it hasn't received any new posts during the last 180 days. This means that you can't post replies or new questions to this discussion thread.
If you have something to add to this topic, use this page to post your question or comments to a new discussion thread.
If you have something to add to this topic, use this page to post your question or comments to a new discussion thread.

