Sign in or 

wget -H -r --level=1 -k -p -erobots=off -np -NHow to run this script: Replace http://del.icio.us/ginatrapani/software with your del.icio.us username and desired tag. Create a new directory called "del.icio.us archive" and from that directory at the command line, run your edited version of the script. (Even better, copy and paste the command into a text file, tweak it to your own needs, and save it as a script - .bat for Windows users, and .sh for Mac users. Then run the script instead of typing out that long thing every time.) When the command has completed, you'll have directories set up named after each domain in the del.icio.us links, with the files stored within them.
--exclude-domains=del.icio.us,doubleclick.net
http://del.icio.us/ginatrapani/software
wget -H -r --level=1 -k -p -erobots=off -np -NThe only difference between this command and the last is that it includes an "--exclude-directories=ginatrapani" directive, which keeps wget from downloading every tag folder unnecessarily. Archive someone's diggs Say you want to archive all the stories Kevin Rose diggs. The wget command would look something like this (without the line breaks):
--exclude-directories=ginatrapani
--exclude-domains=del.icio.us, doubleclick.net http://del.icio.us/ginatrapani
wget -H -r --level=1 -k -p -erobots=off -np -NSimilar to the command above, this one excludes more ad servers (so you don' t fill your hard drive with banner ad images) and is pointed at kevinrose's dugg page.
--exclude-domains=digg.com,doubleclick.net,doubleclick.com,fastclick.net,fmpub.net,tacoda.net,adbrite.com,sitemeter.com
http://digg.com/users/kevinrose/dugg
wget -k -p -erobots=off -np -N -nd -O aviationquotes-notebook.htmlTips and tricks for archiving the web locally
http://www.google.com/notebook/public/18344006957932515597/BDSKUIgoQ9K_Emdkh
BY CORRANROGUE9 AT 09/13/06 10:34 AM @ phantomdata: Good idea setting it up to run nightly, but if the site goes away, then it will rewrite your current data with the blank site. Then you *won't* have it. I would suggest that just after writing your essay that you need websites archived for, then backup the data.
BY DAVID BURCH AT 09/13/06 11:54 AM Gina, Couldn't you also pass a user name and password, either in the URL or as command-line arguments, to down load private Google Notebooks?
BY BASSAM AT 09/13/06 11:56 AM Great Article Gina! I'll definitely be trying this out. Any ideas on how to use wget to archive private Google Notebooks? I keep most of my notebooks private, and I'd love to be able to download them.
BY GINA TRAPANI, LIFEHACKER EDITOR AT 09/13/06 01:17 PM @David: You'd probably have to pass on your Google cookies with wget to the command to authenticate to see private notebooks. I haven't given this a try, but do wget --help to see the cookie options. And yes, Slogger and Scrapbook are both Firefox-based non command line (so non schedulable) extensions that do this as well, with a much friendly GUI interface.
BY SANDER AT 09/14/06 01:09 AM I just found the following code, it is the most concise way I found in two days to back up delicious bookmarks. wget http://del... --http-user=YOURUSERNAME --http-passwd=YOURPASSWORD --no-check-certificateIt gets all your bookmarks because it accesses the API. Not just your first 100 like some other examples. The only problem i have is that the results are in XML format, if anyone had a automated way of transforming it to a unordered html list that would be great. (or could adjust the command).
BY TOWTRUCK AT 09/21/06 05:52 AM Is there a way with wget to just have it output to the screen (instead of downloading)? I would like to use this for debugging code, so I just want to see the raw code coming back, but dont necessarily want to save it to a file every time.
BY SARAH @ JINJUR AT 09/22/06 11:42 AM Gina, thanks! I was planning to start looking for a way to back up my del.icio.us bookmarks and here it is! You (and all of Lifehacker) are my hero once again. (Since bloglines hosed over 200 clippings and saved items of mine I'm saving everything in del.icio.us now, but having lost my bloglines stuff I've been feeling paranoid about having it all in one place. As soon as I actually do this I'll feel so much better!)
BY SHARJEEL AT 09/22/06 10:44 PM I have tweaked your command to download pages for all links in a del.icio.us bookmarks I currently have 494 bookmarks saved in del.icio.us and here is what I used to download all of them wget -H -r --level=1 -k -p -erobots=off -np -N --exclude-directories=sharjeelsayed --exclude-domains= del.icio.us,doubleclick.net http://del.icio.us/sharjeelsayed?setcount=100?page=1 http://del.icio.us/sharjeelsayed?setcount=100?page=2http://del.icio.us/sharjeelsayed?setcount=100?page=3 http://del.icio.us/sharjeelsayed?setcount=100?page=4http://del.icio.us/sharjeelsayed?setcount=100?page=5 
|
rockyen |
Latest page update: made by rockyen
, Jan 9 2008, 5:42 PM EST
(about this update
About This Update
1 image added view changes - complete history) |
|
Keyword tags:
None
More Info: links to this page
|