milanuk
02-22-2001, 02:01 PM
Hello,
I am trying to keep my LDP HOWTO's as current as possible. The problem I am running into is that sometimes there are multiple updates everyday, it seems.
I could mirror the directories that have the individual html howto's and mini-howto's, but they come in (normally) in tarball form. I have as of yet not found a good method for unpacking these all in one shot. I can enter the directory and do a 'gunzip *.gz' and that works just fine. But untarring everything doesn't work so well. I don't have a Linux box handy, so I can't really replicate the error messages.
I can periodically (weekly) just download the Linux-html-HOWTOs.tar.gz file, but at over 17 meg, it takes more than 3 hours to download, which seems like kind of a waste just to get a couple dozen pages updated, out of something like 2000 some for a fully explode archive.
What would be a good way to download the newest updates say like every night? The idea of a bash script run from cron that would run wget to mirror the tarballs to a directory, and then maybe use a 'while' loop to sequentially unpack the files??
As you can probably tell from the preceding paragraph, I _kinda_ know what I want to have happen. I just have little-to-no friggin' idea how to do it. Plus, I'm not even sure that there isn't a better way of doing it, or one already in circulation that I just don't know about ;)
Any help, tips, etc. would be appreciated.
Monte
P.S. And as a final brain teaser, how would you do it _simply_ from a Win98 machine as well? My employer is pretty tolerant and lets me use the laptop at my discretion, but I'd rather not go installing stuff like unix-type shells or perl/python on it. I have enough bandwidth, I may just download the entire archive once a week manually in that situation. Again, any ideas/suggestions are welcome.
I am trying to keep my LDP HOWTO's as current as possible. The problem I am running into is that sometimes there are multiple updates everyday, it seems.
I could mirror the directories that have the individual html howto's and mini-howto's, but they come in (normally) in tarball form. I have as of yet not found a good method for unpacking these all in one shot. I can enter the directory and do a 'gunzip *.gz' and that works just fine. But untarring everything doesn't work so well. I don't have a Linux box handy, so I can't really replicate the error messages.
I can periodically (weekly) just download the Linux-html-HOWTOs.tar.gz file, but at over 17 meg, it takes more than 3 hours to download, which seems like kind of a waste just to get a couple dozen pages updated, out of something like 2000 some for a fully explode archive.
What would be a good way to download the newest updates say like every night? The idea of a bash script run from cron that would run wget to mirror the tarballs to a directory, and then maybe use a 'while' loop to sequentially unpack the files??
As you can probably tell from the preceding paragraph, I _kinda_ know what I want to have happen. I just have little-to-no friggin' idea how to do it. Plus, I'm not even sure that there isn't a better way of doing it, or one already in circulation that I just don't know about ;)
Any help, tips, etc. would be appreciated.
Monte
P.S. And as a final brain teaser, how would you do it _simply_ from a Win98 machine as well? My employer is pretty tolerant and lets me use the laptop at my discretion, but I'd rather not go installing stuff like unix-type shells or perl/python on it. I have enough bandwidth, I may just download the entire archive once a week manually in that situation. Again, any ideas/suggestions are welcome.