I would be very interested in how much data was collected in this. I'd also be interested if it was a basic harvest or there was some smart archiving done for duplicate files etc Eg for a couple of customers they have between 4 and 8 or so sites all pointing at the same place Also as a computer tech I'll put up a download directory to grab programs from for cleaning customers pc (eg spyware utils, general apps, service packs), a quick look is showing 1 gig of data there, and we have that as .co.nz .net.nz and as different domains just so if I tell a customer over the phone to download something to fix they won't make a mistake. Funny you'll be using probably 4gig just on my spyware apps and service packs because somehow its document heritage to New Zealand...of programs made mostly in the states :) Ah well, in 30 years I guess someone will be interested to see what the internet looked like back in 2008. It's also probably a cheaper option than our government spending $100 million to hire people to decide what should and shouldn't be kept Philip -----Original Message----- From: Don Gould [mailto:don(a)bowenvale.co.nz] Sent: Wednesday, 22 October 2008 1:17 p.m. To: Gordon Paynter Cc: nznog(a)list.waikato.ac.nz Subject: Re: [nznog] NLNZHarvester2008 Gordon, Are you guys going to tell us how much data was collected in the end? I also note that the crawler stepped off .nz and went after links that were in .nz pages. eg. This page has links in it to pointclark.net which were also crawled. http://www.crra.org.nz/content/view/17/7/ Cheers Don Gordon Paynter wrote:
Hi Tony:
The archived material will be hosted at the National Library in New Zealand (i.e. in our machine room).
I'll update the FAQ with this information when I get a chance.
Gordon
-- Gordon Paynter Technical Analyst National Digital Library National Library of New Zealand +64 4 474 3114
"Tony Wicks"
21/10/08 8:07 p.m. >>> While I'm sure the good people at Natlib are on top of it, but seeing as this is being discussed to the n'th degree. My question would be, who will hold this archived material long term ? I sincerely hope it will be physically (and diversely) stored and served in New Zealand not stored at a faceless overseas company that can change ownership, collapse or be otherwise invaded. If we are paying to have this material preserved (which IMHO is entirely appropriate), it needs to be here in a government owned (or suitably contracted and legally covered private) repository. This is the only way to ensure that it will survive long term. _______________________________________________ NZNOG mailing list NZNOG(a)list.waikato.ac.nz http://list.waikato.ac.nz/mailman/listinfo/nznog
NZNOG mailing list NZNOG(a)list.waikato.ac.nz http://list.waikato.ac.nz/mailman/listinfo/nznog -- This message was scanned by Turnstone Spam Filter and is believed to be clean. Click here to report this message as spam. http://spamfilter.turnstone.co.nz/cgi-bin/learn-msg.cgi?id=5F6B928027.01 403