<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/'><id>tag:blogger.com,1999:blog-4503292949532760618.post8370267522404940887..comments</id><updated>2008-12-10T06:34:31.760-08:00</updated><category term='CNI2009spring'/><category term='e-journals'/><category term='ipres2008'/><category term='jcdl2010'/><category term='pda2011'/><category term='format migration'/><category term='blog-science'/><category term='storage costs'/><category term='peer review'/><category term='fast11'/><category term='storage failures'/><category term='ipres2010'/><category term='government information'/><category term='format obsolescence'/><category term='open access'/><category term='memento'/><category term='benchmarks'/><category term='normalization'/><category term='scholarly communication'/><category term='publishing business'/><category term='idcc2008'/><category term='fast2009'/><category term='green preservation'/><category term='digital preservation'/><title type='text'>Comments on DSHR's Blog: How Hard Is "A Petabyte for a Century"?</title><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://blog.dshr.org/feeds/8370267522404940887/comments/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4503292949532760618/8370267522404940887/comments/default'/><link rel='alternate' type='text/html' href='http://blog.dshr.org/2008/01/how-hard-is-petabyte-for-century.html'/><author><name>David.</name><uri>http://www.blogger.com/profile/14498131502038331594</uri><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>2</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>25</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-4503292949532760618.post-7553437286545905217</id><published>2008-01-21T07:03:00.000-08:00</published><updated>2008-01-21T07:03:00.000-08:00</updated><title type='text'>I agree that "disk, tape and other media are remar...</title><content type='html'>I agree that "disk, tape and other media are remarkably reliable". Disk and tape drives contain many layers of astonishing engineering, from the medium though the heads and the signal processing that extracts bits from the noisy analog signal, and the error correcting codes that clean up the bit stream. But these are nowhere near good enough to meet the demands society has for data preservation. I'm not the right person to explain these layers.&lt;BR/&gt;&lt;BR/&gt;So, on top of the actual storage media we have to layer file systems and digital preservation systems to make up for their (measurable) unreliability. They increase the reliability of the bits by another large factor. The problem is not in  our ability to continue to add more and more error detection and correction capabilities to the pile. I have worked in these layers  off and on for many years.&lt;BR/&gt;&lt;BR/&gt;There are three problems. The first is that, perhaps because they are in awe of the astonishing engineering, very few people are measuring the reliability their pile is delivering. The second is that the scanty evidence we have is that the pile is failing to deliver the reliability we would expect. The third is that, even if the pile &lt;I&gt;was&lt;/I&gt; delivering the reliability we need, we could not perform the experiments needed to prove that it was doing so.&lt;BR/&gt;&lt;BR/&gt;So, yes, it is amazing that we can store bits so reliably. But in the context of digital preservation, patting ourselves on the back about this achievement is counter-productive.</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4503292949532760618/8370267522404940887/comments/default/7553437286545905217'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4503292949532760618/8370267522404940887/comments/default/7553437286545905217'/><link rel='alternate' type='text/html' href='http://blog.dshr.org/2008/01/how-hard-is-petabyte-for-century.html?showComment=1200927780000#c7553437286545905217' title=''/><author><name>David.</name><uri>http://www.blogger.com/profile/14498131502038331594</uri><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img2.blogblog.com/img/b16-rounded.gif'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://blog.dshr.org/2008/01/how-hard-is-petabyte-for-century.html' ref='tag:blogger.com,1999:blog-4503292949532760618.post-8370267522404940887' source='http://www.blogger.com/feeds/4503292949532760618/posts/default/8370267522404940887' type='text/html'/><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='blogger.itemClass' value='pid-1605961295'/></entry><entry><id>tag:blogger.com,1999:blog-4503292949532760618.post-2782075341987450379</id><published>2008-01-21T02:46:00.000-08:00</published><updated>2008-01-21T02:46:00.000-08:00</updated><title type='text'>Hi David, I think you somewhat misrepresent my com...</title><content type='html'>Hi David, I think you somewhat misrepresent my comment on the other post. I was ONLY addressing the "bit half life 100M times the age of the universe" factoid. For the record, I do believe that keeping a petabyte for a century is hard (I know someone who is trying to do it), and I accept many of your other evidence and arguments. However, it's clear that we can keep data which represents bit half lives &gt;&gt; the age of the universe, even after only 60 years of trying (I don't think I quite emphasised how hard a GB for a decade would have seemed in 1968, when I first met a 8 MB disk drive the size of a fridge).&lt;BR/&gt;&lt;BR/&gt;I would like to understand HOW we can achieve such miraculous-seeming results; I guess it's through distributing checks widely across many layers and many hardware devices.</content><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/4503292949532760618/8370267522404940887/comments/default/2782075341987450379'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/4503292949532760618/8370267522404940887/comments/default/2782075341987450379'/><link rel='alternate' type='text/html' href='http://blog.dshr.org/2008/01/how-hard-is-petabyte-for-century.html?showComment=1200912360000#c2782075341987450379' title=''/><author><name>Chris Rusbridge</name><email>noreply@blogger.com</email><gd:image xmlns:gd='http://schemas.google.com/g/2005' rel='http://schemas.google.com/g/2005#thumbnail' width='16' height='16' src='http://img1.blogblog.com/img/blank.gif'/></author><thr:in-reply-to xmlns:thr='http://purl.org/syndication/thread/1.0' href='http://blog.dshr.org/2008/01/how-hard-is-petabyte-for-century.html' ref='tag:blogger.com,1999:blog-4503292949532760618.post-8370267522404940887' source='http://www.blogger.com/feeds/4503292949532760618/posts/default/8370267522404940887' type='text/html'/><gd:extendedProperty xmlns:gd='http://schemas.google.com/g/2005' name='blogger.itemClass' value='pid-1244104524'/></entry></feed>
