{"id":16512,"date":"2019-05-19T21:53:10","date_gmt":"2019-05-19T21:53:10","guid":{"rendered":"https:\/\/www.lifeandnews.com\/articles\/?p=16512"},"modified":"2019-05-19T22:18:58","modified_gmt":"2019-05-19T22:18:58","slug":"your-internet-data-is-rotting","status":"publish","type":"post","link":"https:\/\/www.lifeandnews.com\/articles\/your-internet-data-is-rotting\/","title":{"rendered":"Your internet data is rotting"},"content":{"rendered":"<p><span><a href=\"https:\/\/theconversation.com\/profiles\/paul-royster-724919\">Paul Royster<\/a>, <em><a href=\"http:\/\/theconversation.com\/institutions\/university-of-nebraska-lincoln-832\">University of Nebraska-Lincoln<\/a><\/em><\/span><\/p>\n<p>Many MySpace users were dismayed to discover earlier this year that the social media platform <a href=\"https:\/\/www.npr.org\/2019\/03\/18\/704458168\/myspace-says-it-lost-years-of-user-uploaded-music\">lost 50 million files uploaded between 2003 and 2015<\/a>. <\/p>\n<p>The failure of MySpace to care for and preserve its users\u2019 content should serve as a reminder that relying on free third-party services can be risky.<\/p>\n<p>MySpace has probably preserved the users\u2019 data; it just lost their content. The data was valuable to MySpace; the users\u2019 content less so.<\/p>\n<h2>What happened to MySpace<\/h2>\n<p>MySpace is a social networking media site where performers could upload music or other content for access and distribution to its user community. It has always been a free site, with revenues coming from ads and programming that targets users for specific products.<\/p>\n<p>Formed in 2003 in imitation of the social gaming site Friendster, <a href=\"https:\/\/en.wikipedia.org\/wiki\/Myspace\">MySpace<\/a> grew rapidly and was purchased by Rupert Murdoch\u2019s News Corporation in 2005. By 2008, MySpace was the leading social networking site, <a href=\"https:\/\/www.theguardian.com\/technology\/2011\/jun\/30\/myspace-sold-35-million-news\">valued at one time at US$12 billion<\/a> But it declined in popularity \u2013 thanks to an overprevalence of ads, concerns about exposure of minors to sexual content and other issues. In 2011, News Corporation <a href=\"https:\/\/www.fastcompany.com\/40562668\/myspace-which-still-exists-may-be-up-for-sale-again\">sold MySpace<\/a> to Specific Media, who sold it again in 2016 to Time Inc., which was in turn bought by the Meredith Corporation in 2018.<\/p>\n<p>So the company went through three changes in ownership over a 12-year period, and saw revenues and membership drop precipitously over that time. One sale might be fine, but three sales over short term suggests to me a troubled business that was not in a good position to watch over others\u2019 intellectual property.<\/p>\n<p>Anyone using MySpace as a storage service who did not have alternate backup is simply out of luck. You left your intellectual property sitting beside the information superhighway, and when you came back 10 years later it was gone. <\/p>\n<p>MySpace is not alone in encountering problems. Amazon cloud services, for example, also experienced a <a href=\"https:\/\/www.govtech.com\/policy-management\/Amazon-EC2-Outage.html\">a substantial outage in 2011<\/a> and <a href=\"https:\/\/www.geekwire.com\/2017\/amazon-explains-massive-aws-outage-says-employee-error-took-servers-offline-promises-changes\/\">another in 2017<\/a>. Though temporary, and without actual loss of data, these outages left users without access to precious and important files for some time.<\/p>\n<figure class=\"align-center zoomable\">\n            <a href=\"https:\/\/images.theconversation.com\/files\/273674\/original\/file-20190509-183093-eez38g.jpg?ixlib=rb-1.1.0&amp;q=45&amp;auto=format&amp;w=1000&amp;fit=clip\"><img alt=\"\" src=\"https:\/\/images.theconversation.com\/files\/273674\/original\/file-20190509-183093-eez38g.jpg?ixlib=rb-1.1.0&amp;q=45&amp;auto=format&amp;w=754&amp;fit=clip\" srcset=\"https:\/\/images.theconversation.com\/files\/273674\/original\/file-20190509-183093-eez38g.jpg?ixlib=rb-1.1.0&amp;q=45&amp;auto=format&amp;w=600&amp;h=400&amp;fit=crop&amp;dpr=1 600w, https:\/\/images.theconversation.com\/files\/273674\/original\/file-20190509-183093-eez38g.jpg?ixlib=rb-1.1.0&amp;q=30&amp;auto=format&amp;w=600&amp;h=400&amp;fit=crop&amp;dpr=2 1200w, https:\/\/images.theconversation.com\/files\/273674\/original\/file-20190509-183093-eez38g.jpg?ixlib=rb-1.1.0&amp;q=15&amp;auto=format&amp;w=600&amp;h=400&amp;fit=crop&amp;dpr=3 1800w, https:\/\/images.theconversation.com\/files\/273674\/original\/file-20190509-183093-eez38g.jpg?ixlib=rb-1.1.0&amp;q=45&amp;auto=format&amp;w=754&amp;h=503&amp;fit=crop&amp;dpr=1 754w, https:\/\/images.theconversation.com\/files\/273674\/original\/file-20190509-183093-eez38g.jpg?ixlib=rb-1.1.0&amp;q=30&amp;auto=format&amp;w=754&amp;h=503&amp;fit=crop&amp;dpr=2 1508w, https:\/\/images.theconversation.com\/files\/273674\/original\/file-20190509-183093-eez38g.jpg?ixlib=rb-1.1.0&amp;q=15&amp;auto=format&amp;w=754&amp;h=503&amp;fit=crop&amp;dpr=3 2262w\" sizes=\"(min-width: 1466px) 754px, (max-width: 599px) 100vw, (min-width: 600px) 600px, 237px\"><\/a><figcaption>\n              <span class=\"caption\">In a statement, Myspace said, \u2018We apologize for the inconvenience.\u2019<\/span><br \/>\n              <span class=\"attribution\"><a class=\"source\" href=\"https:\/\/www.shutterstock.com\/image-photo\/london-uk-april-20th-2017-homepage-625284209?src=UaCSO8O5ckQApQ-UerSS1g-1-6\">chrisdorney\/shutterstock.com<\/a><\/span><br \/>\n            <\/figcaption><\/figure>\n<h2>A much bigger problem<\/h2>\n<p>Preserving content or intellectual property on the internet presents a conundrum. If it\u2019s accessible, then it isn\u2019t safe; if it\u2019s safe, then it isn\u2019t accessible. <\/p>\n<p>Accessible content is subject to tampering, theft or other sorts of bad actions. Only content that is inaccessible can be locked and protected from hacking.<\/p>\n<p>The internet currently accesses about 15 zettabytes of data, and is growing at a rate of <a href=\"https:\/\/www.live-counter.com\/how-big-is-the-internet\/\">70 terabytes per second<\/a>. It is an admittedly leaky vessel, and content is constantly going offline to wind up lost forever. <\/p>\n<p><iframe loading=\"lazy\" id=\"ETU1N\" class=\"tc-infographic-datawrapper\" src=\"https:\/\/datawrapper.dwcdn.net\/ETU1N\/3\/\" height=\"400px\" width=\"100%\" style=\"border: none\" frameborder=\"0\"><\/iframe><\/p>\n<p>Massive and desperate efforts are underway to preserve whatever is worth preserving, but even sorting out what is and what is not is itself a formidable undertaking. What will be of value in 10 years \u2013 or 50 years? And how to preserve it? <\/p>\n<p><a href=\"https:\/\/www.niso.org\/publications\/z3948-1992-r2009-permanence-paper\">Acid-free paper<\/a> can last 500 years; stone inscriptions even longer. But magnetic media like hard drives have a much shorter life, lasting only three to five years. They also need to be copied and verified on a very short life cycle to avoid data degradation at <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/empirical-measurements-of-disk-failure-rates-and-error-rates\/\">observed failure rates<\/a> between 3% and 8% annually. <\/p>\n<p>Then there is also a problem of software preservation: How can people today or in the future interpret those WordPerfect or WordStar files from the 1980s, when the original software companies have stopped supporting them or gone out of business?<\/p>\n<p>A nonprofit startup called <a href=\"https:\/\/archive.org\/\">The Internet Archive<\/a> is preserving snapshots of the web on an ongoing basis, but mostly this is for top-level public HTML webpages such as <a href=\"http:\/\/web.archive.org\/web\/*\/nyt.com\">The New York Times website<\/a> and <a href=\"http:\/\/web.archive.org\/web\/2017*\/www.facebook.com\">Facebook<\/a>, not for underlying content files. As of last fall, its <a href=\"https:\/\/en.wikipedia.org\/wiki\/Wayback_Machine\">Wayback Machine<\/a> held over 450 billion pages in 25 petabytes of data. This would represent .0003% of the total internet.<\/p>\n<p>Universities, governments and scientific societies are struggling to preserve scientific data in a hodgepodge of archives, such as the U.K.\u2018s <a href=\"https:\/\/www.dpconline.org\/\">Digital Preservation Coalition<\/a>, <a href=\"https:\/\/metaarchive.org\/\">MetaArchive<\/a>, or the <a href=\"https:\/\/scholarlykitchen.sspnet.org\/2018\/12\/13\/digital-preservation-network-disband\/\">now-disbanded<\/a> collaborative Digital Preservation Network.<br \/>\nPreservation is hard and expensive in time, money and equipment. To be most useful, it not only has to be stored, but hosted in a form that is accessible and available for future reuse. <\/p>\n<p>Actual storage costs less than $0.05 per gigabyte, but <a href=\"https:\/\/doi.org\/10.1007%2Fs00799-012-0092-1\">storage is only a small percentage<\/a> of the costs of preservation. Acquisition, networking, maintenance and administration all require substantial and costly human labor. <\/p>\n<p>Budgeting models suggest a 10-year preservation expense of around $2.50 per gigabyte, or $2,500 per terabyte, or $625,000 for the files MySpace failed to preserve.<\/p>\n<figure class=\"align-center zoomable\">\n            <a href=\"https:\/\/images.theconversation.com\/files\/273676\/original\/file-20190509-183086-1ccfq8n.jpg?ixlib=rb-1.1.0&amp;q=45&amp;auto=format&amp;w=1000&amp;fit=clip\"><img alt=\"\" src=\"https:\/\/images.theconversation.com\/files\/273676\/original\/file-20190509-183086-1ccfq8n.jpg?ixlib=rb-1.1.0&amp;q=45&amp;auto=format&amp;w=754&amp;fit=clip\" srcset=\"https:\/\/images.theconversation.com\/files\/273676\/original\/file-20190509-183086-1ccfq8n.jpg?ixlib=rb-1.1.0&amp;q=45&amp;auto=format&amp;w=600&amp;h=450&amp;fit=crop&amp;dpr=1 600w, https:\/\/images.theconversation.com\/files\/273676\/original\/file-20190509-183086-1ccfq8n.jpg?ixlib=rb-1.1.0&amp;q=30&amp;auto=format&amp;w=600&amp;h=450&amp;fit=crop&amp;dpr=2 1200w, https:\/\/images.theconversation.com\/files\/273676\/original\/file-20190509-183086-1ccfq8n.jpg?ixlib=rb-1.1.0&amp;q=15&amp;auto=format&amp;w=600&amp;h=450&amp;fit=crop&amp;dpr=3 1800w, https:\/\/images.theconversation.com\/files\/273676\/original\/file-20190509-183086-1ccfq8n.jpg?ixlib=rb-1.1.0&amp;q=45&amp;auto=format&amp;w=754&amp;h=566&amp;fit=crop&amp;dpr=1 754w, https:\/\/images.theconversation.com\/files\/273676\/original\/file-20190509-183086-1ccfq8n.jpg?ixlib=rb-1.1.0&amp;q=30&amp;auto=format&amp;w=754&amp;h=566&amp;fit=crop&amp;dpr=2 1508w, https:\/\/images.theconversation.com\/files\/273676\/original\/file-20190509-183086-1ccfq8n.jpg?ixlib=rb-1.1.0&amp;q=15&amp;auto=format&amp;w=754&amp;h=566&amp;fit=crop&amp;dpr=3 2262w\" sizes=\"(min-width: 1466px) 754px, (max-width: 599px) 100vw, (min-width: 600px) 600px, 237px\"><\/a><figcaption>\n              <span class=\"caption\">Huge amounts of new content are uploaded to the internet every day.<\/span><br \/>\n              <span class=\"attribution\"><a class=\"source\" href=\"https:\/\/www.shutterstock.com\/image-photo\/large-group-utp-cables-ethernet-rack-1081131248?src=Yn6oBBWSpzU8MYxq2Im2zA-1-6\">Fingon ss\/shutterstock.com<\/a><\/span><br \/>\n            <\/figcaption><\/figure>\n<h2>Considering your own data<\/h2>\n<p>So yes, the internet is rotting, but archivists and digital librarians like myself knew it was rotten already, as did anyone who ever got a \u201c404 File Not Found\u201d error. <\/p>\n<p>Where there is economic incentive to keep and use data \u2013 such as user information, profiles or browsing history \u2013 it may exist for quite a long time. It has been said by many that <a href=\"https:\/\/medium.com\/project-2030\/data-is-the-new-oil-a-ludicrous-proposition-1d91bba4f294\">data is the new oil<\/a>, and corporations are anxious to drill and exploit this resource. <\/p>\n<p>However, where content is less valuable to whomever owns the servers, there is less incentive to invest in preserving it. A survey of 10 million hits from 25 random sites in 2004 suggests that <a href=\"http:\/\/answers.google.com\/answers\/threadview\/id\/332705.html\">404 errors occur at close to 3% of targeted URLs<\/a>. The internet is growing much faster than it is rotting, but both things are happening at once. No giant internet company has your interests closer to its heart than its own. <\/p>\n<p><iframe loading=\"lazy\" id=\"TqNta\" class=\"tc-infographic-datawrapper\" src=\"https:\/\/datawrapper.dwcdn.net\/TqNta\/1\/\" height=\"400px\" width=\"100%\" style=\"border: none\" frameborder=\"0\"><\/iframe><\/p>\n<p>One preservation network is known under the acronym <a href=\"https:\/\/www.lockss.org\/\">LOCKSS<\/a> \u2013 Lots of Copies Keeps Stuff Safe \u2013 and that\u2019s a good rule of thumb. Always have a backup, and always have multiple backups. Guard your privacy and guard your content, at least that content you may wish to have preserved, like photos, email, that screenplay or novel, or video and music files. Copyright rules do not prohibit storing content you may have purchased, as long as you don\u2019t put it out for public sharing. <\/p>\n<p>Free storage is a great offer, but sometimes you only get what you pay for. The internet is neither secure nor permanent. It never promised to be, and users should not assume that it will become so. Parts are rotting and corroding and collapsing as I type this. Just hope and plan to not be resting on that platform when it falls.<!-- Below is The Conversation's page counter tag. Please DO NOT REMOVE. --><img loading=\"lazy\" src=\"https:\/\/counter.theconversation.com\/content\/115891\/count.gif?distributor=republish-lightbox-basic\" alt=\"The Conversation\" width=\"1\" height=\"1\" style=\"border: none !important; box-shadow: none !important; margin: 0 !important; max-height: 1px !important; max-width: 1px !important; min-height: 1px !important; min-width: 1px !important; opacity: 0 !important; outline: none !important; padding: 0 !important; text-shadow: none !important\" \/><!-- End of code. If you don't see any code above, please get new code from the Advanced tab after you click the republish button. The page counter does not collect any personal data. More info: http:\/\/theconversation.com\/republishing-guidelines --><\/p>\n<p><span><a href=\"https:\/\/theconversation.com\/profiles\/paul-royster-724919\">Paul Royster<\/a>, Coordinator of Scholarly Communications, <em><a href=\"http:\/\/theconversation.com\/institutions\/university-of-nebraska-lincoln-832\">University of Nebraska-Lincoln<\/a><\/em><\/span><\/p>\n<p>This article is republished from <a href=\"http:\/\/theconversation.com\">The Conversation<\/a> under a Creative Commons license. Read the <a href=\"https:\/\/theconversation.com\/your-internet-data-is-rotting-115891\">original article<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Paul Royster, University of Nebraska-Lincoln Many MySpace users were dismayed to discover earlier this year that the social media platform lost 50 million files uploaded between 2003 and 2015. The failure of MySpace to care for and preserve its users\u2019 content should serve as a reminder that relying on free third-party services can be risky. [&hellip;]<\/p>\n","protected":false},"author":44,"featured_media":16511,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[3410],"tags":[2944,1870,5262,549,6338,6339,702],"_links":{"self":[{"href":"https:\/\/www.lifeandnews.com\/articles\/wp-json\/wp\/v2\/posts\/16512"}],"collection":[{"href":"https:\/\/www.lifeandnews.com\/articles\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.lifeandnews.com\/articles\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.lifeandnews.com\/articles\/wp-json\/wp\/v2\/users\/44"}],"replies":[{"embeddable":true,"href":"https:\/\/www.lifeandnews.com\/articles\/wp-json\/wp\/v2\/comments?post=16512"}],"version-history":[{"count":1,"href":"https:\/\/www.lifeandnews.com\/articles\/wp-json\/wp\/v2\/posts\/16512\/revisions"}],"predecessor-version":[{"id":16513,"href":"https:\/\/www.lifeandnews.com\/articles\/wp-json\/wp\/v2\/posts\/16512\/revisions\/16513"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.lifeandnews.com\/articles\/wp-json\/wp\/v2\/media\/16511"}],"wp:attachment":[{"href":"https:\/\/www.lifeandnews.com\/articles\/wp-json\/wp\/v2\/media?parent=16512"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.lifeandnews.com\/articles\/wp-json\/wp\/v2\/categories?post=16512"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.lifeandnews.com\/articles\/wp-json\/wp\/v2\/tags?post=16512"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}