A Research Library Based on the Historical Collections of the Internet Archive
I have always considered The Internet Archive to be a very interesting space on the web having first encountered it in the late 90s as an intrepid web exploring youth. This effort at Cornell which adapts their collections into a backed up and usable tool intrigued me because it is a prime example of what I think is a main concern with digital content. Digital content is quite easily produced, anyone at a computer terminal can create content in a matter of minutes. With such a low barrier to the creation of content the amount of content created is of course exponential. Essentially the problem becomes one of control – the content amasses so rapidly that there is scant time to provide proper curation and control, due to the sheer volume and also the various types, contexts, and topics which they span. An effort such as this at Cornell utilizes the aggregation and archival that The Internet Archive and its affiliates have established over the past nearly two decades of operation. I see this as a natural partnership for large scale cataloging of publicly created material. One entity to collect it in an organized manner and another entity to refine it into a usable resource that can be utilized beneficially. Without any order The Internet Archive is essentially a snapshot with no context, a piece of data with no metadata, which renders it almost worthless as a tool for study or research.