RECMGMT-L Archives

Records Management

RECMGMT-L@LISTSERV.IGGURU.US

Options: Use Forum View

Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Reply To:
Records Management Program <[log in to unmask]>
Date:
Tue, 17 Nov 2015 20:52:46 -0600
Content-Type:
text/plain
Parts/Attachments:
text/plain (35 lines)
The Internet Archive turns 20 years old next year, having archived nearly
two decades and 23 petabytes of the evolution of the World Wide Web. Yet,
surprisingly little is known about what exactly is in the Archive’s vaunted
Wayback Machine. Beyond saying it has archived more than 445 billion
webpages, the Archive has never published an inventory of the websites it
archives or the algorithms it uses to determine what to capture and when.
Given the Archive’s recent announcements of new efforts to make its web
archive accessible to scholarly research, it is critically important to
understand what precisely makes up this 445-billion-page archive and how
that composition might affect the kinds of research scholars can perform
with it.


http://onforb.es/1OPEKbM
http://onforb.es/1OPEKbM+




-- 
Peterk
Dallas, Tx
[log in to unmask]
Save our in-boxes! http://emailcharter.org
"The problems of our economy have occurred not as an outgrowth of
laissez-faire, unbridled competition.
They have occurred under the guidance of federal agencies, and under the
umbrella of federal regulations."
Senator Ted Kennedy, in defending trucking deregulation in 1978.

List archives at http://lists.ufl.edu/archives/recmgmt-l.html
Contact [log in to unmask] for assistance
To unsubscribe from this list, click the below link. If not already present, place UNSUBSCRIBE RECMGMT-L or UNSUB RECMGMT-L in the body of the message.
mailto:[log in to unmask]

ATOM RSS1 RSS2