RECMGMT-L Archives

Records Management

RECMGMT-L@LISTSERV.IGGURU.US

Options: Use Forum View

Use Monospaced Font
Show Text Part by Default
Show All Mail Headers

Message: [<< First] [< Prev] [Next >] [Last >>]
Topic: [<< First] [< Prev] [Next >] [Last >>]
Author: [<< First] [< Prev] [Next >] [Last >>]

Print Reply
Subject:
From:
Reply To:
Records Management Program <[log in to unmask]>
Date:
Tue, 19 Jan 2016 19:49:54 -0600
Content-Type:
text/plain
Parts/Attachments:
text/plain (30 lines)
To most of the web surfing public, the Internet Archive’s Wayback Machine
is the face of the Archive’s web archiving activities. Via a simple
interface, anyone can type in a URL and see how it has changed over the
last 20 years. Yet, behind that simple search box lies an exquisitely
complex assemblage of datasets and partners that make possible the
Archive’s vast repository of the web. How does the Archive really work,
what does its crawl workflow look like, how does it handle issues like
robots.txt, and what can all of this teach us about the future of web
archiving?


http://onforb.es/1ZzZUei
http://onforb.es/1ZzZUei+

-- 
Peterk
Dallas, Tx
[log in to unmask]
Save our in-boxes! http://emailcharter.org
"The problems of our economy have occurred not as an outgrowth of
laissez-faire, unbridled competition.
They have occurred under the guidance of federal agencies, and under the
umbrella of federal regulations."
Senator Ted Kennedy, in defending trucking deregulation in 1978.

List archives at http://lists.ufl.edu/archives/recmgmt-l.html
Contact [log in to unmask] for assistance
To unsubscribe from this list, click the below link. If not already present, place UNSUBSCRIBE RECMGMT-L or UNSUB RECMGMT-L in the body of the message.
mailto:[log in to unmask]

ATOM RSS1 RSS2