The Internet Archive discovers and captures web pages through many different web crawls.
At any given time several distinct crawls are running, some for months, and some every day or longer.
View the web archive through the Wayback Machine.
Web wide crawl with initial seedlist and crawler configuration from October 2010
TIMESTAMPS
The Wayback Machine - https://web.archive.org/web/20110106193921/http://www.searchengineoptimising.com/glossary/glossary-of-computer-and-internet-terms/protocol
In order for computers to successfully communicate with each other they need to use a standard set of instructions and rules. And this is known as a protocol.