The Internet Archive discovers and captures web pages through many different web crawls.
At any given time several distinct crawls are running, some for months, and some every day or longer.
View the web archive through the Wayback Machine.
Web wide crawl with initial seedlist and crawler configuration from October 2010
TIMESTAMPS
The Wayback Machine - https://web.archive.org/web/20110106213014/http://www.searchengineoptimising.com/glossary/glossary-of-computer-and-internet-terms/file
A File is an accumulation of data that is stored together under one name. Files can contain text files, images, audio and video clips and even applications. Applications can also be files, such as iTunes and Microsoft Internet Explorer.