Wayback Machine

Download the Web Archive

Access our massive collection of preserved websites, historical records, and digital documents. Available formats for researchers, educators, and historians.

15 Petabytes of Data

450 million archived web pages from 200,000+ domains

200,000+ Domains

Historical websites from organizations, governments, and individuals

Multiple Formats

HTML, images, video, audio, and metadata archives

Choose Your Archive Access Method

We offer multiple ways depending on your technical needs and usage requirements

Programmatic API Access

For developers and machine learning researchers

  • Unlimited access to query the archive
  • Access 15+ PB of data using standard search formats
  • Authentication and rate limiting for enterprise access

Bulk Download

15 petabyte data set via secure links

  • 15 PB of web content in multiple formats
  • Academic and non-profit distribution
  • Requires signed data usage agreement

Academic and Research Use

The Internet Archive welcomes researchers and educational institutions to use our collections for scholarly purposes. Access requires a formal research proposal and affiliation verification for bulk downloads.

Research Access Program