Download the Web Archive
Access our massive collection of preserved websites, historical records, and digital documents. Available formats for researchers, educators, and historians.
15 Petabytes of Data
450 million archived web pages from 200,000+ domains
200,000+ Domains
Historical websites from organizations, governments, and individuals
Multiple Formats
HTML, images, video, audio, and metadata archives
Choose Your Archive Access Method
We offer multiple ways depending on your technical needs and usage requirements
Programmatic API Access
For developers and machine learning researchers
- Unlimited access to query the archive
- Access 15+ PB of data using standard search formats
- Authentication and rate limiting for enterprise access
Bulk Download
15 petabyte data set via secure links
- 15 PB of web content in multiple formats
- Academic and non-profit distribution
- Requires signed data usage agreement
Academic and Research Use
The Internet Archive welcomes researchers and educational institutions to use our collections for scholarly purposes. Access requires a formal research proposal and affiliation verification for bulk downloads.
Research Access Program