ArchiveBox

Self-hosted _wayback machine_ that creates HTML & screenshot archives of sites from your bookmarks, browsing history, RSS feeds, or other sources.


Product Overview

ArchiveBox is an open-source, self-hosted tool that enables organizations and individuals to archive both public and private web content while retaining control over their data. This powerful platform allows users to save copies of bookmarks, preserve evidence for legal cases, backup photos from social media platforms, and more.

Main Features

ArchiveBox offers a range of features that make it an ideal solution for web archiving:

  • Multi-source input: Feed ArchiveBox URLs one at a time or schedule regular imports from your bookmarks, history, social media feeds, RSS feeds, link-saving services like Pocket/ Pinboard, and more.
  • Redundant format saving: Save snapshots of the URLs you feed it in several redundant formats, ensuring that your archived data is safe and secure.
  • Content extraction: Detect content featured inside pages and extract it out into a folder, making it easy to access and utilize the archived material.
  • Flexible usage: Use ArchiveBox as a CLI tool, self-hosted Web App, Python library, or one-off command, providing users with the flexibility they need.

By leveraging ArchiveBox, you can take control of your digital legacy and ensure that valuable information is preserved for future generations.

Related

ArchivesSpace
LinkWarden
Wayback
Piler
Plik
Pinepods
Salut à Toi
Star history

Star history for ArchiveBox