Skip to content

Latest commit

 

History

History
185 lines (120 loc) · 12.8 KB

README.md

File metadata and controls

185 lines (120 loc) · 12.8 KB

What is Hoardy-Web Web Extension?

Hoardy-Web is a Web Extension browser add-on that passively captures and collects dumps of HTTP requests and responses (see WRR file format) as you browse the web, and then archives them to your Downloads directory, your own private archiving server (like hoardy-web-sas), or your browser's local storage.

Practically speaking, you install this and just browse the web normally while Hoardy-Web passively, in background, captures and archives web pages you visit for later offline viewing, mirroring, and/or indexing. Hoardy-Web has a lot of configuration options to help you tweak what should or should not be archived and a very low memory footprint, keeping you browsing experience snappy even on ancient hardware (unless explicitly configured otherwise to, e.g., minimize writes to disk instead).

In other words, this extension implements an in-browser half of your own personal private passive Wayback Machine that archives everything you see, including HTTP POST requests and responses (e.g. answer pages of web search engines), as well as most other HTTP-level data (AJAX/JSON RPC/etc).

By default, this extension will save all captured data into browser's local storage, so it can be used standalone. To view your archived data, however, you will need to install at the very least the accompanying hoardy-web CLI tool (also there).

If you do not care about archival, you can also use this extension to log and later inspect HTTP traffic generated by various sleazy websites, even when they generate said web traffic on events that can not normally be inspected with browser's own Network Monitor (e.g. when a page generates HTTP requests when its window closes).

Hoardy-Web is most similar to archiveweb.page and DownloadNet projects, but it works both on Firefox- and Chromium-based browsers, and it follows its own design philosophy, which makes the experience of using it is very different.

Hoardy-Web was previously known as "Personal Private Passive Web Archive" aka pWebArc.

Screenshots

See the gallery.

What does it do exactly? I have questions.

Installation

On Firefox-based browsers (Firefox, Tor Browser, LibreWolf, Fenix aka Firefox for Android, Fennec, Mull, etc)

Install as an unsigned XPI

  • Make sure your browser supports installation of unsigned add-ons (Firefox ESR, Nightly, Developer Edition, and Tor Browser do, vanilla Firefox and its mobile versions do not).
  • Go to about:config, set xpinstall.signatures.required to false.
  • Go to about:addons, click the gear button, select Install Add-on from File, and select the XPI file in ./extension/dist directory (or do File > Open File from the menu and then select the XPI file, or drag-and-drop the XPI file into the browser window).
  • Then press Extensions toolbar button and pin Hoardy-Web.

Install as a temporary add-on

If you are building from source, this is a nice way to do development, since to reload the add-on after making a new build will require a single click.

  • In the browser, go to about:debugging#/runtime/this-firefox, click Load Temporary Add-on button, and select ./extension/dist/Hoardy-Web-firefox-*/manifest.json.
  • Then you might need to go into about:addons and enable Run in Private Windows for Hoardy-Web if your Firefox is running in Private-Windows-only mode.

On Chromium-based browsers (Chromium, Google Chrome, Ungoogled Chromium, Brave, etc)

Hoardy-Web isn't on Chrome Web Store because Hoardy-Web appears to violate its "Terms of Use". Specifically, the "enables the unauthorized download of streaming content or media" clause. See higher-level README for more info and discussion.

So, installation on Chromium-based browsers (aka "Chromium forks") requires a little bit of work. There are several ways you can do it. I recommend you read all of the following subsections first, and then decide what you want to actually try doing. Do not rush.

Install as an unpacked extension

This is the simplest method that will work on all Chromium forks, but Hoardy-Web will not get automatic updates. I.e., you will have to check the Releases page periodically, download, and install new releases manually to update.

  • Download the latest Hoardy-Web-chromium-*.zip from Releases.
  • Unpack it. It's packed with a single directory named Hoardy-Web-chromium-* inside for convenience.
  • Go to Extensions > Manage Extensions in the menu of your browser and enable Developer mode toggle.
  • On the same page, press Load Unpacked and select the directory the unpacking of the .zip file produced. It should have manifest.json file in it, just navigate to that directory select it and then press the Open button (or navigate into that directory and then press the Open button, that will work too).
  • Then press Extensions toolbar button and pin Hoardy-Web.

Apparently, if you are using Google Chrome, you will get annoying warnings about side-loaded extensions. But you can also whitelist your extensions to prevent it, see the second answer of this stackoverflow question. For a Hoardy-Web downloaded from Releases here, its Chromium extension ID is amklkmnfmgjmofkjifjgngdnajpbpefp.

Install as an unpacked extension built from source

Alternatively, you can build it from source and then follow those same instructions above, except use ./dist/Hoardy-Web-chromium-* directory after pressing Load Unpacked button in the browser's UI.

Similarly, there will be no automatic updates.

In case you need to whitelist your build, the Chromium extension ID of your build will be written to ./dist/manifest-chromium-id.txt. That ID is derived from a public key, which is derived from a private key, which is generated by the gen-chromium-key.sh script called by build.sh script when you build the extension the very first time. The result then gets stored in ./private/chromium.key.pem and reused between builds.

Install a CRX directly

If your Chromium fork supports installation of third-party CRX files (not fetched from Chrome Web Store), you can do this:

  • Go to chrome://flags.
  • Search for the #extension-mime-request-handling flag and set it to Always prompt for install.
  • (If you did not find such a thing there, then you Chromium fork does not support installations of third-party CRX files.)
  • Then, download the latest Hoardy-Web-chromium-*.crx from Releases.
  • The browser should prompt you if you want to install Hoardy-Web.
  • Confirm the install.

There may or may not be automatic updates for Hoardy-Web, depending of what your Chromium fork comes with. If it supports updates for third-party extensions, you will get updates, if it does not, you will not. The vanilla mainline Chromium comes without any such support. See below for how to fix it.

Install a CRX via drag-and-drop

If your Chromium fork supports installation of third-party CRX files fetched manually:

  • Download the latest Hoardy-Web-chromium-*.crx from Releases.
  • Drag-and-drop the resulting CRX file from your Downloads folder into your browser's window.
  • The browser should either prompt you if you want to install Hoardy-Web or just install it silently.
  • (If it does not, then you browser does not support that too.)
  • Confirm the install or check you extensions list to confirm it's there.

The updates situation will be exactly the same as above.

How to make Hoardy-Web (and other indie extensions) automatically update on Chromium forks that do not support auto-updates for third-party extensions

  • Install Hoardy-Web using one of the above methods.

  • Install chromium-web-store extension using one of the above methods.

    It exists to help you to install extensions from Chrome Web Store and other similar Web Extensions repositories on Chromium forks that do not come with builtin support for Web Extension stores. More importantly, however, it can periodically check all your extensions that have an update_url field set in their manifest.json for updates and notify you about them. (Hoardy-Web, of course, comes with update_url set.)

    So, the simplest way to do this on a most limited Chromium fork is to run

    git clone https://github.com/NeverDecaf/chromium-web-store

    and then simply Load Unpacked the ./chromium-web-store/src directory in your Chromium fork.

Then you can:

  • Go to Extensions > Manage Extensions in the menu of your browser.
  • Press the Details button on Chromium Web Store extension.
  • Press the Extension options (should be near the bottom) there.
  • And edit those options to your liking.

E.g., you might want to:

  • change hourly update checks to daily by setting the interval to 1440,
  • ask it to ignore some of your extensions you don't want to ever update,
  • disable Enable Chrome Web Store Integration,
  • etc, see there for more info.

Congratulations, from now on Hoardy-Web --- or any other extension that has update_url field set in its manifest.json, regardless of its availability at Chrome Web Store --- will get checked for updates periodically.

  • If you installed Hoardy-Web (or another extension which has update_url field set) via a CRX, then chromium-web-store can even automatically update it for you.
  • If you installed Hoardy-Web (or another extension) via Load Unpacked, you will have to manually re-install it from a new release on updates, but you will at least get notified about it updating without you needing to check manually.

See chromium-web-store's README for more info and instructions, especially if you get CRX_REQUIRED_PROOF_MISSING or Apps, extensions and user scripts cannot be added from this website errors.

Recommended next steps

  • Open Hoardy-Web's popup and press the Help button there. Read the contents of that page.

  • Eventually, install and see the docs of the hoardy-web CLI tool.

Development

Build it from source

  • git clone this repository.
  • cd extension.
  • Optionally: run nix-shell ./default.nix to get the exact build environment I use.
  • Build by running ./build.sh clean firefox-mv2 chromium-mv2 from this directory.
  • All outputs can then be found in the dist directory.

Debugging

On Firefox-based browsers

  • To get the debugger console go to about:debugging#/runtime/this-firefox and press extension's Inspect button.

On Chromium-based browsers

  • To get the debugger console go to Extensions > Manage Extensions and press Inspect views link after the extension's ID.