winget install --id=Y2Z.Monolith -e
A data hoarder’s dream come true: bundle any web page into a single HTML file. You can finally replace that gazillion of open tabs with a gazillion of .html files stored somewhere on your precious little drive. Unlike the conventional “Save page as”, monolith not only saves the target document, it embeds CSS, image, and JavaScript assets all at once, producing a single HTML5 document that is a joy to store and share. If compared to saving websites with wget -mpk, this tool embeds all assets as data URLs and therefore lets browsers render the saved page exactly the way it was on the Internet, even when no network connection is available.
Monolith: A Command-Line Tool for Efficient Web Page Archiving
Primary Purpose:
Monolith is a command-line tool designed to save complete web pages as single HTML files, ensuring that all associated assets are embedded within the file.
Key Features:
Audience & Benefits:
Ideal for users who want to efficiently save web pages and access them offline. Monolith is perfect for those seeking to reduce digital clutter from multiple tabs or ensure consistent content availability without relying on internet connectivity.
This tool offers a professional solution for archiving web content, providing tangible benefits in data management and accessibility.
_____ _____________ __________ ___________________ ___
| \ / \ | | | | | |
| \/ __ \| __ | | ___ ___ |__| |
| | | | | | | | | | | |
| |\ /| |__| |__| |___| | | | | __ |
| | \__/ | |\ | | | | | | |
|___| |__________| \___________________| |___| |___| |___|
A data hoarder’s dream come true: bundle any web page into a single HTML file. You can finally replace that gazillion of open tabs with a gazillion of .html files stored somewhere on your precious little drive.
Unlike the conventional “Save page as”, monolith
not only saves the target document, it embeds CSS, image, and JavaScript assets all at once, producing a single HTML5 document that is a joy to store and share.
If compared to saving websites with wget -mpk
, this tool embeds all assets as data URLs and therefore lets browsers render the saved page exactly the way it was on the Internet, even when no network connection is available.
cargo install monolith
brew install monolith
choco install monolith
scoop install main/monolith
winget install --id=Y2Z.Monolith -e
sudo port install monolith
snap install monolith
guix install monolith
nix-env -iA nixpkgs.monolith
flox install monolith
pacman -S monolith
apk add monolith
xbps-install -S monolith
pkg install monolith
cd /usr/ports/www/monolith/
make install clean
cd /usr/pkgsrc/www/monolith
make install clean
docker build -t y2z/monolith .
sudo install -b dist/run-in-container.sh /usr/local/bin/monolith
Dependencies: libssl
, cargo
Install cargo (GNU/Linux) Check if cargo is installed
cargo -v
If cargo is not already installed, install and add it to your existing $PATH
(paraphrasing the official installation instructions):
curl https://sh.rustup.rs -sSf | sh
. "$HOME/.cargo/env"
Proceed with installing from source:
git clone https://github.com/Y2Z/monolith.git
cd monolith
make install
Every release contains pre-built binaries for Windows, GNU/Linux, as well as platforms with non-standard CPU architecture.
monolith https://lyrics.github.io/db/P/Portishead/Dummy/Roads/ -o %title%.%timestamp%.html
cat some-site-page.html | monolith -aIiFfcMv -b https://some.site/ - > some-site-page-with-assets.html
-a
: Exclude audio sources-b
: Use custom base URL
-B
: Forbid retrieving assets from specified domain(s)-c
: Exclude CSS-C
: Read cookies from file
-d
: Allow retrieving assets only from specified domain(s)
-e
: Ignore network errors-E
: Save document using custom encoding
-f
: Omit frames-F
: Exclude web fonts-h
: Print help information-i
: Remove images-I
: Isolate the document-j
: Exclude JavaScript-k
: Accept invalid X.509 (TLS) certificates-m
: Output in MHTML format instead of HTML-M
: Don't add timestamp and URL information-n
: Extract contents of NOSCRIPT elements-o
: Write output to file
(use “-” for STDOUT)-q
: Be quiet-t
: Adjust network request timeout
-u
: Provide custom User-Agent
-v
: Exclude videos-V
: Print version numberOptions -d
and -B
provide control over what domains can be used to retrieve assets from, e.g.:
monolith -I -d example.com -d www.example.com https://example.com -o example-only.html
monolith -I -B -d .googleusercontent.com -d googleanalytics.com -d .google.com https://example.com -o example-no-ads.html
Monolith doesn't feature a JavaScript engine, hence websites that retrieve and display data after initial load may require usage of additional tools.
For example, Chromium (Chrome) can be used to act as a pre-processor for such pages:
chromium --headless --window-size=1920,1080 --run-all-compositor-stages-before-draw --virtual-time-budget=9000 --incognito --dump-dom https://github.com | monolith - -I -b https://github.com -o github.html
monolith https://username:password@example.com -o example-basic-auth.html
Please set https_proxy
, http_proxy
, and no_proxy
environment variables.
You can run Monolith in the cloud without installation using the Monolith Actor on Apify free of charge.
echo '{"urls": ["https://news.ycombinator.com/"]}' | apify call -so snshn/monolith
[{
"url": "https://news.ycombinator.com/",
"status": "0",
"kvsUrl": "https://api.apify.com/v2/key-value-stores/of9xNgvpon4elPLbc/records/https___news.ycombinator.com_"
}]
Read more about the Monolith Actor, including how to use it via the Apify UI, API and CLI without installation.
Please open an issue if something is wrong, that helps make this project better.
To the extent possible under law, the author(s) have dedicated all copyright related and neighboring rights to this software to the public domain worldwide. This software is distributed without any warranty.