📄 HTML to PDF Generator (Ultimate Edition)

A professional tool that converts local HTML files into optimized PDFs, solving lazy-loading issues and greatly reducing final file size.

HTML to PDF Generator is a desktop Python tool using Playwright (Chromium) to ensure perfect CSS/JS rendering and applies PyMuPDF post-processing to optimize images without losing visual quality.

✨ Main Features (v5)

Local Temporary Server: Prevents browser CORS issues with local files by serving resources and fonts correctly.
Auto-Scroll for Lazy-Loading: Injects a script that scrolls the page to force lazy-loaded images to load before capture.
Hybrid Compression Engine: Chooses PNG for images with transparency and JPEG for opaque images using configurable quality.
Metadata Cleanup: Removes EXIF and PDF metadata (Author, Creation Date, Software) for privacy and smaller files.
Smart Multi-language Naming: Detects -en or _en suffixes to name PDFs accordingly (e.g. CV_David_Pinto_ENG.pdf).

⚙️ Requirements & Installation

The script includes a robust auto-installer that downloads required Python packages and installs Chromium for Playwright on first run.

Requirements:

Python 3.8 or newer.

Dependencies (installed automatically):

playwright (rendering)
pymupdf / fitz (PDF processing)
pillow (image processing)

Run:

python 5_HTML-2-PDF-Python.py

Note: first run may take a couple of minutes while Playwright downloads Chromium.

Quick Usage

Add HTML files using the ➕ button.
Set JPEG quality (recommended ~75%).
Optionally remove metadata for privacy.
Choose output folder and press Generate.

Changelog Overview

v0: Initial Playwright + server and auto-scroll.
v1: Optimized scaling for very large images.
v2: Privacy mode (no metadata).
v3-v4: Hybrid PNG/JPEG engine improvements.
v5: Smart renaming and removal of trailing blank pages.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
images/README		images/README
.gitignore		.gitignore
0_HTML-2-PDF-Python.py		0_HTML-2-PDF-Python.py
10_manual_slider_option.py		10_manual_slider_option.py
1_html-2-pdf-optimizedScale.py		1_html-2-pdf-optimizedScale.py
2_html-2-pdf-no-metadata.py		2_html-2-pdf-no-metadata.py
3_html-2-pdf-png-support.py		3_html-2-pdf-png-support.py
4_html2pdfTry-betterControls.py		4_html2pdfTry-betterControls.py
5_Better_Names-4-pdfs.py		5_Better_Names-4-pdfs.py
6_better-names-into-gui.py		6_better-names-into-gui.py
7_better-control-4-names.py		7_better-control-4-names.py
8_control-4-name-spaces.py		8_control-4-name-spaces.py
9_UI_adjusment.py		9_UI_adjusment.py
README.md		README.md
README_es.md		README_es.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📄 HTML to PDF Generator (Ultimate Edition)

✨ Main Features (v5)

⚙️ Requirements & Installation

Quick Usage

Changelog Overview

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📄 HTML to PDF Generator (Ultimate Edition)

✨ Main Features (v5)

⚙️ Requirements & Installation

Quick Usage

Changelog Overview

About

Resources

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages