← toolkit.bot

Why Is My EPUB So Large? How to Reduce EPUB File Size

A typical text-only EPUB should be under 1 MB. If your EPUB is 20, 50, or 100 MB, something is wrong — usually unoptimized images, large embedded fonts, or inefficient PDF conversion that carried over rasterized content. Here's how to diagnose and fix it.

Why EPUBs Become Large

How to Check What's Inside Your EPUB

An EPUB is a ZIP file. Rename it from .epub to .zip and extract it. The largest files in the OEBPS/ or EPUB/ folder are your problem items — typically in images/ or fonts/.

# On Mac/Linux:
cp book.epub book.zip
unzip book.zip -d book-contents
du -sh book-contents/EPUB/images/*  # see which images are large
du -sh book-contents/EPUB/fonts/*   # see font sizes

Method 1: Compress Images with Calibre

  1. Open Calibre, add your .epub.
  2. Right-click → Convert books → Convert individually.
  3. Under Comic Options, check "No processing" is off.
  4. Under Look & Feel → Layout, set image compression to your preferred level.
  5. Alternatively, use the Polish books feature (right-click → Polish books) — it recompresses images and removes unused files without changing content.

Method 2: Compress Images Manually

Extract the EPUB (rename to .zip, unzip). Run images through an optimizer:

# Install: brew install jpegoptim optipng (Mac) or apt-get (Linux)
jpegoptim --max=85 book-contents/EPUB/images/*.jpg
optipng -o5 book-contents/EPUB/images/*.png

# Repack to EPUB
cd book-contents
zip -X ../book-optimized.epub mimetype
zip -rg ../book-optimized.epub . -x mimetype

Method 3: Remove Embedded Fonts

In Calibre's Convert dialog → Look & Feel → check Remove all embedded fonts. This replaces the EPUB's fonts with the reader's system fonts. Appropriate if the custom fonts aren't essential to the reading experience (e.g., body text fonts on a technical document).

What If the EPUB Contains Rasterized Pages?

If your PDF-to-EPUB conversion produced rasterized page images (common with Calibre on complex PDFs), the fix is to reconvert using a better tool. toolkit.bot performs proper text extraction with layout analysis — the output contains real text, not page images. File sizes are typically 95–99% smaller than rasterized conversions.

Go to toolkit.bot/pdf2epub, upload your PDF, and download a properly-converted EPUB with real text.

Target File Sizes

Getting a large, image-heavy EPUB from your PDF? toolkit.bot extracts real text for a clean, lightweight EPUB.

Convert PDF to EPUB →

Related guides