How to Convert Research Papers (PDF) to EPUB for E-Readers
Academic PDFs are notoriously bad on e-readers. Two-column layouts, inline citations, footnotes, and small-font figures all break when you try to zoom or reflow. Here's how to get a clean, readable EPUB from any journal article, preprint, or working paper — in under a minute, free.
Why academic PDFs fail on e-readers
- Two-column layout — PDF viewers pan horizontally; e-readers can't reflow two columns correctly
- Fixed font size — zoom doesn't help because the layout re-renders at the same scale
- Footnotes cut at page breaks — often split across pages, losing context
- Figures and captions — placed by the typesetter, not logical reading order
- Hyphenation artifacts — PDF text extraction produces "hy-phenated" words mid-sentence
How toolkit.bot handles these problems
- Two-column detection — automatically identifies multi-column regions and reflows as single-column text
- De-hyphenation — merges hyphenated word splits from PDF extraction back into whole words
- Footnote grouping — footnotes are collected per section and rendered at the end of that section
- Figure placement — figures are anchored near their reference in the text
- OCR fallback — if a page has no extractable text (scanned section), OCR is applied automatically
Step-by-step: convert an arXiv paper
- Download the PDF from arXiv (click Download PDF on the abstract page)
- Go to toolkit.bot/pdf2epub
- Upload the PDF — conversion takes 20–60 seconds for typical papers
- Download the EPUB and load it onto your e-reader
arXiv papers are the most common use case — two-column LaTeX, equations as images, figure captions. Our converter handles these reliably. Full arXiv guide →
Journal articles (publisher PDFs)
Publisher PDFs from Springer, Elsevier, and IEEE often have more complex layouts — sidebars, pull quotes, watermarks. Watermarks are removed during extraction; sidebars are treated as separate text blocks. Results are generally good for text-heavy papers; figures-heavy layout articles may have positioning artifacts.
Why Calibre fails on academic PDFs
Calibre's PDF engine does not detect two-column layouts — it reads columns left-to-right across the full page width, producing garbled text where column 1 line 1 is followed by column 2 line 1. For any paper with two columns, use toolkit.bot. Calibre vs toolkit.bot comparison →
Batch converting a reading list
If you have a folder of papers to convert, use the shell script approach:
for f in papers/*.pdf; do
curl -sX POST https://toolkit.bot/convert -F "file=@$f" -o "${f%.pdf}.epub"
sleep 2
done
See the full batch conversion guide → for Python and Windows variants.
Loading papers onto your e-reader
Kindle
Email to your @kindle.com address (no file size limit for recent Kindle models). Kindle guide →
Kobo
Drag to the Kobo drive via USB, or use Kobo's Send to Kobo email. Kobo guide →
iPad
Open in Safari → share to Apple Books. iPad guide →
Convert any research paper PDF to readable EPUB — free, no account.
Convert PDF to EPUB →