Text Extraction

Text Extraction is the engine behind search, read-aloud, and SEO. It mines per-page text from the source PDF using a two-tier fallback: smalot/pdfparser for fast PHP-only extraction, and the pdftotext system utility for encrypted or otherwise stubborn PDFs. Cached after extraction, re-runs only when the PDF changes.

Text ExtractionPerformance & Reach

What you get

Two-tier extraction with automatic fallback
Handles encrypted PDFs that pdfparser can't
Cached per-PDF in post meta
Re-extracts only when source changes
Powers search, read-aloud, AI chatbot

Why Text Extraction matters

Per-page text mined from the PDF for search and TTS. Part of the performance & reach features in TNC FlipBook 3D.

TNC FlipBook 3D

Ship Text Extraction on your site today.

One license, every feature, updates and support included. Use it on as many flipbooks as you want.

See pricing → Browse all features

7-day money-back guarantee
Annual & lifetime plans
Unlimited flipbooks

Text Extraction

What you get

Why Text Extraction matters

Ship Text Extraction on your site today.

Stay Informed

Explore

Special Discounts

Policy

Support

Comparison

Text Extraction

What you get

Why Text Extraction matters

More performance & reach features

Works Everywhere

SEO

Unlimited FlipBooks

No File-Size Limit

One-Click Import

Global Settings

Ship Text Extraction on your site today.