PDF Text Extractor
Output Format
Plain Text (.txt)
Chunking (Optional)
Strategy:
None
Token
Word
Sentence
Paragraph
Chunk Size:
Download Chunks As:
JSON
Text
Text Separator:
Structured JSON (.json)
(Includes page number, approximate line/block structure)
Show Advanced Options
Advanced Extraction Settings
Normalize Whitespace (pdf.js default)
Combine Text Items (pdf.js default)
Separator between text items (Internal pdf.js):
Select PDF Files
or drag & drop files here
No files selected.
Change options above before uploading.
Extracted Text / Chunks
(c) 2025 Neeraj Sharma