← Back to Skills
PDF Processing
Extract, transform, merge, split, and generate PDFs with structured outputs.
name:
pdf
description:
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.
PDF Processing Guide
Overview
This guide covers essential PDF processing operations using Python libraries and command-line tools. For advanced features, JavaScript libraries, and detailed examples, see reference.md. If you need to fill out a PDF form, read forms.md and follow its instructions.
Quick Start
example.py
Loading...
Python Libraries
pypdf - Basic Operations
Merge PDFs
example.py
Loading...
Split PDF
example.py
Loading...
Extract Metadata
example.py
Loading...
Rotate Pages
example.py
Loading...
pdfplumber - Text and Table Extraction
Extract Text with Layout
example.py
Loading...
Extract Tables
example.py
Loading...
Advanced Table Extraction
example.py
Loading...
reportlab - Create PDFs
Basic PDF Creation
example.py
Loading...
Create PDF with Multiple Pages
example.py
Loading...
Command-Line Tools
pdftotext (poppler-utils)
terminal
Loading...
qpdf
terminal
Loading...
pdftk (if available)
terminal
Loading...
Common Tasks
Extract Text from Scanned PDFs
example.py
Loading...
Add Watermark
example.py
Loading...
Extract Images
terminal
Loading...
Password Protection
example.py
Loading...
Quick Reference
| Task | Best Tool | Command/Code |
|---|---|---|
| Merge PDFs | pypdf | writer.add_page(page) |
| Split PDFs | pypdf | One page per file |
| Extract text | pdfplumber | page.extract_text() |
| Extract tables | pdfplumber | page.extract_tables() |
| Create PDFs | reportlab | Canvas or Platypus |
| Command line merge | qpdf | qpdf --empty --pages ... |
| OCR scanned PDFs | pytesseract | Convert to image first |
| Fill PDF forms | pdf-lib or pypdf (see forms.md) | See forms.md |
Next Steps
- For advanced pypdfium2 usage, see reference.md
- For JavaScript libraries (pdf-lib), see reference.md
- If you need to fill out a PDF form, follow the instructions in forms.md
- For troubleshooting guides, see reference.md