← Back to Skills

PDF Processing

Extract, transform, merge, split, and generate PDFs with structured outputs.

Category

documents

Provider

computer agents

Code Files

11

name:
pdf
description:
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.

PDF Processing Guide

Overview

This guide covers essential PDF processing operations using Python libraries and command-line tools. For advanced features, JavaScript libraries, and detailed examples, see reference.md. If you need to fill out a PDF form, read forms.md and follow its instructions.

Quick Start

example.py
Loading...

Python Libraries

pypdf - Basic Operations

Merge PDFs

example.py
Loading...

Split PDF

example.py
Loading...

Extract Metadata

example.py
Loading...

Rotate Pages

example.py
Loading...

pdfplumber - Text and Table Extraction

Extract Text with Layout

example.py
Loading...

Extract Tables

example.py
Loading...

Advanced Table Extraction

example.py
Loading...

reportlab - Create PDFs

Basic PDF Creation

example.py
Loading...

Create PDF with Multiple Pages

example.py
Loading...

Command-Line Tools

pdftotext (poppler-utils)

terminal
Loading...

qpdf

terminal
Loading...

pdftk (if available)

terminal
Loading...

Common Tasks

Extract Text from Scanned PDFs

example.py
Loading...

Add Watermark

example.py
Loading...

Extract Images

terminal
Loading...

Password Protection

example.py
Loading...

Quick Reference

TaskBest ToolCommand/Code
Merge PDFspypdfwriter.add_page(page)
Split PDFspypdfOne page per file
Extract textpdfplumberpage.extract_text()
Extract tablespdfplumberpage.extract_tables()
Create PDFsreportlabCanvas or Platypus
Command line mergeqpdfqpdf --empty --pages ...
OCR scanned PDFspytesseractConvert to image first
Fill PDF formspdf-lib or pypdf (see forms.md)See forms.md

Next Steps

  • For advanced pypdfium2 usage, see reference.md
  • For JavaScript libraries (pdf-lib), see reference.md
  • If you need to fill out a PDF form, follow the instructions in forms.md
  • For troubleshooting guides, see reference.md