Skip to main content

PDF Reader Pack

Trustedv1.0.0MITGold Verified95

by AgentNode · published 2 months ago · toolpack

Extract text, tables, and metadata from PDF files.

Wraps pdfplumber to provide reliable PDF text and table extraction.

langchaincrewaigeneric

Quick Start

bash
agentnode install pdf-reader-pack

Runs in a subprocess with filtered environment by default. Declared permissions are policy-checked, not sandboxed.

Usage

From package
python
from pdf_reader_pack.tool import run

result = run(
    action="extract_pdf_text",
    file_path="/tmp/invoice-2026-0319.pdf",
    extract_tables=True
)

print("Document text (first 500 chars):")
print(result["text"][:500])

print(f"\nTables found: {len(result['tables'])}")
if result["tables"]:
    invoice_table = result["tables"][0]
    print(f"Headers: {invoice_table['headers']}")
    for row in invoice_table["rows"]:
        print(f"  {row}")

print(f"\nPages: {result['page_count']}")
print(f"Metadata: {result['metadata']}")

Runs locally on your machine. No execution data is sent to AgentNode. Permissions are checked before execution. Learn how this works

Verification

high confidence95/100★ Gold Verified
smokeReturned valid result
+25/25
testsPublisher-provided tests passed
+15/15
importAll tools imported successfully
+15/15
installInstalled in 2.1s
+15/15
contractAll contract checks passed
+10/10
determinismConsistent output across runs (normalized)
+5/5
reliability3/3 runs passed
+10/10

Package installs and imports correctly. runtime checks passed. publisher tests passed.

install2.1s
import728ms
smoke1.8s
tests1.8s

This package was executed and validated by AgentNode before listing. Install, import, and runtime checks passed.

Python 3.12.3ffmpegpopplertesseractuv

Last verified 13d ago· Runner v2.0.0

Use this when you need to...

  • Read and parse text content from multi-page PDF manuals
  • Extract structured table data from PDF invoices and purchase orders
  • Pull document metadata like author, title, and creation date from PDFs
  • Convert PDF page content to plain text for downstream NLP processing
  • Extract specific page ranges from large PDF documents

README

Version History

Capabilities

pdf_extractionextract_pdf_texttool

Input Schema

{
  "type": "object",
  "required": [
    "file_path"
  ],
  "properties": {
    "pages": {
      "type": "string",
      "default": "all"
    },
    "file_path": {
      "type": "string",
      "default": "/tmp/agentnode_verify/test.pdf",
      "description": "Path to the PDF file"
    }
  }
}

Permissions

Declared by the publisher. Checked before execution by the policy gate.

Networknone
Filesystemtemp
Code Executionnone
Data Accessinput_only
User Approvalnever

Permissions are policy-checked before execution. Network and filesystem access are not sandboxed at runtime. Learn more

Privacy

All tool execution happens locally on your machine. AgentNode never receives:

  • • Tool inputs or outputs
  • • Execution logs
  • • Data your agent processes

Only install events and search queries are sent to the registry.

bash
agentnode install pdf-reader-pack

Files (5)

License

MIT

Stats

Downloads14
Installs0
Versionv1.0.0
Published3/13/2026
Channelstable
Typetoolpack
Entrypointpdf_reader_pack.tool

Compatibility

Frameworks

langchaincrewaigeneric

Runtime

python

Python Version

>=3.10

Trust & Security

PublisherTrusted
SignatureNone
ProvenanceNone
Security Issues0

Publisher

A

AgentNode

@agentnode