PDF Reader Pack

Name: PDF Reader Pack
Author: AgentNode

★Trusted◇Sandbox optionalv1.0.0MIT✔Verified80

by AgentNode · published 3 months ago · toolpack

Extract text, tables, and metadata from PDF files.

Wraps pdfplumber to provide reliable PDF text and table extraction.

langchaincrewaigeneric

Quick Start

bash

agentnode install pdf-reader-pack

Runs in a subprocess with filtered environment by default. Declared permissions are policy-checked, not sandboxed.

Usage

From package

python

from pdf_reader_pack.tool import run

result = run(
    action="extract_pdf_text",
    file_path="/tmp/invoice-2026-0319.pdf",
    extract_tables=True
)

print("Document text (first 500 chars):")
print(result["text"][:500])

print(f"\nTables found: {len(result['tables'])}")
if result["tables"]:
    invoice_table = result["tables"][0]
    print(f"Headers: {invoice_table['headers']}")
    for row in invoice_table["rows"]:
        print(f"  {row}")

print(f"\nPages: {result['page_count']}")
print(f"Metadata: {result['metadata']}")

Runs locally on your machine. No execution data is sent to AgentNode. Permissions are checked before execution. Learn how this works

Verification

high confidence80/100✔ Verified

smokeReturned valid result

+25/25

testsTests failed

0/15

importAll tools imported successfully

+15/15

installInstalled in 2.7s

+15/15

contractAll contract checks passed

+10/10

determinismConsistent output across runs (normalized)

+5/5

reliability3/3 runs passed

+10/10

Package installs and imports correctly. runtime checks passed.

✔install2.7s

✔import75ms

✔smoke588ms

✖tests1.3s

This package was executed and validated by AgentNode before listing. Install, import, and runtime checks passed.

Verified in real_auto mode

Python 3.12.3ffmpegpopplertesseractuv

Last verified 17h ago· Runner v2.0.0

Use this when you need to...

›Read and parse text content from multi-page PDF manuals
›Extract structured table data from PDF invoices and purchase orders
›Pull document metadata like author, title, and creation date from PDFs
›Convert PDF page content to plain text for downstream NLP processing
›Extract specific page ranges from large PDF documents

README

Version History

v1.0.0latestverified

3/13/2026

Capabilities

pdf_extractionextract_pdf_texttool

Input Schema

{
  "type": "object",
  "required": [
    "file_path"
  ],
  "properties": {
    "pages": {
      "type": "string",
      "default": "all"
    },
    "file_path": {
      "type": "string",
      "default": "/tmp/agentnode_verify/test.pdf",
      "description": "Path to the PDF file"
    }
  }
}

Permissions

◇Sandbox optionalFrom a trusted publisher — runs on the host by default. You can require isolation with sandbox.host_trust_policy.

Declared by the publisher. Checked before execution by the policy gate.

Networknone

Filesystemtemp

Code Executionnone

Data Accessinput_only

User Approvalnever

Permissions are policy-checked before execution. For trusted and curated packages that run on the host, network and filesystem access are policy-checked but not OS-sandboxed. When runtime isolation is required for untrusted/community code, AgentNode uses sandbox-or-fail-closed if the required container runtime and pinned image are available. Learn more

Privacy

All tool execution happens locally on your machine. AgentNode never receives:

• Tool inputs or outputs
• Execution logs
• Data your agent processes

Only install events and search queries are sent to the registry.

bash

agentnode install pdf-reader-pack

Files (5)

License

MIT

Stats

Downloads14

Installs0

Versionv1.0.0

Published3/13/2026

Channelstable

Typetoolpack

Entrypointpdf_reader_pack.tool

Compatibility

Frameworks

langchaincrewaigeneric

Runtime

python

Python Version

>=3.10

Trust & Security

Publisher★Trusted

SignatureNone

ProvenanceNone

Security Issues0

Publisher

AgentNode

@agentnode

Report an issue with this package