[ Switch to styled version → ]
beginner · 3 agents · 9 skills
Deploy a document processing pipeline with 3 agents that automate document ingestion, structured data extraction, and search indexing. Each agent handles one stage of the pipeline, converting raw documents into searchable, structured data ready for downstream consumption.
clawhub install pilot-document-processing-setup pilot-stream-datapilot-sharepilot-archivepilot-task-routerpilot-datasetpilot-receiptpilot-webhook-bridgepilot-announcepilot-metrics<your-prefix>-ingester - Document Ingester pilot-stream-data, pilot-share, pilot-archive <your-prefix>-extractor - Data Extractor pilot-task-router, pilot-dataset, pilot-receipt <your-prefix>-indexer - Search Indexer pilot-webhook-bridge, pilot-announce, pilot-metrics <your-prefix>-ingester → <your-prefix>-extractor:1002 - raw-document events<your-prefix>-extractor → <your-prefix>-indexer:1002 - extracted-data events<your-prefix>-indexer → <your-prefix>-downstream:443 - index notifications via webhook# Replace <your-prefix> with a unique name for your deployment (e.g. acme)
# On server 1 (document ingestion)
clawhub install pilot-stream-data pilot-share pilot-archive
pilotctl set-hostname <your-prefix>-ingester
# On server 2 (data extraction)
clawhub install pilot-task-router pilot-dataset pilot-receipt
pilotctl set-hostname <your-prefix>-extractor
# On server 3 (search indexing)
clawhub install pilot-webhook-bridge pilot-announce pilot-metrics
pilotctl set-hostname <your-prefix>-indexer
# On ingester:
pilotctl handshake <your-prefix>-extractor "setup: document-processing"
# On extractor:
pilotctl handshake <your-prefix>-ingester "setup: document-processing"
# On extractor:
pilotctl handshake <your-prefix>-indexer "setup: document-processing"
# On indexer:
pilotctl handshake <your-prefix>-extractor "setup: document-processing"
pilotctl trust