[ Switch to styled version → ]
advanced · 4 agents · 11 skills
Deploy a distributed data labeling pipeline with 4 agents that ingests raw data, applies ML-based labels, reviews quality, and exports training-ready datasets. The system handles images, text, and audio across formats like COCO, VOC, and JSONL, with inter-annotator agreement checks and automated quality gating.
clawhub install pilot-data-labeling-pipeline-setup pilot-s3-bridgepilot-stream-datapilot-task-parallelpilot-task-routerpilot-datasetpilot-metricspilot-reviewpilot-event-filterpilot-alertpilot-sharepilot-webhook-bridge<your-prefix>-ingester - Data Ingester pilot-s3-bridge, pilot-stream-data, pilot-task-parallel <your-prefix>-labeler - Auto Labeler pilot-task-router, pilot-dataset, pilot-metrics <your-prefix>-reviewer - Quality Reviewer pilot-review, pilot-event-filter, pilot-alert <your-prefix>-exporter - Dataset Exporter pilot-dataset, pilot-share, pilot-webhook-bridge <your-prefix>-ingester → <your-prefix>-labeler:1002 - work-item events<your-prefix>-labeler → <your-prefix>-reviewer:1002 - labeled-item events<your-prefix>-reviewer → <your-prefix>-labeler:1002 - review-feedback events<your-prefix>-reviewer → <your-prefix>-exporter:1002 - approved-label events<your-prefix>-exporter → external:443 - dataset-published notifications# Replace <your-prefix> with a unique name for your deployment (e.g. acme)
# On server 1 (data ingestion)
clawhub install pilot-s3-bridge pilot-stream-data pilot-task-parallel
pilotctl set-hostname <your-prefix>-ingester
# On server 2 (auto labeling)
clawhub install pilot-task-router pilot-dataset pilot-metrics
pilotctl set-hostname <your-prefix>-labeler
# On server 3 (quality review)
clawhub install pilot-review pilot-event-filter pilot-alert
pilotctl set-hostname <your-prefix>-reviewer
# On server 4 (dataset export)
clawhub install pilot-dataset pilot-share pilot-webhook-bridge
pilotctl set-hostname <your-prefix>-exporter
# On ingester:
pilotctl handshake <your-prefix>-labeler "setup: data-labeling-pipeline"
# On labeler:
pilotctl handshake <your-prefix>-ingester "setup: data-labeling-pipeline"
# On labeler:
pilotctl handshake <your-prefix>-reviewer "setup: data-labeling-pipeline"
# On reviewer:
pilotctl handshake <your-prefix>-labeler "setup: data-labeling-pipeline"
# On reviewer:
pilotctl handshake <your-prefix>-exporter "setup: data-labeling-pipeline"
# On exporter:
pilotctl handshake <your-prefix>-reviewer "setup: data-labeling-pipeline"
pilotctl trust