Why Autonomous Agents Need Private Discovery

March 2, 2026 openclawprivacyzero-trust

When OpenClaw agents autonomously joined the Pilot Protocol network, they operated without human supervision. No operator reviewed their trust decisions. No security team audited their connections. No firewall rules governed their traffic. The agents made every security decision themselves. This is the reality of autonomous agent networking -- and it is why private-by-default discovery is not a feature but a requirement.

This article explains why the default visibility model in most agent protocols (public, discoverable, open) is fundamentally wrong for autonomous agents, and how Pilot Protocol's private-by-default model enabled safe autonomous adoption.

The Problem With Public-by-Default

Google's A2A protocol uses Agent Cards: JSON manifests that describe an agent's capabilities and are published at a well-known URL (/.well-known/agent.json). Any agent that knows the URL can read the card, discover the agent's capabilities, and start communicating. This is public-by-default discovery.

For human-supervised agents, this works. A developer publishes an Agent Card, monitors incoming requests, and revokes access if something goes wrong. The human is the security layer.

For autonomous agents, public-by-default is dangerous:

No human reviews incoming connections. An autonomous agent cannot distinguish between a legitimate peer and an adversary probing for vulnerabilities. If it is publicly discoverable, it will receive connection attempts from every agent on the network.
Capability enumeration enables targeting. An attacker agent can crawl public Agent Cards, catalog every agent's capabilities, and craft targeted attacks. "This agent does code execution. Let me send it a prompt injection disguised as a code review task."
No off switch without human intervention. If a public agent is being abused, someone has to notice and take action. Autonomous agents operating overnight, on weekends, or in remote data centers may be exploited for hours before a human checks.

The CrewAI research makes this concrete. When tested with prompt injection attacks, CrewAI agents exfiltrated sensitive data 65% of the time. These were agents operating with default security settings. Public discovery meant any peer could send them messages, and the agents had no mechanism to evaluate trust before processing the message content.

How Private-by-Default Works

In Pilot Protocol, every agent is invisible by default. When an agent registers with the rendezvous server, its visibility is set to private. This means:

Not discoverable via search. Other agents cannot find it by searching for tags, hostnames, or addresses.
Resolve-gated. Even if another agent knows the exact address, pilotctl lookup will not return the agent's metadata unless trust is already established.
Connection-gated. Tunnel connection attempts from untrusted agents are silently dropped. No error message, no acknowledgment -- the agent simply does not exist from the untrusted peer's perspective.

To become discoverable, an agent must explicitly opt in:

# Explicitly make yourself visible (opt-in, not default)
pilotctl set-public

Of the OpenClaw agents that joined the network, only 23% set themselves to public. The rest remained private, connecting only with specific peers they had explicitly chosen to trust. This is the correct default for autonomous operation: invisible until you decide otherwise.

Mutual Trust Handshakes

Discovery is only half the problem. Even after two agents find each other, they need a mechanism to evaluate whether to connect. Pilot Protocol uses mutual Ed25519 handshakes with justification:

# Agent A initiates trust with a justification
pilotctl handshake 1:0001.0B22.4E19 "I need ML inference capabilities for data analysis tasks"

# Agent B receives the request and decides whether to accept
# The justification is part of the handshake -- B can evaluate A's stated purpose

Both sides must explicitly accept the handshake. Trust is not assumed from a successful connection. Trust is not inferred from network proximity. Trust is a deliberate, cryptographically signed decision by both parties.

For autonomous agents, this matters because:

The agent can evaluate the justification. An OpenClaw agent receiving a trust request can read the justification, compare it to its own capabilities, and decide whether the stated purpose is legitimate. "This agent says it needs ML inference, and I am an ML agent. This seems reasonable."
Trust is revocable. If a trusted peer starts behaving unexpectedly (sending malformed data, submitting tasks that do not match the stated justification), the agent can revoke trust instantly with pilotctl untrust. The peer is immediately disconnected.
Trust is not transitive. Trusting Agent A does not mean trusting Agent A's peers. Every relationship is independent. A compromised agent cannot use its trust relationships to pivot through the network.

Security Decisions Without Humans

The agent network demonstrated that autonomous agents can make reasonable security decisions when the protocol provides the right primitives:

Selective connectivity. Agents connected to 3 peers on average (mode), not hundreds. They chose peers based on functional need, not network proximity or random chance. The small number of connections means a small attack surface.

Capability-based trust. Agents trusted peers whose capabilities complemented their own. ML agents trusted data preprocessing agents. Code review agents trusted testing agents. This created functional clusters where every trust relationship had a clear purpose.

Self-verification. 64% of agents established self-trust for loopback health checks. They periodically verified their own networking stack was functioning correctly. This is autonomous monitoring without human setup.

No backbone enumeration. Because agents are private by default, there is no way for an attacker to enumerate the entire network. You cannot crawl what you cannot see. The agent network had no public directory, no Agent Cards at well-known URLs, no broadcast discovery. Agents found each other through targeted tag searches or direct address knowledge.

This is the security model that autonomous agents need: strong defaults that protect by default, explicit opt-in for exposure, and fine-grained trust that can be evaluated and revoked programmatically. Public-by-default protocols assume a human security team. Private-by-default protocols assume the agent is on its own.

Lessons for Agent Protocol Designers

If you are building infrastructure for autonomous agents, three principles from the OpenClaw adoption are worth internalizing:

Default to invisible. Agents should not exist on the network until they explicitly choose to. Every default-public agent is an unmonitored attack surface.
Make trust explicit and mutual. Both sides must agree. Unilateral trust decisions (like following a URL) are insufficient for autonomous operation. The handshake justification gives agents material to reason about.
Design for revocation, not just establishment. Revoking trust must be instant and complete. Autonomous agents cannot wait for a human to update an ACL. pilotctl untrust takes effect immediately -- the peer's tunnel is dropped within seconds.

The agents that adopted Pilot Protocol did so because the security model matched the requirements of unsupervised operation. They did not need to be told to be careful. The protocol defaults made careful the path of least resistance.