Agent.email's OTP Handoff: How Curl-First Signup Bridges Machine and Human Identity

Agent.email solves a problem most agent infrastructure ignores: what happens when a machine initiates an account but a human needs to authorize it? The YC S25 company, built by Haakam, Michael, and Adi, created a curl-first signup flow where agents create restricted email inboxes, then request OTP codes from their human operators to unlock full access. It exposes the identity handoff boundary that every autonomous agent will hit when trying to open accounts that require human verification.

The core tension is simple. Agents need programmatic account creation. Many services need human verification for compliance and fraud prevention. Agent.email inverts the typical web-first signup by allowing agents to initiate via curl and humans to authorize via OTP.

Agent.email treats agents as first-class users. The signup endpoint returns Markdown by default, HTML only when a browser user-agent is detected. This is the opposite of every SaaS product built in the last 20 years.

Here’s the state machine:

State	Agent Capabilities	Human Role	Transition Trigger
Initiated	Can hit `/signup` endpoint with human email parameter	None	Agent receives restricted credentials
Restricted	Can only email registered human; 10 emails/day limit	Receives OTP request from agent, replies with code	Agent submits valid OTP
Claimed	Full inbox access; can email any address	Account remains tied to human email (1:1 mapping)	Restrictions lifted permanently

The restricted state is the critical piece. Until claimed, the agent inbox has a 10-email-per-day limit and can only communicate with its registered human. This prevents abuse while allowing the agent to complete the handoff autonomously.

The announcement notes “Right now it’s a 1:1 mapping between agent and human. The next step is many-to-one, because one person running several agents in parallel is already very common.” This constraint means each agent inbox is currently tied to a single human email during signup. The many-to-one evolution will require a more sophisticated state machine where one human can authorize multiple agent inboxes.

State Transitions and Race Conditions

The handoff protocol needs to handle several edge cases. The announcement describes a 1:1 mapping during signup but does not specify the race condition prevention mechanism. The constraint suggests a simple approach: the first agent to register with a given human email succeeds, subsequent attempts fail until that agent is claimed or expires.

OTP expiration and retry logic: The agent must handle email parsing to extract the OTP from the human’s reply. This parsing step is fragile because agents need to interpret natural language responses rather than structured API payloads.

IP-based rate limiting: The signup endpoint is “rate-limited hard by IP” to prevent bulk account creation. This is a blunt instrument but necessary when you’re allowing programmatic signup without CAPTCHA. An agent that exhausts its rate limit can’t create new inboxes until the window resets. The team does not specify the rate limit threshold or reset window, but the hard limit by IP suggests they’re prioritizing abuse prevention over convenience.

Implementation Details

The system makes several design choices that reveal the plumbing:

Markdown-first responses: Agents receive structured text instead of HTML. This reduces hallucination risk when parsing instructions. The team found that consistent formatting matters: the CLI outputs in a single column with uniform delimiters because mixed formatting is easy for humans to scan but harder for agents to parse reliably.

MessageID hallucination problem: The team “shortened messageIDs after agents started hallucinating completions on longer ones.” This is a concrete example of agent-first design constraints. Long identifiers cause LLMs to generate plausible-looking but incorrect completions. The fix is simple: shorter IDs reduce the token space and hallucination surface area. This is the kind of failure mode you only discover by running agents in production.

Email-based OTP delivery: Instead of SMS or authenticator apps, the OTP comes via email reply. This keeps the entire handoff in the agent’s communication channel. The agent doesn’t need to integrate with a separate verification system. The trade-off is that email parsing is less reliable than structured API responses.

CLI output formatting: The team redesigned their CLI to use single-column output with consistent delimiters. This is optimization for machine parsing, not human aesthetics. Mixed delimiters are easy for humans to scan visually but harder for agents to parse reliably without hallucinating structure.

Security and Attack Vectors

The restricted state is the primary security control. An unclaimed agent inbox can’t spam arbitrary addresses or participate in phishing campaigns. But several attack vectors remain:

Email spoofing: An attacker could spoof the human’s email address to send a fake OTP. The system likely validates the sender domain, but email authentication (SPF, DKIM, DMARC) is not foolproof.

OTP interception: If the human’s email account is compromised, an attacker can intercept the OTP and claim the agent inbox. This is the same risk as any email-based verification flow.

Agent credential theft: Once claimed, the agent has full inbox access. If the agent’s credential store is compromised, the attacker has a working email account.

The 10-email-per-day limit in restricted mode provides some blast radius containment. Even if an attacker claims an agent inbox, they can’t immediately use it for large-scale spam campaigns.

Observability Gaps

The announcement does not cover monitoring, logging, or audit trails. For production use, you need to track:

Agents stuck in unclaimed state beyond expected handoff latency
OTP request-to-claim duration (if this exceeds 5 minutes, investigate)
Failed OTP submission attempts (possible attack or parsing error)
Rate limit exhaustion events by IP

Without these metrics, you can’t detect agents failing to complete handoff or attackers probing the OTP mechanism.

Deployment Constraints

Agent.email is a hosted service, not self-hosted infrastructure. This creates a dependency: your agent’s identity is tied to a third-party service. If Agent.email goes down, your agent can’t receive email or complete handoffs.

The announcement does not specify data residency, retention policies, or SLA guarantees. For agents handling sensitive operations, these details matter. Some use cases have geographic restrictions on where email data can be stored.

Failure Modes

Several things can break the handoff:

Human doesn’t respond: The agent is stuck in restricted mode. It can retry the OTP request up to the 10-email-per-day limit, but there’s no escalation path if the human is unavailable.

Email delivery failure: If the OTP email doesn’t reach the human (spam filter, server issue), the agent can’t proceed. The system needs retry logic with exponential backoff.

Agent parsing error: If the agent misreads the OTP from the human’s reply, it fails to claim the account. Parsing natural language replies is fragile. The agent might need multiple attempts, consuming the 10-email daily budget.

Rate limit exhaustion: If an agent hits the IP rate limit, it can’t create new inboxes. This is a problem for agents that need to spin up multiple identities quickly.

Technical Verdict

Use Agent.email’s pattern if:

Your agent can reliably parse email replies to extract OTP codes
OTP claim latency under 5 minutes is acceptable for your workflow
You’re prototyping agent identity flows and need a working reference implementation
Your threat model allows email-based verification (no high-value transactions)

Avoid this pattern if:

You need sub-second account creation without human intervention
Your compliance requirements mandate self-hosted identity infrastructure with full audit trails
You’re handling operations where email spoofing is an unacceptable risk
You need to spin up multiple agent identities rapidly (IP rate limiting will block you)

The real contribution is the restricted-state pattern: agents can initiate signup but cannot send external emails until human OTP claim. Most agent infrastructure assumes either full autonomy or full human control. Agent.email shows what the middle ground looks like: machines initiate, humans authorize, machines continue.

The messageID hallucination discovery is particularly valuable. It’s a concrete example of how LLM behavior shapes infrastructure design. Shorter identifiers reduce hallucination risk, but this constraint propagates through your entire system. Every identifier your agent handles needs to be short enough to avoid completion errors.

For production deployment, extend this pattern with stronger authentication, comprehensive logging, and failover mechanisms. But as a proof of concept, Agent.email demonstrates agent-first design: curl instead of browsers, Markdown instead of HTML, email-based OTP instead of web forms.

Source Links

Show HN Discussion

The Curl-First Signup Flow