Your at-least-once pipeline is probably at-most-once

Ask an engineer about their outbound event pipeline and you'll usually hear "at-least-once." It's the answer that justifies idempotency keys, retries, dead letter queues, the whole apparatus. At-least-once is the floor: messages might be delivered more than once, but they won't be lost.

The actual semantics are usually weaker. Most pipelines self-described as at-least-once are at-most-once in practice, and the gap is invisible until something is already gone. The difference is rarely a missing component. It's an ack-ordering decision inside a queue handler, made years ago, that nobody noticed was load-bearing.

This post is about how to tell which pipeline you actually have.

The semantic ladder

Three delivery guarantees, in order of strength:

At-most-once. A message is delivered zero or one times. Loss is possible; duplication is not. This is what naive code gives you — send the message, hope for the best. Acceptable for low-stakes telemetry; catastrophic for transactional events.

At-least-once. A message is delivered one or more times. Duplication is possible; loss is not. This is the standard guarantee from production message systems. It requires that the message stays in the queue (or log) until the worker confirms successful processing — and "successful" has to mean the work is durably committed, not the worker has it in memory.

Exactly-once. Each message has exactly one effect on downstream state. Achievable end-to-end only through idempotency on top of at-least-once delivery. As a transport guarantee, exactly-once doesn't exist in distributed systems without strong assumptions about coordination that most pipelines can't make.

The ladder matters because each rung requires different machinery, and the difference between rungs isn't a feature you add. It's a property of how acks interact with work.

The ack-ordering trap

This is the single decision that silently downgrades at-least-once to at-most-once. A worker pulls a message from a queue and has two operations to perform: acknowledge the message back to the queue (so it doesn't get redelivered), and do the work the message describes (deliver to HubSpot, fire the webhook, update the database).

The order matters absolutely.

Ack-after-success is at-least-once. The worker pulls the message, does the work, then acks. If the worker crashes between pulling and finishing, the queue redelivers — possibly to another worker — and the work runs again. Idempotency handles the duplicate.

Ack-before-success is at-most-once. The worker pulls the message, acks immediately, then does the work. If the worker crashes between the ack and the work completing, the message is gone. The queue has already forgotten about it.

The seductive thing about ack-before-success is that it usually works. The crash window between ack and work-complete is small — milliseconds at most for a fast handler. In normal operation, it's invisible. Workers process millions of messages and nobody notices the occasional lost one because the loss rate is below the noise floor of every other failure mode.

Your at-least-once pipeline is probably at-most-once

Most teams say their event delivery is at-least-once. The actual semantics are usually weaker — and the gap is one ack-ordering decision in a queue handler. A walk through the patterns that silently downgrade delivery guarantees, and how to tell which one you have.

The semantic ladder

The ack-ordering trap

Diagnostic checklist

1. Ack-before-work

2. Visibility timeout shorter than work duration

3. In-memory queues without persistence

4. Success metrics that count the wrong thing

5. Queue purge on deploy

6. Log truncation that loses pending retries

How to tell

Fixing the gap

Closing