What is a vampire record in API integrations?

A vampire record is a data entry caught in an infinite loop between two bidirectionally synced systems. It continuously triggers updates, bouncing back and forth indefinitely, draining API quotas and corrupting data without ever resolving.

How do you filter webhook echoes to prevent infinite loops?

You can filter echoes by checking the actor ID in the webhook payload against a dedicated integration user, or by passing custom metadata headers (like Smartsheet's Change-Agent) during the outbound API call that are echoed back in the webhook.

Why is payload state hashing useful for loop prevention?

Hashing allows you to compare the incoming webhook data payload against your local database state. If you hash only the business-relevant fields and the hashes match, the update contains no new information and can be dropped as an echo.

How should systems handle API rate limits during an infinite loop?

Systems should implement circuit breakers that read standard IETF rate limit headers (ratelimit-reset) to pause syncs on a per-account basis. Blindly retrying with exponential backoff will simply exhaust the quota the moment the rate limit window resets.

What is the best pattern for bidirectional sync loop prevention?

No single pattern is foolproof. The most resilient architecture uses a layered defense: actor/header filtering at ingestion for speed, state fingerprinting as a reliable secondary gate, watermark-based delta sync as a polling fallback, and rate limit circuit breakers as the final safety net.

How to Prevent Infinite Loops in Bidirectional API Syncs: A Developer's Cookbook

Bidirectional sync between your application and a third-party API like Salesforce or HubSpot sounds simple on a whiteboard. System A writes to System B. System B writes back to System A. Both stay in sync. In production, this is where your weekends go to die. The moment both systems can write to the same record, you open the door to infinite loops - phantom updates bouncing between systems, draining API quotas, polluting audit logs, and silently corrupting customer data.

These are what integration engineering teams call "vampire records": data entries that bounce back and forth indefinitely, feeding on your API rate limits without ever dying. The financial impact of these loops is strictly measurable. Gartner estimates that poor data quality costs organizations an average of $12.9 million every year in wasted resources and lost opportunities. IBM research puts the macroeconomic cost of bad data to U.S. businesses at approximately $3.1 trillion annually. On the ground level, research shows sales reps waste roughly 27% of their time dealing with inaccurate CRM records. For inside sales teams, this translates to roughly 546 hours per representative annually.

When your bidirectional sync is the source of that bad data, the damage is immediate. If you are architecting a two-way sync and experiencing infinite loops, you need structural patterns to stop them at the edge.

This cookbook breaks down exactly why these loops occur, why naive workarounds fail, and provides four concrete, code-level patterns to build a loop-free architecture.

The Anatomy of a Vampire Record

To understand how to stop an infinite loop, you have to understand how it starts.

Consider a standard integration between your B2B SaaS application and a CRM. You want to keep the Contact record in sync.

A user updates a contact's phone number in your application.
Your background worker pushes the update to the CRM API.
The CRM successfully updates the record.
Because the record was updated, the CRM fires an AccountUpdated webhook back to your application.
Your webhook receiver parses the payload, sees a "new" timestamp, and updates your local database to match the CRM.
Because your local database was updated, your background worker detects the change and pushes the "update" back to the CRM.
The cycle repeats indefinitely.

sequenceDiagram
    participant App as Your Application
    participant API as Third-Party API
    
    App->>API: 1. PATCH /contacts/123 (Update phone)<br>Source: User
    API-->>App: 200 OK
    API->>App: 2. POST /webhook (Contact Updated)<br>Triggered by Step 1
    App->>App: 3. Process Webhook & Update Local DB
    App->>API: 4. PATCH /contacts/123 (Sync Local DB)<br>Triggered by Step 3
    API-->>App: 200 OK
    API->>App: 5. POST /webhook (Contact Updated)<br>Triggered by Step 4
    Note over App,API: Infinite Loop Initiated

To stop this, you have to break the chain. You need a mechanism to identify which system originated the change and drop "echoes" before they trigger a recursive write. The patterns below are ordered from simplest to most comprehensive. In practice, you want to layer them.

Quickstart: Prevent an Echo in Three Lines

Before diving into the full patterns, here is the smallest possible echo guard. Add this check to every webhook handler you write:

// Minimum viable echo filter (TypeScript/Node)
const actorId = payload.event?.actor?.id || payload.event?.modified_by;
if (actorId === process.env.INTEGRATION_USER_ID) return res.status(200).send('OK');
// Everything below this line only runs for genuine external changes

# Minimum viable echo filter (Python)
actor_id = payload.get("event", {}).get("actor", {}).get("id")
if actor_id == os.environ["INTEGRATION_USER_ID"]:
    return Response(status_code=200)
# Everything below this line only runs for genuine external changes

This single guard eliminates the majority of infinite loops in bidirectional syncs. The rest of this cookbook covers what happens when it's not enough - and it often isn't.

Pattern 1: The Dedicated Integration User (Actor Filtering)

The most common, simplest, and reliable pattern for breaking infinite loops is Actor Filtering. You require the customer to create a dedicated service account or "Integration User" inside the third-party platform.

When your application writes to the API, it authenticates as this specific user. When the third-party system fires a webhook back to your application, the payload usually contains an actor_id, updated_by, or author field. If that ID matches your Integration User, you know the event is just an echo of your own write, and you drop it immediately.

The Code

Here is how you might implement this in a standard Node.js/TypeScript webhook receiver:

// webhook-handler.ts
app.post('/webhook/crm', async (req, res) => {
  const payload = req.body;
  const accountId = req.query.account_id;
 
  // 1. Acknowledge receipt immediately to prevent third-party retries
  res.status(200).send('OK');
 
  // 2. Fetch the stored configuration for this specific customer account
  const integrationConfig = await db.getIntegrationConfig(accountId);
  const myIntegrationUserId = integrationConfig.dedicated_user_id;
 
  // 3. Extract the actor from the webhook payload.
  // Field name varies by provider: "userId", "modifiedBy", "actor", etc.
  const actorId = payload.event?.actor?.id 
    || payload.event?.modified_by?.id 
    || payload.event?.user_id;
 
  // 4. Drop the event if we caused it
  if (actorId === myIntegrationUserId) {
    console.log(`[Webhook] Echo detected: event authored by integration user. Dropping.`);
    return;
  }
 
  // 5. Otherwise, process the legitimate third-party update
  await syncQueue.add('process-contact-update', payload);
});

The same pattern in Python using FastAPI:

# webhook_handler.py
from fastapi import FastAPI, Request, Response, BackgroundTasks
 
app = FastAPI()
 
@app.post("/webhook/crm")
async def handle_crm_webhook(
    request: Request,
    background_tasks: BackgroundTasks,
):
    payload = await request.json()
    account_id = request.query_params.get("account_id")
 
    # Fetch the stored integration user ID for this customer
    config = await db.get_integration_config(account_id)
    my_user_id = config["dedicated_user_id"]
 
    # Extract the actor from the webhook payload
    actor_id = (
        payload.get("event", {}).get("actor", {}).get("id")
        or payload.get("event", {}).get("modified_by", {}).get("id")
        or payload.get("event", {}).get("user_id")
    )
 
    # Drop echoes immediately
    if actor_id == my_user_id:
        return Response(status_code=200)
 
    # Queue the legitimate update for background processing
    background_tasks.add_task(sync_queue.enqueue, "process-contact-update", payload)
    return Response(status_code=200)

The Trade-offs

While technically simple, Actor Filtering introduces significant business friction and fails in two common scenarios:

Licensing Costs and Governance: Platforms like Salesforce, Jira, and Zendesk charge per seat. Forcing a customer to burn a paid license just to authorize your integration is a tough sell, especially for SMBs. Furthermore, as noted by Ambientia in their Jira-to-Jira sync architecture, using an integration user requires strict governance so human users do not log in with the service account and generate untracked manual changes.
Skinny Webhook Payloads: Some platforms send "skinny" payloads with only a record ID and event type - no author info. For example, Smartsheet's callback payload indicates the changed objects and the event type, but it doesn't contain substantial object data. You have to make a secondary fetch to get actor info, which costs an extra API call per event.
Shared Service Accounts: If your customer has three different tools writing through one shared Salesforce integration user, you cannot distinguish which tool made the change. You must mandate one dedicated user per integration.

Pattern 2: Echo Filtering via Custom Headers and Metadata

If you cannot force the customer to provision a dedicated integration user, you need a way to tag your outbound API requests so the resulting webhooks carry a recognizable fingerprint. This is more reliable than actor filtering because the tag travels with the event, not with the user.

Some modern APIs support custom headers or metadata injection specifically for this purpose. For example, Smartsheet allows developers to pass a Smartsheet-Change-Agent header during an API call. If a webhook fires as a result, the header is included in the payload. Other platforms, like Stripe, allow you to pass arbitrary key-value pairs in a metadata object during mutations.

The Code

const MY_CHANGE_AGENT = 'my-saas-sync-engine';
 
// 1. Making the outbound API call with a tracking header/metadata
async function updateThirdPartyRecord(recordId: string, data: any) {
  await fetch(`https://api.example.com/v1/records/${recordId}`, {
    method: 'PATCH',
    headers: {
      'Authorization': `Bearer ${token}`,
      'Content-Type': 'application/json',
      // Injecting our custom correlation ID for APIs that support it
      'X-Change-Agent': MY_CHANGE_AGENT,
      'Smartsheet-Change-Agent': MY_CHANGE_AGENT
    },
    body: JSON.stringify({
      ...data,
      // Fallback: Injecting into metadata if headers aren't supported
      metadata: {
        source_system: MY_CHANGE_AGENT
      }
    })
  });
}
 
// 2. Filtering at the webhook receiver
app.post('/webhook/records', async (req, res) => {
  const payload = req.body;
  res.status(200).send('OK');
 
  // Check for our specific fingerprint in the payload
  const agent = payload.meta?.change_agent 
    || payload.changeAgent 
    || payload.data?.metadata?.source_system;
 
  if (agent?.includes(MY_CHANGE_AGENT)) {
    console.log('Echo detected via metadata/header. Dropping event.');
    return;
  }
 
  await processExternalUpdate(payload);
});

Vendor-Specific Equivalents

Platform	Mechanism	How It Works
Smartsheet	`Smartsheet-Change-Agent` header	Echoed back in webhook callback events
Salesforce	Custom field (e.g., `Last_Synced_By__c`)	Write a marker on the record; filter on it in triggers
HubSpot	`hs_lastmodifieddate` + origin comparison	No native header support; requires timestamp-based filtering
Jira	`changelog.author.accountId`	Compare against integration user's Atlassian account ID

The Trade-offs

The glaring issue with this pattern is inconsistent API support. For every API that gracefully passes context markers through to its webhooks, there are ten legacy APIs that strip all custom headers and overwrite metadata objects. When the third-party API doesn't offer a native header mechanism, you can simulate it by writing a sentinel field on the record (e.g., _last_sync_source: "my-app"). Check this field in your webhook handler before processing. This costs an extra field on every synced record, but it works universally.

The Idempotent Write Guard

Echo filtering stops most loops at the webhook handler. But what if a duplicate slips through - a retry from the third-party, a race condition between two polling cycles, or a webhook replayed after a partial failure? Your write path needs to handle the same update arriving twice without creating duplicate records or re-triggering the sync.

The fix is an idempotent upsert. Instead of blindly inserting or updating, you use a natural key (the external record ID) and a version check to ensure the same logical change only applies once.

Upsert Pseudocode (TypeScript)

async function idempotentUpsert(
  externalId: string,
  resourceType: string,
  incomingData: Record<string, unknown>,
  externalUpdatedAt: string
): Promise<{ action: string; reason?: string }> {
  // 1. Look up the existing record by external ID
  const existing = await db.findByExternalId(resourceType, externalId);
 
  if (existing) {
    // 2. Compare timestamps - skip if incoming data is stale or duplicate
    const incomingTs = new Date(externalUpdatedAt).getTime();
    const storedTs = new Date(existing.external_updated_at).getTime();
 
    if (incomingTs <= storedTs) {
      console.log(`Stale or duplicate update for ${resourceType}/${externalId}. Skipping.`);
      return { action: 'skipped', reason: 'stale' };
    }
 
    // 3. Update only if incoming data is strictly newer
    await db.update(existing.id, {
      ...incomingData,
      external_updated_at: externalUpdatedAt,
      last_synced_at: new Date().toISOString(),
    });
    return { action: 'updated' };
  }
 
  // 4. No existing record - insert with external ID as the dedup key
  await db.insert({
    external_id: externalId,
    resource_type: resourceType,
    ...incomingData,
    external_updated_at: externalUpdatedAt,
    last_synced_at: new Date().toISOString(),
  });
  return { action: 'created' };
}

Upsert Pseudocode (Python)

from datetime import datetime
 
async def idempotent_upsert(
    external_id: str,
    resource_type: str,
    incoming_data: dict,
    external_updated_at: str,
) -> dict:
    existing = await db.find_by_external_id(resource_type, external_id)
 
    if existing:
        incoming_ts = datetime.fromisoformat(external_updated_at)
        stored_ts = datetime.fromisoformat(existing["external_updated_at"])
 
        if incoming_ts <= stored_ts:
            return {"action": "skipped", "reason": "stale"}
 
        await db.update(existing["id"], {
            **incoming_data,
            "external_updated_at": external_updated_at,
            "last_synced_at": datetime.utcnow().isoformat(),
        })
        return {"action": "updated"}
 
    await db.insert({
        "external_id": external_id,
        "resource_type": resource_type,
        **incoming_data,
        "external_updated_at": external_updated_at,
        "last_synced_at": datetime.utcnow().isoformat(),
    })
    return {"action": "created"}

The key details that make this loop-safe:

External ID as the natural key. Never use your internal auto-increment ID for deduplication. The external system's ID is the stable identifier across both systems.
Timestamp comparison rejects stale data. If the incoming updated_at is older than or equal to what you already have, the update is either a duplicate or out-of-order delivery. Drop it.
Separate last_synced_at from external_updated_at. Your outbound sync should trigger on external_updated_at changes, not on last_synced_at. This prevents the sync metadata itself from creating a change event.

Pattern 3: Fingerprinting and State Hashing

Actor filtering and header tagging are fast, but they depend on the third-party providing useful metadata. Fingerprinting works regardless of what the webhook payload contains. This is the concept behind deterministic state comparison.

The logic is straightforward: you hash the data payload of the incoming webhook and compare it to the hash of the record currently sitting in your database. If the hashes match, the incoming webhook contains no net-new information. It is either an echo of your own write or a redundant update. Either way, you drop it.

The Code

import { createHash } from 'crypto';
import stringify from 'json-stable-stringify';
 
// In-memory or Redis-backed state store
const stateStore = new Map<string, string>();
 
function generateRecordHash(recordData: any, trackedFields: string[]) {
  // 1. Isolate only the business-relevant fields your sync actually modifies
  const relevantData = trackedFields.reduce((acc, field) => {
    acc[field] = recordData[field];
    return acc;
  }, {} as Record<string, unknown>);
 
  // 2. Use a deterministic stringifier (ignores key order)
  const stableString = stringify(relevantData);
 
  // 3. Generate a SHA-256 hash
  return createHash('sha256').update(stableString).digest('hex');
}
 
app.post('/webhook', async (req, res) => {
  const incomingRecord = req.body.record;
  res.status(200).send('OK');
 
  const trackedFields = ['first_name', 'last_name', 'email', 'phone', 'status'];
  const incomingHash = generateRecordHash(incomingRecord, trackedFields);
  const storedHash = stateStore.get(incomingRecord.id);
 
  if (incomingHash === storedHash) {
    console.log('State hash match. Payload hasn\'t actually changed. Dropping redundant event.');
    return;
  }
 
  // Real change detected. Update state and proceed.
  stateStore.set(incomingRecord.id, incomingHash);
  await applyUpdate(incomingRecord);
});

Computing Field-Level Diffs

When a hash mismatch tells you something changed, you often need to know what changed - both for logging and for selective field-level syncs that avoid overwriting fields you don't own.

function computeFieldDiff(
  incoming: Record<string, unknown>,
  stored: Record<string, unknown>,
  trackedFields: string[]
): Map<string, { was: unknown; now: unknown }> {
  const diff = new Map<string, { was: unknown; now: unknown }>();
 
  for (const field of trackedFields) {
    const inVal = stringify(incoming[field]);
    const storedVal = stringify(stored[field]);
    if (inVal !== storedVal) {
      diff.set(field, { was: stored[field], now: incoming[field] });
    }
  }
 
  return diff;
}
 
// Usage inside a webhook handler
const diff = computeFieldDiff(incomingRecord, storedRecord, trackedFields);
 
if (diff.size === 0) {
  console.log('No meaningful field changes. Dropping.');
  return;
}
 
console.log(`Fields changed: ${[...diff.keys()].join(', ')}`);
// Only sync the fields that actually changed
await syncChangedFields(incomingRecord.id, diff);

Selective field-level syncing is worth the extra complexity. If you blindly push the entire record on every change, you risk overwriting fields that the other system "owns." For example, your app might own status while the CRM owns phone. A field-level diff lets you sync only what belongs to each side.

Critical Detail: Choose Your Hash Fields Carefully

State hashing requires strict normalization. Third-party APIs frequently inject read-only fields, computed properties, or system timestamps into their webhook payloads. If you hash the entire raw payload, fields like updated_at or system_modstamp will cause the hash to change every single time, rendering the fingerprint useless.

The OpenCTI project learned this the hard way. In their bidirectional sync, their ingestion code path defaulted a metadata version to now() when no version was provided. Because the sync operation generated a new version timestamp on every pass, it defeated the version check and created an unbounded A → B → A → B file replace loop.

You must maintain a rigid schema mapping that hashes only the business-relevant fields your sync actually modifies, stripping out volatile fields before hashing. This adds maintenance overhead, as you have to update your hashing logic when schemas evolve.

Pattern 4: Watermark-Based Delta Syncs

Webhooks are the fast path for change detection, but they are erratic. Most integration architectures need a polling fallback - a scheduled, watermark-based delta sync that catches events webhooks missed (dropped deliveries, late arrivals, endpoint downtime).

To prevent loops in a delta sync, you manage the time window. You use a last_successful_run timestamp per resource per account. On each polling cycle, you only fetch records modified after the watermark. Your own writes, which happened before the watermark was advanced, naturally fall outside the query window.

The Code

interface SyncState {
  resourceType: string;
  accountId: string;
  lastSuccessfulRun: string; // ISO 8601 timestamp
}
 
async function pollForChanges(state: SyncState, apiClient: ApiClient) {
  const modifiedSince = state.lastSuccessfulRun;
  const now = new Date().toISOString();
 
  // 1. Fetch only records modified after our last successful sync.
  // Our own writes from the previous cycle fall before the watermark.
  const records = await apiClient.listRecords(state.resourceType, {
    modified_after: modifiedSince,
    sort: 'modified_at:asc',
  });
 
  // 2. Further filter out records we very recently updated ourselves 
  // to account for clock skew between servers.
  const myRecentWrites = await db.getRecentOutboundWrites(state.accountId, state.resourceType);
  
  const validUpdates = records.filter(record => {
    const writeEvent = myRecentWrites.find(w => w.external_id === record.id);
    if (writeEvent) {
      const timeDiff = Math.abs(new Date(record.updated_at).getTime() - new Date(writeEvent.timestamp).getTime());
      if (timeDiff < 5000) return false; // Assume echo if within 5 seconds
    }
    return true;
  });
 
  for (const record of validUpdates) {
    await processRecord(record);
  }
 
  // 3. Advance the watermark only after successful processing.
  state.lastSuccessfulRun = now;
  await saveSyncState(state);
}

Why This Prevents Loops

Consider the timeline:

T=0: Your sync polls the CRM. Watermark is T=0.
T=1: Your sync writes record X to the CRM. CRM sets modifiedAt: T=1.
T=2: Sync completes successfully. Watermark advances to T=2.
T=3: Next poll fetches records where modifiedAt > T=2. Record X (modified at T=1) is excluded.

The echo never enters your pipeline. The catch: watermark sync has inherent latency. It only catches changes on the next polling interval. Furthermore, you have to account for clock skew between your servers and the third-party API, which requires adding a time buffer and handling duplicates gracefully. For near-real-time requirements, pair it with webhook-based echo filtering (Patterns 1-3) as the primary path and watermark polling as the correctness backstop.

Full Polling Loop with Retries and Backoff

The basic watermark pattern above handles the happy path. In production, you need to handle API failures, transient errors, and rate limits without letting the sync stall forever or hammer a struggling endpoint.

async function runDeltaSyncLoop(
  state: SyncState,
  apiClient: ApiClient,
  options: { maxRetries: number; baseDelayMs: number; pollIntervalMs: number }
) {
  const { maxRetries, baseDelayMs, pollIntervalMs } = options;
 
  while (true) {
    let attempt = 0;
    let success = false;
 
    while (attempt < maxRetries && !success) {
      try {
        await pollForChanges(state, apiClient);
        success = true;
      } catch (err: any) {
        attempt++;
 
        if (err.status === 429) {
          // Rate limited - respect the Retry-After header if present
          const retryAfter = parseInt(err.headers?.['retry-after'] || '60', 10);
          console.warn(`Rate limited. Waiting ${retryAfter}s before retry.`);
          await sleep(retryAfter * 1000);
          continue;
        }
 
        if (attempt >= maxRetries) {
          console.error(`Delta sync failed after ${maxRetries} attempts. Will retry next cycle.`);
          // Do NOT advance the watermark - next cycle retries the same window
          break;
        }
 
        // Exponential backoff with jitter for transient errors
        const delay = baseDelayMs * Math.pow(2, attempt) + Math.random() * 1000;
        console.warn(`Attempt ${attempt} failed. Retrying in ${Math.round(delay)}ms...`);
        await sleep(delay);
      }
    }
 
    await sleep(pollIntervalMs);
  }
}
 
function sleep(ms: number): Promise<void> {
  return new Promise(resolve => setTimeout(resolve, ms));
}
 
// Example: poll every 60s, retry up to 3 times with 2s base delay
runDeltaSyncLoop(syncState, apiClient, {
  maxRetries: 3,
  baseDelayMs: 2000,
  pollIntervalMs: 60_000,
});

Key decisions in this loop:

Rate limit responses (429) get special handling. Instead of exponential backoff, you wait exactly as long as the API tells you to. Backing off further than Retry-After wastes time; backing off less gets you rate-limited again.
Exponential backoff with jitter for other failures. Adding random jitter prevents thundering herd problems when multiple sync jobs recover at the same time.
The watermark only advances on success. If pollForChanges throws, the watermark stays put. The next attempt re-fetches the same window, which is safe because the idempotent upsert logic (see The Idempotent Write Guard above) handles duplicates.

Handling the Fallout: Rate Limits and Circuit Breakers

No matter how perfectly you architect your loop prevention, edge cases will eventually trigger a runaway sync. A customer might map two fields to each other in a circular reference, or a third-party API might deploy a bug that breaks your hashing logic.

When a loop does start, the first symptom is usually a flood of HTTP 429 Too Many Requests responses. Your API quota evaporates in minutes. How your system handles these 429s determines whether the loop quietly pauses or takes down your entire background worker fleet.

Do not blindly retry 429 errors. If two systems are caught in a bidirectional loop and both implement aggressive exponential backoff, they will simply hammer the API the moment the rate limit window resets, instantly exhausting the quota again.

You must implement a circuit breaker that respects standardized rate limit headers. The IETF has drafted a standard set of HTTP header fields for rate limiting (ratelimit-limit, ratelimit-remaining, ratelimit-reset).

interface RateLimitInfo {
  limit: number;
  remaining: number;
  resetAt: number; // Unix epoch seconds
}
 
function parseRateLimitHeaders(headers: Headers): RateLimitInfo | null {
  // Try standardized IETF headers first, then common vendor variants
  const limit = parseInt(headers.get('ratelimit-limit') || headers.get('x-ratelimit-limit') || '0');
  const remaining = parseInt(headers.get('ratelimit-remaining') || headers.get('x-ratelimit-remaining') || '0');
  const reset = parseInt(headers.get('ratelimit-reset') || headers.get('x-ratelimit-reset') || '0');
 
  if (!limit) return null;
  return { limit, remaining, resetAt: reset };
}
 
function shouldCircuitBreak(info: RateLimitInfo): boolean {
  // If we've burned through 90%+ of our quota, something is wrong.
  // Likely a loop. Pause and investigate.
  const usagePercent = ((info.limit - info.remaining) / info.limit) * 100;
  return usagePercent > 90;
}

A sudden, unexpected drop in ratelimit-remaining for a single account is one of the strongest signals that a loop is active. Instrument your integration layer to emit metrics on quota consumption per account. When an alert fires, the circuit breaker should halt all outbound syncs for that specific customer account until the reset time passes, rather than allowing a single vampire record to consume the account's entire API quota. For a deeper look at managing these limits, refer to our guide on How Mid-Market SaaS Teams Handle API Rate Limits and Webhooks at Scale.

Putting It All Together: The Layered Defense

No single pattern stops every loop. APIs differ too much. Some send rich webhook payloads with actor metadata. Some send skinny payloads with just a record ID. Some don't support webhooks at all. The architecture that survives production is a layered one.

flowchart TD
    A["Inbound Webhook Event"] --> B{"Actor = integration user?"}
    B -->|Yes| Z["Drop event"]
    B -->|No / Unknown| C{"Change-Agent header<br>or origin tag present?"}
    C -->|Yes, it's ours| Z
    C -->|No / Not supported| D{"Payload hash matches<br>stored state?"}
    D -->|Match| Z
    D -->|Different| E["Process event"]
    E --> F["Write to target system"]
    F --> G["Update stored hash"]

    H["Polling Fallback"] --> I{"Record modified after<br>watermark?"}
    I -->|No| Z
    I -->|Yes| D

Layer 1 (fastest): Actor/header filtering at ingestion. Catches 80-90% of echoes with zero latency.
Layer 2 (reliable): Payload fingerprinting. Catches echoes that slip past actor filtering - including multi-hop loops. API-agnostic.
Layer 3 (correctness backstop): Watermark delta sync. Catches everything the webhook path misses. Adds latency but guarantees convergence.
Cross-cutting: Rate limit monitoring as a circuit breaker. Your last line of defense before the quota is gone.

Testing Your Loop Prevention

Shipping echo filtering without tests is like installing a fire alarm without batteries. Here's a concrete checklist and test examples to verify your defenses actually hold.

The Checklist

Echo round-trip test: Fire a webhook with the integration user's actor ID. Verify no outbound API call is made.
Unknown actor test: Fire a webhook with an unrecognized actor ID. Verify the update is processed.
Duplicate payload test: Send the same webhook payload twice. Verify the second one is dropped by state hashing.
Stale update test: Send a webhook with an updated_at older than the stored record. Verify it's rejected by the idempotent upsert.
Field subset test: Change only a non-tracked field (e.g., system_modstamp). Verify the hash doesn't change and the event is dropped.
Rate limit escalation test: Simulate a burst of 429 responses. Verify the circuit breaker engages before quota is exhausted.
Watermark regression test: Verify the watermark doesn't advance when a sync cycle fails mid-way.

Example: Unit Testing the Echo Filter (Node/Jest)

describe('Echo filtering', () => {
  const INTEGRATION_USER_ID = 'bot-user-42';
 
  function isEcho(payload: any, integrationUserId: string): boolean {
    const actorId = payload.event?.actor?.id
      || payload.event?.modified_by?.id
      || payload.event?.user_id;
    return actorId === integrationUserId;
  }
 
  it('drops events authored by the integration user', () => {
    const payload = { event: { actor: { id: 'bot-user-42' }, data: {} } };
    expect(isEcho(payload, INTEGRATION_USER_ID)).toBe(true);
  });
 
  it('processes events from external users', () => {
    const payload = { event: { actor: { id: 'human-user-99' }, data: {} } };
    expect(isEcho(payload, INTEGRATION_USER_ID)).toBe(false);
  });
 
  it('handles missing actor fields without crashing', () => {
    const payload = { event: { data: {} } };
    expect(isEcho(payload, INTEGRATION_USER_ID)).toBe(false);
  });
});

Example: Unit Testing the State Hash (Node/Jest)

describe('State hashing', () => {
  const trackedFields = ['first_name', 'last_name', 'email', 'phone'];
 
  it('produces identical hashes for identical business data', () => {
    const a = { first_name: 'Jane', last_name: 'Doe', email: 'j@co.com', phone: '555-1234', updated_at: '2025-01-01' };
    const b = { first_name: 'Jane', last_name: 'Doe', email: 'j@co.com', phone: '555-1234', updated_at: '2025-06-15' };
    expect(generateRecordHash(a, trackedFields)).toBe(generateRecordHash(b, trackedFields));
  });
 
  it('produces different hashes when a tracked field changes', () => {
    const a = { first_name: 'Jane', last_name: 'Doe', email: 'j@co.com', phone: '555-1234' };
    const b = { first_name: 'Jane', last_name: 'Doe', email: 'j@co.com', phone: '555-9999' };
    expect(generateRecordHash(a, trackedFields)).not.toBe(generateRecordHash(b, trackedFields));
  });
 
  it('ignores key ordering in the payload', () => {
    const a = { email: 'j@co.com', first_name: 'Jane', last_name: 'Doe', phone: '555-1234' };
    const b = { first_name: 'Jane', phone: '555-1234', last_name: 'Doe', email: 'j@co.com' };
    expect(generateRecordHash(a, trackedFields)).toBe(generateRecordHash(b, trackedFields));
  });
});

Integration Test: Synthetic Loop Detection

The most important test you can write simulates an actual bidirectional loop in a controlled environment:

describe('Bidirectional loop prevention (integration)', () => {
  it('does not produce an outbound write when processing its own echo', async () => {
    const outboundCalls: any[] = [];
    const INTEGRATION_USER_ID = 'bot-user-42';
 
    // Mock the API client to record outbound calls
    const mockApiClient = {
      updateRecord: async (id: string, data: any) => {
        outboundCalls.push({ id, data });
      },
    };
 
    // Step 1: Simulate an outbound write
    await syncEngine.pushToThirdParty('contact-123', { phone: '555-0000' }, mockApiClient);
 
    // Step 2: Simulate the echo webhook that the third party would fire
    const echoWebhook = {
      event: {
        actor: { id: INTEGRATION_USER_ID },
        type: 'contact.updated',
        data: { id: 'contact-123', phone: '555-0000' },
      },
    };
 
    outboundCalls.length = 0; // Reset tracking
    await webhookHandler.process(echoWebhook);
 
    // Step 3: Verify no outbound call was triggered by the echo
    expect(outboundCalls).toHaveLength(0);
  });
});

Run this test in your CI pipeline. If it ever fails, you have a loop waiting to happen in production.

How Truto Simplifies Bidirectional Sync Architecture

Building these loop-prevention mechanisms from scratch for one API is difficult. Building them for fifty different APIs - each with different webhook structures, rate limit behaviors, and authentication models - requires a dedicated integrations team.

Truto addresses these pain points through a declarative, data-driven approach:

Webhook normalization via JSONata. Truto allows you to handle webhook normalization and loop prevention entirely through configuration data. Each integration's webhook mapping is a declarative JSONata expression that extracts event type, actor, and payload data into a common schema. You can write a single expression to evaluate incoming webhooks, check actor IDs, or compare state hashes, and filter out echo events at the mapping layer before they ever reach your application logic.

Standardized rate limit headers. When an upstream API returns a 429, Truto passes that error directly to the caller with the standardized headers attached (ratelimit-limit, ratelimit-remaining, ratelimit-reset). Your system gets clear, uniform backpressure signals, allowing you to implement a single circuit breaker that works across every integration, whether the underlying API is Salesforce, HubSpot, or Zendesk.

Per-customer override hierarchy. Different customers have different CRM configurations. One customer's Salesforce instance might use a custom Integration_Source__c field for origin tagging, while another relies purely on actor filtering. Truto provides an environment-level and account-level override hierarchy. You can apply customer-specific loop-prevention logic directly to their account without deploying a single line of new code. For more details on architecting these systems without hardcoded logic, read Zero Integration-Specific Code: How to Ship API Connectors as Data-Only Operations or explore The Architect's Guide to Bi-Directional API Sync (Without Infinite Loops).

What to Do Next

Bidirectional syncs do not have to be a liability. If you're building or debugging a bidirectional sync right now:

Audit your webhook handlers. Are you checking actor/author on every inbound event? If not, you have an open loop vector.
Add fingerprint tracking. Even if your actor filtering works today, one API change to the webhook payload format can silently break it. Hashing is your safety net.
Instrument rate limit consumption. Set up per-account alerts for anomalous quota depletion. This is your early warning system.
Test with a synthetic loop. In staging, intentionally create a bidirectional write and watch what happens. If your filters don't catch it, you know what to fix before production teaches you the hard way.

How to Prevent Infinite Loops in Bidirectional API Syncs: A Developer's Cookbook

The Anatomy of a Vampire Record

Quickstart: Prevent an Echo in Three Lines

Pattern 1: The Dedicated Integration User (Actor Filtering)

The Code

The Trade-offs

Pattern 2: Echo Filtering via Custom Headers and Metadata

The Code

Vendor-Specific Equivalents

The Trade-offs

The Idempotent Write Guard

Upsert Pseudocode (TypeScript)

Upsert Pseudocode (Python)

Pattern 3: Fingerprinting and State Hashing

The Code

Computing Field-Level Diffs

Critical Detail: Choose Your Hash Fields Carefully

Pattern 4: Watermark-Based Delta Syncs

The Code

Why This Prevents Loops

Full Polling Loop with Retries and Backoff

Handling the Fallout: Rate Limits and Circuit Breakers

Putting It All Together: The Layered Defense

Testing Your Loop Prevention

The Checklist

Example: Unit Testing the Echo Filter (Node/Jest)

Example: Unit Testing the State Hash (Node/Jest)

Integration Test: Synthetic Loop Detection

How Truto Simplifies Bidirectional Sync Architecture

What to Do Next

FAQ

More from our Blog

The Architect's Guide to Bi-Directional API Sync (Without Infinite Loops)

How Mid-Market SaaS Teams Handle API Rate Limits and Webhooks at Scale

Zero Integration-Specific Code: How to Ship API Connectors as Data-Only Operations

How to Architect a Bidirectional HubSpot Sync (Without Infinite Loops)

How to Handle Webhooks and Real-Time Data Sync from Legacy APIs