Add documentation for request coalescing in Kafka consumer

2026-05-11 21:10:23 +09:00 · 2026-05-11 21:10:23 +09:00 · f3487a4a0f
commit f3487a4a0f
parent c6ca4d23fc
1 changed files with 139 additions and 0 deletions
--- a/content/posts/request-coalescing-kafka-consumer.md
+++ b/content/posts/request-coalescing-kafka-consumer.md
@ -0,0 +1,139 @@
+---
+title: "Request Coalescing in a Kafka Consumer"
+date: 2026-05-09
+draft: false
+description: "Collapsing redundant upstream API calls in a Kafka consumer with Go's singleflight, applied to concurrent user-change events against a shared rate-limited API."
+tags: [go, kafka, singleflight, spanner, google-cloud, event-driven]
+github: ""
+url: ""
+---
+
+## Overview
+
+Discord's post on [how they store trillions of messages](https://discord.com/blog/how-discord-stores-trillions-of-messages)
+covers request coalescing as a way to collapse duplicate reads under concurrent
+load. The same problem appeared in a Kafka consumer I built for processing
+user-change events.
+
+Each event — email, username, address changes — is produced to a Kafka topic
+keyed by `accountId`. All events for the same account land on the same
+partition; each pod owns one or more partitions. For every message, the consumer
+calls an internal API to fetch the current account state and writes the result
+to Spanner.
+
+The constraint: the internal API has a **1500 TPS ceiling shared across the
+entire company**. Without coalescing, events for the same account arriving 2–3
+seconds apart each trigger a separate API call — for data that hasn't changed
+between calls. At scale, unnecessary duplicate calls erode the shared budget for
+every other team consuming the same API.
+
+## Architecture
+
+```
+User change
+     │
+     ▼
+┌──────────────────────────────────────────────┐
+│ Upstream pipeline                            │
+│ A (intake) → B (announce) → C (business      │
+│ logic + DB write)            ~20s to settle  │
+└───────────────────────┬──────────────────────┘
+                        │ Internal API
+                        │ (source of truth, 1500 TPS shared)
+                        │
+        Kafka topic (key = accountId)
+                        │
+                        ▼
+          ┌─────────────────────────┐
+          │      Consumer pod       │
+          │     (1+ partitions)     │
+          │                         │
+          │  msg: account X ────────┤
+          │  msg: account X ───wait─┤──► singleflight("X")
+          │  msg: account X ───wait─┤       │
+          │                         │    sleep 30s   ← upstream settles
+          │  msg: account Y ────────┤       │
+          │  msg: account Y ───wait─┤──►    API call (once per account)
+          │                         │       │
+          └─────────────────────────┘    Spanner write
+                                            │
+                                     Kafka commit
+                                   (all coalesced msgs)
+```
+
+## Stack
+
+- **Go** — consumer service
+- `golang.org/x/sync/singleflight` — per-`accountId` call deduplication
+- **Apache Kafka** — event stream, partitioned by `accountId`
+- **Cloud Spanner** — write target; chosen for strong consistency, managed
+  replication, and no operational overhead for a small team
+
+## Highlights
+
+**Kafka partitioning is load-bearing, not incidental.**
+Routing by `accountId` as the message key guarantees all events for one account
+hit the same partition and are processed by one consumer instance. Without this,
+two pods could independently fire API calls for the same account at the same
+time, and singleflight — which is in-process — would not help.
+
+**The 30-second sleep is deliberate, not a workaround.**
+The upstream pipeline takes roughly 20 seconds to apply business logic and
+commit state. The consumer sleeps 30 seconds before calling the API, giving the
+upstream time to settle. Messages for the same `accountId` that arrive during
+the sleep join the existing singleflight group and block — they never reach the
+API independently. When the call completes, all of them share the result.
+
+**Coalesced messages commit to Kafka together.**
+At the end of a singleflight group's lifetime, all messages that waited on that
+call have their Kafka offsets committed in a single batch. This is a natural
+consequence of the blocking behaviour: no message in the group is acknowledged
+until the group resolves.
+
+**Coalescing protects a shared rate limit, not just throughput.**
+The internal API enforces a 1500 TPS ceiling across the whole company. Each
+unnecessary duplicate call consumes capacity that other teams depend on.
+Singleflight makes the consumer a cooperative user of that shared resource
+rather than a potential source of contention. After deployment, API call volume
+dropped 36% — measured via consumer metrics sent to Datadog.
+
+**Different `accountId`s run in parallel.**
+Singleflight groups are keyed by `accountId`. An in-flight call for account X
+does not block processing for account Y — each account gets its own group,
+its own 30-second window, and its own API call.
+
+## Code
+
+> Illustrative only — this is not the production implementation.
+
+```go
+var group singleflight.Group
+
+func processMessage(ctx context.Context, accountID string) error {
+    result, err, _ := group.Do(accountID, func() (interface{}, error) {
+        // All messages for the same accountID block here.
+        // Only the first goroutine executes this function.
+
+        // Wait for the upstream pipeline to settle before fetching.
+        time.Sleep(30 * time.Second)
+
+        return fetchFromInternalAPI(ctx, accountID)
+    })
+    if err != nil {
+        return err
+    }
+
+    if err := writeToSpanner(ctx, result.(*AccountData)); err != nil {
+        return err
+    }
+
+    // Caller commits its Kafka offset after this returns.
+    // All goroutines that waited on the same group reach this point
+    // with the same result and commit their offsets together.
+    return nil
+}
+```
+
+## Status
+
+Running in production. API call reduction is tracked as an ongoing metric in Datadog.