22. May 2026 15 min read

Medium

Kai Ole Hartwig

CVE-2026-5950 — unbounded resend loop in the BIND 9 resolver: why DNS resilience is more than just a patch in the update wave

Q: How do I check whether my distribution package is already patched?

named -v shows the version number. Expected: 9.18.49, 9.20.23, or 9.21.22. For Debian/Ubuntu, additionally check the package changelog: apt changelog bind9 | head -50 shows whether the CVE fix is in the backport.

Q: We use CoreDNS in the Kubernetes cluster — are we affected?

CoreDNS itself is not. But if CoreDNS forwards via the forward plugin to an external BIND recursor, that forward target is affected. The question is: which resolver stands behind your CoreDNS? If that is a BIND in the affected version range, the patch recommendation applies.

21 May 2026. Yesterday, ISC published the advisory for CVE-2026-5950 — an unbounded resend loop in the BIND 9 resolver state machine that, under the “bad server” handling path, can run into an infinite repetition loop until only the query timeout breaks it. Patches are available in 9.18.49, 9.20.23, and 9.21.22; authoritative services are not affected per ISC, recursive resolvers are. For platform operators with their own internal DNS resolver in container clusters, in edge-caching setups, or in classic office networks, today is a small, orderly update wave — and perhaps the occasion to take an honest look at the DNS resilience architecture.

Offener Walnuss-und-Messing-Schlüsselkasten auf mattem dunklem Schiefer mit zwei identischen Schlüsselloch-Slots. Ein gebürsteter Messing-Hauptschlüssel mit oxblutrotem Band hängt am Walnuss-Pendelarm exakt zwischen beiden Slots, ohne in einem zu greifen. Unten links Messing-Zählwerk und geschlossenes Walnuss-Logbuch, oben rechts eine Messing-Uhrwerk-Hemmung. Metapher für die Endlos-Wiederholungs-Schleife im Resolver-State-Machine, die zwischen zwei „bad server"-Zuständen pendelt, ohne abzuschliessen. — AI-generated · gpt-image 2.0

TL;DR — the 90-second summary

A classic resolver resource exhaustion with a clear patch path — and a good occasion to look at the DNS layer in the platform architecture the same way we look at web servers and container hosts.

Question	Answer
Affected?	BIND 9 as a recursive resolver in the version lines `9.18.36` through `9.18.48`, `9.20.8` through `9.20.22`, `9.21.7` through `9.21.21` plus subscription lines `9.18.36-S1` through `9.18.48-S1` and `9.20.9-S1` through `9.20.22-S1`. Authoritative services are not affected per ISC.
Risk?	Severe resource exhaustion on the resolver via targeted query sequences that trigger specific retry conditions in the “bad server” handler. A remote, unauthenticated attacker can hold the resolver in an infinite loop until the query timeout — not RCE, but a DoS on an infrastructure layer that applications, updates, TLS verification, and service discovery depend on.
Immediate action?	Distribution update to `9.18.49`, `9.20.23`, or `9.21.22`; for self-builds, pin the BIND build accordingly. No configuration-knob workaround known — the fix sits in the resolver state-machine code. Containerised BIND setups: image rebuild against the updated distribution / Wolfi package.
Recommendation?	Mittelstand: regular maintenance rollout in the weekly window; not rushed, but not delayed. Enterprise: additionally evaluate resolver telemetry for query-retry trajectories per upstream NS and add anomalies as a new detection track. For everyone: take this advisory as an occasion to tidy up the DNS layer in the architecture — single resolver, no failover, no health check is no longer a default position in 2026.
Criticality?	See hero badge — `medium`. Resource exhaustion on the resolver is a substantial platform impact, but no RCE/data leak and no active exploitation in circulation.

What is the problem? — the “bad server” path in the resolver state machine

A recursive resolver has to deal with unreliable upstream name servers — that is normal operation. CVE-2026-5950 sits in the logic that handles exactly this reality.

CVE	CVE-2026-5950
Vendor advisory	ISC Knowledge Base — CVE-2026-5950
Component	BIND 9, recursive resolver state machine, “bad server” handling path
Vulnerability type	Unbounded loop / resource exhaustion (CWE-835, Loop with Unreachable Exit Condition)
Affected versions	`9.18.36` – `9.18.48`, `9.20.8` – `9.20.22`, `9.21.7` – `9.21.21`; subscription: `9.18.36-S1` – `9.18.48-S1`, `9.20.9-S1` – `9.20.22-S1`
Not affected	Authoritative-only configurations; all BIND versions before `9.18.36` (bug introduced in the lines above)
Fixed in	BIND `9.18.49`, `9.20.23`, `9.21.22` (public release by ISC)
Severity	ISC: medium. CVSS assignment pending, ISC-typical ~7.x for resource exhaustion on the resolver
Exploitation status	No active exploits per ISC, no known workarounds

The bug in resolver logic

A recursive DNS resolver has to deal with the reality that upstream name servers respond badly — delayed, with incomplete answers, with truncation flags, with server-side errors. BIND has a “bad server” handler for this that detects when a particular upstream NS is unreliable and adapts the strategy (choose another NS from the NS set, retry with another transport, etc.).

The bug: in a certain constellation of “bad server” handling, the resolver state machine enters a retry loop with no built-in exit criterion — the only exit path is the global query timeout (typically 10 seconds, in defensive setups up to 30 seconds). During that time the loop holds a resolver worker slot, writes log entries, sends further queries to the upstream NS marked “bad”, and processes no new client request.

An attacker who supplies the right sequence of prepared responses (or controlled authoritative NS responses under their own domain) can trigger this path on purpose. With enough parallel queries against the same resolver, the worker slots get exhausted, and the resolver is effectively DoS'd.

Who is affected?

Most important distinction: recursive vs. authoritative configuration. The second is not affected, the first is. The rest is version inventory.

Setup	Status	Condition
BIND 9 recursive resolver in office / Mittelstand network	Affected	If version sits in the lines above; distribution packages are usually in the 9.18 line (Debian 13: `9.18.x`; Ubuntu 24.04: `9.18.x`)
BIND 9 as authoritative NS server	Not affected	ISC assessment explicitly
BIND 9 in mixed mode (recursive + authoritative)	Affected	The recursive part is
Unbound, PowerDNS Recursor, dnsmasq	Not affected	Independent implementations; check their own CVE trackers
Kubernetes CoreDNS cluster resolver	Not directly	CoreDNS is Go-implemented, not BIND. But: when forwarder is configured to an external BIND resolver, indirectly affected if that forward target runs BIND
Public DNS resolvers (1.1.1.1, 8.8.8.8, 9.9.9.9)	Operator-dependent	Cloudflare uses its own resolver; Google and Quad9 mix implementations. The resolver operator's own risk situation, not directly a Mittelstand topic
Container images with BIND as recursive resolver (e.g. PowerDNS-Auth + BIND-Recursor sidecar)	Indirectly	Check image pinning, check Wolfi/Alpine package against patch wave

Impact

DNS outage affects more than just “the website does not load”. Three consequences that belong in the situation assessment.

DNS outage as a platform domino. A non-responding DNS resolver has consequences far beyond “the website does not load”: TLS certificate refresh (Let's Encrypt ACME path), software updates (apt/dnf/Wolfi-apk), container image pulls (registry hostname resolution), service discovery in microservice architectures, mail delivery (SPF/DKIM/DMARC lookup), backend calls to external APIs. Anyone running only a single BIND recursor in the internal office network has, in the worst case, a partial outage of platform operations.

Is the query timeout window enough? Yes — and no. Yes in the sense that the loop does not run forever (query timeout breaks it); no in the sense that with enough parallel queries the loop consumes worker slots and slot exhaustion occurs faster than the timeouts take effect. For a typical Mittelstand recursor with 50–200 parallel slots, a few hundred trigger queries per second are enough to effectively cripple the resolver.

Logging avalanche. While the loop runs, BIND usually writes log entries — which under an exploited bug can lead to disk-IO avalanches. Anyone without log rate limit or log rotation sanity check gets a secondary effect on the logging backend.

Mitigation / immediate measures

Three tracks: distribution update, container rebuild, defence in depth without a configuration knob.

Path 1 — distribution package update

# Debian 13 / Ubuntu 24.04:
apt update && apt install --only-upgrade bind9 bind9-utils
named -v   # 9.18.49 expected once backport is in the repo

# RHEL 9 / Rocky / Alma:
dnf upgrade bind bind-utils
named -v

# Fedora 43+:
dnf upgrade bind
named -v

# Wolfi:
apk update && apk upgrade bind

Distribution backports usually take a few days after the ISC release. As of 21 May 2026: ISC release was 20 May, Debian/Ubuntu/RHEL packages should be in the standard repo between 22 and 26 May. For faster turnaround, fall back to the ISC source tarball or activate distribution-specific proposed-updates repos.

Path 2 — containerised BIND setups

# Image rebuild against the updated Wolfi / Alpine / Debian package
# Example Dockerfile (Wolfi base):
docker build --pull -t my-recursor:bind-9.18.49 -<<'EOF'
FROM cgr.dev/chainguard/wolfi-base:latest
RUN apk add --no-cache bind
COPY named.conf /etc/bind/named.conf
EXPOSE 53/udp 53/tcp
ENTRYPOINT ["/usr/sbin/named","-g","-c","/etc/bind/named.conf"]
EOF

# Then a rolling restart via Kubernetes:
kubectl -n dns rollout restart deployment/bind-recursor

Path 3 — no workaround, but defence in depth

There is no configuration knob that blocks the resend loop — the fix sits in the state-machine code. What you can do actively in the meantime:

Run multiple resolvers redundantly: if you only have one internal BIND recursor, that is a bad architecture anyway. A second resolver instance (preferably with a different implementation — Unbound or PowerDNS Recursor as failover) is the robust answer to any resolver DoS situation, not just this one.
Query volume telemetry: if your resolver's outgoing query rate against a single upstream NS spikes, that is an indicator. The Prometheus exporter for BIND (bind_exporter) provides the metrics.
Resolver cache TTL hygiene: a well-warmed cache buffers temporary resolver outages. Anyone running the cache aggressively short is faster in the hole in a DoS event.

Detection / verification

Version inventory, retry telemetry, log patterns — in that order.

Version inventory

# On every host with BIND:
named -v
# Expected: BIND 9.18.49 / 9.20.23 / 9.21.22 (or higher)

# Through inventory:
ansible all -m shell -a 'named -v 2>&1' | grep -E '9\.(18|20|21)\.'

Resolver telemetry on retry bursts

# BIND own statistics interface:
rndc stats
cat /var/cache/bind/named.stats | grep -E 'resolver|query'

# Prometheus bind_exporter — key metrics:
#   bind_resolver_queries_total{type=...}
#   bind_resolver_retries_total{type=...}
#   bind_resolver_query_duration_seconds{quantile="0.99"}

Suspicion pattern: a strong rise in bind_resolver_retries_total while bind_resolver_queries_total stays nearly unchanged (internal retry traffic without matching client load). That is exactly the effect the unbounded loop produces.

Log anomalies

# named.log for repeat patterns against a single upstream NS:
grep -E 'lame server|FORMERR|SERVFAIL' /var/log/named/named.log \
  | awk '{print $NF}' | sort | uniq -c | sort -nr | head -20

Operator recommendation

Operational decision block

Mitigate immediately if: you operate a publicly reachable recursive BIND resolver (mail provider, ISP-like role, hosting provider with customer DNS) — the attack surface is open
A maintenance window is fine if: you run an internal BIND recursor in an office / datacenter network that is not directly reachable — standard update rollout in the weekly window
Backlog is enough if: BIND runs authoritative-only (no recursion path configured) — authoritative services are explicitly not affected

Mittelstand

Wait for the distribution backport (expected in the next 3–5 days), regular patch run in next week's maintenance window. In parallel, look at the DNS resilience question: who still runs exactly one recursor with a single implementation today? A second recursor in a different implementation (BIND + Unbound, or BIND + PowerDNS Recursor) is a small increase in complexity for a significant resilience gain. We recommend this for new setups, and this advisory is a good occasion to raise the question in existing environments.

Enterprise

Evaluate resolver telemetry on retry trajectories per upstream NS and hook it in as anomaly detection. For critical DNS paths, enforce anycast DNS and multi-implementation as default architecture. Run the update wave through the usual pre-patch validation pipeline — BIND is a component where “just update” works without friction in most cases, but subscription lines (BIND 9 Supported) often require their own test steps.

Kubernetes

If CoreDNS forwards to an external BIND recursor (forward . 10.x.x.x in the Corefile): patch the backend recursor. CoreDNS itself is not affected. For cluster-internal recursors as sidecar / DaemonSet, pin the image tag and update against the patch wave.

Declarative stacks

Plan a NixOS/Talos/Flatcar module update against the new BIND versions; for NixOS flakes typically within days in the unstable channel, in the stable channel in the next backport run.

What we do

A preliminary note on our own situation: BIND is not the central DNS layer in our own platform — service resolution in the Kubernetes clusters and application containers sits on CoreDNS, which forwards external lookups to an upstream resolver service (provider resolver or our own recursor instance, depending on the setup). Where BIND runs as a recursive resolver, this CVE situation applies directly; where another implementation (Unbound, PowerDNS Recursor) serves the path, it does not.

How we approach the situation operationally — as a working pattern, not as a report of completed measures:

Version inventory: reconcile hosts with bind9 / bind in the package index against the affected branch ranges (9.18.36–9.18.48, 9.20.8–9.20.22, 9.21.7–9.21.21). Distribution stable packages are the default line in most existing environments and therefore within the affected range; in mixed-mode setups (authoritative + recursive) the recursive part is affected.
Arm the patch pipeline: as soon as the distribution backports are available (expected between 22 and 26 May), the update runs through at platforms under our care in the respective maintenance window — like any other distribution CVE update. Subscription lines (BIND 9 Supported) often need their own test steps; that is planned in.
Resolver telemetry: where bind_exporter metrics are missing or incomplete in the Prometheus view, this is the occasion to complete the view — not as a mandatory update for this CVE, but because a resolver DoS without telemetry only becomes visible as a follow-on damage, not as a cause.
Architecture question: where we see only a single internal BIND resolver today, the CVE is a good occasion for an architecture note recommending running a second resolver in an alternative implementation (Unbound or PowerDNS Recursor) in parallel. That is not a consequence of this specific CVE, but a general resilience recommendation.
Logging sanity check: check BIND log rotation against disk-IO avalanches, before an exploited resend loop drags the logging backend along as a secondary effect. logrotate.d/named with a short interval plus compress is the robust default configuration.

A substantive observation: DNS resilience is a layer that in the Mittelstand is often booked as “it just runs, it is just there” — until it no longer runs. CVE-2026-5950 is not spectacular, but it is a good occasion to look at the DNS layer the way we look at web servers, container hosts, and CMS stacks: with versioning, telemetry, redundancy, recovery path.

Frequently asked questions about CVE-2026-5950

Are our authoritative BIND NS servers affected?+

No. ISC explicitly excludes authoritative services from the affected set. The bug sits in the recursive resolver state machine, which is not active at all in an authoritative-only configuration. However, if your BIND runs in mixed mode (authoritative + recursion enabled), the recursive part is affected.

Is recursion no; enough as a workaround?+

Yes, if you can deliberately disable recursion. For many Mittelstand setups, however, the recursive path is exactly the reason to deploy BIND — there “turn off recursion” is not an option, but: install the patch.

How do I check whether my distribution package is already patched?+

named -v shows the version number. Expected: 9.18.49, 9.20.23, or 9.21.22. For Debian/Ubuntu, additionally check the package changelog: apt changelog bind9 | head -50 shows whether the CVE fix is in the backport.

Is Unbound affected?+

No. Unbound is an independent resolver implementation (NLnet Labs), not the BIND code. If you are worried about DNS resolver security and only have one implementation in use: a second resolver in a different implementation is the right response — not just to this CVE, but in general.

If the bug is remote-unauthenticated triggerable — does that mean anyone on the internet can take down my resolver?+

Only if your resolver is reachable for external clients. Open-recursor setups (recursive answers to arbitrary source IPs) are a bad idea anyway (amplification vector, abuse risk), and most Mittelstand BIND resolvers restrict allow-recursion to the internal network. If you have configured this properly, you are not at internet risk but only at insider risk.

We use CoreDNS in the Kubernetes cluster — are we affected?+

CoreDNS itself is not. But if CoreDNS forwards via the forward plugin to an external BIND recursor, that forward target is affected. The question is: which resolver stands behind your CoreDNS? If that is a BIND in the affected version range, the patch recommendation applies.

Bottom line

CVE-2026-5950 is not the most spectacular DNS advisory of the year, and that is fine — no RCE, no data leak, no active exploitation in circulation. It is a classic resource-exhaustion bug in a code path BIND has maintained stably for years, and ISC has cleanly delivered patches in three version lines. Anyone running a BIND recursor runs the update wave in the normal maintenance window over the next few days and is done.

What we should take away from the CVE beyond merely patching: DNS is an infrastructure layer with platform-wide domino effects. A non-responding resolver cripples more than just “one website not loading”. The question whether the DNS layer in your platform architecture enjoys the same resilience discipline as the web server or the container host — redundancy, multi-implementation, telemetry, recovery path — is not triggered by this advisory but merely surfaced by it. The answer can vary considerably depending on platform size and risk profile. What matters: that the question is asked before the next advisory lands in a path that is more critical for you.

Before the next DNS situation turns into a platform question — let's talk about redundancy and implementation mix.

We check, harden, and validate the DNS layer of your platform — not only when a BIND advisory drops.

Today we are running through the BIND version states in customer platforms and pulling in distribution backports as soon as they are available. In parallel, we are writing architecture notes for setups where only a single resolver runs today. If you are not sure where your recursive DNS resolvers stand, which version line they run, or how quickly a second resolver in a different implementation could stand next to it: we examine that with you and run the hardening sprint.

Standard service in this situation: DevSecOps platform operations with continuous infrastructure hardening and CVE continuous patching, plus architecture review for platforms that want to set up their DNS layer for resilience.

Book a meeting directly →

About the author

Kai Ole Hartwig

Founder · Moselwal Digitalagentur · OnlyOle

Programming since 2002 – self-taught, set up my own business with KO-Web in 2012, now Moselwal. Over 100 projects, with a focus on security, performance, automation and quality.

LinkedIn · GitHub · Blog · OnlyOle

A walnut-coloured architectural scale model of a narrow late-19th-century townhouse, opened like a doll's house, with a thin gap between a cross-beam and its socket at a load-bearing junction; a brass-handled screwdriver with an oxblood marking line stands wedged in the gap, beside it an old architectural reference book with the year 1908 on the spine and a cream-coloured blueprint — in cool north light on a matte concrete surface.