bind9

mirror of https://github.com/isc-projects/bind9.git synced 2026-05-23 10:37:43 -04:00

Author	SHA1	Message	Date
Ondřej Surý	3a339cfca4	Clean up frdataset in resume_dslookup() on shutdown When resume_dslookup() receives ISC_R_SHUTTINGDOWN or ISC_R_CANCELED, frdataset (fctx->nsrrset) was not disassociated. While fctx__destroy() eventually cleans it up, leaving it associated keeps the underlying DB node referenced longer than necessary.	2026-03-30 20:41:13 +02:00
Evan Hunt	cd4a7a2d72	Fix fetchlimit test failure When a referral lookup is triggered by a QMIN query, it should be exempt from the fetches-per-zone limit just as the QMIN query itself is. Also restart the test server between the fetches-per-server and fetches-per-zone tests so that leftover statistics from the former do not pollute the latter. Another fix is because zone spills and general query drops are no longer in a strict >= relation (on a parent-centric resolver), so check that both counters are non-zero instead.	2026-03-30 20:41:13 +02:00
Colin Vidal	f2f9a97526	Do not cache NS from referral in negative responses Stop storing the NS referral into the main cache when processing a negative response. These records are already cached in the delegation database and are not needed elsewhere. Update dnssec tests that relied on parent-side NS RRsets being returned in recursive query responses.	2026-03-30 20:41:13 +02:00
Colin Vidal	bc8f0b3a79	Cleans up `mark_related()` Cleans up mark_related(): since the FCTX_ATTR_GLUING flag is never set anymore, the code that handled it has been removed.	2026-03-30 20:41:13 +02:00
Colin Vidal	883478bc6a	Use delegdb for lookup in query_delegation_recurse() When `query.c` finds a zonecut in the main cache (e.g. from stale NS records), it must still use the correct delegation for recursion. Look up the delegation DB via `dns_view_bestzonecut()` first; fall back to `dns_deleg_fromrdataset()` only if no delegation is found. This might also be done inside `query_lookup()` instead, with the `qctx` holding a delegset property, but that approach needs further work to avoid breakage and it is not clear so far if there would be other use case of it. Current approach is simpler for now.	2026-03-30 20:41:13 +02:00
Colin Vidal	6ed7a8a723	Resolver is parent-centric The resolver now uses glue addresses from `dns_deleg_t` objects stored in the delegation database. The main cache is still used for ADB A/AAAA lookups when no glue is available for a nameserver name. The resolver's `fctx_getaddresses()` is refactored to, for each delegation of the delegation set, try to get the address-based finds, then nameserver name lookups. (Later, the logic to handle DELEG `include-delegparm=` will be hooked there too.)	2026-03-30 20:41:13 +02:00
Colin Vidal	cfac5f3974	Add dns_adb_createaddrinfosfind() for address-based lookups Add a new ADB API function that creates a find from a list of addresses rather than by looking up nameserver names. This enables the resolver to handle address-based delegations (NS-based with glues or DELEG with addresses) and name-based delegations uniformly (i.e. the list of finds from ADB is handled the same way no matter the type of the delegation).	2026-03-30 20:41:13 +02:00
Evan Hunt	a9883483ef	Remove dns_db_findzonecut() This function is no longer used and has been removed, along with its implementation in qpcache.	2026-03-30 20:41:13 +02:00
Evan Hunt	3704cf42eb	Don't use dns_db_findzonecut() in query_addbestns() Previously, when answering from the cache, and when minimal-responses was not set, we added the best known zone cut to the authority section of the response message, using dns_db_findzonecut() to look it up in the DNS cache. Since the DNS cache will no longer be used to store parent-side NS RRsets, it will now be possible for an ancestor node to be used as the zone cut, leading to the wrong NS record being included. There are various ways we could correct this: 1. Use dns_deleg_lookup() instead of dns_db_findzonecut() to find the zone cut. But currently, the deleg database stores only the server addresses for the delegation, not the full NS RRset; this would need to be changed. 2. Look up <name>/NS whenever we cache a referral; that way we'll get the child-side NS RRset and cache that, and we can retrieve it when building the response. But the solution chosen here is simply not to look up the NS record when answering from the cache, effectively making "minimal-responses yes;" mandatory for queries answered from the cache. System tests have been updated as needed, so they no longer expect NS RRsets in the authority section of recursive responses.	2026-03-30 20:41:13 +02:00
Colin Vidal	de8bc44dc8	Use delegation DB for bestzonecut lookups Function `dns_view_bestzonecut()` now uses the delegation DB instead of the main cache when looking up at the cache. As a result, replace `dns_rdataset_t` (representing an NS RRset) with `dns_delegset_t` in `dns_view_bestzonecut()` and `dns_resolver_createfetch()` APIs. The resolver and query processing now use the delegation DB instead of the cache for zonecut lookups. In the case of the delegation lives in the local database, the locally found `rdataset` is internally converted into a `dns_delegset_t` object. From caller POV, it doesn't change anything: a delegation set is a read-only object which can be used as long as needed and must be detached one it's done with it.	2026-03-30 20:41:13 +02:00
Colin Vidal	c7b75f448f	Populate the delegation DB from referrals answers The resolver now caches NS records and their A/AAAA glues from referral answers into the delegation database. A new `cache_delegns()` function extracts NS names and associated glue addresses from the authority/additional sections of a referral answer and use those informations to build a delegation set, which is then inserted into the delegation database. The created delegation set contains a delegation per NS RR. If the NS RR has matching A/AAAA RR, the delegation only store the addresses and not the name. (Note this is technically possible to group all NS RR which doesn't have glues into a single delegation, and the implementation can be changed in that way in the future). Each view has its own instance of the delegation database (they are never shared between views), but a server restart/reload preserve the delegation database state.	2026-03-30 20:41:13 +02:00
Colin Vidal	1b5f757084	Introduce a delegation database Add `dns_delegdb_t`, a qpmulti-based database enabling to lookup a delegation set (`dns_delegset_t`) from a zonecut name (`dns_name_t`). A delegation set object essentially contains an expiration time and a list of delegation (`dns_deleg_t`). Finally, a delegation can be either: - A list of IP addresses (`isc_netaddrlist_t`), for NS-based delegation providing glues or DELEG-based delegation using `server-ipv4=` or `server-ipv6=`; - Or a list of nameserver names, for NS-based delegation without glues, or DELEG-based delegation using `server-name=`; - Or a list of nameserver names, for DELEG-based delegation using `include-delegparam=`. The delegation database API provides lookup by closest zonecut, delegation and delegation set builders as well as insertion of those newly built delegation set, dumping to a `FILE *`, conversion from an NS rdataset to a delegation set, deletion of a specific zonecut or all the sub-tree of a given zonecut. A memory context is internally used inside the delegation database and can be constraint to a maximum size. Once it gets close to its maximum size and a new delegation set is inserted into the database, a reclamation flow is run internally removing the least recently used entries. The delegation set and delegation objects are, once they been inserted into the database, read-only object. Thus, the caller can use them without concurrency or locking concerns, and must detached them once its done with it.	2026-03-30 20:41:13 +02:00
Colin Vidal	b4abc63dfa	Add ISC_LIST support for isc_netaddr_t Add an `isc_netaddrlink_t` type wrapping an `isc_netaddr_t` and an `ISC_LINK`. This enable to build list of `isc_netaddr_t` without increasing the memory footprint of existing usages of `isc_netaddr_t` (which doesn't require to be linked).	2026-03-30 20:41:13 +02:00
Aram Sargsyan	357331f886	Revert NTA flush on expire Flushing the name when NTA expires causes problems for the ongoing resolving process. Do not flush the name from the cache. Instead, the resolver should do the flushing (this is planned to be merged next).	2026-03-30 18:27:35 +00:00
Ondřej Surý	6ba57a1f0f	Count temporal problems with DNSSEC validation as attempts After KeyTrap, the temporal DNSSEC were originally hard errors that caused validation failures even if the records had another valid signature. This has been changed and the RRSIGs outside of the inception and expiration time are not counted as hard errors. However, these errors are not even counted as validation attempts, so excessive number of expired RRSIGs would cause some non-cryptograhic extra work for the validator. This has been fixed and the temporal errors are correctly counted as validation attempts.	2026-03-30 11:16:13 +02:00
Ondřej Surý	bac40394d5	Fix update-policy per-type max quota bypass via counter desynchronization The prescan and main update loops in DNS UPDATE processing both used the same counter to index the maxbytype[] quota array. The prescan loop always incremented the counter, but the main loop had 14 continue paths that skipped the increment. This allowed an authenticated DDNS client to craft an UPDATE message with padding records (e.g. CNAME+A pairs that trigger CNAME-conflict skips) to shift the counter and read wrong quota entries, bypassing per-type record limits entirely. Fix by incrementing the counter unconditionally at the start of each iteration in the main loop.	2026-03-28 10:07:49 +01:00
Mark Andrews	f2fd54f4b2	Allow the dns_rdata_in_apl structure to be walked twice The offset value should be set prior to calculating the length.	2026-03-27 12:00:22 +00:00
Aram Sargsyan	35b8af229e	Allow empty APL records Allow empty APL records because RFC 3123 (Section 4) says "zero or more items". This fixes processing of a catalog zone ACL (which is based on APL records) when the zone contains an empty APL record or when a zone update arrives which creates an empty APL record.	2026-03-27 12:36:50 +11:00
Alessio Podda	70b65648ac	Move ns_highwater_recursclients to highwater stats Since it is impossible to increase an isc_statsmulti counter and retrieve the new counter atomically, and we need the output of recursclients in order to compute ns_highwater_recursive, we change the recursclients counter to an isc_stats one.	2026-03-26 10:19:25 +01:00
Alessio Podda	80be99d3ac	Convert isc_statsmulti to use ISC_REFCOUNT_IMPL Instead of using hand-rolled attach and detach function, this commit declares the same functions through the ISC_REFCOUNT_IMPL macro.	2026-03-26 10:19:25 +01:00
Alessio Podda	ed0ecb62e4	Add low contention stats counter In the current statistics counter implementation, the statistics are backed by an array of counters, which are updated via atomic operations. This leads to contention, especially on high core count machines. This commit introduces a new isc_statsmulti_t counter that keeps a separate array per thread. These counters are then aggregated only when statistics are queried, shifting work off the critical path. These changes lead to a ~2% improvement in perflab.	2026-03-26 10:19:25 +01:00
Mark Andrews	ed15b6cb26	Add switch to disable cookie checking in delv This adds the switch +[no]cookie to delv to control the sending of DNS COOKIE options when sending requests. The default is to send DNS COOKIE options.	2026-03-26 11:18:26 +11:00
Michał Kępień	b0fc0e31c5	Merge tag 'v9.21.20'	2026-03-25 14:23:41 +00:00
Evan Hunt	ae67c1851d	rpz_rrset_find() now recurses on ISC_R_NOTFOUND previously, rpz_rrset_find() behaved differently depending on whether a cache lookup returned DNS_R_DELEGATION or ISC_R_NOTFOUND. the former indicates the presence of a cached NS rrset, and the latter indicates that the cache is cold or that all NS rrsets above the query name have expired. both results indicate that the caller should recurse, but rpz_rrset_find() only recursed in the case of DNS_R_DELEGATION. the nsip-wait-recurse and nsdname-wait-recurse test cases in the rpzrecurse system test were dependent on this misbehavior. the test server was configured with a lame delegation, so that recursion always failed, but once the lame delegation was expired due to a zero TTL, the cache returned ISC_R_NOTFOUND, which caused the recursion not to be attempted. the test seemed to be observing a delay before recursion succeeded, but it was actually observing a delay before recursion was skipped. fixing this bug caused the test to fail. the test server has now been reconfigured so that recursion succeeds after a delay, instead of failing. now we're able to test that we're waiting for the successful completion of recursion.	2026-03-23 12:30:16 -07:00
Ondřej Surý	c503b6eee8	Add regression test for TOCTOU race in DNS UPDATE SSU handling Race rndc reconfig (toggling between allow-update and update-policy) against a stream of DNS UPDATEs for 5 seconds and verify that named does not crash. Before the fix, the race between send_update() and update_action() reading the SSU table independently could trigger an assertion failure (INSIST) when the zone's update policy changed between the two reads.	2026-03-23 11:10:48 +01:00
Ondřej Surý	c172416559	Fix TOCTOU race in DNS UPDATE SSU table handling Pass the SSU table through the update event struct from send_update() to update_action() instead of reading it from the zone twice. If rndc reconfig changed the zone's update policy between the two reads (e.g., from allow-update to update-policy), send_update() would skip the maxbytype allocation but update_action() would see a non-NULL ssutable, triggering INSIST(ssutable == NULL \|\| maxbytype != NULL) and crashing named. The ssutable reference is now taken once in send_update() and transferred to update_action() via the event struct, ensuring both functions see the same value.	2026-03-23 11:10:48 +01:00
Ondřej Surý	4bea5871ad	Replace SAVE/RESTORE/INITANDSAVE macros with MOVE_OWNERSHIP() Replace the local SAVE(), RESTORE(), and INITANDSAVE() macros in query.c with the project-wide MOVE_OWNERSHIP() macro. The new form is clearer about the intent: ownership of a pointer is being transferred from source to destination, with the source set to NULL. SAVE and RESTORE were identical macros with different names used to indicate the direction of transfer, but this distinction was purely cosmetic. INITANDSAVE additionally set the destination to NULL first, which is unnecessary because the preceding memcpy already initialized all fields from the source struct.	2026-03-23 11:06:28 +01:00
Ondřej Surý	24951b703e	Move ISC_NONSTRING from util.h to attributes.h ISC_NONSTRING is a compiler attribute macro and belongs alongside the other attribute definitions in attributes.h, not in util.h.	2026-03-23 11:06:28 +01:00
Ondřej Surý	0f3be0beb8	Add MOVE_OWNERSHIP() macro for transferring pointer ownership A helper macro that returns the current value of a pointer and sets it to NULL in one expression, useful for transferring ownership in designated initializers.	2026-03-23 11:06:28 +01:00
Ondřej Surý	da8e1c956a	Fix cache flush ordering on NTA expiry dns_view_flushnode() was called in the delete_expired() async callback, which runs after the query that detected the NTA expiry. This created a race: the query would proceed with stale cached data from the NTA period before the flush had a chance to run, resulting in transient SERVFAIL with EDE 22 (No Reachable Authority). Move dns_view_flushnode() into dns_ntatable_covered() so the cache is flushed synchronously when the expiry is detected, before the query continues. Also simplify the expiry comparison in delete_expired() to a direct pointer comparison (nta == pval) instead of comparing expiry timestamps.	2026-03-20 14:35:11 +01:00
Ondřej Surý	4d15494b94	Fix non-atomic read-modify-write on entry->srtt in adjustsrtt() The SRTT update loaded the old value, computed a new one, and stored it back as separate operations. Two concurrent callers could each read the same old value and one update would be silently lost. Use a CAS loop for the read-modify-write on entry->srtt. For the aging path, also CAS on entry->lastage to prevent multiple threads from aging the same entry in the same second.	2026-03-20 02:06:21 +01:00
Ondřej Surý	a2bd833909	Fix data race on fctx->vresult in validated() Move the write to fctx->vresult after LOCK(&fctx->lock). The field was being set before acquiring the lock, but dns_resolver_logfetch() reads it under the same lock from another thread.	2026-03-20 00:56:19 +01:00
Ondřej Surý	9457f4f8c5	Fix data race in RCU pointer exchange operations The liburcu rcu_cmpxchg_pointer() uses CMM_RELAXED ordering on the CAS failure path. When a thread loses the CAS and gets another thread's pointer back, reading fields through that pointer is a data race on weakly-ordered architectures (ARM, POWER) because the failing load has no acquire semantics. Override rcu_cmpxchg_pointer() and rcu_xchg_pointer() to use standard __atomic builtins with __ATOMIC_ACQ_REL (success) and __ATOMIC_ACQUIRE (failure) ordering. This fixes the race on all architectures and is natively visible to ThreadSanitizer.	2026-03-19 08:10:22 +01:00
Ondřej Surý	44bb3cd2a7	Fix data race on nta->expiry Use CMM_LOAD_SHARED/CMM_STORE_SHARED for nta->expiry, which is written from the NTA's owning loop but read from any loop (validator, rndc status, rndc nta -dump). Also dispatch delete_expired to the NTA's owning loop rather than the caller's loop.	2026-03-19 01:44:37 +01:00
Ondřej Surý	fae6c6eead	Refactor NTA to use RCU instead of rwlock Replace the ntatable rwlock with RCU read-side critical sections. The QP multi trie already provides its own concurrency control for reads and writes, making the rwlock redundant. NTA fields like expiry are only accessed from the NTA's own event loop thread, so no additional synchronization is needed. The table shutdown is now deferred via call_rcu to ensure all read-side critical sections have completed before iterating and shutting down individual NTAs.	2026-03-19 01:44:37 +01:00
Aram Sargsyan	1899a3318c	Flush the node when NTA expires When NTA expires the name's node should be flushed from the view's cache as it's done when the NTA is manually removed using a rndc command.	2026-03-19 00:12:59 +01:00
Aram Sargsyan	48d7401f0d	Take 'env' reference before async calling perform_reopen() The 'env' pointer is passed to an async function without taking a reference first, which can potentially cause a use-after-free error. Take a reference, then detach in the async function.	2026-03-18 16:10:07 +00:00
Aram Sargsyan	4ac3a6520e	Convert dns_dtenv_t reference counting to standard macors Use standard reference counting macros for dns_dtenv_t instead of custom attach/detach functions.	2026-03-18 16:10:07 +00:00
Ondřej Surý	8e240bbb5f	Fix isc_buffer_init capacity mismatch in DoH data chunk callback isc_buffer_init() is given MAX_DNS_MESSAGE_SIZE (65535) as capacity but only h2->content_length bytes are allocated. This makes the buffer believe it has more space than actually allocated. A secondary bounds check (new_bufsize <= h2->content_length) prevents actual overflow, but the buffer invariant is violated. Pass h2->content_length as the capacity to match the allocation.	2026-03-18 11:39:01 +01:00
Ondřej Surý	7f8b972a3d	Remove NZF support, make LMDB required for new zone storage Drop the NZF (New Zone File) fallback for persisting runtime zone configurations, making LMDB (NZD) the only storage backend. This removes all #ifdef HAVE_LMDB conditionals, the meson 'lmdb' option, and the NZF-related functions. LMDB is now a mandatory build dependency. The named-nzd2nzf tool is now always built.	2026-03-18 11:02:33 +01:00
Ondřej Surý	5b1750f15f	Fix missing mutex destroy and ede invalidate on fctx_create() error paths The error cleanup in fctx_create() was missing isc_mutex_destroy() and dns_ede_invalidate() calls. When error paths (cleanup_nameservers, cleanup_fcount, cleanup_qmessage, cleanup_adb) were taken after the mutex and edectx were initialized, the fctx memory was freed without properly destroying these resources first.	2026-03-17 16:05:11 +01:00
Ondřej Surý	96a22451d7	Fix rwlock type mismatch in delete_ds() error path The lock is acquired for reading but the error path from dns_rdata_fromstruct() incorrectly unlocks it as a write lock.	2026-03-17 16:05:11 +01:00
Mark Andrews	d3ffa1f007	Clear errno before calling strtol The previous code was incorrectly clearing errno after calling strtol but before testing the result rather than clearing it and then calling strtol so that changes to errno can be correctly determined.	2026-03-17 10:51:37 +11:00
Matthijs Mekking	bc1d177cc2	Fast fail a validator deadlock We return DNS_R_NOVALIDSIG if we detected a deadlock. Then in 'validate_async_done()', this result value is used to check if we need to fall back to insecure. As part of that we create a new fetch but that fails because of the detected deadlock. This results in a loop of deadlock detected, fallback to insecure, deadlock detected, ... Add a new result value, ISC_R_DEADLOCK, and return this instead when we have detected a deadlock. This will be treated as a generic error, as there is no special handling for this result value.	2026-03-16 16:46:51 +00:00
Ondřej Surý	6e286beaa6	Cleanup weird syntax defining struct dns_ixfr The struct dns_ixfr was defined as part of struct dns_xfrin, probably because at some point it was an anonymous struct and then it was changed to named struct with typedef at the top. Move the definition from struct dns_xfrin into and fold into the typedef ... dns_ixfr_t.	2026-03-16 12:17:06 +01:00
Ondřej Surý	f4b4f030c4	Cleanup the duplicate logic and comments around add into NSEC tree After merging the NORMAL, NSEC and NSEC3 tree into single QP tree, there were some comments still speaking about auxiliary NSEC tree. These were cleaned up and the logic when we pass the qp tree (write transaction) to qpzone_addrdataset_inner() was changed to be more obvious that this is needed only when we are adding NSEC records.	2026-03-16 12:17:06 +01:00
Ondřej Surý	e57245ee81	Fix use-after-free in xfrin_recv_done Move the LIBDNS_XFRIN_RECV_DONE probe execution before dns_xfrin_detach in xfrin_recv_done. Previously, dns_xfrin_detach was called before the trace probe, which could free the xfr object. Because the accessed member xfr->info is an embedded array, the expression evaluates via pointer arithmetic rather than a direct memory dereference. Although this prevents a reliable crash in practice, it technically remains a use-after-free issue. Reorder the statements to ensure the transfer context is fully valid when the probe executes.	2026-03-16 11:06:06 +01:00
Aram Sargsyan	336c523b79	OpenSSL 4 compatibility fix Starting from OpenSSL 4 the the X509_get_subject_name() function returns a 'const' pointer to a name instead of a regular pointer. Duplicate the name before operating on it, then free it.	2026-03-16 10:01:18 +00:00
Ondřej Surý	df1993611b	Fix KASP key leaks on keystore lookup failure In both cfg_kasp_fromconfig() and cfg_kasp_builtinconfig(), the newly allocated KASP key was not destroyed when the keystore lookup failed.	2026-03-14 13:58:32 +01:00
Ondřej Surý	2ab3d7c075	Fix missing server socket detach in TLS accept error path When TLS creation fails in tlslisten_acceptcb(), tlssock->server was not detached before detaching tlssock itself.	2026-03-14 13:58:32 +01:00

1 2 3 4 5 ...

16516 commits