bind9

mirror of https://github.com/isc-projects/bind9.git synced 2026-05-23 18:47:40 -04:00

Author	SHA1	Message	Date
Evan Hunt	dc6202479f	remove find_deepest_zonecut() from qpcache because the cache no longer stores delegation (parent-side) NS rrsets, and authoritative (child-side) NS rrsets don't affect recursion, it no longer makes sense for qpcache_find() to look for NS rrsets and return DNS_R_DELEGATION. that code has been removed. the cache still does search for covering DNAME records. the check_zonecut() function has been renamed to check_dname() for clarity. related changes: - one test case has been removed from the mirror system test, because it tested the behavior of a cached delegation. - query_checkrrl() and rpz_rrset_find() have been updated so they no longer expect cache responses to have DNS_R_DELEGATION response codes.	2026-03-30 20:41:13 +02:00
Ondřej Surý	a1cb966944	Guard against NULL delegset in query_delegation_recurse() If both dns_view_bestzonecut() and dns_deleg_fromrdataset() fail, delegset stays NULL. Passing it to ns_query_recurse() would crash on the REQUIRE(DNS_DELEGSET_VALID(delegset)) in createfetch(). Return ISC_R_NOTFOUND instead, which lets the caller handle the failure gracefully.	2026-03-30 20:41:13 +02:00
Colin Vidal	883478bc6a	Use delegdb for lookup in query_delegation_recurse() When `query.c` finds a zonecut in the main cache (e.g. from stale NS records), it must still use the correct delegation for recursion. Look up the delegation DB via `dns_view_bestzonecut()` first; fall back to `dns_deleg_fromrdataset()` only if no delegation is found. This might also be done inside `query_lookup()` instead, with the `qctx` holding a delegset property, but that approach needs further work to avoid breakage and it is not clear so far if there would be other use case of it. Current approach is simpler for now.	2026-03-30 20:41:13 +02:00
Evan Hunt	3704cf42eb	Don't use dns_db_findzonecut() in query_addbestns() Previously, when answering from the cache, and when minimal-responses was not set, we added the best known zone cut to the authority section of the response message, using dns_db_findzonecut() to look it up in the DNS cache. Since the DNS cache will no longer be used to store parent-side NS RRsets, it will now be possible for an ancestor node to be used as the zone cut, leading to the wrong NS record being included. There are various ways we could correct this: 1. Use dns_deleg_lookup() instead of dns_db_findzonecut() to find the zone cut. But currently, the deleg database stores only the server addresses for the delegation, not the full NS RRset; this would need to be changed. 2. Look up <name>/NS whenever we cache a referral; that way we'll get the child-side NS RRset and cache that, and we can retrieve it when building the response. But the solution chosen here is simply not to look up the NS record when answering from the cache, effectively making "minimal-responses yes;" mandatory for queries answered from the cache. System tests have been updated as needed, so they no longer expect NS RRsets in the authority section of recursive responses.	2026-03-30 20:41:13 +02:00
Colin Vidal	de8bc44dc8	Use delegation DB for bestzonecut lookups Function `dns_view_bestzonecut()` now uses the delegation DB instead of the main cache when looking up at the cache. As a result, replace `dns_rdataset_t` (representing an NS RRset) with `dns_delegset_t` in `dns_view_bestzonecut()` and `dns_resolver_createfetch()` APIs. The resolver and query processing now use the delegation DB instead of the cache for zonecut lookups. In the case of the delegation lives in the local database, the locally found `rdataset` is internally converted into a `dns_delegset_t` object. From caller POV, it doesn't change anything: a delegation set is a read-only object which can be used as long as needed and must be detached one it's done with it.	2026-03-30 20:41:13 +02:00
Alessio Podda	70b65648ac	Move ns_highwater_recursclients to highwater stats Since it is impossible to increase an isc_statsmulti counter and retrieve the new counter atomically, and we need the output of recursclients in order to compute ns_highwater_recursive, we change the recursclients counter to an isc_stats one.	2026-03-26 10:19:25 +01:00
Alessio Podda	ed0ecb62e4	Add low contention stats counter In the current statistics counter implementation, the statistics are backed by an array of counters, which are updated via atomic operations. This leads to contention, especially on high core count machines. This commit introduces a new isc_statsmulti_t counter that keeps a separate array per thread. These counters are then aggregated only when statistics are queried, shifting work off the critical path. These changes lead to a ~2% improvement in perflab.	2026-03-26 10:19:25 +01:00
Evan Hunt	ae67c1851d	rpz_rrset_find() now recurses on ISC_R_NOTFOUND previously, rpz_rrset_find() behaved differently depending on whether a cache lookup returned DNS_R_DELEGATION or ISC_R_NOTFOUND. the former indicates the presence of a cached NS rrset, and the latter indicates that the cache is cold or that all NS rrsets above the query name have expired. both results indicate that the caller should recurse, but rpz_rrset_find() only recursed in the case of DNS_R_DELEGATION. the nsip-wait-recurse and nsdname-wait-recurse test cases in the rpzrecurse system test were dependent on this misbehavior. the test server was configured with a lame delegation, so that recursion always failed, but once the lame delegation was expired due to a zero TTL, the cache returned ISC_R_NOTFOUND, which caused the recursion not to be attempted. the test seemed to be observing a delay before recursion succeeded, but it was actually observing a delay before recursion was skipped. fixing this bug caused the test to fail. the test server has now been reconfigured so that recursion succeeds after a delay, instead of failing. now we're able to test that we're waiting for the successful completion of recursion.	2026-03-23 12:30:16 -07:00
Ondřej Surý	4bea5871ad	Replace SAVE/RESTORE/INITANDSAVE macros with MOVE_OWNERSHIP() Replace the local SAVE(), RESTORE(), and INITANDSAVE() macros in query.c with the project-wide MOVE_OWNERSHIP() macro. The new form is clearer about the intent: ownership of a pointer is being transferred from source to destination, with the source set to NULL. SAVE and RESTORE were identical macros with different names used to indicate the direction of transfer, but this distinction was purely cosmetic. INITANDSAVE additionally set the destination to NULL first, which is unnecessary because the preceding memcpy already initialized all fields from the source struct.	2026-03-23 11:06:28 +01:00
Aram Sargsyan	393f932dbf	Keep client->inner.tnow and client->inner.now in sync The incoming queries RTT statistics are going to need correct time information for calculations.	2026-02-26 14:00:10 +00:00
Ondřej Surý	d46277b398	Clear serve-stale flags when following the CNAME chains A stale answer or SERVFAIL could have been served in case of multiple upstream failures when following the CNAME chains. This has been fixed.	2026-02-23 08:07:12 +01:00
Colin Vidal	e62cafd3c7	rename fetch response `db` field to `cache` As the `dns_fetchresponse_t` `db` field can only be attached to the resolver cache database, rename it into `cache` to avoid ambiguities.	2026-02-10 08:50:16 +01:00
Alessio Podda	78588981df	Remove rrset-order cyclic from the default config, with shim Currently we add an rrset-order cyclic statement to the default config. Since the rrset-order allows matching a subset of all names, it must be implemented with a string comparison against a wildcard, and since the statement applies per rrset, this can result in millions of comparisons per second on a busy authoritative server. This commit removes rrset-order from the default config, but adds back a code shim in query_setorder to preserve the previous behaviour.	2026-01-08 14:43:04 +01:00
Ondřej Surý	bd074ff0ea	Cleanup the extra dns_rdataset_disassociate() code Manually go through the code using dns_rdataset_isassociated() and use dns_rdataset_cleanup() where appropriate in places that a simple semantic patch is not able to find automatically.	2025-12-17 15:19:55 +01:00
Ondřej Surý	8320faf64b	Apply the dns_rdataset_cleanup patch through the codebase Add a semantic patch to turn the conditional rdataset disassociate into dns_rdataset_cleanup() call and run it.	2025-12-17 15:19:55 +01:00
Colin Vidal	430c0ce76a	support EDE 13 (Cached Error) Extended DNS Error 13 (Cached Error) is now returned when the server answers a message from a cached SERVFAIL. See RFC 8914 section 4.14.	2025-12-05 23:28:29 +01:00
Mark Andrews	0e230c86d2	Rename isc_result_t ret; to isc_result_t result; Standardize result variable naming by using 'result' in most places.	2025-12-03 13:45:43 -08:00
Evan Hunt	6b33b7fc77	switch to RETERR where it wasn't being used replace all instances of the pattern: result = <statement> if (result != ISC_R_SUCCESS) { return result; } with: RETERR(<statement>);	2025-12-03 13:45:43 -08:00
Evan Hunt	38e94cc7da	switch to CHECK where it wasn't being used replace all instances of the pattern: result = <statement> if (result != ISC_R_SUCCESS) { goto cleanup; } with: CHECK(<statement>);	2025-12-03 13:45:42 -08:00
Colin Vidal	3048b2a578	add RRSIG if required as soon as they are found When EDNS DO flag (`dig +dnssec`) flag is set, an rdataset is allocated to hold the RRSIG of an RR, if present in DB. However, this allocation is not done if the zone DB is not considered as secure (`dns_db_issecure() == false`). Changes this behaviour by allocating the rdataset anyway, so the RRSIG can be associated in the answer section of the response as soon it is found from the DB.	2025-12-03 15:49:47 +01:00
Ondřej Surý	4d307ac67a	Detect resolution loops between fetches Maintain the relationship between the parent and child fetch and when creating a new child fetch, properly check the resolution loops that would lead to a new fetch would join one of the parent's fetch contexts.	2025-11-27 17:34:25 +01:00
Ondřej Surý	e94a31a666	Split qctx_destroy() into qctx_deinit() and qctx_destroy() The qctx_destroy() only needs to be called on allocated memory and qctx_deinit() needs to be called always. Also remove .allocated member from the query_ctx_t structure.	2025-11-27 10:37:58 +01:00
Colin Vidal	0b93d5725b	add query ID to the query trace message Adding the query ID to the query trace message. The log is now as the following (id is at the end): query client=0x7f75c5017000 thread=0x7f75c6dfe680(foo.fr/A): \ client attr:0x22300, query attr:0x700, restarts:0, \ origqname:foo.fr, timer:0, authdb:0, referral:0, id:21338 This should help debugging tests, in particular to quickly get a specific query from the logs.	2025-11-06 15:11:45 +01:00
Matthijs Mekking	77418fedce	Fix comment in lib/ns/query.c While renaming exit_check() to dns__zone_free_check() in lib/dns/zone.c, a dead reference to exit_check() in the comments was found in lib/ns/query.c.	2025-11-06 10:54:55 +01:00
Colin Vidal	59f116fbc9	support EDE 24 (Invalid Data) Extended DNS Error 24 (Invalid Data) is returned when the server cannot answer data for a zone it is configured for. This occurs typically when an authoritative server does not have loaded the DB of a configured zone, or a secondary server zone is expired. See RFC 8914 section 4.25.	2025-11-03 17:34:25 +01:00
Colin Vidal	90b4f256a7	query_getzonedb can returns DNS_R_EXPIRED If `query_getzonedb()` finds a zone but the zone is expired it immediately returns `DNS_R_EXPIRED` and doesn't attempt to get the zone DB (which would be NULL in this case). This enable caller to have a more precise reason of why getting the DB has failed.	2025-11-03 17:34:25 +01:00
Colin Vidal	cecb03d6db	fix hookasyncctx renaming The field `ns_hookasync_t` was initially named `hook_actx` and wrongly renamed `hook_aclctx` during a mass-renaming of various names for the config acl context into a consistent `aclctx` name (see !11003). Of course this is wrong as `ns_hookasync_t` has nothing to do with ACL but about _async_ context. This commit fixes the mistake by renaming this field `hookasyncctx`	2025-09-28 22:41:32 +02:00
Mark Andrews	a0945f6337	Use signer name when disabling DNSSEC algorithms When disabling algorithms, use the signer name to determine if the algorithm is disabled or not. This allows for algorithms to be cleanly disabled on a zone level basis. Previously, just using the records owner name, "disable-algorithms" could impact resolution of names that where not disabled. This does now mean that "disable-algorithms" can not be used to disable part of a zone anymore.	2025-09-25 11:14:27 +10:00
Colin Vidal	36a05c81b4	rename cfg_aclconfctx_t variables to aclctx ACL configuration context variables are inconsistently named as `actx`, `ac`, or `aclconfctx`, which caused confusion during code reviews. This commit renames all `cfg_aclconfctx_t` variables to `aclctx`, which is short, consistent, and unambiguous.	2025-09-24 20:14:49 +02:00
Alessio Podda	6e7aec2cb7	Use unique names for probes.d files Enabling LTO in the subsequent commit requires the file names to be unique and having same probes.d in each of the libraries breaks this requirement. Rename probes.d to probes-{isc,dns,ns}.d files and adjust the includes.	2025-09-24 13:18:13 +02:00
Evan Hunt	0cdcc8a8f4	rename NS_QUERY_RESET to NS_QUERY_CLEANUP query_reset() is called during query initialization, but the only time the NS_QUERY_SETUP hook runs is when it's called from query_cleanup(). it makes more sense to move the hook point to there and rename it to NS_QUERY_CLEANUP. this change caused a crash in the unit tests due to the view being unnecessarily detached before ns__client_reset_cb() was called. this has also been fixed.	2025-09-10 17:46:53 -07:00
Colin Vidal	b6a292b03f	don't call hooks when a query hasn't started guard the call to the NS_QUERY_RESET hook so it's called only if the view has been set. If the view is NULL, it means the client has been reset _before_ the query even started, and no other hook could have been called, so it doesn't make sense to call this one. this also enables us to avoid a NULL-check on the qctx->view in the CALL_HOOK macros.	2025-09-10 14:14:36 -07:00
Evan Hunt	637e8d01d2	minimize calls to dns_zone_gethooktable per qctx add a 'zhooks' member to the query_ctx structure, so that we only need to look up the hook table for the zone once when iniitalizing a qctx, and not once for every hook point.	2025-09-10 14:05:42 -07:00
Evan Hunt	0194a265fe	check target pointer validity in qctx_save Make sure the target pointer address (getting the allocated instance of qctx) is valid and the pointer is NULL.	2025-09-10 12:43:05 +02:00
Colin Vidal	d676ce8085	remove query_ctx_t detach_client property Since the removal of NS_QUERY_QCTX_DESTROYED hook, there is no need for the `qctx->detach_client` object anymore, as this was designed to tell the plugin whether the client object is about to be, or is already, freed from memory. This is not needed anymore, as NS_QUERY_RESET is called _always_ when the client object is about to be freed from memory. Remove `detach_client` and tidy up the code a bit by including the freeing of the qctx object (when allocated) inside the qctx_destroy function instead of requiring extra calls.	2025-09-09 10:02:32 +02:00
Colin Vidal	95c71c2739	replace QCTX_INIT/_DESTROY hooks with QUERY_SETUP/_RESET The hook NS_QUERY_QCTX_DESTROY is problematic with zone plugins because it can be called in some contexts where `qctx->client` is invalid (the pointer is dangling); which would lead to a use-after-free (spotted by TSAN build) as `qctx->client` is used to get the zone hooktable, to find out whether there is an authoritive zone which would have NS_QUERY_QCTX_DESTROY registered. This can't easily be fixed, because there is no easy way to know from query.c code if `client` is still a valid object: `client->reqhandle`, representing the request from a client, is refcounted, and the `client` object is freed from memory once its refcounter gets to 0. While `reqhandle` is attached from query.c code, it can be attached more than once from asynchronous code and there is no clear path where detaching it would lead to a client free. Hence, there is no way to know for sure when to set `qctx->client = NULL` (this is why the pointer remains dangling). Back to the original problem; this is why the NS_QUERY_QCTX_DESTROY hook is incompatible with zone plugins. `qctx->detach_client`, which is used to tell a plugin that the `client` object is either free or about to be free can't be use either, because in some cases the client is still there, and should be used. Code issue aside, the `qctx` object is really just an aggregate of various data to pass easily in the various functions and callbacks, initially stored on the stack, but allocated in some cases (for some asynchronous flow, when recursion is needed), so the point it gets created/"destroyed" is really just an implementation "detail", and providing a higher level hook for the plugin would be beneficial. Hence, NS_QUERY_RESET and NS_QUERY_INIT are removed, and instead, the existing NS_QUERY_SETUP can be used as well as the newly introduced NS_QUERY_RESET (which replaces NS_QUERY_QCTX_DESTROY). The advanage is that NS_QUERY_RESET is called _only_ when the client object is _always_ about to be freed, which avoids usage of the extra `qctx->detach_client` usage from the plugin. The way NS_QUERY_RESET works is that when the `client` is freed, a callback (from `query.c`) is called. This callback creates a transient qctx object on the stack with a pointer to the view, and uses that to call the hook.	2025-09-09 09:42:34 +02:00
Colin Vidal	1566634fae	add NS_QUERY_AUTHZONE_ATTACHED hook Add a new query hook called `NS_QUERY_AUTHZONE_ATTACHED`. This hook is called whenever an authoritative zone is found and attached during a query answer. From code level, this hook is called when `qctx->client->query->authzone` is attached during a query. This enables zone-specific plugins to initialize specific states whenever a local zone is found that can answer a query.	2025-09-09 09:42:34 +02:00
Colin Vidal	5893770cd9	add zone-specific plugin instance The zone object now has its own hooktable and plugins, which are initialized during zone initialization.	2025-09-09 09:42:34 +02:00
Aram Sargsyan	1962857ac4	Log the servfail-until-ready message not faster than once per second Since the log level has been raised, busy servers can "explode" from the amount of log messages. Use the usual practice of logging "every once in a while".	2025-09-03 13:23:12 +00:00
Aram Sargsyan	49356ce944	Change the "RPZ not ready yet" message and its log level The "RPZ not ready yet" message is logged at debug 3 level. Use the info level instead for better visibility. After raising the log level, the rpz_log_fail_helper() function starts appending " failed: " the the message. Change the log message so it makes more sense.	2025-09-03 13:23:12 +00:00
Aram Sargsyan	d9b5f6c502	RPZ 'servfail-until-ready': skip updating SERVFAIL cache In order to not pollute the SERVFAIL cache with the configured SERVFAIL answers while RPZ is loading, set the NS_CLIENTATTR_NOSETFC attribute for the client.	2025-09-03 13:23:12 +00:00
Alessio Podda	20a1583661	Lazily allocate fetch counter The counter in ns_client_t is used to track the maximum number of recursions in the resolver, but it is created unconditionally when starting the client and deallocated when resetting it. This commit defers the allocation of the counter till recursion needs to actually happen, speeding up authoritative workloads in perflab by 1.5~2%.	2025-09-02 11:22:28 +02:00
Aram Sargsyan	41387b8d30	Add a new 'servfail-until-ready' configuration option for RPZ By default, when named is started it may start answering to queries before the response policy zones are completely loaded and processed. This new feature gives an option to the users to tell named that incoming requests should result in SERVFAIL anwser until all the response policy zones are procesed and ready.	2025-08-22 16:31:17 +00:00
Ondřej Surý	42496f3f4a	Use ControlStatementsExceptControlMacros for SpaceBeforeParens > Put a space before opening parentheses only after control statement > keywords (for/if/while...) except this option doesn’t apply to ForEach > and If macros. This is useful in projects where ForEach/If macros are > treated as function calls instead of control statements.	2025-08-19 07:58:33 +02:00
Ondřej Surý	3445362918	Add dns_rdatatype_isnsec() helper function Replace the checks for both NSEC and NSEC3 with a single helper function.	2025-08-15 07:22:52 +02:00
Ondřej Surý	59d1326175	Use dns_rdatatype_none more consistently Use dns_rdatatype_none instead of plain '0' for dns_rdatatype_t and dns_typepair_t manipulation. While plain '0' is technically ok, it doesn't carry the required semantic meaning, and using the named dns_rdatatype_none constant makes the code more readable.	2025-08-15 07:22:52 +02:00
Alessio Podda	ae6a34cbda	Decouple database and node lifetimes by adding node-specific vtables All databases in the codebase follow the same structure: a database is an associative container from DNS names to nodes, and each node is an associative container from RR types to RR data. Each database implementation (qpzone, qpcache, sdlz, builtin, dyndb) has its own corresponding node type (qpznode, qpcnode, etc). However, some code needs to work with nodes generically regardless of their specific type - for example, to acquire locks, manage references, or register/unregister slabs from the heap. Currently, these generic node operations are implemented as methods in the database vtable, which creates problematic coupling between database and node lifetimes. If a node outlives its parent database, the node destructor will destroy all RR data, and each RR data destructor will try to unregister from heaps by calling a virtual function from the database vtable. Since the database was already freed, this causes a crash. This commit breaks the coupling by standardizing the layout of all database nodes, adding a dedicated vtable for node operations, and moving node-specific methods from the database vtable to the node vtable.	2025-08-07 11:39:38 -07:00
Matthijs Mekking	2f70a0ef12	Add ede for zone with rpz cname override policy When the zone is configured with a CNAME override policy, also add the configured EDE code. When the zone is contains a wildcard CNAME, also add the configured EDE code.	2025-08-05 08:35:51 +02:00
Matthijs Mekking	7774f16ed5	Special case refresh stale ncache data When refreshing stale ncache data, the qctx->rdataset is NULL and requires special processing.	2025-07-23 07:18:48 +00:00
Matthijs Mekking	a66b04c8d4	Make serve-stale refresh behave as prefetch A serve-stale refresh is similar to a prefetch, the only difference is when it triggers. Where a prefetch is done when an RRset is about to expire, a serve-stale refresh is done when the RRset is already stale. This means that the check for the stale-refresh window needs to move into query_stale_refresh(). We need to clear the DNS_DBFIND_STALEENABLED option at the same places as where we clear DNS_DBFIND_STALETIMEOUT. Now that serve-stale refresh acts the same as prefetch, there is no worry that the same rdataset is added to the message twice. This makes some code obsolete, specifically where we need to clear rdatasets from the message.	2025-07-23 07:18:48 +00:00

1 2 3 4 5 ...

430 commits