bind9

mirror of https://github.com/isc-projects/bind9.git synced 2026-06-03 13:59:27 -04:00

Author	SHA1	Message	Date
Ondřej Surý	5f15df5c53	Fix memory leak in QPcache addnoqname/addclosest mechanism The attacker that controls DNSSEC-signed zone can trigger a memory leak in the addnoqname() and/or addclosest() by creating more than max-records-per-type RRSIG for any NSEC records. The memory leaks have been fixed. (cherry picked from commit `a854a5c83d`)	2026-03-13 13:22:23 +01:00
Matthijs Mekking	63262fd0f4	Implement dns_dbiterator_seek3 This is a new seek function for dbiterator that is meant to find an NSEC3 node in a zone database. The difference with dns_dbiterator_seek is that if the node does not exist, this seek function will point the iterator to the next NSEC3 name. (cherry picked from commit `41159e9062`)	2025-12-11 13:53:25 +01:00
Evan Hunt	25c9fb54da	standardize CHECK and RETERR macros previously, there were over 40 separate definitions of CHECK macros, of which most used "goto cleanup", and the rest "goto failure" or "goto out". there were another 10 definitions of RETERR, of which most were identical to CHECK, but some simply returned a result code instead of jumping to a cleanup label. this has now been standardized throughout the code base: RETERR is for returning an error code in the case of an error, and CHECK is for jumping to a cleanup tag, which is now always called "cleanup". both macros are defined in isc/util.h. (cherry picked from commit `52bba5cc34`)	2025-12-03 19:17:20 -08:00
Mark Andrews	f8cafb9756	Fix missing RRSIGs for "glue" lookups with CD=1 The code to test whether to store the RRSIGs on DNS_R_UNCHANGED with CD=1 was failing because the comparison methods of the two rdatatset instances were not compatible. Move the testing into dns_db_addrdataset(), and request it by setting the DNS_ADD_EQUALOK option. If the option is set and the old and new rrsets compare as equal, dns_db_addrdataset() returns ISC_R_SUCCESS instead of DNS_R_UNCHANGED. (cherry picked from commit `b954a1df43`)	2025-09-10 17:08:52 +10:00
Ondřej Surý	08328a9cce	Don't preserve cache entries if new TTL is smaller than existing Under certain circumstances, cache entries with equivalent rdataset might not get replaced. Previously such entry would get preserved regardless of the new TTL and expire time on the existing header would get updated when the expire time was less than the expire time on the existing header. Change the logic to preserve the existing header only if the new expire time is larger than the existing one and replace the existing cache entry when the new expire time is less than the existing one. Co-authored-by: Jinmei Tatuya <jtatuya@infoblox.com> (cherry picked from commit `9f7ba584cf`)	2025-08-26 21:13:25 +02:00
Ondřej Surý	06e3d996c1	Preserve ZEROTTL attribute when replacing NS RRset Previously, BIND 9 would drop the ZEROTTL attribute when updating previously cached NS entry with ZEROTTL attribute set. Co-authored-by: Jinmei Tatuya <jtatuya@infoblox.com> (cherry picked from commit `982ca161c2`)	2025-08-26 21:12:21 +02:00
alessio	d21f63884a	Adaptive memory allocation strategy for qp-tries qp-tries allocate their nodes (twigs) in chunks to reduce allocator pressure and improve memory locality. The choice of chunk size presents a tradeoff: larger chunks benefit qp-tries with many values (as seen in large zones and resolvers) but waste memory in smaller use cases. Previously, our fixed chunk size of 2^10 twigs meant that even an empty qp-trie would consume 12KB of memory, while reducing this size would negatively impact resolver performance. This commit implements an adaptive chunking strategy that: - Tracks the size of the most recently allocated chunk. - Doubles the chunk size for each new allocation until reaching a predefined maximum. This approach effectively balances memory efficiency for small tries while maintaining the performance benefits of larger chunk sizes for bigger data structures. This commit also splits the callback freeing qpmultis into two phases, one that frees the underlying qptree, and one that reclaims the qpmulti memory. In order to prevent races between the qpmulti destructor and chunk garbage collection jobs, the second phase is protected by reference counting. (cherry picked from commit `70b1777d8a`)	2025-08-05 12:48:19 +02:00
Mark Andrews	e4d64a0c33	Fix find_coveringnsec in qpcache.c dns_qp_lookup was returning ISC_R_NOTFOUND rather than DNS_R_PARTIALMATCH when there wasn't a parent with a NSEC record in the cache. This was causing find_coveringnsec to fail rather than returing the covering NSEC. (cherry picked from commit `7de4207cb6`)	2025-07-21 17:46:00 +02:00
Mark Andrews	53738b0e5e	Use clang-format-20 to update formatting (cherry picked from commit `422b9118e8`)	2025-06-25 13:32:08 +10:00
Ondřej Surý	817a0a8e8e	Fix invalid cache-line padding for qpcache buckets The isc_queue_t was missing in the calculation of the required padding size inside the qpcache bucket structure. (cherry picked from commit `3ef9b09620`)	2025-03-25 09:59:02 +00:00
Ondřej Surý	614f8c1ef1	Acquire the database reference before possibly last node release Acquire the database refernce in the detachnode() to prevent the last reference to be release while the NODE_LOCK being locked. The NODE_LOCK is locked/unlocked inside the RCU critical section, thus it is most probably this should not pose a problem as the database uses call_rcu memory reclamation, but this it is still safer to acquire the reference before releasing the node. (cherry picked from commit `d1ef6a93c1`)	2025-03-06 10:39:17 +00:00
Ondřej Surý	ee6e64df21	Revert "fix: dev: Delete dead nodes when committing a new version" This reverts commit `67255da4b3`, reversing changes made to `74c9ff384e`. (cherry picked from commit `1e4695510a`)	2025-03-05 17:28:44 +00:00
Evan Hunt	e35e701c2c	when committing a new qpzone version, delete dead nodes if all data has been deleted from a node in the qpzone database, delete the node too. (cherry picked from commit `e58ce19cf2`)	2025-02-18 22:55:20 +00:00
Ondřej Surý	d4e8a92977	Rely on call_rcu() to destroy the qpzone outside of locks Reduce the number of qpzone_ref() and qpzone_unref() calls in qpzone_detachnode() by relying on the call_rcu to delay the destruction of the lock buckets. (cherry picked from commit `1fa5219fdf`)	2025-02-04 23:28:53 +01:00
Ondřej Surý	a9f4e3369a	Reduce false sharing in dns_qpcache Instead of having many node_lock_count * sizeof(<member>) arrays, pack all the members into a qpcache_bucket_t struct that is cacheline aligned and have a single array of those. Additionaly, make both the head and the tail of isc_queue_t padded, not just the head, to prevent false sharing of the lock-free structure with the lock that follows it. (cherry picked from commit `c602d76c1f`)	2025-02-04 23:27:28 +01:00
Ondřej Surý	8229d9cdfa	Print the expiration time of the stale records (not ancient) In #1870, the expiration time of ANCIENT records were printed, but actually the ancient records are very short lived, and the information carries a little value. Instead of printing the expiration of ANCIENT records, print the expiration time of STALE records. (cherry picked from commit `355fc48472`)	2025-02-04 18:07:59 +01:00
Ondřej Surý	302aca809d	Expand the usage of mark_ancient() helper functions When the mark_ancient() helper function was introduced, couple of places with duplicate (or almost duplicate) code was missed. Move the mark_ancient() function closer to the top of the file, and correctly use it in places that mark the header as ANCIENT. (cherry picked from commit `58179e6a19`)	2025-02-03 15:53:34 +01:00
Ondřej Surý	4b114838de	Add better ZEROTTL handling in bindrdataset() If we know that the header has ZEROTTL set, the server should never send stale records for it and the TTL should never be anything else than 0. The comment was already there, but the code was not matching the comment. (cherry picked from commit `cfee6aa565`)	2025-02-03 15:53:34 +01:00
Ondřej Surý	b32512a232	In cache, set rdataset TTL to 0 when the header is not active When the header has been marked as ANCIENT, but the ttl hasn't been reset (this happens in couple of places), the rdataset TTL would be set to the header timestamp instead to a reasonable TTL value. Since this header has been already expired (ANCIENT is set), set the rdataset TTL to 0 and don't reuse this field to print the expiration time when dumping the cache. Instead of printing the time, we now just print 'expired (awaiting cleanup'. (cherry picked from commit `1bbb57f81b`)	2025-02-03 15:53:34 +01:00
Evan Hunt	1e818d368f	fix the cache findzonecut implementation the search for the deepest known zone cut in the cache could improperly reject a node containing stale data, even if the NS rdataset wasn't the data that was stale. this change also improves the efficiency of the search by stopping it when both NS and RRSIG(NS) have been found. (cherry picked from commit `1f095b902c`)	2025-02-02 20:01:52 +01:00
Evan Hunt	5300eebc9e	Clarify reference counting in QP databases Change the names of the node reference counting functions and add comments to make the mechanism easier to understand: - newref() and decref() are now called qpcnode_acquire()/ qpznode_acquire() and qpcnode_release()/qpznode_release() respectively; this reflects the fact that they modify both the internal and external reference counters for a node. - qpcnode_newref() and qpznode_newref() are now called qpcnode_erefs_increment() and qpznode_erefs_increment(), and qpcnode_decref() and qpznode_decref() are now called qpcnode_erefs_decrement() and qpznode_erefs_decrement(), to reflect that they only increase and decrease the node's external reference counters, not internal. (cherry picked from commit `d4f791793e`)	2025-01-31 05:52:13 +01:00
Ondřej Surý	7dab6cdfbc	Remove db_nodelock_t in favor of reference counted qpdb This removes the db_nodelock_t structure and changes the node_locks array to be composed only of isc_rwlock_t pointers. The .reference member has been moved to qpdb->references in addition to common.references that's external to dns_db API users. The .exiting members has been completely removed as it has no use when the reference counting is used correctly. (cherry picked from commit `431513d8b3`)	2025-01-31 05:49:36 +01:00
Ondřej Surý	082a54cc5d	Remove origin_node from qpcache The origin_node in qpcache was always NULL, so we can remove the getoriginode() function and origin_node pointer as the dns_db_getoriginnode() correctly returns ISC_R_NOTFOUND when the function is not implemented. (cherry picked from commit `36a26bfa1a`)	2025-01-31 05:49:23 +01:00
Ondřej Surý	d1d444d2ab	Refactor decref() in both qpcache.c and qpzone.c Cleanup the pattern in the decref() functions in both qpcache.c and qpzone.c, so it follows the similar patter as we already have in newref() function. (cherry picked from commit `814b87da64`)	2025-01-31 05:49:12 +01:00
JINMEI Tatuya	da0453b1d5	Optimize database decref by avoiding locking with refs > 1 Previously, this function always acquires a node write lock if it might need node cleanup in case the reference decrements to 0. In fact, the lock is unnecessary if the reference is larger than 1 and it can be optimized as an "easy" case. This optimization could even be "necessary". In some extreme cases, many worker threads could repeat acquring and releasing the reference on the same node, resulting in severe lock contention for nothing (as the ref wouldn't decrement to 0 in most cases). This change would prevent noticeable performance drop like query timeout for such cases. Co-authored-by: JINMEI Tatuya <jtatuya@infoblox.com> Co-authored-by: Ondřej Surý <ondrej@isc.org> (cherry picked from commit `7f4471594d`)	2025-01-22 14:29:30 +01:00
Alessio Podda	1edf405add	Optimize memory layout of core structs Reduce memory footprint by: - Reordering struct fields to minimize padding. - Using exact-sized atomic types instead of _least/_fast variants - Downsizing integer fields where possible Affected structs: - dns_name_t - dns_slabheader_t - dns_rdata_t - qpcnode_t - qpznode_t (cherry picked from commit `32c7060bd2`)	2024-12-09 09:04:28 +01:00
JINMEI Tatuya	08122316a7	emit more helpful log for exceeding max-records-per-type The new log message is emitted when adding or updating an RRset fails due to exceeding the max-records-per-type limit. The log includes the owner name and type, corresponding zone name, and the limit value. It will be emitted on loading a zone file, inbound zone transfer (both AXFR and IXFR), handling a DDNS update, or updating a cache DB. It's especially helpful in the case of zone transfer, since the secondary side doesn't have direct access to the offending zone data. It could also be used for max-types-per-name, but this change doesn't implement it yet as it's much less likely to happen in practice. (cherry picked from commit `4156995431`)	2024-11-27 11:17:34 +11:00
Ondřej Surý	58a15d38c2	Remove redundant parentheses from the return statement (cherry picked from commit `0258850f20`)	2024-11-19 14:26:52 +01:00
Evan Hunt	c1b94dc622	rename 'rbtiterator' and similar names in qpcache when the QP cache was adapted from the RBT database, some names weren't changed. this could be confusing, so let's change them now. also, we no longer need to include rbt.h. (cherry picked from commit `5a444838db`)	2024-09-19 15:02:23 -07:00
Ondřej Surý	57cd34441a	Be smarter about refusing to add many RR types to the database Instead of outright refusing to add new RR types to the cache, be a bit smarter: 1. If the new header type is in our priority list, we always add either positive or negative entry at the beginning of the list. 2. If the new header type is negative entry, and we are over the limit, we mark it as ancient immediately, so it gets evicted from the cache as soon as possible. 3. Otherwise add the new header after the priority headers (or at the head of the list). 4. If we are over the limit, evict the last entry on the normal header list.	2024-07-01 12:48:51 +02:00
Ondřej Surý	b27c6bcce8	Expand the list of the priority types and move it to db_p.h Add HTTPS, SVCB, SRV, PTR, NAPTR, DNSKEY and TXT records to the list of the priority types that are put at the beginning of the slabheader list for faster access and to avoid eviction when there are more types than the max-types-per-name limit.	2024-07-01 12:47:30 +02:00
Ondřej Surý	52b3d86ef0	Add a limit to the number of RR types for single name Previously, the number of RR types for a single owner name was limited only by the maximum number of the types (64k). As the data structure that holds the RR types for the database node is just a linked list, and there are places where we just walk through the whole list (again and again), adding a large number of RR types for a single owner named with would slow down processing of such name (database node). Add a configurable limit to cap the number of the RR types for a single owner. This is enforced at the database (rbtdb, qpzone, qpcache) level and configured with new max-types-per-name configuration option that can be configured globally, per-view and per-zone.	2024-06-10 16:55:09 +02:00
Ondřej Surý	32af7299eb	Add a limit to the number of RRs in RRSets Previously, the number of RRs in the RRSets were internally unlimited. As the data structure that holds the RRs is just a linked list, and there are places where we just walk through all of the RRs, adding an RRSet with huge number of RRs inside would slow down processing of said RRSets. Add a configurable limit to cap the number of the RRs in a single RRSet. This is enforced at the database (rbtdb, qpzone, qpcache) level and configured with new max-records-per-type configuration option that can be configured globally, per-view and per-zone.	2024-06-10 16:55:07 +02:00
Ondřej Surý	086b63f56d	Use isc_queue to implement wait-free deadnodes queue Replace the ISC_LIST based deadnodes implementation with isc_queue which is wait-free and we don't have to acquire neither the tree nor node lock to append nodes to the queue and the cleaning process can also copy (splice) the list into a local copy without acquiring the list. Currently, there's little benefit to this as we need to hold those locks anyway, but in the future as we move to RCU based implementation, this will be ready. To align the cleaning with our event loop based model, remove the hardcoded count for the node locks and use the number of the event loops instead. This way, each event loop can have its own cleaning as part of the process. Use uniform random numbers to spread the nodes evenly between the buckets (instead of hashing the domain name).	2024-06-05 09:19:56 +02:00
Aram Sargsyan	8052848d50	Fix a bug in expireheader() call arguments order The expireheader() call in the expire_ttl_headers() function is erroneous as it passes the 'nlocktypep' and 'tlocktypep' arguments in a wrong order, which then causes an assertion failure. Fix the order of the arguments so it corresponds to the function's prototype.	2024-05-02 08:38:35 +00:00
Evan Hunt	4b02246130	fix more ambiguous struct names there were some structure names used in qpcache.c and qpzone.c that were too similar to each other and could be confusing when debugging. they have been changed as follows: in qcache.c: - changed_t was unused, and has been removed - search_t -> qpc_search_t - qpdb_rdatasetiter_t -> qpc_rditer_t - qpdb_dbiterator_t -> qpc_dbiter_t in qpzone.c: - qpdb_changed_t -> qpz_changed_t - qpdb_changedlist_t -> qpz_changedlist_t - qpdb_version_t -> qpz_version_t - qpdb_versionlist_t -> qpz_versionlist_t - qpdb_search_t -> qpz_search_t - qpdb_load_t -> qpz_search_t	2024-04-30 12:50:01 -07:00
Evan Hunt	e300dfce46	use dns_qp_getname() where possible some calls to dns_qp_lookup() do not need partial matches, QP chains or QP iterators. in these cases it's more efficient to use dns_qp_getname().	2024-04-30 12:50:01 -07:00
Evan Hunt	2789e58473	get foundname from the node when calling dns_qp_lookup() from qpcache, instead of passing 'foundname' so that a name would be constructed from the QP key, we now just use the name field in the node data. this makes dns_qp_lookup() run faster. the same optimization has also been added to qpzone. the documentation for dns_qp_lookup() has been updated to discuss this performance consideration.	2024-04-30 12:50:01 -07:00
Evan Hunt	04d319afe4	include the nodenames when calculating memory to purge when the cache is over memory, we purge from the LRU list until we've freed the approximate amount of memory to be added. this approximation could fail because the memory allocated for nodenames wasn't being counted. add a dns_name_size() function so we can look up the size of nodenames, then add that to the purgesize calculation.	2024-04-30 12:50:01 -07:00
Evan Hunt	a8bda6ff1e	simplify qpcache iterators in a cache database, unlike zones, NSEC3 records are stored in the main tree. it is not necessary to maintain a separate 'nsec3' tree, nor to have code in the dbiterator implementation to traverse from one tree to another. (if we ever implement synth-from-dnssec using NSEC3 records, we'll need to revert this change. in the meantime, simpler code is better.)	2024-04-30 12:50:01 -07:00
Evan Hunt	7ff43befb7	clean up unnecessary dbiterator code related to origin the QP database doesn't support relative names as the RBTDB did, so there's no need for a 'new_origin' flag or to handle `DNS_R_NEWORIGIN` result codes.	2024-04-30 12:42:32 -07:00
Evan Hunt	85ab92b6e0	more cleanups in qpcache.c - remove unneeded struct members and misleading comments. - remove unused parameters for static functions. - rename 'find_callback' to 'delegating' for consistency with qpzone; the find callback mechanism is not used in QP databases.	2024-04-30 12:42:31 -07:00
Evan Hunt	3acab71d46	rename QPDB_HEADERNODE to HEADERNODE this makes the macro consistent between qpcache.c and qpzone.c. also removed a redundant definition of HEADERNODE in qpzone.c.	2024-04-30 12:42:31 -07:00
Evan Hunt	46d40b3dca	fix structure names in qpcache.c and qpzone.c - change dns_qpdata_t to qpcnode_t (QP cache node), and dns_qpdb_t to qpcache_t, as these types are only accessed locally. - also change qpdata_t in qpzone.c to qpznode_t (QP zone node), for consistency. - make the refcount declarations for qpcnode_t and qpznode_t static, using the new ISC_REFCOUNT_STATIC macros.	2024-04-30 12:42:07 -07:00
Evan Hunt	20d32512ca	clean up unnecessary requirements in qpcache.c qpcache can only support cache semantics now, so there's no longer any need to check for that internally.	2024-04-30 12:31:48 -07:00
Ondřej Surý	c13a1d8b01	Improve the reference counting checks in newref() In qpcache (and rbtdb), there are some functions that acquire neither the tree lock nor the node lock when calling newref(). In theory, this could lead to a race in which a new reference is added to a node that was about to be deleted. We now detect this condition by passing the current tree and node lock status to newref(). If the node was previously unreferenced and we don't hold at least one read lock, we will assert.	2024-04-30 08:41:56 +02:00
Evan Hunt	2c88946590	dns_name_dupwithoffsets() cannot fail this function now always returns success; change it to void and clean up its callers.	2024-04-10 22:51:07 -04:00
Evan Hunt	ea6659a5e9	update foundname when detecting a zonecut above qname an assertion could be triggered in the QPDB cache if a DNAME was found above a queried NS, because the 'foundname' value was not correctly updated to point to the zone cut. the same mistake existed in qpzone and has been fixed there as well.	2024-04-02 10:00:03 +02:00
Evan Hunt	8b67476249	reduce memory consumption of qpcache database as with qpzone, use a dynamically-allocated dns_name instead of a dns_fixedname object to store node names in the QP database.	2024-03-14 10:20:52 -07:00
Matthijs Mekking	659fa0cbc3	Fix Coverity CID 487884: Dead code in qpcache.c Adding a changed record is zonedb related and does not belong in the cache code. This is a leftover dead code and can be safely removed. /lib/dns/qpcache.c: 3459 in add() 3453 } 3454 newheader->next = topheader->next; 3455 newheader->down = topheader; 3456 topheader->next = newheader; 3457 qpnode->dirty = 1; 3458 if (changed != NULL) { >>> CID 487884: (DEADCODE) >>> Execution cannot reach this statement: "changed->dirty = true;". 3459 changed->dirty = true; 3460 } 3461 } else { 3462 /* 3463 * No rdatasets of the given type exist at the node. 3464 */ /lib/dns/qpcache.c: 3409 in add() 3403 } 3404 newheader->next = topheader->next; 3405 newheader->down = topheader; 3406 topheader->next = newheader; 3407 qpnode->dirty = 1; 3408 if (changed != NULL) { >>> CID 487884: (DEADCODE) >>> Execution cannot reach this statement: "changed->dirty = true;". 3409 changed->dirty = true; 3410 } 3411 mark_ancient(header); 3412 if (sigheader != NULL) { 3413 mark_ancient(sigheader); 3414	2024-03-14 10:42:30 +00:00

1 2

51 commits