bind9

mirror of https://github.com/isc-projects/bind9.git synced 2026-04-28 01:28:05 -04:00

Author	SHA1	Message	Date
Evan Hunt	3460fe73e2	fix commit error in mutex_test when the branch implementing mutex_test was rebased and merged, a rebasing error was missed: the isc_threadresult and isc_threadarg types no longer exist.	2023-04-28 02:37:29 +01:00
Ondřej Surý	42c7694dfb	Add mutex unit test Add simple mutex unit test and mutex benchmark. The benchmark compares the pthread mutext with isc mutex implementation, so it's mainly useful when developing a new isc mutex implementation.	2023-04-27 13:15:50 +02:00
Tony Finch	e0248bf60f	Simplify isc_thread a little Remove the `isc_threadarg_t` and `isc_threadresult_t` typedefs which were unhelpful disguises for `void *`, and free the dummy jemalloc allocation sooner.	2023-04-27 12:38:53 +02:00
Tony Finch	06f534fa69	Avoid spurious compilation failures in liburcu headers When liburcu is not installed from a system package, its headers are not treated as system headers by the compiler, so BIND's -Werror and other warning options take effect. The liburcu headers have a lot of inline functions, some of which do not use all their arguments, which BIND's build treats as an error.	2023-04-27 12:38:53 +02:00
Ondřej Surý	b497e90179	Add isc_spinlock unit with shim pthread_spin implementation The spinlock is small (atomic_uint_fast32_t at most), lightweight synchronization primitive and should only be used for short-lived and most of the time a isc_mutex should be used. Add a isc_spinlock unit which is either (most of the time) a think wrapper around pthread_spin API or an efficient shim implementation of the simple spinlock.	2023-04-21 12:10:02 +02:00
Ondřej Surý	32a8773ab3	Always initialize the workers in the libtest The workers variable might be needed even to tests not using loopmgr. Split the workers initialization into setup_workers() function and always call it from the default main loop.	2023-04-21 09:04:24 +02:00
Ondřej Surý	3b10814569	Fix the streaming read callback shutdown logic When shutting down TCP sockets, the read callback calling logic was flawed, it would call either one less callback or one extra. Fix the logic in the way: 1. When isc_nm_read() has been called but isc_nm_read_stop() hasn't on the handle, the read callback will be called with ISC_R_CANCELED to cancel active reading from the socket/handle. 2. When isc_nm_read() has been called and isc_nm_read_stop() has been called on the on the handle, the read callback will be called with ISC_R_SHUTTINGDOWN to signal that the dormant (not-reading) socket is being shut down. 3. The .reading and .recv_read flags are little bit tricky. The .reading flag indicates if the outer layer is reading the data (that would be uv_tcp_t for TCP and isc_nmsocket_t (TCP) for TLSStream), the .recv_read flag indicates whether somebody is interested in the data read from the socket. Usually, you would expect that the .reading should be false when .recv_read is false, but it gets even more tricky with TLSStream as the TLS protocol might need to read from the socket even when sending data. Fix the usage of the .recv_read and .reading flags in the TLSStream to their true meaning - which mostly consist of using .recv_read everywhere and then wrapping isc_nm_read() and isc_nm_read_stop() with the .reading flag. 4. The TLS failed read helper has been modified to resemble the TCP code as much as possible, clearing and re-setting the .recv_read flag in the TCP timeout code has been fixed and .recv_read is now cleared when isc_nm_read_stop() has been called on the streaming socket. 5. The use of Network Manager in the named_controlconf, isccc_ccmsg, and isc_httpd units have been greatly simplified due to the improved design. 6. More unit tests for TCP and TLS testing the shutdown conditions have been added. Co-authored-by: Ondřej Surý <ondrej@isc.org> Co-authored-by: Artem Boldariev <artem@isc.org>	2023-04-20 12:58:32 +02:00
Tony Finch	3dcfad81d8	Check dns_name_countlabels() wrt DNS_NAME_MAXLABELS This test case was omitted from [GL !7803]	2023-04-18 13:32:09 +01:00
Tony Finch	80a153e159	Fix several typos in name_test `nane` -> `name`	2023-04-18 12:56:29 +01:00
Aram Sargsyan	87db9ea84c	unit tests: include an OpenSSL header before including cmocka.h OpenSSL 3.1.0 uses __attribute__(malloc), conflicting with a redefined malloc in cmocka.h. As a workaround, include an OpenSSL header file before including cmocka.h in the unit tests where OpenSSL is used.	2023-04-14 12:11:52 +00:00
Mark Andrews	21a3d4f762	Use SIGABRT rather than SIGKILL for long running unit test SIGABRT will produce a core dump which will allow for forensic analysis of the unit test	2023-04-14 15:40:02 +10:00
Ondřej Surý	c60ce13127	Revert "Kill unit tests that run more than 1200 seconds" This reverts commit `3d5c7cd46c` which added wrapper around all the unit tests that would run the unit test in the forked process. This makes any debugging of the unit tests too hard. Futures attempts to fix #3980 should add a custom automake test harness (log driver) that would kill the unit test after configured timeout.	2023-04-14 06:14:19 +02:00
Ondřej Surý	1715cad685	Refactor the isc_quota code and fix the quota in TCP accept code In `e185412872`, the TCP accept quota code became broken in a subtle way - the quota would get initialized on the first accept for the server socket and then deleted from the server socket, so it would never get applied again. Properly fixing this required a bigger refactoring of the isc_quota API code to make it much simpler. The new code decouples the ownership of the quota and acquiring/releasing the quota limit. After (during) the refactoring it became more clear that we need to use the callback from the child side of the accepted connection, and not the server side.	2023-04-12 14:10:37 +02:00
Tony Finch	3405b43fe9	Fix a division by zero bug in isc_histo This can occur when calculating the standard deviation of an empty histogram.	2023-04-05 23:29:21 +02:00
Tony Finch	e8ff0f0c08	Correct value of DNS_NAME_MAXLABELS It should be floor(DNS_NAME_MAXWIRE / 2) + 1 == 128 The mistake was introduced in `c6bf51492d` because: * I was refactoring an existing `DNS_MAX_LABELS` defined as 127 * There was a longstanding bug in `dns_name_isvalid()` which checked the number of labels against 127U instead of 128 * I mistakenly thought `dns_name_isvalid()` was correct and `dns_name_countlabels()` was incorrect, but the reverse was true. After this commit, occurrances of `DNS_NAME_MAXLABELS` with value 128 are consistent with the use of 127 or 128 before commit `c6bf51492d` except for the mistake in `dns_name_isvalid()`. This commit adds a test case that checks the MAXLABELS case in `dns_name_fromtext()` and `dns_name_isvalid()`.	2023-04-05 14:46:39 +00:00
Tony Finch	b171cacf4f	Use a qp-trie for the zone table This change makes the zone table lock-free for reads. Previously, the zone table used a red-black tree, which is not thread safe, so the hot read path acquired both the per-view mutex and the per-zonetable rwlock. (The double locking was to fix to cleanup races on shutdown.) One visible difference is that zones are not necessarily shut down promptly: it depends on when the qp-trie garbage collector cleans up the zone table. The `catz` system test checks several times that zones have been deleted; the test now checks for zones to be removed from the server configuration, instead of being fully shut down. The catz test does not churn through enough zones to trigger a gc, so the zones are not fully detached until the server exits. After this change, it is still possible to improve the way we handle changes to the zone table, for instance, batching changes, or better compaction heuristics.	2023-04-05 12:38:11 +01:00
Tony Finch	b3e35fd120	A few qp-trie cleanups Revert refcount debug tracing (commit `a8b29f0365`), there are better ways to do it. Use the dns_qpmethods_t typedef where appropriate. Some stylistic improvements.	2023-04-05 12:35:04 +01:00
Tony Finch	39f38754e2	Compact more in dns_qp_compact(DNS_QPGC_ALL) Commit `0858514ae8` enriched dns_qp_compact() to give callers more control over how thoroughly the trie should be compacted. In the DNS_QPGC_ALL case, if the trie is small it might be compacted to a new position in the same memory chunk. In this situation it will still be holding references to old leaf objects which have been removed from the trie but will not be completely detached until the chunk containing the references is freed. This change resets the qp-trie allocator to a fresh chunk before a DNS_QPGC_ALL compaction, so all the old memory chunks will be evacuated and old leaf objects can be detached sooner.	2023-04-05 12:35:04 +01:00
Tony Finch	fa1b57ee6e	Support for finding the longest parent domain in a qp-trie This is the first of the "fancy" searches that know how the DNS namespace maps on to the structure of a qp-trie. For example, it will find the closest enclosing zone in the zone tree.	2023-04-05 12:35:04 +01:00
Tony Finch	8a3a216f40	Support for iterating over the leaves in a qp-trie The iterator object records a path through the trie, in a similar manner to the existing dns_rbtnodechain.	2023-04-05 12:35:04 +01:00
Tony Finch	906d434aea	Fix Coverity complaints in the qp-trie tests The main problem was `qp_test_keytoname()` not using `qpkey_bit()` to do bounds checking.	2023-04-03 15:10:47 +00:00
Tony Finch	cd0e7f853a	Simplify histogram quantiles The `isc_histosummary_t` functions were written in the early days of `hg64` and carried over when I brought `hg64` into BIND. They were intended to be useful for graphing cumulative frequency distributions and the like, but in practice whatever draws charts is better off with a raw histogram export. Especially because of the poor performance of the old functions. The replacement `isc_histo_quantiles()` function is intended for providing a few quantile values in BIND's stats channel, when the user does not want the full histogram. Unlike the old functions, the caller provides all the query fractions up-front, so that the values can be found in a single scan instead of a scan per value. The scan is from larger values to smaller, since larger quantiles are usually more interesting, so the scan can bail out early.	2023-04-03 12:08:05 +01:00
Tony Finch	82213a48cf	Add isc_histo for histogram statistics This is an adaptation of my `hg64` experiments for use in BIND. As well as renaming everything according to ISC style, I have written some more extensive tests that ensure the edge cases are correct and the fenceposts are in the right places. I have added utility functions for working with precision in terms of decimal significant figures as well as this code's native binary.	2023-04-03 12:08:05 +01:00
Ondřej Surý	3a6a0fa867	Replace DE_CONST(k, v) with v = UNCONST(k) macro Replace the complicated DE_CONST macro that required union with much simple reference-dereference trick in the UNCONST() macro.	2023-04-03 10:25:56 +00:00
Michal Nowak	4d094f6b51	Disable failing MD5 unit tests in FIPS mode With FIPS mode enabled 'isc_hmac_init_test' and 'isc_hmac_md5_test' tests of hmac_test and 'isc_md_init_test' and 'isc_md_md5_test' test of md_test fail. This is due to leveraging MD5, which is disabled in FIPS mode.	2023-04-03 12:05:29 +10:00
Mark Andrews	3d5c7cd46c	Kill unit tests that run more than 1200 seconds The CI doesn't provide useful forensics when a system test locks up. Fork the process and kill it with ABRT if it is still running after 20 minutes. Pass the exit status to the caller.	2023-04-03 00:15:43 +00:00
Ondřej Surý	a5f5f68502	Refactor isc_time_now() to return time, and not result The isc_time_now() and isc_time_now_hires() were used inconsistently through the code - either with status check, or without status check, or via TIME_NOW() macro with RUNTIME_CHECK() on failure. Refactor the isc_time_now() and isc_time_now_hires() to always fail when getting current time has failed, and return the isc_time_t value as return value instead of passing the pointer to result in the argument.	2023-03-31 15:02:06 +02:00
Ondřej Surý	956155f613	Squash dns_name_fullhash() and dns_name_hash() The only place where dns_name_hash() was being used is the old hash table in the dns_badcache unit. Squash the dns_name_fullhash() and dns_name_hash() into single dns_name_hash() function that's always case-insensitive as it doesn't make to do case-sensitive hashing of the domain names and we were not using this anywhere.	2023-03-31 12:43:30 +00:00
Ondřej Surý	4bd6096d4b	Remove isc_stdtime_get() macro Now that isc_stdtime_get() macro is unused, remove it from the header file.	2023-03-31 13:33:16 +02:00
Ondřej Surý	46f06c1d6e	Apply the semantic patch to remove isc_stdtime_get() This is a simple replacement using the semantic patch from the previous commit and as added bonus, one removal of previously undetected unused variable in named/server.c.	2023-03-31 13:32:56 +02:00
Ondřej Surý	2c0a9575d7	Replace __attribute__((unused)) with ISC_ATTR_UNUSED attribute macro Instead of marking the unused entities with UNUSED(x) macro in the function body, use a `ISC_ATTR_UNUSED` attribute macro that expans to C23 [[maybe_unused]] or __attribute__((__unused__)) as fallback.	2023-03-30 23:29:25 +02:00
Ondřej Surý	f5fc224af3	Add isc_async_current() macro to run job on current loop Previously, isc_job_run() could have been used to run the job on the current loop and the isc_job_run() would take care of allocating and deallocating the job. After the change in this MR, the isc_job_run() is more complicated to use, so we introduce the isc_async_current() macro to suplement isc_async_run() when we need to run the job on the current loop.	2023-03-30 16:07:41 +02:00
Ondřej Surý	1844590ad9	Refactor isc_job_run to not-make any allocations Change the isc_job_run() to not-make any allocations. The caller must make sure that it allocates isc_job_t - usually as part of the argument passed to the callback. For simple jobs, using isc_async_run() is advised as it allocates its own separate isc_job_t.	2023-03-30 16:00:52 +02:00
Ondřej Surý	665f8bb78d	Fix isc_nm_httpconnect to check for shuttindown condition The isc_nm_httpconnect() would succeed even if the netmgr would be already shuttingdown. This has been fixed and the unit test has been updated to cope with fact that the handle would be NULL when isc_nm_httpconnect() returns with an error.	2023-03-29 05:49:57 +00:00
Mark Andrews	64c0065986	Build libtest even if CMOCKA is not available Be more selective about what is not built when CMOCKA is not available so that fuzz/dns_qp and fuzz/dns_qpkey_name can link against it.	2023-03-29 02:29:18 +00:00
Evan Hunt	d91097e0c7	change ns__client_request() to ns_client_request() in the future we'll want to call this function from outside named, so change the name to one suitable for external access.	2023-03-28 12:38:28 -07:00
Evan Hunt	4ad95e0567	add ns_interface_create() add a public function ns_interface_create() allowing the caller to set up a listening interface directly without having to set up listen-on and scan network interfaces.	2023-03-28 12:38:28 -07:00
Evan Hunt	e914c5e194	add basic test for TSIG key dump/restore functionality stop and restart the server in the 'tsiggss' test, in order to confirm that GSS negotiated TSIG keys are saved and restored when named loads. added logging to dns_tsigkey_createfromkey() to indicate whether a key has been statically configured, generated via GSS negotiation, or restored from a file.	2023-03-16 09:55:50 -07:00
Ondřej Surý	bd4576b3ce	Remove TKEY Mode 2 (Diffie-Hellman) Completely remove the TKEY Mode 2 (Diffie-Hellman Exchanged Keying) from BIND 9 (from named, named.conf and all the tools). The TKEY usage is fringe at best and in all known cases, GSSAPI is being used as it should. The draft-eastlake-dnsop-rfc2930bis-tkey specifies that: 4.2 Diffie-Hellman Exchanged Keying (Deprecated) The use of this mode (#2) is NOT RECOMMENDED for the following two reasons but the specification is still included in Appendix A in case an implementation is needed for compatibility with old TKEY implementations. See Section 4.6 on ECDH Exchanged Keying. The mixing function used does not meet current cryptographic standards because it uses MD5 [RFC6151]. RSA keys must be excessively long to achieve levels of security required by current standards. We might optionally implement Elliptic Curve Diffie-Hellman (ECDH) key exchange mode 6 if the draft ever reaches the RFC status. Meanwhile the insecure DH mode needs to be removed.	2023-03-08 08:36:25 +01:00
Ondřej Surý	cd632ad31d	Implement dns_db node tracing This implements node reference tracing that passes all the internal layers from dns_db API (and friends) to increment_reference() and decrement_reference(). It can be enabled by #defining DNS_DB_NODETRACE in <dns/trace.h> header. The output then looks like this: incr:node:check_address_records:rootns.c:409:0x7f67f5a55a40->references = 1 decr:node:check_address_records:rootns.c:449:0x7f67f5a55a40->references = 0 incr:nodelock:check_address_records:rootns.c:409:0x7f67f5a55a40:0x7f68304d7040->references = 1 decr:nodelock:check_address_records:rootns.c:449:0x7f67f5a55a40:0x7f68304d7040->references = 0 There's associated python script to find the missing detach located at: https://gitlab.isc.org/isc-projects/bind9/-/snippets/1038	2023-02-28 11:44:15 +01:00
Tony Finch	a8b29f0365	Improve qp-trie refcount debugging Add some qp-trie tracing macros which can be enabled by a developer. These print a message when a leaf is attached or detached, indicating which part of the qp-trie implementation did so. The refcount methods must now return the refcount value so it can be printed by the trace macros.	2023-02-27 13:47:57 +00:00
Tony Finch	4b5ec07bb7	Refactor qp-trie to use QSBR The first working multi-threaded qp-trie was stuck with an unpleasant trade-off: * Use `isc_rwlock`, which has acceptable write performance, but terrible read scalability because the qp-trie made all accesses through a single lock. * Use `liburcu`, which has great read scalability, but terrible write performance, because I was relying on `rcu_synchronize()` which is rather slow. And `liburcu` is LGPL. To get the best of both worlds, we need our own scalable read side, which we now have with `isc_qsbr`. And we need to modify the write side so that it is not blocked by readers. Better write performance requires an async cleanup function like `call_rcu()`, instead of the blocking `rcu_synchronize()`. (There is no blocking cleanup in `isc_qsbr`, because I have concluded that it would be an attractive nuisance.) Until now, all my multithreading qp-trie designs have been based around two versions, read-only and mutable. This is too few to work with asynchronous cleanup. The bare minimum (as in epoch based reclamation) is three, but it makes more sense to support an arbitrary number. Doing multi-version support "properly" makes fewer assumptions about how safe memory reclamation works, and it makes snapshots and rollbacks simpler. To avoid making the memory management even more complicated, I have introduced a new kind of "packed reader node" to anchor the root of a version of the trie. This is simpler because it re-uses the existing chunk lifetime logic - see the discussion under "packed reader nodes" in `qp_p.h`. I have also made the chunk lifetime logic simpler. The idea of a "generation" is gone; instead, chunks are either mutable or immutable. And the QSBR phase number is used to indicate when a chunk can be reclaimed. Instead of the `shared_base` flag (which was basically a one-bit reference count, with a two version limit) the base array now has a refcount, which replaces the confusing ad-hoc lifetime logic with something more familiar and systematic.	2023-02-27 13:47:55 +00:00
Tony Finch	549854f63b	Some minor qp-trie improvements Adjust the dns_qp_memusage() and dns_qp_compact() functions to be more informative and flexible about handling fragmentation. Avoid wasting space in runt chunks. Switch from twigs_mutable() to cells_immutable() because that is the sense we usually want. Drop the redundant evacuate() function and rename evacuate_twigs() to evacuate(). Move some chunk test functions closer to their point of use. Clarify compact_recursive(). Some small cleanups to comments. Use isc_time_monotonic() for qp-trie timing stats. Use #define constants to control debug logging. Set up DNS name label offsets in dns_qpkey_fromname() so it is easier to use in cases where the name is not fully hydrated.	2023-02-27 13:47:25 +00:00
Tony Finch	4b09c9a6ae	qp-trie naming improvements Adjust to typename_operation style s/VALID_QP/QP_VALID/g s/QP_VALIDMULTI/QPMULTI_VALID/g Improved greppability s/\bctx\b/uctx/g Less cluttered logging s/QP_TRACE/TRACE/g s/QP_LOG_STATS/LOG_STATS/g	2023-02-27 13:47:25 +00:00
Tony Finch	a9d57b91db	Benchmarks for the qp-trie The main benchmark is `qpmulti`, which exercizes the qp-trie transactional API with differing numbers of threads and differing data sizes, to get some idea of how its performance scales. The `load-names` benchmark compares the times to populate and query and the memory used by various BIND data structures: qp-trie, hash table (chained), hash map (closed), and red-black tree. The `qp-dump` program is a test utility rather than a benchmark. It populates a qp-trie and prints it out, either in an ad-hoc text format, or as input to the graphviz `dot` program.	2023-02-27 13:47:25 +00:00
Tony Finch	fbdb8b502a	Test the qp-trie transactional API Randomized testing with intensive consistency and correctness checks make it much easier to get good coverage and to shake out bugs than hand-written unit tests for specific cases. These tests only run in a single thread, but each test transaction uses both a write/update and a query/snapshot, to ensure that modifications are not visible to concurrent readers.	2023-02-27 13:47:25 +00:00
Tony Finch	c1c679b1a9	Test infrastructure for the qp-trie This change adds a number of support routines for the unit tests, and for benchmarks and fuzz tests to be added later. It isn't necessary to include the support routines in libdns, since they are not needed by BIND's installed programs. So `libtest` seems like the best place for them. The tests themselves verify that dns_qpkey_fromname() behaves as expected.	2023-02-27 13:47:25 +00:00
Evan Hunt	7975b785fd	Support for relative names in unit tests The dns_test_namefromstring() function can now generate relative names, and all the tests that used it before it have been updated to use FQDNs.	2023-02-27 13:47:25 +00:00
Tony Finch	330ff06d4a	Move irs_resconf into libdns and remove libirs `libirs` used to be a reference implementation of `getaddrinfo` and related modern resolver APIs. It was stripped down in BIND 9.18 leaving only the `irs_resconf` module, which parses `/etc/resolv.conf`. I have kept its include path and namespace prefix, so it remains a little fragment of libirs now embedded in libdns.	2023-02-24 09:38:59 +00:00
Evan Hunt	ae5ba54fbe	move dispatchmgr from resolver to view the 'dispatchmgr' member of the resolver object is used by both the dns_resolver and dns_request modules, and may in the future be used by others such as dns_xfrin. it doesn't make sense for it to live in the resolver object; this commit moves it into dns_view.	2023-02-24 08:30:33 +00:00

1 2 3

149 commits