bind9

mirror of https://github.com/isc-projects/bind9.git synced 2026-05-27 12:13:20 -04:00

Author	SHA1	Message	Date
Artem Boldariev	fff01fe7eb	Fix named failing to start on Solaris systems with hundreds of CPUs This commit fixes a startup issue on Solaris systems with many (reportedly > 510) CPUs by bumping RLIMIT_NOFILE. This appears to be a regression from 9.11.	2022-10-20 14:01:28 +02:00
Ondřej Surý	cd0e5c5784	Replace some raw nc usage in statschannel system test with curl For tests where the TCP connection might get interrupted abruptly, replace the nc with curl as the data sent from server to client might get lost because of abrupt TCP connection. This happens when the TCP connection gets closed during sending the large request to the server. As we already require curl for other system tests, replace the nc usage in the statschannel test with curl that actually understands the HTTP/1.1 protocol, so the same connection is reused for sending the consequtive requests, but without client-side "pipelining". For the record, the server doesn't support parallel processing of the pipelined request, so it's a bit misnomer here, because what we are actually testing is that we process all requests received in a single TCP read callback.	2022-10-20 12:23:34 +02:00
Evan Hunt	575a924b1a	add a test with CD=1 query for pending data this is a regression test for [GL #3247].	2022-10-19 11:36:11 -07:00
Ondřej Surý	0f56a53d66	Remove the time requirement for the statschannel truncated test The 5 seconds requirement to finish the 'pipelined with truncated stream' was causing spurious failures in the CI because the job runners might be very busy and sending 128k of data might simply take some time. Remove the time requirement altogether, there's actually no reason why the test SHOULD or even MUST finish under 5 seconds.	2022-10-19 14:08:24 +02:00
Tom Krizek	cbd0355328	Remove generated controls.conf file from system tests The controls.conf file shouldn't be used directly without templating it first. Remove this no longer used hard-coded file to avoid confusion.	2022-10-19 12:59:27 +02:00
Tom Krizek	cb0a2ae1dd	Revive dupsigs system test Correctly source conf.sh in dupsigs test scripts (fix issue introduced by `093af1c00a`). Update dupsigs test for dnssec-dnskey-kskonly default. Since v9.17.20, the dnssec-dnskey-kskonly is set to yes. Update the test to not expect the additional RRSIG with ZSK for DNSKEY. Speed up the test from 20 minutes to 2.5 minutes and make it part of the default test suite executed in CI. - decrease number of records to sign from 2000 to 500 - decrease the signing interval by a factor of 6 - shorten the final part of the test after last signing (since nothing new happens there) Finally, clarify misleading comments about (in)sufficient time for zone re-signing. The time used in the test is in fact sufficient for the re-signing to happen. If it wasn't, the previous ZSK would end up being deleted while its signatures would still be present, which is a situation where duplicate signatures can still happen.	2022-10-19 12:59:27 +02:00
Tom Krizek	7495deea3e	Revive the stress system test Ensure the port numbers are dynamically filled in with copy_setports. Clarify test fail condition. Make the stress test part of the default test suite since it doesn't seem to run too long or interfere with other tests any more (the original note claiming so is more than 20 years old). Related !6883	2022-10-19 12:59:27 +02:00
Tom Krizek	235ae5f344	Revive dialup system test Properly template the port number in config files with copy_setports. The test takes two minutes on my machine which doesn't seem like a proper justification to exclude it from the test suite, especially considering we run these tests in parallel nowadays. The resource usage doesn't seems significantly increased so it shouldn't interfere with other system tests. There also exists a precedent for longer running system tests that are already part of the default system test suite (e.g. serve-stale takes almost three minutes on the same machine).	2022-10-19 12:59:27 +02:00
Tom Krizek	1e7d832342	Make digdelv test work in different network envs When a target server is unreachable, the varying network conditions may cause different ICMP message (or no message). The host unreachable message was discovered when attempting to run the test locally while connected to a VPN network which handles all traffic. Extend the dig output check with "host unreachable" message to avoid a false negative test result in certain network environments.	2022-10-19 12:59:25 +02:00
Michał Kępień	604d8f0b96	Add tests for CVE-2022-2795 Add a test ensuring that the amount of work fctx_getaddresses() performs for any encountered delegation is limited: delegate example.net to a set of 1,000 name servers in the redirect.com zone, the names of which all resolve to IP addresses that nothing listens on, and query for a name in the example.net domain, checking the number of times the findname() function gets executed in the process; fail if that count is excessively large. Since the size of the referral response sent by ans3 is about 20 kB, it cannot be sent back over UDP (EMSGSIZE) on some operating systems in their default configuration (e.g. FreeBSD - see the net.inet.udp.maxdgram sysctl). To enable reliable reproduction of CVE-2022-2795 (retry patterns vary across BIND 9 versions) and avoid false positives at the same time (thread scheduling - and therefore the number of fetch context restarts - vary across operating systems and across test runs), extend bin/tests/system/resolver/ans3/ans.pl so that it also listens on TCP and make "ns1" in the "resolver" system test always use TCP when communicating with "ans3". Also add a test (foo.bar.sub.tld1/TXT) that ensures the new limitations imposed on the resolution process by the mitigation for CVE-2022-2795 do not prevent valid, glueless delegation chains from working properly.	2022-10-19 11:53:08 +02:00
Evan Hunt	3c11fafadf	test for growth of compressed pipelined responses add a test to compare the Content-Length of successive compressed messages on a single HTTP connection that should contain the same data; fail if the size grows by more than 100 bytes from one query to the next.	2022-10-18 17:16:00 +02:00
Petr Špaček	c3e7bed1ab	Fix cookie system test for builds without --enable-developer The "connecting via TCP" message comes from FCTXTRACE which is not available on some builds.	2022-10-18 13:54:45 +02:00
Petr Špaček	ddf46056ca	Allow system tests to run under root user when inside CI https://docs.gitlab.com/ee/ci/variables/predefined_variables.html says variable CI_SERVER="yes" is available in all versions of Gitlab.	2022-10-18 13:30:16 +02:00
Petr Špaček	c8a38d70f0	Document that nsupdate ignores server command in GSS-TSIG mode This behavior is present since introduction of GSS-TSIG support, commit `289ae548d5`.	2022-10-18 10:12:02 +02:00
Tony Finch	26ed03a61e	Include the function name when reporting unexpected errors I.e. print the name of the function in BIND that called the system function that returned an error. Since it was useful for pthreads code, it seems worthwhile doing so everywhere.	2022-10-17 13:43:59 +01:00
Tony Finch	ec50c58f52	De-duplicate __FILE__, __LINE__ Mostly generated automatically with the following semantic patch, except where coccinelle was confused by #ifdef in lib/isc/net.c @@ expression list args; @@ - UNEXPECTED_ERROR(__FILE__, __LINE__, args) + UNEXPECTED_ERROR(args) @@ expression list args; @@ - FATAL_ERROR(__FILE__, __LINE__, args) + FATAL_ERROR(args)	2022-10-17 11:58:26 +01:00
Michal Nowak	212c4de043	Replace fgrep and egrep with grep -F/-E GNU Grep 3.8 reports the following warnings: egrep: warning: egrep is obsolescent; using grep -E fgrep: warning: fgrep is obsolescent; using grep -F	2022-10-17 09:08:15 +02:00
Michal Nowak	65e91ef5e6	Remove stray backslashes GNU Grep 3.8 reports several instances of stray backslashes in matching patterns: grep: warning: stray \ before / grep: warning: stray \ before :	2022-10-17 09:08:15 +02:00
Tony Finch	45b2d8938b	Simplify and speed up DNS name compression All we need for compression is a very small hash set of compression offsets, because most of the information we need (the previously added names) can be found in the message using the compression offsets. This change combines dns_compress_find() and dns_compress_add() into one function dns_compress_name() that both finds any existing suffix, and adds any new prefix to the table. The old split led to performance problems caused by duplicate names in the compression context. Compression contexts are now either small or large, which the caller chooses depending on the expected size of the message. There is no dynamic resizing. There is a behaviour change: compression now acts on all the labels in each name, instead of just the last few. A small benchmark suggests this is about 2x faster.	2022-10-17 08:45:44 +02:00
Ondřej Surý	cedfc97974	Improve reporting for pthread_once errors Replace all uses of RUNTIME_CHECK() in lib/isc/include/isc/once.h with PTHEADS_RUNTIME_CHECK(), in order to improve error reporting for any once-related run-time failures (by augmenting error messages with file/line/caller information and the error string corresponding to errno).	2022-10-14 16:39:21 +02:00
Tom Krizek	05180154d9	Remove system test delzone There are multiple reasons to remove this test as obsolete: - The test may not possibly work for over 2.5 years, since `98b3b93791` removed the rndc.py python tool on which this test relies. - It isn't part of the test suite either in CI or locally unless it is explicitly enabled. As a result, there are many issues which prevent the test from being executed caused by various refactoring efforts accumulated over time. - Even if the test could be executed, it has no clear failure condition. If the python script(s) fail, the test still passes.	2022-10-14 16:35:20 +02:00
Ondřej Surý	cad2706cce	Replace the statschannel truncated tests with two new tests Now that the artificial limit on the recv buffer has been removed, the current system test always fails because it tests if the truncation has happened. Add test that sending more than 10 headers makes the connection to closed; and add test that sending huge HTTP request makes the connection to be closed.	2022-10-14 11:26:54 +02:00
Ondřej Surý	beecde7120	Rewrite isc_httpd using picohttpparser and isc_url_parse Rewrite the isc_httpd to be more robust. 1. Replace the hand-crafted HTTP request parser with picohttpparser for parsing the whole HTTP/1.0 and HTTP/1.1 requests. Limit the number of allowed headers to 10 (arbitrary number). 2. Replace the hand-crafted URL parser with isc_url_parse for parsing the URL from the HTTP request. 3. Increase the receive buffer to match the isc_netmgr buffers, so we can at least receive two full isc_nm_read()s. This makes the truncation processing much simpler. 4. Process the received buffer from single isc_nm_read() in a single loop and schedule the sends to be independent of each other. The first two changes makes the code simpler and rely on already existing libraries that we already had (isc_url based on nodejs) or are used elsewhere (picohttpparser). The second two changes remove the artificial "truncation" limit on parsing multiple request. Now only a request that has too many headers (currently 10) or is too big (so, the receive buffer fills up without reaching end of the request) will end the connection. We can be benevolent here with the limites, because the statschannel channel is by definition private and access must be allowed only to administrators of the server. There are no timers, no rate-limiting, no upper limit on the number of requests that can be served, etc.	2022-10-14 11:26:54 +02:00
Petr Špaček	53b3ceacd4	Replace #define DNS_NAMEATTR_ with struct of bools sizeof(dns_name_t) did not change but the boolean attributes are now separated as one-bit structure members. This allows debuggers to pretty-print dns_name_t attributes without any special hacks, plus we got rid of manual bit manipulation code.	2022-10-13 17:04:02 +02:00
Artem Boldariev	95a551de7b	doth system test: increase transfers-in/out limits Sometimes doth test could intermittently fail shortly after start due to inability to complete a zone transfer in time. As it turned out, it could happen due to transfers-in/out limits. Initially the defaults were fine, but over time, especially when adding Strict/Mutual TLS, we added more than 10 zones so it became possible to hit the limits. This commit takes care of that by bumping the limits.	2022-10-12 21:52:52 +03:00
Artem Boldariev	354494cd10	doth system test - decrease HTTP listener quota size This commit reduces the size of HTTP listener quota from 300 (default) to 100 so that it would make hitting any global limits in case of running multiple tests in parallel in multiple containers unlikely. This way the need in opening many file descriptors of different kinds (e.g. client side connections and pipes) gets significantly reduced while the required code paths are still verified.	2022-10-12 21:46:39 +03:00
Michał Kępień	18e20f95f6	Fix startup detection after restart in start.pl The bin/tests/system/start.pl script waits until a "running" message is logged by a given name server instance before attempting to send a version.bind/CH/TXT query to it. The idea behind this was to make the script wait until named loads all the zones it is configured to serve before telling the system test framework that a given server is ready to use; this prevents the need to add boilerplate code that waits for a specific zone to be loaded to each test expecting that. The problem is that when it looks for "running" messages, the bin/tests/system/start.pl script assumes that the existence of any such message in the named.run file indicates that a given named instance has already finished loading all zones. Meanwhile, some system tests restart all the named instances they use throughout their lifetime (some even do that a few times), for example to run Python-based tests. The bin/tests/system/start.pl script handles such a scenario incorrectly: as soon as it finds any "running" message in the named.run file it inspects and it gets a response to a version.bind/CH/TXT query, it tells the system test framework that a given server is ready to use, which might not be true - it is possible that only the "version.bind" zone is loaded at that point and the "running" message found was logged by a previously-shutdown named instance. This triggers intermittent failures for Python-based tests. Fix by improving the logic that the bin/tests/system/start.pl script uses to detect server startup: check how many "running" lines are present in a given named.run file before attempting to start a named instance and only proceed with version.bind/CH/TXT queries when the number of "running" lines found in that named.run file increases after the server is started.	2022-10-11 11:54:57 +02:00
Michał Kępień	9146b956ae	Do not truncate ns2 logs in the "rrsetorder" test In the "rrsetorder" system test, the ns2 named instance is restarted without passing the --restart option to bin/tests/system/start.pl. This causes the log file for that named instance to be needlessly truncated. Prevent this from happening by restarting the affected named instance in the same way as all the other named instances used in system tests.	2022-10-11 11:54:57 +02:00
Petr Špaček	058c1744ba	Clarify error message about missing inline-signing & dnssec-policy	2022-10-06 10:26:30 +02:00
Mark Andrews	491a8cfe96	Add sleeps to ixfr system test ensure that at least a second has passed since a zone was last loaded to prevent it accidentally being skipped as up to date.	2022-10-06 08:18:03 +11:00
Ondřej Surý	0dcbc6274b	Record the 'edns-udp-size' in the view, not in the resolver Getting the recorded value of 'edns-udp-size' from the resolver requires strong attach to the dns_view because we are accessing `view->resolver`. This is not the case in places (f.e. dns_zone unit) where `.udpsize` is accessed. By moving the .udpsize field from `struct dns_resolver` to `struct dns_view`, we can access the value directly even with weakly attached dns_view without the need to lock the view because `.udpsize` can be accessed after the dns_view object has been shut down.	2022-10-05 11:59:36 -07:00
Michal Nowak	f5d9fa6ea4	Drop flake8 ignore lists flake8 is not used in BIND 9 CI and inline ignore lists are not needed anymore.	2022-10-05 17:56:24 +02:00
Ondřej Surý	e18b6fb6a6	Use isc_mem_regetx() when appropriate While refactoring the isc_mem_getx(...) usage, couple places were identified where the memory was resized manually. Use the isc_mem_reget(...) that was introduced in [GL !5440] to resize the arrays via function rather than a custom code.	2022-10-05 16:44:05 +02:00
Ondřej Surý	c0598d404c	Use designated initializers instead of memset()/MEM_ZERO for structs In several places, the structures were cleaned with memset(...)) and thus the semantic patch converted the isc_mem_get(...) to isc_mem_getx(..., ISC_MEM_ZERO). Use the designated initializer to initialized the structures instead of zeroing the memory with ISC_MEM_ZERO flag as this better matches the intended purpose.	2022-10-05 16:44:05 +02:00
Ondřej Surý	c1d26b53eb	Add and use semantic patch to replace isc_mem_get/allocate+memset Add new semantic patch to replace the straightfoward uses of: ptr = isc_mem_{get,allocate}(..., size); memset(ptr, 0, size); with the new API call: ptr = isc_mem_{get,allocate}x(..., size, ISC_MEM_ZERO);	2022-10-05 16:44:05 +02:00
Mark Andrews	285351d4b2	Add additional forensics to zero system test	2022-10-05 07:46:01 +00:00
Matthijs Mekking	0681b15225	If refresh stale RRset times out, start stale-refresh-time The previous commit failed some tests because we expect that if a fetch fails and we have stale candidates in cache, the stale-refresh-time window is started. This means that if we hit a stale entry in cache and answering stale data is allowed, we don't bother resolving it again for as long we are within the stale-refresh-time window. This is useful for two reasons: - If we failed to fetch the RRset that we are looking for, we are not hammering the authoritative servers. - Successor clients don't need to wait for stale-answer-client-timeout to get their DNS response, only the first one to query will take the latency penalty. The latter is not useful when stale-answer-client-timeout is 0 though. So this exception code only to make sure we don't try to refresh the RRset again if it failed to do so recently.	2022-10-05 08:20:48 +02:00
Mark Andrews	6d561d3886	Add support for 'dohpath' to SVCB (and HTTPS) dohpath is specfied in draft-ietf-add-svcb-dns and has a value of 7. It must be a relative path (start with a /), be encoded as UTF8 and contain the variable dns ({?dns}).	2022-10-04 14:21:41 +11:00
Ondřej Surý	d971472321	Be more patient when stopping servers in the system tests When the TCP test is run on the busy server, the server might take a while to wind the server down because it might still be processing all that 300k invalid XFR requests. Increate the rncd wait time to 120 seconds, the SIGTERM time to 300 seconds, and reduce the time to wait for ans servers from 1200 second to just 120 seconds.	2022-09-30 17:12:44 +02:00
Ondřej Surý	477eb22c12	Refactor isc_ratelimiter API Because the dns_zonemgr_create() was run before the loopmgr was started, the isc_ratelimiter API was more complicated that it had to be. Move the dns_zonemgr_create() to run_server() task which is run on the main loop, and simplify the isc_ratelimiter API implementation. The isc_timer is now created in the isc_ratelimiter_create() and starting the timer is now separate async task as is destroying the timer in case it's not launched from the loop it was created on. The ratelimiter tick now doesn't have to create and destroy timer logic and just stops the timer when there's no more work to do. This should also solve all the races that were causing the isc_ratelimiter to be left dangling because the timer was stopped before the last reference would be detached.	2022-09-30 10:36:30 +02:00
Ondřej Surý	36cdeb7656	Remove debugging fprintf from run_server() In the loopmgr branch, we forgot the scissors^Hdebugging output in the patient^Hnamed, remove it.	2022-09-29 14:22:58 +02:00
Aram Sargsyan	ae4296729c	Test dynamic update forwarding when using a TLS-enabled primary Add several test cases in the 'upforwd' system test to make sure that different scenarios of Dynamic DNS update forwarding are tested, in particular when both the original and forwarded requests are over Do53, or DoT, or they use different transports.	2022-09-28 09:36:24 +00:00
Mark Andrews	432064f63c	Suffix may be used before it is assigned a value CID 350722 (#5 of 7): Bad use of null-like value (FORWARD_NULL) 12. invalid_operation: Invalid operation on null-like value suffix. 145 r.authority.append( 146 dns.rrset.from_text( 147 "icky.ptang.zoop.boing." + suffix, 148 1, 149 IN, 150 NS, 151 "a.bit.longer.ns.name." + suffix, 152 ) 153 )	2022-09-27 23:47:12 +00:00
Ondřej Surý	3b31f7f563	Add autoconf option to enable memory leak detection in libraries There's a known memory leak in the engine_pkcs11 at the time of writing this and it interferes with the named ability to check for memory leaks in the OpenSSL memory context by default. Add an autoconf option to explicitly enable the memory leak detection, and use it in the CI except for pkcs11 enabled builds. When this gets fixed in the engine_pkc11, the option can be enabled by default.	2022-09-27 17:53:04 +02:00
Ondřej Surý	d1cc847ab0	Check the libuv, OpenSSL and libxml2 memory context on exit As we can't check the deallocations done in the library memory contexts by default because it would always fail on non-clean exit (that happens on error or by calling exit() early), we just want to enable the checks to be done on normal exit.	2022-09-27 17:10:42 +02:00
Ondřej Surý	e537fea861	Use custom isc_mem based allocator for libxml2 The libxml2 library provides a way to replace the default allocator with user supplied allocator (malloc, realloc, strdup and free). Create a memory context specifically for libxml2 to allow tracking the memory usage that has originated from within libxml2. This will provide a separate memory context for libxml2 to track the allocations and when shutting down the application it will check that all libxml2 allocations were returned to the allocator. Additionally, move the xmlInitParser() and xmlCleanupParser() calls from bin/named/main.c to library constructor/destructor in libisc library.	2022-09-27 17:10:42 +02:00
Petr Špaček	c648e280e4	Document list of crypto algorithms in named -V output	2022-09-27 16:54:39 +02:00
Mark Andrews	d34ecdb366	Deduplicate string formating	2022-09-27 16:54:39 +02:00
Mark Andrews	3156d36495	silence scan-build false positive	2022-09-27 16:54:39 +02:00
Mark Andrews	cb1515e71f	Report algorithms supported by named at startup	2022-09-27 16:54:39 +02:00

1 2 3 4 5 ...

10973 commits