haproxy

mirror of https://github.com/haproxy/haproxy.git synced 2026-04-21 14:17:30 -04:00

Author	SHA1	Message	Date
William Lallemand	631fd5f99b	MEDIUM: acme: add dns-01 DNS propagation pre-check When using the dns-01 challenge type, TXT record propagation across DNS servers can take time. If the ACME server verifies the challenge before the record is visible, the challenge fails and it's not possible to trigger it again. This patch introduces an optional DNS pre-check mechanism controlled by two new configuration directives in the "acme" section: - "dns-check on\|off": enable DNS propagation verification before notifying the ACME server (default: off) - "dns-delay <time>": delay before querying DNS (default: 300s) When enabled, three new states are inserted in the state machine between AUTH and CHALLENGE: - ACME_RSLV_WAIT: waits dns-delay seconds before starting - ACME_RSLV_TRIGGER: starts an async TXT resolution for each pending authorization using HAProxy's resolver infrastructure - ACME_RSLV_READY: compares the resolved TXT record against the expected token; retries from ACME_RSLV_WAIT if any record is missing or does not match The "acme_rslv" structure is implemented in acme_resolvers.c, it holds the resolution for each domain. The "auth" structure which contains each challenge to resolve contains an "acme_rslv" structure. Once ACME_RSLV_TRIGGER leaves, the DNS tasks run on the same thread, and the last DNS task which finishes will wake up acme_process(). Note that the resolution goes through the configured resolvers, not through the authoritative name servers of the domain. The result may therefore still be affected by DNS caching at the resolver level.	2026-03-30 18:24:28 +02:00
William Lallemand	e418e828aa	MINOR: resolvers: basic TXT record implementation This patch adds support for TXT records. It allows to get the first string of a TXT-record which is limited to 255 characters. The rest of the record is ignored.	2026-03-30 18:24:28 +02:00
Christopher Faulet	25d6e65aae	BUG/MAJOR: resolvers: Properly lowered the names found in DNS response Names found in DNS responses are lowered to be compared. A name is composed of several labels, strings precedeed by their length on one byte. For instance: 3www7haproxy3org There is an bug when labels are lowered. The label length is not skipped and tolower() function is called on it. So for label length in the range [65-90] (uppercase char), 32 is added to the label length due to the conversion of a uppercase char to lowercase. This bugs can lead to OOB read later in the resolvers code. The fix is quite obvious, the label length must be skipped when the label is lowered. Thank you to Kamil Frankowicz for having reported this. This patch must be backported to all stable versions.	2026-03-05 15:35:31 +01:00
Amaury Denoyelle	f3127df74d	MINOR: proxy: add refcount to proxies Implement refcount notion into proxy structure. The objective is to be able to increment refcount on proxy to prevent its deletion temporarily. This is similar to the server refcount : "del backend" is not blocked and will remove the targetted instance from the global proxies_list. However, the final free operation is delayed until the refcount is null. As stated above, the API is similar to servers. Proxies are initialized with a refcount of 1. Refcount can be incremented via proxy_take(). When no longer useful, refcount is decremented via proxy_drop() which replaces the older free_proxy(). Deinit is only performed once refcount is null. This commit also defines flag PR_FL_DELETED. It is set when a proxy instance has been removed via a "del backend" CLI command. This should serve as indication to modules which may still have a refcount on the target proxy so that they can release it as soon as possible. Note that this new refcount is completely ignored for a default proxy instance. For them, proxy_take() is pure noop. Free is immediately performed on first proxy_drop() invokation.	2026-03-02 10:44:59 +01:00
Willy Tarreau	de0eddf512	MINOR: counters: add EXTRA_COUNTERS_BASE() to retrieve extra_counters base storage The goal is to always retrieve the storage address of the first thread group for the given module. This will be used to iterate over all thread groups. For now it returns the same value as EXTRA_COUNTERS_GET().	2026-02-26 17:03:53 +01:00
Willy Tarreau	a60e1fcf7f	MEDIUM: counters: store the number of thread groups accessing extra_counters In order to be able to properly allocate all storage and retrieve data from there, we'll need to know how many thread groups are supposed to access it. Let's store the number of thread groups at init time. If the tgrp_step is zero, there's always only one tg though. Now EXTRA_COUNTERS_ALLOC() takes this number of thread groups in argument and stores it in the structure. It also allocates as many areas as needed, incrementing the datap pointer by the step for each of them. EXTRA_COUNTERS_FREE() uses this info to free all allocated areas. EXTRA_COUNTERS_INIT() initializes all allocated areas, this is used elsewhere to clear/preset counters, e.g. in proxy_stats_clear_counters(). It involves a memcpy() call for each array, which is normally preset to something empty but might also be used to preset certain non-scalar fields such as an instance name.	2026-02-26 17:03:53 +01:00
Willy Tarreau	7ac47910a2	MINOR: counters: store a tgroup step for extra_counters to access multiple tgroups We'll need to permit any user to update its own tgroup's extra counters instead of the global ones. For this we now store the per-tgroup step between two consecutive data storages, for when they're stored in a tgroup array. When shared (e.g. resolvers or listeners), we just store zero to indicate that it doesn't scale with tgroups. For now only the registration was handled, it's not used yet.	2026-02-26 17:03:53 +01:00
Willy Tarreau	04a9f86a85	MEDIUM: counters: add a dedicated storage for extra_counters in various structs Servers, proxies, listeners and resolvers all use extra_counters. We'll need to move the storage to per-tgroup for those where it matters. Now we're relying on an external storage, and the data member of the struct was replaced with a pointer to that pointer to data called datap. When the counters are registered, these datap are set to point to relevant locations. In the case of proxies and servers, it points to the first tgrp's storage. For listeners and resolvers, it points to a local storage. The rationale here is that listeners are limited to a single group anyway, and that resolvers have a low enough load so that we do not care about contention there. Nothing should change for the user at this point.	2026-02-26 17:03:47 +01:00
Willy Tarreau	8dd22a62a4	CLEANUP: counters: only retrieve zeroes for unallocated extra_counters Since version 2.4 with commit `7f8f6cb926` ("BUG/MEDIUM: stats: prevent crash if counters not alloc with dummy one") we can afford to always update extra_counters because we know they're always either allocated or linked to a dedicated trash. However, the ->fill_stats() callbacks continue to access such values, making it technically possible to retrieve random counters from this trash, which is not really clean. Let's implement an explicit test in the ->fill_stats() functions to only return 0 for the metric when not allocated like this. It's much cleaner because it guarantees that we're returning an empty counter in this case rather than random values. The situation currently happens for dummy servers like the ones used in Lua proxies as well as those used by rings (e.g. used for logging or traces). Normally, none of the objects retrieved via stats or Prometheus is concerned by this unallocated extra_counters situation, so this is more about a cleanup than a real fix.	2026-02-26 08:24:03 +01:00
Willy Tarreau	95a9f472d2	MEDIUM: counters: change the fill_stats() API to pass the module and extra_counters We'll soon need to iterate over thread groups in the fill_stats() functions, so let's first pass the extra_counters and stats_module pointers to the fill_stats functions. They now call EXTRA_COUNTERS_GET() themselves with these elements in order to retrieve the required pointer. Nothing else changed, and it's getting even a bit more transparent for callers. This doesn't change anything visible however.	2026-02-26 08:24:03 +01:00
Willy Tarreau	56fc12d6fa	CLEANUP: stats: drop stats.h / stats-t.h where not needed A number of C files include stats.h or stats-t.h, many of which were just to access the counters. Now those which really need counters rely on counters.h or counters-t.h, which already reduces the amount of preprocessed code to be built (~3000 lines or about 0.05%).	2026-02-26 08:24:03 +01:00
Damien Claisse	561dc127bd	BUG/MINOR: resolvers: ensure fair round robin iteration Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Previous fixes restored round robin iteration, but an imbalance remains when the response tree contains record types other than A or AAAA. Let's take the following example: the DNS answers two A records and a CNAME. The response "tree" (which is actually flat, more like a list) may look as follows, ordered by hash: - 1st item: first A record with IP 1 - 2nd item: second A record with IP 2 - 3rd item: CNAME record As a consequence, resolv_get_ip_from_response will iterate as follows, while the TTL is still valid: - 1st call: DNS request is done, response tree is created, iteration starts at the first item, IP 1 is returned. - 2nd call: cached response tree is used, iteration starts at the second item, IP 2 is returned. - 3rd call: cached response tree is used, iteration starts at the third item, but it's a CNAME, so we continue to the next item, which restarts iteration at the first item, and IP 1 is returned. - 4th call: cached response tree is used and iteration restarts at the beginning, returning IP 1 again. The 1-2-1-1-2-1-1-2 sequence will repeat, so IP 1 will be used twice as often as IP 2, creating a strong imbalance. Even with more IP addresses, the first one by hashing order in the tree will always receive twice the traffic of the others. To fix this, set the next iteration item to the one following the selected IP record, if any. This ensures we never use the same IP twice in a row. This commit should be backported where `3023e9819` ("BUG/MINOR: resolvers: Restore round-robin selection on records in DNS answers") is, so as far as 2.6.	2025-11-02 17:28:32 +01:00
Christopher Faulet	c84c15d393	BUG/MINOR: resolvers: Apply dns-accept-family setting on additional records Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details dns-accept-family setting was only evaluated for responses to A / AAAA DNS queries. It was ignored when additional records in SRV responses were parsed. With this patch, whena SRV responses is parsed, additional records not matching the dns-accept-family setting are ignored, as expected. This patch must be backported to 3.2.	2025-10-29 11:20:27 +01:00
Willy Tarreau	ced9784df4	BUG/MEDIUM: resolvers: break an infinite loop in resolv_get_ip_from_response() The fix in `3023e98199` ("BUG/MINOR: resolvers: Restore round-robin selection on records in DNS answers") still contained an issue not addressed `f6dfbbe870` ("BUG/MEDIUM: resolvers: Test for empty tree when getting a record from DNS answer"). Indeed, if the next element is the same as the first one, then we can end up with an endless loop because the test at the end compares the next pointer (possibly null) with the end one (first). Let's move the null->first transition at the end. This must be backported where the patches above were backported (3.2 for now).	2025-10-03 09:08:10 +02:00
Willy Tarreau	fdf6fd5b45	MEDIUM: server: switch the host_dn member to cebis_tree This member is used to index the hostname_dn contents for DNS resolution. Let's replace it with a cebis_tree to save another 32 bytes (24 for the node + 8 by avoiding the duplication of the pointer). The struct server is now at 3904 bytes.	2025-09-16 09:23:46 +02:00
Valentine Krasnobaeva	f8acac653e	BUG/MINOR: resolvers: always normalize FQDN from response RFC1034 states the following: By convention, domain names can be stored with arbitrary case, but domain name comparisons for all present domain functions are done in a case-insensitive manner, assuming an ASCII character set, and a high order zero bit. This means that you are free to create a node with label "A" or a node with label "a", but not both as brothers; you could refer to either using "a" or "A". In practice, most DNS resolvers normalize domain labels (i.e., convert them to lowercase) before performing searches or comparisons to ensure this requirement is met. While HAProxy normalizes the domain name in the request, it currently does not do so for the response. Commit `75cc653` ("MEDIUM: resolvers: replace bogus resolv_hostname_cmp() with memcmp()") intentionally removed the `tolower()` conversion from `resolv_hostname_cmp()` for safety and performance reasons. This commit re-introduces the necessary normalization for FQDNs received in the response. The change is made in `resolv_read_name()`, where labels are processed as an unsigned char string, allowing `tolower()` to be applied safely. Since a typical FQDN has only 3-4 labels, replacing `memcpy()` with an explicit copy that also applies `tolower()` should not introduce a significant performance degradation. This patch addresses the rare edge case, as most resolvers perform this normalization themselves. This fixes the GitHub issue #3102. This fix may be backported in all stable versions since 2.5 included 2.5.	2025-09-15 18:02:16 +02:00
Christopher Faulet	157852ce99	BUG/MEDIUM: resolvers: Wake resolver task up whne unlinking a stream requester Another regression introduced with the commit `3023e9819` ("BUG/MINOR: resolvers: Restore round-robin selection on records in DNS answers"). Stream requesters are unlinked from any theards. So we must not try to queue the resolver's task here because it is not allowed to do so from another thread than the task thread. Instead, we can simply wake the resolver's task up. It is only performed when the last stream requester is unlink from the resolution. This patch should fix the issue #3119. It must be backported with the commit above.	2025-09-15 07:57:29 +02:00
Christopher Faulet	e6a9192af6	BUG/MEDIUM: resolvers: Accept to create resolution without hostname A regression was introduced by commit `6cf2401ed` ("BUG/MEDIUM: resolvers: Make resolution owns its hostname_dn value"). In fact, it is possible (an allowed ?!) to create a resolution without hostname (hostname_dn == NULL). It only happens on startup for a server relying on a resolver but defined with an IP address and not a hostname Because of the patch above, an error is triggered during the configuration parsing when this happens, while it should be accepted. This patch must be backported with the commit above.	2025-09-12 11:52:06 +02:00
Christopher Faulet	6cf2401eda	BUG/MEDIUM: resolvers: Make resolution owns its hostname_dn value The commit `37abe56b1` ("BUG/MEDIUM: resolvers: Properly cache do-resolv resolution") introduced a regression. A resolution does not own its hostname_dn value, it is a pointer on the first request value. But since the commit above, it is possible to have orphan resolution, with no requester. So it is important to modify the resolutions to make it owns its hostname_dn value by duplicating it when it is created. This patch must be backported with the commit above.	2025-09-12 11:09:19 +02:00
Christopher Faulet	f6dfbbe870	BUG/MEDIUM: resolvers: Test for empty tree when getting a record from DNS answer In the previous fix 5d1d93fad ("BUG/MEDIUM: resolvers: Properly handle empty tree when getting a record from the DNS answer"), I missed the fact the answer tree can be empty. So, to avoid crashes, when the answer tree is empty, we immediately exit from resolv_get_ip_from_response() function with RSLV_UPD_NO_IP_FOUND. In addition, when a record is removed from the tree, we take care to reset the next node saved if necessary. This patch must be backported with the commit above.	2025-09-12 11:09:19 +02:00
Christopher Faulet	3023e98199	BUG/MINOR: resolvers: Restore round-robin selection on records in DNS answers Since the commit `dcb696cd3` ("MEDIUM: resolvers: hash the records before inserting them into the tree"), When several records are found in a DNS answer, the round robin selection over these records is no longer performed. Indeed, before a list of records was used. To ensure each records was selected one after the other, at each selection, the first record of the list was moved at the end. When this list was replaced bu a tree, the same mechanism was preserved. However, the record is indexed using its key, a hash of the record. So its position never changes. When it is removed and reinserted in the tree, its position remains the same. When we walk though the tree, starting from the root, the records are always evaluated in the same order. So, even if there are several records in a DNS answer, the same IP address is always selected. It is quite easy to trigger the issue with a do-resolv action. To fix the issue, the node to perform the next selection is now saved. So instead of restarting from the root each time, we can restart from the next node of the previous call. Thanks to Damien Claisse for the issue analysis and for the reproducer. This patch should fix the issue #3116. It must be backported as far as 2.6.	2025-09-11 15:46:45 +02:00
Christopher Faulet	37abe56b18	BUG/MEDIUM: resolvers: Properly cache do-resolv resolution As stated by the documentation, when a do-resolv resolution is performed, the result should be cached for <hold.valid> milliseconds. However, the only way to cache the result is to always have a requester. When the last requester is unlink from the resolution, the resolution is released. So, for a do-resolv resolution, it means it could only work by chance if the same FQDN is requested enough to always have at least two streams waiting for the resolution. And because in that case, the cached result is used, it means the traffic must be quite high. In fact, a good approach to fix the issue is to keep orphan resolutions to be able cache the result and only release them after hold.valid milliseconds after the last real resolution. The resolver's task already releases orphan resolutions. So we only need to check the expiration date and take care to not release the resolution when the last stream is unlink from it. This patch should be backported to all stable versions. We can start to backport it as far as 3.1 and then wait a bit.	2025-09-11 15:46:45 +02:00
Willy Tarreau	2ce5e0edcc	MEDIUM: resolvers: make the process_resolvers() task single-threaded This task is sometimes caught triggering the watchdog while waiting for the infamous resolvers lock, or the scheduler's wait queue lock in task_queue(). Both are caused by its multi-threaded capability. The task may indeed start on a thread that's different from the one that is currently receiving a response and that holds the resolvers lock, and when being queued back, it requires to lock the wait queue. Both problems disappear when sticking it to a single thread. But for configs running multiple resolvers sections, it would be suboptimal to run them all on the same thread. In order to avoid this, we implement a counter in the resolvers_finalize_config() section that rotates the thread for each resolvers section. This was sufficient to further improve the performance here, making the CPU usage drop to about 7% (from 11 previously or 38 initially) and not showing any resolvers lock contention anymore in perf top output. The change was kept fairly minimal to permit a backport once enough testing is conducted on it. It could address a significant part of the trouble reported by Felipe in GH issue #3101.	2025-09-10 16:51:14 +02:00
Willy Tarreau	c264ea1679	MEDIUM: tree-wide: replace most DECLARE_POOL with DECLARE_TYPED_POOL This will make the pools size and alignment automatically inherit the type declaration. It was done like this: sed -i -e 's:DECLARE_POOL($[^,],[^,],\s$sizeof($[^)]$)):DECLARE_TYPED_POOL(\1\2):g' $(git grep -lw DECLARE_POOL src addons) sed -i -e 's:DECLARE_STATIC_POOL($[^,],[^,],\s$sizeof($[^)]$)):DECLARE_STATIC_TYPED_POOL(\1\2):g' $(git grep -lw DECLARE_STATIC_POOL src addons) 81 replacements were made. The only remaining ones are those which set their own size without depending on a structure. The few ones with an extra size were manually handled. It also means that the requested alignments are now checked against the type's. Given that none is specified for now, no issue is reported. It was verified with "show pools detailed" that the definitions are exactly the same, and that the binaries are similar.	2025-08-11 19:55:30 +02:00
Willy Tarreau	96da670cd7	MINOR: resolvers: do not duplicate the hostname_dn field The hostdn.key field in the server contains a pure copy of the hostname_dn since commit `3406766d57` ("MEDIUM: resolvers: add a ref between servers and srv request or used SRV record") which wanted to lowercase it. Since it's not necessary, let's drop this useless copy. In addition, the return from strdup() was not tested, so it could theoretically crash the process under heavy memory contention.	2025-07-08 07:54:45 +02:00
Willy Tarreau	54d36f3e65	MEDIUM: resolvers: switch dns-accept-family to "auto" by default As notified in the 3.2 announce [1], dns-accept-family needed to switch to "auto" by default in 3.3. This is now done. [1] https://www.mail-archive.com/haproxy@formilux.org/msg45917.html	2025-07-08 07:54:45 +02:00
Willy Tarreau	8a96216847	MEDIUM: sock-inet: re-check IPv6 connectivity every 30s IPv6 connectivity might start off (e.g. network not fully up when haproxy starts), so for features like resolvers, it would be nice to periodically recheck. With this change, instead of having the resolvers code rely on a variable indicating connectivity, it will now call a function that will check for how long a connectivity check hasn't been run, and will perform a new one if needed. The age was set to 30s which seems reasonable considering that the DNS will cache results anyway. There's no saving in spacing it more since the syscall is very check (just a connect() without any packet being emitted). The variables remain exported so that we could present them in show info or anywhere else. This way, "dns-accept-family auto" will now stay up to date. Warning though, it does perform some caching so even with a refreshed IPv6 connectivity, an older record may be returned anyway.	2025-05-09 15:45:44 +02:00
Aurelien DARRAGON	bd48e26a74	CLEANUP: proxy: mention that px->conn_retries isn't relevant in some cases Since `91e785edc` ("MINOR: stream: Rely on a per-stream max connection retries value"), px->conn_retries may be ignored in the following cases: * proxy not part of a list which gets properly post-init (ie: main proxy list, log-forward list, sink list) * proxy lacking the CAP_FE capability Documenting such cases where the px->conn_retries is set but effectively ignored, so that we either remove ignored statements or fix them in the future if they are really needed. In fact all cases affected here are automomous applets that already handle the retries themselves so the fact that `91e785edc` made ->conn_retries ineffective should not be a big deal anyway.	2025-04-29 21:21:19 +02:00
Willy Tarreau	40aceb7414	MINOR: resolvers: use the runtime IPv6 status instead of boot time one On systems where the network is not reachable at boot time (certain HA systems for example, or dynamically addressed test machines), we'll want to be able to periodically revalidate the IPv6 reachability status. The current code makes it complicated because it sets the config bits once for all at boot time. This commit changes this so that the config bits are not changed, but instead we rely on a static inline function that relies on sock_inet6_seems_reachable for every test (really cheap). This also removes the now unneeded resolvers late init code. This variable for now is still set at boot time but this will ease the transition later, as the resolvers code is now ready for this.	2025-04-25 09:32:05 +02:00
Willy Tarreau	69b051d1dc	MINOR: resolvers: add "dns-accept-family auto" to rely on detected IPv6 Instead of always having to force IPv4 or IPv6, let's now also offer "auto" which will only enable IPv6 if the system has a default gateway for it. This means that properly configured dual-stack systems will default to "ipv4,ipv6" while those lacking a gateway will only use "ipv4". Note that no real connectivity test is performed, so firewalled systems may still get it wrong and might prefer to rely on a manual "ipv4" assignment.	2025-04-24 17:52:28 +02:00
Willy Tarreau	2c46c2c042	MINOR: resolvers: add command-line argument -4 to force IPv4-only DNS In order to ease troubleshooting and testing, the new "-4" command line argument enforces queries and processing of "A" DNS records only, i.e. those representing IPv4 addresses. This can be useful when a host lack end-to-end dual-stack connectivity. This overrides the global "dns-accept-family" directive and is equivalent to value "ipv4".	2025-04-24 17:52:28 +02:00
Willy Tarreau	940fa19ad8	MEDIUM: resolvers: add global "dns-accept-family" directive By default, DNS resolvers accept both IPv4 and IPv6 addresses. This can be influenced by the "resolve-prefer" keywords on server lines as well as the family argument to the "do-resolve" action, but that is only a preference, which does not block the other family from being used when it's alone. In some environments where dual-stack is not usable, stumbling on an unreachable IPv6-only DNS record can cause significant trouble as it will replace a previous IPv4 one which would possibly have continued to work till next request. The "dns-accept-family" global option permits to enforce usage of only one (or both) address families. The argument is a comma-delimited list of the following words: - "ipv4": query and accept IPv4 addresses ("A" records) - "ipv6": query and accept IPv6 addresses ("AAAA" records) When a single family is used, no request will be sent to resolvers for the other family, and any response for the othe family will be ignored. The default value is "ipv4,ipv6", which effectively enables both families.	2025-04-24 17:52:28 +02:00
Aurelien DARRAGON	4194f756de	MEDIUM: tree-wide: avoid manually initializing proxies In this patch we try to use the proxy API init functions as much as possible to avoid code redundancy and prevent proxy initialization errors. As such, we prefer using alloc_new_proxy() and setup_new_proxy() instead of manually allocating the proxy pointer and performing the base init ourselves.	2025-04-10 22:10:31 +02:00
Christopher Faulet	892eb2bb2c	BUG/MEDIUM: promex/resolvers: Don't dump metrics if no nameserver is defined A 'resolvers' section may be defined without any nameserver. In that case, we must take care to not dump corresponding Prometheus metrics. However there is an issue that could lead to a crash or a strange infinite loop because we are looping on an empty list and, at some point, we are dereferencing an invalid pointer. There is an issue because the loop on the nameservers of a resolvers section is performed via callback functions and not the standard list_for_each_entry macro. So we must take care to properly detect end of the list and empty lists for nameservers. But the fix is not so simple because resolvers sections with and without nameservers may be mixed. To fix the issue, in rslv_promex_start_ts() and rslv_promex_next_ts(), when the next resolvers section must be evaluated, a loop is now used to properly skip empty sections. This patch is related to #2831. Not sure it fixes it. It must be backported as far as 3.0.	2025-01-06 09:08:38 +01:00
Ilia Shipitsin	193c94a539	BUG/MINOR: resolvers: handle a possible strdup() failure This defect was found by the coccinelle script "unchecked-strdup.cocci". It can be backported to all supported branches.	2024-12-10 08:05:50 +01:00
Willy Tarreau	9c6ccb8dbb	MEDIUM: config: warn on unitless timeouts < 100 ms From time to time we face a configuration with very small timeouts which look accidental because there could be expectations that they're expressed in seconds and not milliseconds. This commit adds a check for non-nul unitless values smaller than 100 and emits a warning suggesting to append an explicit unit if that was the intent. Only the common timeouts, the server check intervals and the resolvers hold and timeout values were covered for now. All the code needs to be manually reviewed to verify if it supports emitting warnings. This may break some configs using "zero-warning", but greps in existing configs indicate that these are extremely rare and solely intentionally done during tests. At least even if a user leaves that after a test, it will be more obvious when reading 10ms that something's probably not correct.	2024-11-19 10:33:20 +01:00
Christopher Faulet	8f28dbeea9	BUG/MEDIUM: resolvers: Insert a non-executed resulution in front of the wait list When a resolver is woken up to process DNS resolutions, it is possible to trigger an infinite loop on the resolver's wait list because delayed resolutions are always reinserted at the end of this list. This leads the watchdog to kill the process. By re-inserting them in front of the list, that fixes the bug. When a resolver tries to send the queries for the resolutions in its wait list, it may be unable to proceed for a resolution. This may happen because the resolution must be skipped (no hostname to resolv, a resolution already in-progress) or when an error occurred. In that case, the resolution is re-inserted in the resolver's wait list to be retry later, on a next wakeup. However, the resolution is inserted at the end of the wait list. So it is immediately reevaluated, in the same execution loop, instead of to be delayed. Most of time, it is not an issue because the resolution is considered as not expired on the second run. But it is an problem when the internal time wraps and is equal to 0. In that case, the resolution expiration date is badly computed and it is always considered as expired. If two or more resolutions are in that state, the resolver loops for ever on its wait list, until the process is killed by the watchdog. So we can argue that the way the resolution expiration date is computed must be fixed. And it would be true in a perfect world. However, the resolvers code is so crapy that it is hard to be sure to not introduce regressions. It is farly easier to re-insert delayed resolutions in front of the wait list. This fixes the issue and at worst, these resolutions will be evaluated one time too many on the next wakeup and only if now_ms was equal to 0 on the prior wakeup. This patch should be backported to all stable versions. On 2.2, LIST_ADD() must be used instead of LIST_INSERT()	2024-11-13 10:53:27 +01:00
William Lallemand	5184f3fb30	BUG/MINOR: resolvers/mworker: missing default resolvers in mworker mode Since commit `fe75c1e12d` ("MEDIUM: startup: remove MODE_MWORKER_WAIT") the MODE_MWORKER_WAIT constant disappeared. The initialization of the default resolvers section was conditionned by this constant. The section must be created in mworker mode, but only in the worker not in the master. It was currently completely disabled in both the master and the worker which could break configuration using it, as well as the httpclient. No backport needed.	2024-10-17 12:17:23 +02:00
Valentine Krasnobaeva	fe75c1e12d	MEDIUM: startup: remove MODE_MWORKER_WAIT MODE_MWORKER_WAIT becames redundant with MODE_MWORKER, due to moving master-worker fork in init(). This change allows master no longer perform reexec just after forking in order to free additional memory. As after the fork in the master process we set 'master' variable, we can replace now MODE_MWORKER_WAIT in some 'if' statements by simple check of this 'master' variable. Let's also continue to get rid of HAPROXY_MWORKER_WAIT_ONLY environment variable, as it's no longer needed as well. In cfg_program_postparser(), which is used to check if cmdline is defined to launch a program, we completely remove the check of mode for now, because the master process does not parse the configuration for the moment. 'program' section parsing will be reintroduced in master later in the next commits.	2024-10-16 22:02:39 +02:00
Willy Tarreau	b325453c36	MINOR: proxy: use the global file names for conf->file Proxy file names are assigned a bit everywhere (resolvers, peers, cli, logs, proxy). All these elements were enumerated and now use copy_file_name(). The only ha_free() call was turned to drop_file_name(). As a bonus side effect, a 300k backend config saved 14 MB of RAM.	2024-09-19 15:38:19 +02:00
Aperence	a7b04e383a	MINOR: tools: extend str2sa_range to add an alt parameter Add a new parameter "alt" that will store wether this configuration use an alternate protocol. This alt pointer will contain a value that can be transparently passed to protocol_lookup to obtain an appropriate protocol structure. This change is needed to allow for example the servers to know if it need to use an alternate protocol or not.	2024-08-30 18:53:49 +02:00
Willy Tarreau	d592ebdbeb	MEDIUM: socket: always properly use the sock_domain for requested families Now we make sure to always look up the protocol's domain for an address family. Previously we would use it as-is, which prevented from properly using custom addresses (which is when they differ). This removes some hard-coded tests such as in log.c where UNIX vs UDP was explicitly checked for example. It requires a bit of care, however, so as to properly pass value 1 in the 3rd arg of the protocol_lookup() for DGRAM stuff. Maybe one day we'll change these for defines or enums to limit mistakes.	2024-08-21 17:36:58 +02:00
Aurelien DARRAGON	d3d35f0fc6	BUILD: tree-wide: cast arguments to tolower/toupper to unsigned char (2) Fix build warning on NetBSD by reapplying `f278eec37a` ("BUILD: tree-wide: cast arguments to tolower/toupper to unsigned char"). This should fix issue #2551.	2024-07-18 13:29:52 +02:00
Christopher Faulet	91fe085943	BUG/MINOR: promex: Skip resolvers metrics when there is no resolver section By default, there is always at least on resolver section, the default one, based on "/etc/resolv.conf" content. However, it is possible to have no resolver at all if the file is empty or if any error occurred. Errors are silently ignored at this stage. In that case, there was a bug in the Prometheus exporter leading to a crash because the resolver section list is empty. An invalid resolver entity was used. To fix the issue we must only take care to not dump resolvers metrics when there is no resolver. Thanks to Aurelien to have spotted the offending commit. This patch should fix the issue #2604. It must be backported to 3.0.	2024-06-12 08:55:52 +02:00
Aurelien DARRAGON	c16eba8183	BUG/MEDIUM: server/dns: preserve server's port upon resolution timeout or error @boi4 reported in GH #2578 that since 3.0-dev1 for servers with address learned from A/AAAA records after a DNS flap server would be put out of maintenance with proper address but with invalid port (== 0), making it unusable and causing tcp checks to fail: [NOTICE] (1) : Loading success. [WARNING] (8) : Server mybackend/myserver1 is going DOWN for maintenance (DNS refused status). 0 active and 0 backup servers left. 0 sessions active, 0 requeued, 0 remaining in queue. [ALERT] (8) : backend 'mybackend' has no server available! [WARNING] (8) : mybackend/myserver1: IP changed from '(none)' to '127.0.0.1' by 'myresolver/ns1'. [WARNING] (8) : Server mybackend/myserver1 ('myhost') is UP/READY (resolves again). [WARNING] (8) : Server mybackend/myserver1 administratively READY thanks to valid DNS answer. [WARNING] (8) : Server mybackend/myserver1 is DOWN, reason: Layer4 connection problem, info: "Connection refused", check duration: 0ms. 0 active and 0 backup servers left. 0 sessions active, 0 requeued, 0 remaining in queue. @boi4 also mentioned that this used to work fine before. Willy suggested that this regression may have been introduced by `64c9c8e` ("BUG/MINOR: server/dns: use server_set_inetaddr() to unset srv addr from DNS") Turns out he was right! Indeed, in `64c9c8e` we systematically memset the whole server_inetaddr struct (which contains both the requested server's addr and port planned for atomic update) instead of only memsetting the addr part of the structure: except when SRV records are involved (SRV records provide both the address and the port unlike A or AAAA records), we must not reset the server's port upon DNS errors because the port may have been provided at config time and we don't want to lose its value. Big thanks to @boi4 for his well-documented issue that really helped us to pinpoint the bug right on time for the dev-13 release. No backport needed (unless `64c9c8e` gets backported).	2024-05-24 15:29:48 +02:00
Amaury Denoyelle	634cc2a5d8	MINOR: counters: move last_change into counters struct last_change was a member present in both proxy and server struct. It is used as an age statistics to report the last update of the object. Move last_change into fe_counters/be_counters. This is necessary to be able to manipulate it through generic stat column and report it into stats-file. Note that there is a change for proxy structure with now 2 different last_change values, on frontend and backend side. Special care was taken to ensure that the value is initialized only on the proxy side. The other value is set to 0 unless a listen proxy is instantiated. For the moment, only backend counter is reported in stats. However, with now two distinct values, stats could be extended to report it on both side.	2024-05-02 10:55:25 +02:00
Amaury Denoyelle	65624876f2	MINOR: stats: introduce a more expressive stat definition method Previously, statistics were simply defined as a list of name_desc, as for example "stat_cols_px" for proxy stats. No notion of type was fixed for each stat definition. This correspondance was done individually inside stats_fill_*_line() functions. This renders the process to define new statistics tedious. Implement a more expressive stat definition method via a new API. A new type "struct stat_col" for stat column to replace name_desc usage is defined. It contains a field to store the stat nature and format. A <cap> field is also defined to be able to define a proxy stat only for certain type of objects. This new type is also further extended to include counter offsets. This allows to define a method to automatically generate a stat value field from a "struct stat_col". This will be the subject of a future commit. New type "struct stat_col" is fully compatible full name_desc. This allows to gradually convert stats definition. The focus will be first for proxies counters to implement statistics preservation on reload.	2024-04-26 10:20:57 +02:00
Tim Duesterhus	cd5d62249f	CLEANUP: Reapply ist.cocci (3) This reapplies ist.cocci across the whole src/ tree.	2024-04-02 07:27:33 +02:00
Willy Tarreau	ad31e53287	REORG: dns/ring: split the ring between the generic one and the DNS one A ring is used for the DNS code but slightly differently from the generic one, which prevents some important changes from being made to the generic code without breaking DNS. As the use cases differ, it's better to just split them apart for now and have the DNS code use its own ring that we rename dns_ring and let the generic code continue to live on its own. The unused parts such as CLI registration were dropped, resizing and allocation from a mapped area were dropped. dns_ring_detach_appctx() was kept despite not being used, so as to stay consistent with the comments that say it must be called, despite the DNS code explicitly mentioning that it skips it for now (i.e. this may change in the future). Hopefully after the generic rings are converted the DNS code can migrate back to them, though this is really not necessary.	2024-03-25 17:34:19 +00:00
Aurelien DARRAGON	59f08f65fd	CLEANUP: tree-wide: use proper ERR_* return values for PRE_CHECK fcts httpclient_precheck(), ssl_ocsp_update_precheck(), and resolvers_create_default() functions are registered through REGISTER_PRE_CHECK() macro to be called by haproxy during init from the pre_check_list list. When calling functions registered in pre_check_list, haproxy expects ERR_* return values. However those 3 functions currently use raw return values, so we better use explicit ERR_* macros to prevent breakage in the future if ERR_* values mapping were to change.	2024-03-07 11:48:08 +01:00

1 2 3 4

191 commits