haproxy

mirror of https://github.com/haproxy/haproxy.git synced 2026-04-15 21:59:41 -04:00

Author	SHA1	Message	Date
Aurelien DARRAGON	27236f2218	BUG/MINOR: dns: add tempo between 2 connection attempts for dns servers As reported by Lukas Tribus on the mailing list [1], trying to connect to a nameserver with invalid network settings causes haproxy to retry a new connection attempt immediately which eventually causes unexpected CPU usage on the thread responsible for the applet (namely 100% on one CPU will be observed). This can be reproduced with the test config below: resolvers default nameserver ns1 tcp4@8.8.8.8:53 source 192.168.99.99 listen listen mode http bind :8080 server s1 www.google.com resolvers default init-addr none To fix this the issue, we add a temporisation of one second between a new connection attempt is retried. We do this in dns_session_create() when we know that the applet was created in the release callback (when previous query attempt was unsuccessful), which means initial connection is not affected. [1]: https://www.mail-archive.com/haproxy@formilux.org/msg45665.html This should fix GH #2909 and may be backported to all stable versions. This patch depends on ("MINOR: applet: add appctx_schedule() macro")	2025-04-29 21:20:11 +02:00
William Lallemand	c11ab983bf	BUG/MINOR: acme: remove references to virt@acme "virt@acme" was the default map used during development, now this must be configured in the acme section or it won't try to use any map. This patch removes the references to virt@acme in the comments and the code.	2025-04-29 16:35:35 +02:00
William Lallemand	5555926fdd	MEDIUM: acme: use a map to store tokens and thumbprints The stateless mode which was documented previously in the ACME example is not convenient for all use cases. First, when HAProxy generates the account key itself, you wouldn't be able to put the thumbprint in the configuration, so you will have to get the thumbprint and then reload. Second, in the case you are using multiple account key, there are multiple thumbprint, and it's not easy to know which one you want to use when responding to the challenger. This patch allows to configure a map in the acme section, which will be filled by the acme task with the token corresponding to the challenge, as the key, and the thumbprint as the value. This way it's easy to reply the right thumbprint. Example: http-request return status 200 content-type text/plain lf-string "%[path,field(-1,/)].%[path,field(-1,/),map(virt@acme)]\n" if { path_beg '/.well-known/acme-challenge/' }	2025-04-29 16:15:55 +02:00
Amaury Denoyelle	0f9b3daf98	MEDIUM: quic: limit global Tx memory Define a new settings tune.quic.frontend.max-tot-window. It contains a size argument which can be used to set a limit on the sum of all QUIC connections congestion window. This is applied both on quic_cc_path_set() and quic_cc_path_inc(). Note that this limitation cannot reduce a congestion window more than the minimal limit which is set to 2 datagrams.	2025-04-29 15:19:32 +02:00
Amaury Denoyelle	e841164a44	MINOR: quic: account for global congestion window Use the newly defined cshared type to account for the sum of congestion window of every QUIC connection. This value is stored in global counter quic_mem_global defined in proto_quic module.	2025-04-29 15:19:32 +02:00
Amaury Denoyelle	7bad88c35c	BUG/MINOR: quic: ensure cwnd limits are always enforced Congestion window is limit by a minimal and maximum values which can never be exceeded. Min value is hardcoded to 2 datagrams as recommended by the specification. Max value is specified via haproxy configuration. These values must be respected each time the congestion window size is adjusted. However, in some rare occasions, limit were not always enforced. Fix this by implementing wrappers to set or increment the congestion window. These functions ensure limits are always applied after the operation. Additionnally, wrappers also ensure that if window reached a new maximum value, it is saved in <cwnd_last_max> field. This should be backported up to 2.6, after a brief period of observation.	2025-04-29 15:10:06 +02:00
Amaury Denoyelle	c01d455288	MINOR: quic: refactor BBR API Write minor adjustments to QUIC BBR functions. The objective is to centralize every modification of path cwnd field. No functional change. This patch will be useful to simplify implementation of global QUIC Tx memory usage limitation.	2025-04-29 15:10:06 +02:00
Amaury Denoyelle	2eb1b0cd96	MINOR: quic: rename min/max fields for congestion window algo There was some possible confusion between fields related to congestion window size min and max limit which cannot be exceeded, and the maximum value previously reached by the window. Fix this by adopting a new naming scheme. Enforced limit are now renamed <limit_max>/<limit_min>, while the previously reached max value is renamed <cwnd_last_max>. This should be backported up to 3.1.	2025-04-29 15:10:06 +02:00
William Lallemand	62dfe1fc87	BUG/MINOR: acme: creating an account should not end the task The account creation was mistakenly ending the task instead of being wakeup for the NewOrder state, it was preventing the creation of the certificate, however the account was correctly created. To fix this, only the jump to the end label need to be remove, the standard leaving codepath of the function will allow to be wakeup. No backport needed.	2025-04-29 14:18:05 +02:00
Willy Tarreau	2cdb3cb91e	MINOR: tcp: add support for setting TCP_NOTSENT_LOWAT on both sides TCP_NOTSENT_LOWAT is very convenient as it indicates when to report EAGAIN on the sending side. It takes a margin on top of the estimated window, meaning that it's no longer needed to store too many data in socket buffers. Instead there's just enough to fill the send window and a little bit of margin to cover the scheduling time to restart sending. Experiments on a 100ms network have shown a 10-fold reduction in the memory used by socket buffers by just setting this value to tune.bufsize, without noticing any performance degradation. Theoretically the responsiveness on multiplexed protocols such as H2 should also be improved.	2025-04-29 12:13:42 +02:00
Willy Tarreau	989f609b1a	BUG/MINOR: mux-h2: fix the offset of the pattern for the ping frame The ping frame's pattern must be written at offset 9 (frame header length), not 8. This was added in 3.2 with commit `4dcfe098a6` ("MINOR: mux-h2: prepare to support PING emission"), so no backport is needed.	2025-04-29 12:13:41 +02:00
William Lallemand	2f7f65e159	BUG/MINOR: acme: does not try to unlock after a failed trylock Return after a failed trylock in acme_update_certificate() instead of jumping to the error label which does an unlock.	2025-04-29 11:29:52 +02:00
William Lallemand	582614e1b2	CLEANUP: acme: remove old TODO for account key Remove old TODO comments about the account key.	2025-04-29 09:59:32 +02:00
Willy Tarreau	dc06495b71	MEDIUM: mcli: replicate the current mode when enterin the worker process While humans can find it convenient to enter the worker process in prompt mode, for external tools it will not be convenient to have to systematically disable it. A better approach is to replicate the master socket's mode there, since it has already been configured to suit the user: interactive, prompt and timed modes are automatically passed to the worker process. This makes the using the worker commands more natural from the master process, without having to systematically adapt it for each new connection.	2025-04-28 20:21:06 +02:00
Willy Tarreau	c347cb73fa	MEDIUM: mcli: make the prompt mode configurable between i/p Support the same syntax in master mode as in worker mode in order to configure the prompt. The only thing is that for now the master doesn't have a non-interactive mode and it doesn't seem necessary to implement it, so we only support the interactive and prompt modes. However the code was written in a way that makes it easy to change this later if desired.	2025-04-28 20:21:06 +02:00
Willy Tarreau	e5c255c4e5	MEDIUM: cli: make the prompt mode configurable between n/i/p Now the prompt mode can more finely be configured between non-interactive (default), interactive without prompt, and interactive with prompt. This will ease the usage from automated tools which are not necessarily interested in having to consume '> ' after each command nor displaying "+" on payload lines. This can also be convenient when coming from the master CLI to keep the same output format.	2025-04-28 20:21:06 +02:00
Willy Tarreau	f25b4abc9b	MINOR: cli: split APPCTX_CLI_ST1_PROMPT into two distinct flags The CLI's "prompt" command toggles two distinct things: - displaying or hiding the prompt at the beginning of the line - single-command vs interactive mode These are two independent concepts and the prompt mode doesn't always cope well with tools that would like to upload data without having to read the prompt on return. Also, the master command line works in interactive mode by default with no prompt, which is not consistent (and not convenient for tools). So let's start by splitting the bit in two, and have a new APPCTX_CLI_ST1_INTER flag dedicated to the interactive mode. For now the "prompt" command alone continues to toggle the two at once.	2025-04-28 20:21:06 +02:00
William Lallemand	32b2b782e2	MEDIUM: acme: use 'crt-base' to load the account key Prefix the filename with the 'crt-base' before loading the account key, in order to work like every other keypair in haproxy.	2025-04-28 18:20:21 +02:00
William Lallemand	856b6042d3	MEDIUM: acme: generate the account file when not found Generate the private key on the account file when the file does not exists. This generate a private key of the type and parameters configured in the acme section.	2025-04-28 18:20:21 +02:00
William Lallemand	b2dd6dd72b	MINOR: acme: failure when no directory is specified The "directory" parameter of the acme section is mandatory. This patch exits with an alert when this parameter is not found.	2025-04-28 18:20:21 +02:00
William Lallemand	420de91d26	MINOR: acme: separate the code generating private keys acme_EVP_PKEY_gen() generates private keys of specified <keytype>, <curves> and <bits>. Only RSA and EC are supported for now.	2025-04-28 18:20:21 +02:00
William Lallemand	0897175d73	BUG/MINOR: ssl/acme: free EVP_PKEY upon error Free the EPV_PKEY upon error when the X509_REQ generation failed. No backport needed.	2025-04-28 18:20:21 +02:00
Willy Tarreau	d9a659ed96	MINOR: threads/cli: display the lock history on "show threads" This will display the lock labels and modes for each non-empty step at the end of "show threads" when these are defined. This allows to emit up to the last 8 locking operation for each thread on 64 bit machines.	2025-04-28 16:50:34 +02:00
Willy Tarreau	23371b3e7c	MINOR: threads: turn the full lock debugging to DEBUG_THREAD=2 At level 1 it now does nothing. This is reserved for some subsequent patches which will implement lighter debugging.	2025-04-28 16:50:34 +02:00
Willy Tarreau	903a6b14ef	MINOR: threads: prepare DEBUG_THREAD to receive more values We now default the value to zero and make sure all tests properly take care of values above zero. This is in preparation for supporting several degrees of debugging.	2025-04-28 16:50:34 +02:00
Willy Tarreau	aa49965d4e	BUILD: leastconn: fix build warning when building without threads on old machines Machines lacking CAS8B/DWCAS and emit a warning in lb_fwlc.c without threads due to declaration ordering. Let's just move the variable declaration into the block that uses it as a last variable. No backport is needed.	2025-04-28 16:50:34 +02:00
Willy Tarreau	589d916efa	BUILD: acme: use my_strndup() instead of strndup() Not all systems have strndup(), that's why we have our "my_strndup()", so let's make use of it here. This fixes the build on Solaris 10. No backport is needed.	2025-04-28 16:37:54 +02:00
Aurelien DARRAGON	dc95a3ed61	MINOR: promex: expose ST_I_PX_RATE (current_session_rate) It has been requested to have the current_session_rate exposed at the frontend level. For now only the per-process value was exposed (ST_I_INF_SESS_RATE). Thanks to the work done lately to merge promex and stat_cols_px[] array, let's simply defined an .alt_name for the ST_I_PX_RATE metric in order to have promex exposing it as current_session_rate for relevant contexts.	2025-04-28 12:23:20 +02:00
William Lallemand	83975f34e4	MINOR: ssl/cli: add a '-t' option to 'show ssl sni' Add a -t option to 'show ssl sni', allowing to add an offset to the current date so it would allow to check which certificates are expired after a certain period of time.	2025-04-28 11:35:11 +02:00
Willy Tarreau	f1064c7382	BUG/MAJOR: listeners: transfer connection accounting when switching listeners Since we made it possible for a bind_conf to listen to multiple thread groups with shards in 2.8 with commit `9d360604bd` ("MEDIUM: listener: rework thread assignment to consider all groups"), the per-listener connection count was not properly transferred to the target listener with the connection when switching to another thread group. This results in one listener possibly reaching high values and another one possibly reaching negative values. Usually it's not visible, unless a maxconn is set on the bind_conf, in which case comparisons will quickly put an end to the willingness to accept new connections. This problem only happens when thread groups are enabled, and it seems very hard to trigger it normally, it only impacts sockets having a single shard, hence currently the CLI (or any conf with "bind ... shards 1"), where it can be reproduced with a config having a very low "maxconn" on the stats socket directive (here, 4), and issuing a few tens of socat <<< "show activity" in parallel, or sending HTTP connections to a single-shared listener. Very quickly, haproxy stops accepting connections and eats CPU in the poller which tries to get its connections accepted. A BUG_ON(l->nbconn<0) after HA_ATOMIC_DEC() in listener_release() also helps spotting them better. Many thanks to Christian Ruppert who once again provided a very accurate report in GH #2951 with the required data permitting this analysis. This fix must be backported to 2.8.	2025-04-25 18:47:11 +02:00
Olivier Houchard	9240cd4a27	BUG/MAJOR: tasklets: Make sure he tasklet can't run twice tasklets were originally designed to alway run on only one thread, so it was not possible to have it run on 2 threads concurrently. The API has been extended so that another thread may wake the tasklet, the idea was still that we wanted to have it run on one thread only. However, the way it's been done meant that unless a tasklet was bound to a specific tid with tasklet_set_tid(), or we explicitely used tasklet_wakeup_on() to specify the thread for the target to run on, it would be scheduled to run on the current thread. This is in fact a desirable feature. There is however a race condition in which the tasklet would be scheduled on a thread, while it is running on another. This could lead to the same tasklet to run on multiple threads, which we do not want. To fix this, just do what we already do for regular tasks, set the "TASK_RUNNING" flag, and when it's time to execute the tasklet, wait until that flag is gone. Only one case has been found in the current code, where the tasklet could run on different threads depending on who wakes it up, in the leastconn load balancer, since commit `627280e15f`. It should not be a problem in practice, as the function called can be called concurrently. If a bug is eventually found in relation to this problem, and this patch should be backported, the following patches should be backported too : MEDIUM: quic: Make sure we return the tasklet from quic_accept_run MEDIUM: quic: Make sure we return NULL in quic_conn_app_io_cb if needed MEDIUM: quic: Make sure we return the tasklet from qcc_io_cb MEDIUM: mux_fcgi: Make sure we return the tasklet from fcgi_deferred_shut MEDIUM: listener: Make sure w ereturn the tasklet from accept_queue_process MEDIUM: checks: Make sure we return the tasklet from srv_chk_io_cb	2025-04-25 16:14:26 +02:00
Olivier Houchard	09f5501bb9	MEDIUM: quic: Make sure we return the tasklet from quic_accept_run In quic_accept_run, return the tasklet to tell the scheduler the tasklet is still alive, it is not yet needed, but will be soon.	2025-04-25 16:14:26 +02:00
Olivier Houchard	5838786fa0	MEDIUM: quic: Make sure we return NULL in quic_conn_app_io_cb if needed In quic_conn_app_io_cb, make sure we return NULL if the tasklet has been destroyed, so that the scheduler knows. It is not yet needed, but will be soon.	2025-04-25 16:14:26 +02:00
Olivier Houchard	15c5846db8	MEDIUM: quic: Make sure we return the tasklet from qcc_io_cb In qcc_io_cb, return the tasklet to tell the scheduler the tasklet is still alive, it is not yet needed, but will be soon.	2025-04-25 16:14:26 +02:00
Olivier Houchard	8f70f9c04b	MEDIUM: mux_fcgi: Make sure we return the tasklet from fcgi_deferred_shut In fcgi_deferred_shut, return the tasklet to tell the scheduler the tasklet is still alive, it is not yet needed, but will be soon.	2025-04-25 16:14:26 +02:00
Olivier Houchard	7d190e7df6	MEDIUM: listener: Make sure w ereturn the tasklet from accept_queue_process In accept_queue_process, return the tasklet to tell the scheduler the tasklet is still alive, it is not yet needed, but will be soon.	2025-04-25 16:14:26 +02:00
Olivier Houchard	81dc3e67cf	MEDIUM: checks: Make sure we return the tasklet from srv_chk_io_cb In srv_chk_io_cb, return the tasklet to tell the scheduler the tasklet is still alive, it is not yet needed, but will be soon.	2025-04-25 16:14:26 +02:00
Willy Tarreau	40aceb7414	MINOR: resolvers: use the runtime IPv6 status instead of boot time one On systems where the network is not reachable at boot time (certain HA systems for example, or dynamically addressed test machines), we'll want to be able to periodically revalidate the IPv6 reachability status. The current code makes it complicated because it sets the config bits once for all at boot time. This commit changes this so that the config bits are not changed, but instead we rely on a static inline function that relies on sock_inet6_seems_reachable for every test (really cheap). This also removes the now unneeded resolvers late init code. This variable for now is still set at boot time but this will ease the transition later, as the resolvers code is now ready for this.	2025-04-25 09:32:05 +02:00
Willy Tarreau	7a79f54c98	BUG/MINOR: master/cli: only parse the '@@' prefix on complete lines The new adhoc parser for the '@@' prefix forgot to require the presence of the LF character marking the end of the line. This is the reason why entering incomplete commands would display garbage, because the line was expected to have its LF character replaced with a zero. The problem is well illustrated by using socat in raw mode: socat /tmp/master.sock STDIO,raw,echo=0 then entering "@@1 show info" one character at a time would error just after the second "@". The command must take care to report an incomplete line and wait for more data in such a case.	2025-04-25 09:05:00 +02:00
Willy Tarreau	931d932b3e	Revert "BUG/MINOR: master/cli: properly trim the '@@' process name in error messages" This reverts commit `0e94339eaf`. This patch was in fact fixing the symptom, not the cause. The root cause of the problem is that the parser was processing an incomplete line when looking for '@@'. When the LF is present, this problem does not exist as it's properly replaced with a zero. This can be verified using socat in raw mode: socat /tmp/master.sock STDIO,raw,echo=0 Then entering "@@1 show info" one character at a time will immediately fail on "@@" without going further. A subsequent patch will fix this. No backport is needed.	2025-04-25 09:05:00 +02:00
Christopher Faulet	101cc4f334	BUG/MEDIUM: cli: Handle applet shutdown when waiting for a command line When the CLI applet was refactord in the commit `20ec1de21` ("MAJOR: cli: Refacor parsing and execution of pipelined commands"), a regression was introduced. The applet shutdown was not longer handled when the applet was waiting for the next command line. It is especially visible when a client timeout occurred because the client connexion is no longer closed. To fix the issue, the test on the SE_FL_SHW flag was reintroduced in CLI_ST_PARSE_CMDLINE state, but only is there is no pending input data. It is a 3.2-specific issue. No backport needed.	2025-04-25 08:47:05 +02:00
William Lallemand	27b732a661	MEDIUM: acme: better error/retry management of the challenge checks When the ACME task is checking for the status of the challenge, it would only succeed or retry upon failure. However that's not the best way to do it, ACME objects contain an "status" field which could have a final status or a in progress status, so we need to be able to retry. This patch adds an acme_ret enum which contains OK, RETRY and FAIL. In the case of the CHKCHALLENGE, the ACME could return a "pending" or a "processing" status, which basically need to be rechecked later with the RETRY. However a "invalid" or "valid" status is final and will return either a FAIL or a OK. So instead of retrying in any case, the "invalid" status will ends the task with an error.	2025-04-24 20:14:47 +02:00
William Lallemand	0909832e74	MEDIUM: acme: reset the remaining retries When a request succeed, reset the remaining retries to the default ACME_RETRY value (3 by default).	2025-04-24 20:14:47 +02:00
William Lallemand	bb768b3e26	MEDIUM: acme: use Retry-After value for retries Parse the Retry-After header in response and store it in order to use the value as the next delay for the next retry, fallback to 3s if the value couldn't be parse or does not exist.	2025-04-24 20:14:47 +02:00
Willy Tarreau	69b051d1dc	MINOR: resolvers: add "dns-accept-family auto" to rely on detected IPv6 Instead of always having to force IPv4 or IPv6, let's now also offer "auto" which will only enable IPv6 if the system has a default gateway for it. This means that properly configured dual-stack systems will default to "ipv4,ipv6" while those lacking a gateway will only use "ipv4". Note that no real connectivity test is performed, so firewalled systems may still get it wrong and might prefer to rely on a manual "ipv4" assignment.	2025-04-24 17:52:28 +02:00
Willy Tarreau	5d41d476f3	MINOR: sock-inet: detect apparent IPv6 connectivity In order to ease dual-stack deployments, we could at least try to check if ipv6 seems to be reachable. For this we're adding a test based on a UDP connect (no traffic) on port 53 to the base of public addresses (2001::) and see if the connect() is permitted, indicating that the routing table knows how to reach it, or fails. Based on this result we're setting a global variable that other subsystems might use to preset their defaults.	2025-04-24 17:52:28 +02:00
Willy Tarreau	2c46c2c042	MINOR: resolvers: add command-line argument -4 to force IPv4-only DNS In order to ease troubleshooting and testing, the new "-4" command line argument enforces queries and processing of "A" DNS records only, i.e. those representing IPv4 addresses. This can be useful when a host lack end-to-end dual-stack connectivity. This overrides the global "dns-accept-family" directive and is equivalent to value "ipv4".	2025-04-24 17:52:28 +02:00
Willy Tarreau	940fa19ad8	MEDIUM: resolvers: add global "dns-accept-family" directive By default, DNS resolvers accept both IPv4 and IPv6 addresses. This can be influenced by the "resolve-prefer" keywords on server lines as well as the family argument to the "do-resolve" action, but that is only a preference, which does not block the other family from being used when it's alone. In some environments where dual-stack is not usable, stumbling on an unreachable IPv6-only DNS record can cause significant trouble as it will replace a previous IPv4 one which would possibly have continued to work till next request. The "dns-accept-family" global option permits to enforce usage of only one (or both) address families. The argument is a comma-delimited list of the following words: - "ipv4": query and accept IPv4 addresses ("A" records) - "ipv6": query and accept IPv6 addresses ("AAAA" records) When a single family is used, no request will be sent to resolvers for the other family, and any response for the othe family will be ignored. The default value is "ipv4,ipv6", which effectively enables both families.	2025-04-24 17:52:28 +02:00
Willy Tarreau	0e94339eaf	BUG/MINOR: master/cli: properly trim the '@@' process name in error messages When '@@' alone is sent on the master CLI (no trailing LF), we get an error that displays anything past these two characters in the buffer since there's no room for a \0. Let's make sure to limit the length of the process name in this case. No backport is needed since this was added with `00c967fac4` ("MINOR: master/cli: support bidirectional communications with workers").	2025-04-24 17:52:28 +02:00
Christopher Faulet	568ed6484a	MINOR: applet: Save the "use-service" rule in the stream to init a service applet When a service is initialized, the "use-service" rule that was executed is now saved in the stream, using "current_rule" field, instead of saving it into the applet context. It is safe to do so becaues this field is unused at this stage. To avoid any issue, it is reset after the service initialization. Doing so, it is no longer necessary to save it in the applet context. It was the last usage of the rule pointer in the applet context. The init functions for TCP and HTTP lua services were updated accordingly.	2025-04-24 16:22:24 +02:00

1 2 3 4 5 ...

19234 commits