haproxy

mirror of https://github.com/haproxy/haproxy.git synced 2026-04-15 21:59:41 -04:00

Author	SHA1	Message	Date
William Lallemand	1274c21a42	BUG/MINOR: ssl: error with ssl-f-use when no "crt" Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details ssl-f-use lines tries to load a crt file, but the "crt" keyword is not mandatory. That could lead to crtlist_load_crt() being called with a NULL path, and trying to do a stat. In this particular case we don't need to try anything and it's better to leave with an actual error. Must be backported as far as 3.2.	2026-02-16 18:41:40 +01:00
William Lallemand	0016d45a9c	BUG/MINOR: ssl: clarify ssl-f-use errors in post-section parsing crtlist_load_crt() in post_section_frontend_crt_init() won't give details about the line being parsed, this should be done by the caller. Modify post_section_frontend_crt_init() to ouput the right error format. Must be backported to 3.2.	2026-02-16 18:41:08 +01:00
William Lallemand	e0d1cdff6a	BUG/MINOR: ssl: fix leak in ssl-f-use parser upon error cfg_crt_node->filename is leaked on the error path in the ssl-f-use configuration parser. Could be backported as far as 3.2	2026-02-16 16:04:35 +01:00
William Lallemand	86df0e206e	BUG/MINOR: ssl: double-free on error path w/ ssl-f-use parser In post_section_frontend_crt_init(), the crt_entry is populated by the ssl_conf fromt the cfg_crt_node. On error path, the crt_list is completely freed, including the ssl_conf structure. But the ssl_conf structure was already freed when freeing the cfg_crt_node. Fix the issue by doing a crtlist_dup_ssl_conf(n->ssl_conf) in the crtlist_entry instead of an assignation. Fix issue #3268. Need to be backported as far as 3.2. The previous patch which adds the crtlist_dup_ssl_conf() declaration is needed.	2026-02-16 16:04:35 +01:00
Aurelien DARRAGON	d71e2e73ea	MEDIUM: filters: use per-channel filter list when relevant Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details In the historical implementation, all filter related information where stored at the stream level (using struct strm_flt * context), and filters iteration was performed at the stream level also. We identified that this was not ideal and would make the implementation of future filters more complex since filters ordering should be handled in a different order during request and response handling for decompression for instance. To make such thing possible, in this commit we migrate some channel specific filter contexts in the channel directly (request or response), and we implement 2 additional filter lists, one on the request channel and another on the response channel. The historical stream filter list is kept as-is because in some contexts only the stream is available and we have to iterate on all filters. But for functions where we only are interested in request side or response side filters, we now use dedicated channel filters list instead. The only overhead is that the "struct filter" was expanded by two "struct list". For now, no change of behavior is expected.	2026-02-13 12:24:13 +01:00
Aurelien DARRAGON	bb6cfbe754	MINOR: filters: rework filter iteration for channel related callback functions Multiple channel related functions have the same construction: they use list_for_each_entry() to work on a given filter from the stream+channel combination. In future commits we will try to use filter list from dedicated channel list instead of the stream one, thus in this patch we need as a prerequisite to implement and use the flt_list_{start,next} API to iterate over filter list, giving the API the responsibility to iterate over the correct list depending on the context, while the calling function remains free to use the iteration construction it needs. This way we will be able to easily change the way we iterate over filter list without duplicating the code for requests and responses.	2026-02-13 12:24:07 +01:00
Aurelien DARRAGON	e88b219331	MINOR: filters: rework RESUME_FILTER_* macros as inline functions There is no need to have those helpers defined as macro, and since it is not mandatory, code maintenance is much easier using functions, thus let's switch to function definitions. Also, we change the way we iterate over the list so that the calling function now has a pseudo API to get and iterate over filter pointers while keeping control on how they implement the iterating logic. One benefit of this is that we will also be able to switch between lists depending on the channel type, which is a prerequisite for upcoming rework that split the filter list over request and response channels (commit will follow) No change of behavior is expected.	2026-02-13 12:24:00 +01:00
William Lallemand	d13164e105	MINOR: startup: show the list of detected features at runtime with haproxy -vv Features prefixed by "HAVE_WORKING_" in the haproxy -vv feature list, are features that are detected during runtime. This patch splits these features on another line in haproxy -vv. This line is named "Detected feature list".	2026-02-12 18:02:19 +01:00
William Lallemand	b90b312a50	MINOR: startup: sort the feature list in haproxy -vv The feature list in haproxy -vv is partly generated from the Makefile using the USE_* keywords, but it's also possible to add keywords in the feature list using hap_register_feature(), which adds the keyword at the end of list. When doing so, the list is not correctly sorted anymore. This patch fixes the problem by splitting the string using an array of ist and applying a qsort() on it.	2026-02-12 18:02:19 +01:00
William Lallemand	1592ed9854	MINOR: startup: Add HAVE_WORKING_TCP_MD5SIG in haproxy -vv the TCP_MD5SIG ifdef is not enough to check if the feature is usable. The code might compile but the OS could prevent to use it. This patch tries to use the TCP_MD5SIG setsockopt before adding HAVE_WORKING_TCP_MD5SIG in the feature list. so it would prevent to start reg-tests if the OS can't run it.	2026-02-12 18:02:19 +01:00
Remi Tricot-Le Breton	aad212954f	MINOR: jwt: Add new jwt_decrypt_jwk converter This converter takes a private key in the JWK format (RFC7517) that can be provided as a string of via a variable. The only keys managed for now are of type 'RSA' or 'oct'.	2026-02-12 16:31:27 +01:00
Remi Tricot-Le Breton	b26f0cc45a	MINOR: jwt: Convert an RSA JWK into an EVP_PKEY Add helper functions that take a JWK (JSON representation of an RSA private key) into an EVP_PKEY (containing the private key). Those functions are not used yet, they will be used in the upcoming 'jwt_decrypt_jwk' converter.	2026-02-12 16:31:12 +01:00
Remi Tricot-Le Breton	b3a44158fb	MINOR: ssl: Missing '\n' in error message Fix missing '\n' in error message raised when trying to load a password protected private key.	2026-02-12 16:29:01 +01:00
Amaury Denoyelle	8e16fd2cf1	BUG/MAJOR: quic: fix parsing frame type QUIC frame type is encoded as a varint. Initially, haproxy parsed it as a single byte, which was enough to cover frames defined in RFC9000. The code has been extended recently to support multi-bytes encoded value, in anticipation of QUIC frames extension support. However, there was no check on the varint format. This is interpreted erroneously as a PADDING frame as this serves as the initial value. Thus the rest of the packet is incorrectly handled, with various resulting effects, including infinite loops and/or crashes. This patch fixes this by checking the return value of quic_dec_int(). If varint cannot be parsed, the connection is immediately closed. This issue is assigned to CVE-2026-26080 report. This must be backported up to 3.2. Reported-by: Asim Viladi Oglu Manizada <manizada@pm.me>	2026-02-12 09:09:44 +01:00
Amaury Denoyelle	4aa974f949	BUG/MAJOR: quic: reject invalid token Token parsing code on INITIAL packet for the NEW_TOKEN format is not robust enough and may even crash on some rare malformed packets. This patch fixes this by adding a check on the expected length of the received token. The packet is now rejected if the token does not match QUIC_TOKEN_LEN. This check is legitimate as haproxy should only parse tokens emitted by itself. This issue has been introduced with the implementation of NEW_TOKEN tokens parsing required for 0-RTT support. This issue is assigned to CVE-2026-26081 report. This must be backported up to 3.0. Reported-by: Asim Viladi Oglu Manizada <manizada@pm.me>	2026-02-12 09:09:44 +01:00
Amaury Denoyelle	d80f0143c9	BUG/MINOR: quic: ensure handshake speed up is only run once per conn Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details When a duplicated CRYPTO frame is received during handshake, a server may consider that there was a packet loss and immediately retransmit its pending CRYPTO data without having to wait for PTO expiration. However, RFC 9002 indicates that this should only be performed at most once per connection to avoid excessive packet transmission. QUIC connection is flagged with QUIC_FL_CONN_HANDSHAKE_SPEED_UP to mark that a fast retransmit has been performed. However, during the refactoring on CRYPTO handling with the storage conversion from ncbuf to ncbmbuf, the check on the flag was accidentely removed. The faulty patch is the following one : commit `f50425c021` MINOR: quic: remove received CRYPTO temporary tree storage This patch adds again the check on QUIC_FL_CONN_HANDSHAKE_SPEED_UP before initiating fast retransmit. This ensures this is only performed once per connection. This must be backported up to 3.3.	2026-02-12 09:09:44 +01:00
Olivier Houchard	b65df062be	MINOR: servers: Call process_srv_queue() without lock when possible Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details In server_warmup(), call process_srv_queue() only once we released the server lock, as we don't need it.	2026-02-12 02:19:38 +01:00
Olivier Houchard	a8f50cff7e	MINOR: queues: Check minconn first in srv_dynamic_maxconn() In srv_dynamic_maxconn(), we'll decide that the max number of connection is the server's maxconn if 1) the proxy's number of connection is over fullconn, or if minconn was not set. Check if minconn is not set first, as it will be true most of the time, and as the proxy's "beconn" variable is in a busy cache line, it can be costly to access it, while minconn/maxconn is in a cache line that should very rarely change.	2026-02-12 02:18:59 +01:00
William Lallemand	ea92b0ef01	BUG/MINOR: ssl: SSL_CERT_DIR environment variable doesn't affect haproxy Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details The documentation of @system-ca specifies that one can overwrite the value provided by the SSL Library using SSL_CERT_DIR. However it seems like X509_get_default_cert_dir() is not affected by this environment variable, and X509_get_default_cert_dir_env() need to be used in order to get the variable name, and get the value manually. This could be backported in every stable branches. Note that older branches don't have the memprintf in ssl_sock.c.	2026-02-10 21:34:45 +01:00
William Lallemand	2ac0d12790	MINOR: startup: Add the SSL lib verify directory in haproxy -vv SSL libraries built manually might lack the right X509_get_default_cert_dir() value. The common way to fix the problem is to build openssl with ./configure --openssldir=/etc/ssl/ In order to verify this setting, output it with haproxy -vv.	2026-02-10 21:06:38 +01:00
Willy Tarreau	c724693b95	MINOR: activity: allow to switch per-task lock/memory profiling at runtime Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Given that we already have "set profiling task", it's easy to permit to enable/disable the lock and/or memory profiling at run time. However, the change will only be applied next time the task profiling will be switched from off/auto to on. The patch is very minor and is best viewed with git show -b because it indents a whole block that moves in a "if" clause. This can be backported to 3.3 along with the two previous patches.	2026-02-10 17:53:01 +01:00
Willy Tarreau	e2631ee5f7	MEDIUM: activity: apply and use new finegrained task profiling settings In continuity of previous patch, this one makes use of the new profiling flags. For this, based on the global "profiling" setting, when switching profiling on, we set or clear two flags on the thread context, TH_FL_TASK_PROFILING_L and TH_FL_TASK_PROFILING_M to indicate whether lock profiling and/or malloc profiling are desired when profiling is enabled. These flags are checked along with TH_FL_TASK_PROFILING to decide when to collect time around a lock or a malloc. And by default we're back to the behavior of 3.2 in that neither lock nor malloc times are collected anymore. This is sufficient to see the CPU usage spent in the VDSO to significantly drop from 22% to 2.2% on a highly loaded system. This should be backported to 3.3 along with the previous patch.	2026-02-10 17:52:59 +01:00
Willy Tarreau	a7b2353cb3	MINOR: activity: support setting/clearing lock/memory watching for task profiling Damien Claisse reported in issue #3257 a performance regression between 3.2 and 3.3 when task profiling is enabled, more precisely in relation with the following patches were merged: `98cc815e3e` ("MINOR: activity: collect time spent with a lock held for each task") `503084643f` ("MINOR: activity: collect time spent waiting on a lock for each task") `9d8c2a888b` ("MINOR: activity: collect CPU time spent on memory allocations for each task") The issue mostly comes from the first patches. What happens is that the local time is taken when entering and leaving each lock, which costs a lot on a contended system. The problem here is the lack of finegrained settings for lock and malloc profiling. This patch introduces a better approach. The task profiler goes back to its default behavior in on/auto modes, but the configuration now accepts new extra options "lock", "no-lock", "memory", "no-memory" to precisely indicate other timers to watch for each task when profiling turns on. This is achieved by setting two new flags HA_PROF_TASKS_LOCK and HA_PROF_TASKS_MEM in the global "profiling" variable. This patch only parses the new values and assigns them to the global variable from the config file for now. The doc was updated.	2026-02-10 17:47:02 +01:00
Willy Tarreau	3b45beb465	CLEANUP: haproxy: fix bad line wrapping in run_poll_loop() Commit `3674afe8a0` ("BUG/MEDIUM: threads: Atomically set TH_FL_SLEEPING and clr FL_NOTIFIED") accidentally left a strange-looking line wrapping making one think of an editing mistake, let's fix it and keep it on a single line given that even indented wrapping is almost as large. This can be backported with the fix above till 2.8 to keep the patch context consistent between versions.	2026-02-10 14:11:42 +01:00
Willy Tarreau	64c5d45a26	BUG/MEDIUM: lb-chash: always properly initialize lb_nodes with dynamic servers Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details An issue was introduced in 3.0 with commit `faa8c3e024` ("MEDIUM: lb-chash: Deterministic node hashes based on server address"): the new server_key field and lb_nodes entries initialization were not updated for servers added at run time with "add server": server_key remains zero and the key used in lb_node remains the one depending only on the server's ID. This will cause trouble when adding new servers with consistent hashing, because the hash-key will be ignored until the server's weight changes and the key difference is detected, leading to its recalculation. This is essentially caused by the poorly placed lb_nodes initialization that is specific to lb-chash and had to be replicated in the code dealing with server addition. This commit solves the problem by adding a new ->server_init() function in the lbprm proxy struct, that is called by the server addition code. This also allows to abandon the complex check for LB algos that was placed there for that purpose. For now only lb-chash provides such a function, and calls it as well during initial setup. This way newly added servers always use the correct key now. While it should also theoretically have had an impact on servers added with the "random" algorithm, it's unlikely that the difference between proper server keys and those based on their ID could have had any visible effect. This patch should be backported as far as 3.0. The backport may be eased by a preliminary backport of previous commit "CLEANUP: lb-chash: free lb_nodes from chash's deinit(), not global", though this is not strictly necessary if context is manually adjusted.	2026-02-10 07:22:54 +01:00
Willy Tarreau	62239539bf	CLEANUP: lb-chash: free lb_nodes from chash's deinit(), not global There's an ambuity on the ownership of lb_nodes in chash, it's allocated by chash but freed by the server code in srv_free_params() from srv_drop() upon deinit. Let's move this free() call to a chash-specific function which will own the responsibility for doing this instead. Note that the .server_deinit() callback is properly called both on proxy being taken down and on server deletion.	2026-02-10 07:20:50 +01:00
Amaury Denoyelle	91a5b67b25	BUG/MINOR: proxy: fix default ALPN bind settings Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details For "add backend" implementation, postparsing code in check_config_validity() from cfgparse.c has been extracted in a new dedicated function named proxy_finalize() into proxy.c. This has caused unexpected compilation issue as in the latter file TLSEXT_TYPE_application_layer_protocol_negotiation macro may be undefined, in particular when building without QUIC support. Thus, code related to default ALPN on binds is discarded after the preprocessing stage. Fix this by including openssl-compat header file into proxy source file. This should be sufficient to ensure SSL related defines are properly included. This should fix recent issues on SSL regtests. No need to backport.	2026-02-09 13:52:25 +01:00
Willy Tarreau	ecffaa6d5a	MINOR: net_helper: extend the ip.fp output with an option presence mask Emeric suggested that it's sometimes convenient to instantly know if a client has advertised support for window scaling or timestamps for example. While the info is present in the TCP options output, it's hard to extract since it respects the options order. So here we're extending the 56-bit fingerprint with 8 extra bits that indicate the presence of options 2..8, and any option above 9 for the last bit. In practice this is sufficient since higher options are not commonly used. Also TCP option 5 is normally not sent on the SYN (SACK, only SACK_perm is sent), and echo options 6 & 7 are no longer used (replaced with timestamps). These fields might be repurposed in the future if some more meaningful options are to be mapped (e.g. MPTCP, TFO cookie, auth).	2026-02-09 09:18:04 +01:00
Amaury Denoyelle	a1db464c3e	BUG/MINOR: proxy: fix null dereference in "add backend" handler When a backend is created at runtime, the new proxy instance is inserted at the end of proxies_list. This operation is buggy if this list is empty : the code causes a null dereference which will lead to a crash. This causes the following compilation error : CC src/proxy.o src/proxy.c: In function 'cli_parse_add_backend': src/proxy.c:4933:36: warning: null pointer dereference [-Wnull-dereference] 4933 \| proxies_list->next = px; \| ~~~~~~~~~~~~~~~~~~~^~~~ This patch fixes this issue. Note that in reality it cannot occur at this moment as proxies_list cannot be empty (haproxy requires at least one frontend to start, and the list also always contains internal proxies). No need to backport.	2026-02-06 21:35:12 +01:00
Amaury Denoyelle	5dff6e439d	BUG/MINOR: proxy: fix clang build error on "add backend" handler This patch fixes the following compilation error : src/proxy.c:4954:12: error: format string is not a string literal (potentially insecure) [-Werror,-Wformat-security] 4954 \| ha_notice(msg); \| ^~~ No need to backport.	2026-02-06 21:17:18 +01:00
Amaury Denoyelle	5753c14e84	MINOR: proxy: assign dynamic proxy ID Implement proxy ID generation for dynamic backends. This is performed through the already function existing proxy_get_next_id(). As an optimization, lookup will performed starting from a global variable <dynpx_next_id>. It is initialized to the greatest ID assigned after parsing, and updated each time a backend instance is created. When backend deletion will be implemented, it could be lowered to the newly available slot.	2026-02-06 17:28:27 +01:00
Amaury Denoyelle	3115eb82a6	MEDIUM: proxy: implement dynamic backend creation Implement the required operations for "add backend" handler. This requires a new proxy allocation, settings copy from the specified default instance and proxy config finalization. All handlers registered via REGISTER_POST_PROXY_CHECK() are also called on the newly created instance. If no error were encountered, the newly created proxy is finally attached in the proxies list.	2026-02-06 17:28:27 +01:00
Amaury Denoyelle	07195a1af4	MINOR: proxy: check default proxy compatibility on "add backend" This commits completes "add backend" handler with some checks performed on the specified default proxy instance. These are additional checks outside of the already existing inheritance rules, specific to dynamic backends. For now, a default proxy is considered not compatible if it is not in mode TCP/HTTP. Also, a default proxy is rejected if it references HTTP errors. This limitation may be lifted in the future, when HTTP errors are partiallay reworked.	2026-02-06 17:28:26 +01:00
Amaury Denoyelle	a603811aac	MINOR: proxy: parse guid on dynamic backend creation Defines an extra optional GUID argument for "add backend" command. This can be useful as it is not possible to define it via a default proxy instance.	2026-02-06 17:28:04 +01:00
Amaury Denoyelle	e152913327	MINOR: proxy: parse mode on dynamic backend creation Add an optional "mode" argument to "add backend" CLI command. This argument allows to specify if the backend is in TCP or HTTP mode. By default, it is mandatory, unless the inherited default proxy already explicitely specifies the mode. To differentiate if TCP mode is implicit or explicit, a new proxy flag PR_FL_DEF_EXPLICIT_MODE is defined. It is set for every defaults instances which explicitely defined their mode.	2026-02-06 17:27:50 +01:00
Amaury Denoyelle	7ac5088c50	MINOR: proxy: define "add backend" handler Define a basic CLI handler for "add backend". For now, this handler only performs a parsing of the name argument and return an error if a duplicate already exists. It runs under thread isolation, to guarantee thread safety during the proxy creation. This feature is considered in development. CLI command requires to set experimental-mode.	2026-02-06 17:26:55 +01:00
Amaury Denoyelle	817003aa31	MINOR: backend: add function to check support for dynamic servers Move backend compatibility checks performed during 'add server' in a dedicated function be_supports_dynamic_srv(). This should simplify addition of future restriction. This function will be reused when implementing backend creation at runtime.	2026-02-06 14:35:19 +01:00
Amaury Denoyelle	dc6cf224dd	MINOR: proxy: refactor mode parsing Define a new utility function str_to_proxy_mode() which is able to convert a string into the corresponding proxy mode if possible. This new function is used for the parsing of "mode" configuration proxy keyword. This patch will be reused for dynamic backend implementation, in order to parse a similar "mode" argument via a CLI handler.	2026-02-06 14:35:18 +01:00
Amaury Denoyelle	87ea407cce	MINOR: proxy: refactor proxy inheritance of a defaults section If a proxy is referencing a defaults instance, some checks must be performed to ensure that inheritance will be compatible. Refcount of the defaults instance may also be incremented if some settings cannot be copied. This operation is performed when parsing a new proxy of defaults section which references a defaults, either implicitely or explicitely. This patch extracts this code into a dedicated function named proxy_ref_defaults(). This in turn may call defaults_px_ref() (previously called proxy_ref_defaults()) to increment its refcount. The objective of this patch is to be able to reuse defaults inheritance validation for dynamic backends created at runtime, outside of the parsing code.	2026-02-06 14:35:18 +01:00
Amaury Denoyelle	a8bc83bea5	MINOR: cfgparse: move proxy post-init in a dedicated function A lot of proxies initialization code is delayed on post-parsing stage, as it depends on the configuration fully parsed. This is performed via a loop on proxies_list. Extract this code in a dedicated function proxy_finalize(). This patch will be useful for dynamic backends creation. Note that for the moment the code has been extracted as-is. With each new features, some init code was added there. This has become a giant loop with no real ordering. A future patch may provide some cleanup in order to reorganize this.	2026-02-06 14:35:18 +01:00
Amaury Denoyelle	2c8ad11b73	MINOR: cfgparse: validate defaults proxies separately Default proxies validation occurs during post-parsing. The objective is to report any tcp/http-rules which could not behave as expected. Previously, this was performed while looping over standard proxies list, when such proxy is referencing a default instance. This was enough as only named referenced proxies were kept after parsing. However, this is not the case anymore in the context of dynamic backends creation at runtime. As such, this patch now performs validation on every named defaults outside of the standard proxies list loop. This should not cause any behavior difference, as defaults are validated without using the proxy which relies on it. Along with this change, PR_FL_READY proxy flag is now removed. Its usage was only really needed for defaults, to avoid validating a same instance multiple times. With the validation of defaults in their own loop, it is now redundant.	2026-02-06 14:35:18 +01:00
Egor Shestakov	2a07dc9c24	BUG/MINOR: startup: handle a possible strdup() failure Fix unhandled strdup() failure when initializing global.log_tag. Bug was introduced with the fix UAF for global progname pointer from `351ae5dbe`. So it must be backported as far as 3.1.	2026-02-06 10:50:31 +01:00
Egor Shestakov	9dd7cf769e	BUG/MINOR: startup: fix allocation error message of progname string Initially when init_early was introduced the progname string was a local used for temporary storage of log_tag. Now it's global and detached from log_tag enough. Thus, in the past we could inform that log_tag allocation has been failed but not now. Must be backported since the progname string became global, that is v3.1-dev9-96-g49772c55e	2026-02-06 10:50:31 +01:00
Olivier Houchard	bf7a2808fc	BUG/MEDIUM: threads: Differ checking the max threads per group number Differ checking the max threads per group number until we're done parsing the configuration file, as it may be set after a "thread-group- directive. Otherwise the default value of 64 will be used, even if there is a max-threads-per-group directive. This should be backported to 3.3.	2026-02-06 03:01:50 +01:00
Olivier Houchard	9766211cf0	BUG/MINOR: threads: Initialize maxthrpertgroup earlier. Give global.maxthrpertgroup its default value at global creation, instead of later when we're trying to detect the thread count. It is used when verifying the configuration file validity, and if it was not set in the config file, in a few corner cases, the value of 0 would be used, which would then reject perfectly fine configuration files. This should be backported to 3.3.	2026-02-06 03:01:36 +01:00
Aperence	143f5a5c0d	BUG/MINOR: config: Fix setting of alt_proto This patch fixes the bug presented in issue #3254 (https://github.com/haproxy/haproxy/issues/3254), which occured on FreeBSD when using a stream socket for in nameserver section. This bug occured due to an incorrect reset of the alt_proto for a stream socket when the default socket is created as a datagram socket. This patch fixes this bug by doing a late assignment to alt_proto when a datagram socket is requested, leaving only the modification of alt_proto done by mptcp. Additional documentation for the use of alt_proto has also been added to clarify the use of the alt_proto variable.	2026-02-04 14:54:20 +01:00
Willy Tarreau	b6bdb2553b	MEDIUM: backend: make "balance random" consider req rate when loads are equal As reported by Damien Claisse and C�dric Paillet, the "random" LB algorithm can become particularly unfair with large numbers of servers having few connections. It's indeed fairly common to see many servers with zero connection in a thousand-server large farm, and in this case the P2C algo consisting in checking the servers' loads doesn't help at all and is basically similar to random(1). In this case, we only rely on the distribution of server IDs in the random space to pick the best server, but it's possible to observe huge discrepancies. An attempt to model the problem clearly shows that with 1600 servers with weight 10, for 1 million requests, the lowest loaded ones will take 300 req while the most loaded ones will get 780, with most of the values between 520 and 700. In addition, only the first 28 lower bits of server IDs are used for the key calculation, which means that node keys are more determinist. Setting random keys in the lowest 28 bits only better packs values with min around 530 and max around 710, with values mostly between 550 and 680. This can only be compensated by increasing weights and draws without being a perfect fix either. At 4 draws, the min is around 560 and the max around 670, with most values bteween 590 and 650. This patch takes another approach to this problem: when servers are on tie regarding their loads, instead of arbitrarily taking the second one, we now compare their current request rates, which is updated all the time and smoothed over one second, and we pick the server with the lowest request rate. Now with 2 draws, the curve is mostly flat, with the min at 580 and the max at 628, and almost all values between 611 and 625. And 4 draws exclusively gives values from 614 to 624. Other points will need to be addressed separately (bits of server ID, maybe refine the hash algorithm), but these ones would affect how caches are selected, and cannot be changed without an extra option. For random however we can perform a change without impacting anyone. This should be backported, probably only to 3.3 since it's where the "random" algo became the default.	2026-02-04 14:54:16 +01:00
Willy Tarreau	cddeea58cd	BUG/MINOR: cpu-topo: count cores not cpus to distinguish core types The per-cpu capacity of a cluster was taken into account since 3.2 with commit `6c88e27cf4` ("MEDIUM: cpu-topo: change "performance" to consider per-core capacity"). In cpu_policy_performance() and cpu_policy_efficiency(), we're trying to figure which cores have more capacity than others by comparing their cluster's average capacity. However, contrary to what the comment says, we're not averaging per core but per cpu, which makes a difference for CPUs mixing SMT with non-SMT cores on the same SoC, such as intel's 14th gen CPUs. Indeed, on a machine where cpufreq is not enabled, all CPUs can be reported with a capacity of 1024, resulting in a big cluster of 161024, and 4 small clusters of 41024 each, giving an average of 1024 per CPU, making it impossible to distinguish one from the other. In this situation, both "cpu-policy performance" and "cpu-policy efficiency" enable all cores. But this is wrong, what needs to be taken into account in the divide is the number of cores, not cpus, that allows to distinguish big from little clusters. This was not noticeable on the ARM machines the commit above aimed at fixing because there, the number of CPUs equals the number of cores. And on an x86 machine with cpu_freq enabled, the frequencies continue to help spotting which ones are big/little. By using nb_cores instead of nb_cpus in the comparison and in the avg_capa compare function, it properly works again on x86 without affecting other machines with 1 CPU per core. This can be backported to 3.2.	2026-02-04 08:49:18 +01:00
Olivier Houchard	3674afe8a0	BUG/MEDIUM: threads: Atomically set TH_FL_SLEEPING and clr FL_NOTIFIED When we're about to enter polling, atomically set TH_FL_SLEEPING and remove TH_FL_NOTIFIED, instead of doing it in sequence. Otherwise, another thread may sett that both the TH_FL_SLEEPING and the TH_FL_NOTIFIED bits are set, and don't wake up the thread then it should be doing that. This prevents a bug where a thread is sleeping while it should be handling a new connection, which can happen if there are very few incoming connection. This is easy to reproduce when using only two threads, and injecting with only one connection, the connection may then never be handled. This should be backported up to 2.8.	2026-02-04 07:13:06 +01:00
Hyeonggeun Oh	2527d9dcd1	MEDIUM: tcpcheck: add post-80 option for mysql-check to support MySQL 8.x Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details This patch adds a new 'post-80' option that sets the CLIENT_PLUGIN_AUTH (0x00080000) capability flag and explicitly specifies mysql_native_password as the authentication plugin in the handshake response. This patch also addes documentation content for post-80 option support in MySQL 8.x version. Which handles new default auth plugin caching_sha2_password. MySQL 8.0 changed the default authentication plugin from mysql_native_password to caching_sha2_password. The current mysql-check implementation only supports pre-41 and post-41 client auth protocols, which lack the CLIENT_PLUGIN_AUTH capability flag. When HAProxy sends a post-41 authentication packet to a MySQL 8.x server, the server responds with error 1251: "Client does not support authentication protocol requested by server". The new client capabilities for post-80 are: - CLIENT_PROTOCOL_41 (0x00000200) - CLIENT_SECURE_CONNECTION (0x00008000) - CLIENT_PLUGIN_AUTH (0x00080000) Usage example: backend mysql_servers option mysql-check user haproxy post-80 server db1 192.168.1.10:3306 check The health check user must be created with mysql_native_password: CREATE USER 'haproxy'@'%' IDENTIFIED WITH mysql_native_password BY ''; This addresses https://github.com/haproxy/haproxy/issues/2934.	2026-02-03 07:36:53 +01:00
Olivier Houchard	f26562bcb7	MINOR: quic: Fix build with USE_QUIC_OPENSSL_COMPAT Commit `fa094d0b61` changed the msg callback args, but forgot to fix quic_tls_msg_callback() accordingly, so do that, and remove the unused struct connection paramter.	2026-02-03 04:05:34 +01:00
Christopher Faulet	abc1947e19	BUG/MEDIUM: applet: Fix test on shut flags for legacy applets Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details A regression was introduced in the commit `0ea601127` ("BUG/MAJOR: applet: Don't call I/O handler if the applet was shut"). The test on shut flags for legacy applets is inverted. It should be harmeless on 3.4 and 3.3 because all applets were converted. But this fix is mandatory for 3.2 and older. The patch must be backported as far as 3.0 with the commit above.	2026-01-30 09:55:18 +01:00
William Lallemand	23e8ed6ea6	MEDIUM: ssl: porting to X509_STORE_get1_objects() for OpenSSL 4.0 OpenSSL 4.0 is deprecating X509_STORE_get0_objects(). Every occurence of X509_STORE_get0_objects() was first replaced by X509_STORE_get1_objects(). This changes the ref count of the STACK_OF(X509_OBJECT) everywhere, and need it to be sk_X509_OBJECT_pop_free(objs, X509_OBJECT_free) each time. X509_STORE_get1_objects() is not available in AWS-LC, OpenSSL < 3.2, LibreSSL and WolfSSL, so we need to still be compatible with get0. To achieve this, 2 macros were added X509_STORE_getX_objects() and sk_X509_OBJECT_popX_free(), these macros will use either the get0 or the get1 macro depending on their availability. In the case of get0, sk_X509_OBJECT_popX_free() will just do nothing instead of trying to free. Don't backport that unless really needed if we want to be compatible with OpenSSL 4.0. It changes all the refcounts.	2026-01-29 17:08:41 +01:00
Amaury Denoyelle	fa094d0b61	MEDIUM: ssl: remove connection from msg callback args Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details SSL msg callbacks are used for notification about sent/received SSL messages. Such callbacks are registered via ssl_sock_register_msg_callback(). Prior to this patch, connection was passed as first argument of these callbacks. However, most of them do not use it. Worst, this may lead to confusion as connection can be NULL in QUIC context. This patch cleans this by removing connection argument. As an alternative, connection can be retrieved in callbacks if needed using ssl_sock_get_conn() but the code must be ready to deal with potential NULL instances. As an example, heartbeat parsing callback has been adjusted in this manner.	2026-01-29 11:14:09 +01:00
Amaury Denoyelle	869a997a68	BUG/MEDIUM: ssl: fix msg callbacks on QUIC connections With QUIC backend implementation, SSL code has been adjusted in several place when accessing connection instance. Indeed, with QUIC usage, SSL context is tied up to quic_conn, and code may be executed prior/after connection instantiation. For example, on frontend side, connection is only created after QUIC handshake completion. The following patch tried to fix unsafe accesses to connection. In particular, msg callbacks are not called anymore if connection is NULL. `fab7da0fd0` BUG/MEDIUM: quic-be/ssl_sock: TLS callback called without connection However, most msg callbacks do not need to use the connection instance. The only occurence where it is accessed is for heartbeat message parsing, which is the only case of crash solved. The above fix is too restrictive as it completely prevents execution of these callbacks when connection is unset. This breaks several features with QUIC, such as SSL key logging or samples based on ClientHello capture. The current patch reverts the above one. Thus, this restores invokation of msg callbacks for QUIC during the whole low-level connection lifetime. This requires a small adjustment in heartbeat parsing callback to prevent access on a NULL connection. The issue on ClientHello capture was mentionned in github issue #2495. This must be backported up to 3.3.	2026-01-29 11:14:09 +01:00
Willy Tarreau	48d9c90ff2	BUG/MINOR: config/ssl: fix spelling of "expose-experimental-directives" The help message for "ktls" mentions "expose-experimental-directive" without the final 's', which is particularly annoying when copy-pasting the directive from the error message directly into the config. This should be backported to 3.3.	2026-01-29 11:07:55 +01:00
Willy Tarreau	35d63cc3c7	MEDIUM: h1: strictly verify quoting in chunk extensions Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details As reported by Ben Kallus in the following thread: https://www.mail-archive.com/haproxy@formilux.org/msg46471.html there exist some agents which mistakenly accept CRLF inside quoted chunk extensions, making it possible to fool them by injecting one extra chunk they won't see for example, or making them miss the end of the body depending on how it's done. Haproxy, like most other agents nowadays, doesn't care at all about chunk extensions and just drops them, in agreement with the spec. However, as discussed, since chunk extensions are basically never used except for attacks, and that the cost of just matching quote pairs and checking backslashed quotes is escape consistency remains relatively low, it can make sense to add such a check to abort the message parsing when this situation is encountered. Note that it has to be done at two places, because there is a fast path and a slow path for chunk parsing. Also note that it will cause transfers using improperly formatted chunk extensions to fail, but since these are really not used, and that the likelihood of them being used but improperly quoted certainly is much lower than the risk of crossing a broken parser on the client's request path or on the server's response path, we consider the risk as acceptable. The test is not subject to the configurable parser exceptions and it's very unlikely that it will ever be needed. Since this is done in 3.4 which will be LTS, this patch will have to be backported to 3.3 so that any unlikely trouble gets a chance to be detected before users upgrade to 3.4. Thanks to Ben for the discussion, and to Rajat Raghav for sparking it in the first place even though the original report was mistaken. Cc: Ben Kallus <benjamin.p.kallus.gr@dartmouth.edu> Cc: Rajat Raghav <xclow3n@gmail.com> Cc: Christopher Faulet <cfaulet@haproxy.com>	2026-01-28 18:54:23 +01:00
Willy Tarreau	a79a67b52f	OPTIM: server: get rid of the last use of _ha_barrier_full() The code in srv_add_to_idle_list() has its roots in 2.0 with commit `9ea5d361ae` ("MEDIUM: servers: Reorganize the way idle connections are cleaned."). At this era we didn't yet have the current set of atomic load/store operations and we used to perform loads using volatile casts after a barrier. It turns out that this function has kept this schema over the years, resulting in a big mfence stalling all the pipeline in the function: \| static __inline void \| __ha_barrier_full(void) \| { \| __asm __volatile("mfence" ::: "memory"); 27.08 \| mfence \| if ((volatile void *)srv->idle_node.node.leaf_p == NULL) { 0.84 \| cmpq $0x0,0x158(%r15) 0.74 \| je 35f \| return 1; Switching these for a pair of atomic loads got rid of this and brought 0.5 to 3% extra performance depending on the tests due to variations elsewhere, but it has never been below 0.5%. Note that the second load doesn't need to be atomic since it's protected by the lock, but it's cleaner from an API and code review perspective. That's also why it's relaxed. This was the last user of _ha_barrier_full(), let's try not to reintroduce it now!	2026-01-28 16:07:27 +00:00
William Lallemand	bbab0ac4d0	BUG/MINOR: ssl: fix error message of tune.ssl.certificate-compression Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details tune.ssl-certificate-compression expects 'auto' but not 'on'. Could be backported if the previous patch is backported.	2026-01-27 16:25:11 +01:00
William Lallemand	6995fe60c3	MINOR: ssl: allow to disable certificate compression This option allows to disable the certificate compression (RFC 8879) using OpenSSL >= 3.2.0. This feature is known to permit some denial of services by causing extra memory allocations of approximately 22MiB and extra CPU work per connection with OpenSSL versions affected by CVE-2025-66199. ( https://openssl-library.org/news/vulnerabilities/index.html#CVE-2025-66199 ) Setting this to "off" permits to mitigate the problem. Must be backported to every stable branches.	2026-01-27 16:10:41 +01:00
Christopher Faulet	0ea601127e	BUG/MAJOR: applet: Don't call I/O handler if the applet was shut In 3.0, it was stated an applet could not be woken up after it was shutdown. So the corresponding test in the applets I/O handler was removed. However, it seems it may happen, especially when outgoing data are blocked on the opposite side. But it is really unexpected because the "release" callback function was already called and the appctx context was most probably released. Strangely, it was never detected by any applet till now. But the Prometheus exporter was never updated and was still testing the shutdown. But when it was refactored to use the new applet API in 3.3, the test was removed. And this introduced a regression leading a crash because a server object could be corrupted. Conditions to hit the bug are not really clear however. So, now, to avoid any issue with all other applets, the test is performed in task_process_applet(). The I/O handler is no longer called if the applet is already shut. The same is performed for applets still relying on the old API. An amazing thanks to @idl0r for his invaluable help on this issue ! This patch should fix the issue #3244. It should first be backported to 3.3 and then slowly as far as 3.0.	2026-01-27 16:00:23 +01:00
William Lallemand	0ebef67132	MINOR: ssl: display libssl errors on private key loading Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details Display a more precise error message from the libssl when a private key can't be loaded correctly.	2026-01-26 14:19:19 +01:00
Remi Tricot-Le Breton	9b1faee4c9	BUG/MINOR: ssl: Encrypted keys could not be loaded when given alongside certificate The SSL passphrase callback function was only called when loading private keys from a dedicated file (separate from the corresponding certificate) but not when both the certificate and the key were in the same file. We can now load them properly, regardless of how they are provided. A flas had to be added in the 'passphrase_cb_data' structure because in the 'ssl_sock_load_pem_into_ckch' function, when calling 'PEM_read_bio_PrivateKey' there might be no private key in the PEM file which would mean that the callback never gets called (and cannot set the 'passphrase_idx' to -1). This patch can be backported to 3.3.	2026-01-26 14:09:13 +01:00
Remi Tricot-Le Breton	d2ccc19fde	BUG/MINOR: ssl: Properly manage alloc failures in SSL passphrase callback Some error paths in 'ssl_sock_passwd_cb' (allocation failures) did not set the 'passphrase_idx' to -1 which is the way for the caller to know not to call the callback again so in some memory contention contexts we could end up calling the callback 'infinitely' (or until memory is finally available). This patch must be backported to 3.3.	2026-01-26 14:08:50 +01:00
Willy Tarreau	1a3252e956	MEDIUM: pools: better check for size rounding overflow on registration Certain object sizes cannot be controlled at declaration time because the resulting object size may be slightly extended (tag, caller), aligned and rounded up, or even doubled depending on pool settings (e.g. if backup is used). This patch addresses this by enlarging the type in the pool registration to 64-bit so that no info is lost from the declaration, and extra checks for overflows can be performed during registration after various rounding steps. This allows to catch issues such as these ones and to report a suitable error: global tune.http.logurilen 2147483647 frontend capture request header name len 2147483647 http-request capture src len 2147483647 tcp-request content capture src len 2147483647	2026-01-26 11:54:14 +01:00
Willy Tarreau	e9e4821db5	BUG/MINOR: stick-tables: abort startup on stk_ctr pool creation failure Since 3.3 with commit `945aa0ea82` ("MINOR: initcalls: Add a new initcall stage, STG_INIT_2"), stkt_late_init() calls stkt_create_stk_ctr_pool() but doesn't check its return value, so if the pool creation fails, the process still starts, which is not correct. This patch adds a check for the return value to make sure we fail to start in this case. This was not an issue before 3.3 because the function was called as a post-check handler which did check for errors in the returned values.	2026-01-26 11:45:49 +01:00
Willy Tarreau	4e7c07736a	BUG/MINOR: config: check capture pool creations for failures A few capture pools can fail in case of too large values for example. These include the req_uri, capture, and caphdr pools, and may be triggered with "tune.http.logurilen 2147483647" in the global section, or one of these in a frontend: capture request header name len 2147483647 http-request capture src len 2147483647 tcp-request content capture src len 2147483647 These seem to be the only occurrences where create_pool()'s return value is assigned without being checked, so let's add the proper check for errors there. This can be backported as a hardening measure though the risks and impacts are extremely low.	2026-01-26 11:45:49 +01:00
Christopher Faulet	c267d24f57	BUG/MINOR: proto_tcp: Properly report support for HAVE_TCP_MD5SIG feature Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details Condition to report the support for HAVE_TCP_MD5SIG feature was inverted. It is only an issue for the reg-test related to this feature. This patch must be backported to 3.3.	2026-01-23 11:40:54 +01:00
Christopher Faulet	a3e9a04435	BUG/MEDIUM: mux-h1: Skip UNUSED htx block when formating the start line UNUSED blocks were not properly handled when the H1 multiplexer was formatting the start line of a request or a response. UNUSED was ignored but not removed from HTX message. So the mux can loop infinitly on such block. It could be seen a a major issue but in fact it happens only if a very specific case on the reponse processing (at least I think so): the server must send an interim message (a 100-continue for intance) with the final response. HAProxy must receive both in same time and the final reponse must be intercepted (via a http-response return action for instance), In that case, the interim message is fowarded and the server final reponse is removed and replaced by a proxy error message. Now UNUSED htx blocks are properly skipped and removed. This patch must be backported as far as 3.0.	2026-01-23 11:40:54 +01:00
Aurelien DARRAGON	a66b4881d7	BUG/MINOR: hlua: consume error object if ignored after a failing lua_pcall() We frequently use lua_pcall() to provide safe alternative functions (encapsulated helpers) that prevent the process from crashing in case of Lua error when Lua is executed from an unsafe environment. However, some of those safe helpers don't handle errors properly. In case of error, the Lua API will always put an error object on top of the stack as stated in the documentation. This error object can be used to retrieve more info about the error. But in some cases when we ignore it, we should still consume it to prevent the stack from being altered with an extra object when returning from the helper function. It should be backported to all stable versions. If the patch doesn't apply automatically, all that's needed is to check for lua_pcall() in hlua.c and for other cases than 'LUA_OK', make sure that the error object is popped from the stack before the function returns.	2026-01-23 11:23:37 +01:00
Aurelien DARRAGON	9e9083d0e2	BUG/MEDIUM: hlua: fix invalid lua_pcall() usage in hlua_traceback() Since commit `365ee28` ("BUG/MINOR: hlua: prevent LJMP in hlua_traceback()") we now use lua_pcall() to protect sensitive parts of hlua_traceback() function, and this to prevent Lua from crashing the process in case of unexpected Lua error. This is still relevant, but an error was made, as lua_pcall() was given the nresult argument '1' when _hlua_traceback() internal function doesn't push any argument on the stack. Because of this, it seems Lua API still tries to push garbage object on top of the stack before returning. This may cause functions that leverage hlua_traceback() in the middle of stack manipulation to end up having a corrupted stack when continuing after the hlua_traceback(). There doesn't seem to be many places where this could be a problem, as this was discovered using the reproducer documented in `f535d3e` ("BUG/MEDIUM: debug: only dump Lua state when panicking"). Indeed, when hlua_traceback() was used from the signal handler while the thread was previously executing Lua, when returning to Lua after the handler the Lua stack would be corrupted. To fix the issue, we emphasize on the fact that the _hlua_traceback() function doesn't push anything on the stack, returns 0, thus lua_pcall() is given 0 'nresult' argument to prevent anything from being pushed after the execution, preserving the original stack state. This should be backported to all stable versions (because `365ee28` was backported there)	2026-01-23 11:23:31 +01:00
Amaury Denoyelle	b52c60d366	MEDIUM: proxy: implement persistent named defaults This patch changes the handling of named defaults sections. Prior to this patch, every unreferenced defaults proxies were removed on post parsing. Now by default, these sections are kept after postparsing and only purged on deinit. The objective is to allow reusing them as base configuration for dynamic backends. To implement this, refcount of every still addressable named sections is incremented by one after parsing. This ensures that they won't be removed even if referencing proxies are removed at runtime. This is done via the new function proxy_ref_all_defaults(). To ensure defaults instances are still properly removed on deinit, the inverse operation is performed : refcount is decremented by one on every defaults sections via proxy_unref_all_defaults(). The original behavior can still be used by using the new global keyword tune.defaults.purge. This is useful for users using configuration with large number of defaults and not interested in dynamic backends creation.	2026-01-22 18:06:42 +01:00
Amaury Denoyelle	116983ad94	MEDIUM: cfgparse: do not store unnamed defaults in name tree Defaults section are indexed by their name in defproxy_by_name tree. For named sections, there is no duplicate : if two instances have the same name, the older one is removed from the tree. However, this was not the case for unnamed defaults which are all stored inconditionnally in defproxy_by_name. This commit introduces a new approach for unnamed defaults. Now, these instances are never inserted in the defproxy_by_name tree. Indeed, this is not needed as no tree lookup is performed with empty names. This may optimize slightly config parsing with a huge number of named and unnamed defaults sections, as the first ones won't fill up the tree needlessly. However, defproxy_by_name tree is also used to purge unreferenced defaults instances, both on postparsing and deinit. Thus, a new approach is needed for unnamed sections cleanup. Now, each time a new defaults is parsed, if the previous instance is unnamed, it is freed unless if referenced by a proxy. When config parsing is ended, a similar operation is performed to ensure the last unnamed defaults section won't stay in memory. To implement this, last_defproxy static variable is now set to global. Unnamed sections which cannot be removed due to proxies referencing proxies will still be removed when such proxies are freed themselves, at runtime or on deinit.	2026-01-22 17:57:16 +01:00
Amaury Denoyelle	848e0cd052	MINOR: proxy: simplify defaults proxies list storage Defaults proxies instance are stored in a global name tree. When there is a name conflict and the older entry cannot be simply discarded as it is already referenced, the older entry is instead removed from the name tree and inserted into the orphaned list. The purpose of the orphaned list was to guarantee that any remaining unreferenced defaults are purged either on postparsing or deinit. However, this is in fact completely useless. Indeed on postparsing, orphaned entries are always referenced. On deinit instead, defaults are already freed along the cleanup of all frontend/backend instances clean up, thanks to their refcounting. This patch streamlines this by removing orphaned list. Instead, a defaults section is inserted into a new global defaults_list during their whole lifetime. This is not strictly necessary but it ensures that defaults instances can still be accessed easily in the future if needed even if not present in the name tree. On deinit, a BUG_ON() is added to ensure that defaults_list is indeed emptied. Another benefit from this patch is to simplify the defaults deletion procedure. Orphaned simple list is replaced by a proper double linked list implementation, so a single LIST_DELETE() is now performed. This will be notably useful as defaults may be removed at runtime in the future if backends deletion at runtime is implemented.	2026-01-22 17:57:09 +01:00
Amaury Denoyelle	434e979046	MINOR: proxy: refactor defaults proxies API This patch renames functions which deal with defaults section. A common "defaults_px_" prefix is defined. This serves as a marker to identify functions which can only be used with proxies defaults capability. New BUG_ON() are enforced to ensure this is valid. Also, older proxy_unref_or_destroy_defaults() is renamed defaults_px_detach().	2026-01-22 17:55:47 +01:00
Amaury Denoyelle	6c0ea1fe73	MINOR: proxy: remove proxy_preset_defaults() Function proxy_preset_defaults() purpose has evolved over time. Originally, it was only used to initialize defaults proxies instances. Until today, it was extended so that all proxies use it. Its objective is to initialize settings to common default values. To remove the confusion, this function is now removed. Its content is integrated directly into init_new_proxy().	2026-01-22 16:20:25 +01:00
Willy Tarreau	f535d3e031	BUG/MEDIUM: debug: only dump Lua state when panicking For a long time, we've tried to show the Lua state and backtrace when dumping threads so as to be able to figure is (and which) Lua code was misbehaving, e.g. by performing expensive library calls. Since 3.1 with commit `365ee28510` ("BUG/MINOR: hlua: prevent LJMP in hlua_traceback()"), it appears that the approach is more fragile (though that fix addressed a real issue about out-of-memory), and it's possible to occasionally observe crashes or CPU loops with "show threads" while running Lua heavily. While users of "show threads" are rare, the watchdog warnings, which were also enabled on 3.1, also trigger these issues, which is even more of a concern. This patch goes the simple way to address this for now: since the purpose of the Lua backtrace was to help locate Lua call places upon a panic, let's only call the backtrace on panic but not in other situations. After a panic we obviously don't care that the Lua stack might be corrupted since it's never going to be resumed anyway. This may be relaxed in the future if a solution is found to reliably produce harmless Lua backtraces. The commit above was backported to all stable branches, so this patch will be needed everywhere. However, TAINTED_PANIC only appeared in 2.8, and given the rarety of this bug before 3.1, it's probably not needed to make any extra effort to go beyond 2.8. It's easy enough to test a version for being subject to this issue, by running the following Lua code: local function stress(txn) for _, backend in pairs(core.backends) do for _, server in pairs(backend.servers) do local stats = server:get_stats() end end end core.register_fetches("stress", stress) in the following config file: global stats socket /tmp/haproxy.stat level admin mode 666 tune.lua.bool-sample-conversion normal lua-load-per-thread "stress.lua" listen stress bind :8001 mode http timeout client 5s timeout server 5s timeout connect 5s http-request return status 200 content-type text/plain lf-string %[lua.stress()] server s1 127.0.0.1:8000 and stressing port 8001 with 100+ connections requesting / in loop, then issuing "show threads" on the CLI using socat in loops as well. Normally it instantly segfaults (sometimes during the first "show").	2026-01-22 15:47:42 +01:00
Amaury Denoyelle	ac877a25dd	BUG/MINOR: proxy: fix deinit crash on defaults with duplicate name A defaults proxy instance may be move into the orphaned list when it is replaced by a newer section with the same name. This is attached via <next> member as a single linked list entry. However, proxy free does not clear <next> attach point. This causes a crash on deinit if orphaned list is not empty. First, all frontend/backend instances are freed. This triggers the release of every referenced defaults instances as their refcount reach zero, but orphaned list is not clean up. A loop is then conducted on orphaned list via proxy_destroy_all_unref_defaults(). This causes a segfault due to access on already freed entries. To fix this, this patch extends proxy_destroy_defaults(). If orphaned list is not empty, a loop is performed to remove a possible entry of the currently released defaults instance. This ensures that loop over orphaned list won't be able to access to already freed entries. This bug is pretty rare as it requires to have duplicate name in defaults sections, and also to use settings which forces defaults referencing, such as TCP/HTTP rules. This can be reproduced with the minimal config here : defaults def http-request return status 200 frontend fe bind :20080 defaults def Note that in fact orphaned list looping is not strictly necessary, as defaults instances are automatically removed via refcounting. This will be the purpose of a future patch. However, to limit the risk of regression on stable releases during backport, this patch uses the more direct approach for now. This must be backported up to 3.1.	2026-01-22 15:40:01 +01:00
Amaury Denoyelle	c7004be964	BUG/MEDIUM: mux-quic: prevent BUG_ON() on aborted uni stream close Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details When a QCS instance is fully closed on qcs_close_remote() invokation, it is moved into purg_list for later cleanup. This reuses <el_send> list element, so a BUG_ON() ensures that QCS is not already present in send_list. This code is safe for bidirectional streams, as local channel is only closed after FIN or RESET_STREAM emission completion, so such QCS won't be present in the send_list on full closure. However, things are different for remote uni streams. As such streams do not have any local channel, qcs_close_remote() will always proceed to full closure. Most of the time this is fine, but the aformentionned BUG_ON() could be triggered if emission is required on a remote uni stream : this only happens after read was aborted and a STOP_SENDING frame is prepared. Fix this by adding an extra operation in qcs_close_remote() : on full close, STOP_SENDING is cancelled if it was prepared and the QCS instance is removed from send_list. This is safe as STOP_SENDING is unnecessary after the remote channel is closed. This operation is performed before purg_list insertion which prevents the BUG_ON() crash issue. This patch must be backported up to 3.1.	2026-01-21 14:01:12 +01:00
William Lallemand	eb5279b154	BUG/MEDIUM: ssl: fix generate-certificates option when SNI greater than 64bytes Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details The problem is that the certificate is generated with a CN greater than 64 bytes when the SNI is too long, which is not suppose to be supported, and will end up with a handshake failure. The patch fixes the issue by avoiding to add a CN when the SNI is longer than 64 bytes. Indeed this is not a mandatory field anymore and was deprecated more than 20 years ago. The SAN DNS is enough for this case. Must be backported in every stable branches.	2026-01-21 10:45:22 +01:00
William Lallemand	fbc98ebcda	BUG/MEDIUM: ssl: fix error path on generate-certificates It was reported by Przemyslaw Bromber that using the "generate-certificates" option combined with AWS-LC would crash HAProxy when a request is done with a SNI longer than 64 bytes. The problem is that the certificate is generated with a CN greater than 64 bytes which results in ssl_sock_do_create_cert() returning NULL. This NULL value being passed to SSL_set_SSL_CTX. With OpenSSL, passing a NULL SSL_CTX does not seem to be an issue as it would just ignore it. With AWS_LC, passing a NULL seems to crash the function. This was reported to upstream AWS-LC and fixed in patch 7487ad1dcd8 https://github.com/aws/aws-lc/pull/2946. This must be backported in every branches.	2026-01-21 10:45:22 +01:00
Hyeonggeun Oh	9f766b2056	MINOR: vars: implement dump_all_vars() sample fetch This patch implements dump_all_vars([scope],[prefix]) sample fetch function that dumps all variables in a given scope, optionally filtered by name prefix. Output format: var1=value1, var2=value2, ... - String values are quoted and escaped (", , \r, \n, \b, \0) - All sample types are supported via sample_convert() - Scope can be: sess, txn, req, res, proc - Prefix filtering is optional Example usage: http-request return string %[dump_all_vars(txn)] http-request return string %[dump_all_vars(txn,user)] This addresses GitHub issue #1623.	2026-01-21 10:44:19 +01:00
Hyeonggeun Oh	95e8483b35	MINOR: vars: store variable names for runtime access Currently, variable names are only used during parsing and are not stored at runtime. This makes it impossible to iterate through variables and retrieve their names. This patch adds infrastructure to store variable names: - Add 'name' and 'name_len' fields to var_desc structure - Add 'name' field to var structure - Add VDF_NAME_ALLOCATED flag to track memory ownership - Store names in vars_fill_desc(), var_set(), vars_check_arg(), and parse_store() - Free names in var_clear() and release_store_rule() - Add ARGT_VAR handling in release_sample_arg() to free the allocated name when the flag is set This prepares the ground for implementing dump_all_vars() in the next commit. Tested with: - ASAN-enabled build on Linux (TARGET=linux-glibc USE_OPENSSL=1 ARCH_FLAGS="-g -fsanitize=address") - Regression tests: reg-tests/sample_fetches/vars.vtc - Regression tests: reg-tests/startup/default_rules.vtc	2026-01-21 10:44:19 +01:00
Hyeonggeun Oh	25564b6075	MINOR: tools: add chunk_escape_string() helper function This function takes a string appends it to a buffer in a format compatible with most languages (double-quoted, with special characters escaped). It handles standard escape sequences like \n, \r, \", \\. This generic utility is desined to be used for logging or debugging purposes where arbitrary string data needs to be safely emitted without breaking the output format. It will be primarily used by the upcoming dump_all_vars() sample fetch to dump variable contents safely.	2026-01-21 10:44:19 +01:00
Hyeonggeun Oh	7e85391a9e	REORG: cfgparse: move peers parsing to cfgparse-peers.c Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This patch move the peers section parsing code from src/cfgparse.c to a dedicated src/cfgparse-peers.c file. This seperation improves code organization and prepares for further refactoring of the "peers" keyword registration system. No functional changes in this patch - the code is moved as-is with only the necessary adjustments for compliation (adding SPDX header and updating Makefile for build). This is the first patch in a series to address issue #3221, which reports that "peers" section keywords are not displayed with -dKall.	2026-01-20 17:17:37 +01:00
Aurelien DARRAGON	9156d5f775	BUG/MEDIUM: log: parsing log-forward options may result in segfault Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details As reported by GH user @HiggsTeilchen on #3250, the use of "option dont-parse-log" may result in segmentation fault when parsing the configuration. In fact, "option assume-rfc6587-ntf" is also affected. The reason behind this is that cfg_parse_log_forward() leverages the cfg_parse_listen_match_option() function to check for generic proxy options that are relevant in the PR_MODE_SYSLOG context. And while it is not documented, this function assumes that the currently evaluated proxy is stored in the global variable 'curproxy', which cfg_parse_log_forward() doesn't offer. cfg_parse_listen_match_option() uses curproxy to check the currently evaluated proxy's capabilities is compatible with the option, so if a proxy with the frontend capability was defined earlier in the config, parsing would succeed, if curproxy points to proxy without the frontend capabilty (ie: backend), a warning would be emitted to tell that the option would be ignored while it is perfectly valid for the log-forward proxy, and if no proxy was defined earlier in the config a segfault would be triggered. To fix the issue, we explicitly make "curproxy" global variable point to the log-forward proxy being parsed in cfg_parse_log_forward() before leveraging cfg_parse_listen_match_option() to check for compatible options. It must be backported with `834e9af8` ("MINOR: log: add options eval for log-forward"), which was introduced in 3.2 precisely.	2026-01-19 16:53:00 +01:00
Aurelien DARRAGON	d38b918da1	BUG/MINOR: server: ensure server is detached from proxy list before being freed There remained some cases (on error paths) were a server could be freed while still attached on the parent proxy server list. In 3.3 this can be problematic because new_server() automatically adds the server to the parent proxy list. The bug is insignificant because it is on errors paths during init and often haproxy exits right after. But let's fix that to ensure no UAF or undefined behavior occurs because of that. This patch depends on ("MINOR: cli: use srv_drop() when server was created using new_server()") It must be backported in 3.3 with the above mentioned patch.	2026-01-19 14:24:04 +01:00
Aurelien DARRAGON	12dc9325a7	MINOR: cli: use srv_drop() when server was created using new_server() Now that new_server() is becoming more and more complex, we need to take care that servers created using new_server() must be released using the corresponding release function srv_drop() which takes care of properly de-initing the server and its members.	2026-01-19 14:23:58 +01:00
Egor Shestakov	a3ee35cbfc	REORG/MINOR: cfgparse: eliminate code duplication by lshift_args() Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details There were similar parts of the code in "no" and "default" prefix keywords handling. This duplication caused the bug once. No backport needed.	2026-01-16 09:09:24 +01:00
Egor Shestakov	447d73dc99	BUG/MINOR: cfgparse: fix "default" prefix parsing Fix the left shift of args when "default" prefix matches. The cause of the bug was the absence of zeroing of the right element during the shift. The same bug for "no" prefix was fixed by commit `0f99e3497`, but missed for "default". The shift of ("default", "option", "dontlog-normal") produced ("option", "dontlog-normal", "dontlog-normal") instead of ("option", "dontlog-normal", "") As an example, a valid config line: default option dontlog-normal caused a parse error: [ALERT] (32914) : config : parsing [bug-default-prefix.cfg:22] : 'option dontlog-normal' cannot handle unexpected argument 'dontlog-normal'. The patch should be backported to all stable versions, since the absence of zeroing was introduced with "default" keyword.	2026-01-16 09:09:19 +01:00
Remi Tricot-Le Breton	aba18bac71	MINOR: jwe: Some algorithms not supported by AWS-LC AWS-LC does not have EVP_aes_128_wrap or EVP_aes_192_wrap so the A128KW and A192KW algorithms will not be supported for JWE token decryption.	2026-01-15 10:56:28 +01:00
Remi Tricot-Le Breton	e3a782adb5	MINOR: jwe: Add new jwt_decrypt_cert converter This converter checks the validity and decrypts the content of a JWE token that has an asymetric "alg" algorithm (RSA). In such a case, we must provide a path to an already loaded certificate and private key that has the "jwt" option set to "on".	2026-01-15 10:56:27 +01:00
Remi Tricot-Le Breton	416b87d5db	MINOR: jwe: Add new jwt_decrypt_secret converter This converter checks the validity and decrypts the content of a JWE token that has a symetric "alg" algorithm. In such a case, we only require a secret as parameter in order to decrypt the token.	2026-01-15 10:56:27 +01:00
Remi Tricot-Le Breton	c431034037	MINOR: ssl: Add new aes_cbc_enc/_dec converters Those converters allow to encrypt or decrypt data with AES in Cipher Block Chaining mode. They work the same way as the already existing aes_gcm_enc/_dec ones apart from the AEAD tag notion which is not supported in CBC mode.	2026-01-15 10:56:27 +01:00
Remi Tricot-Le Breton	f0e64de753	MINOR: ssl: Factorize AES GCM data processing The parameter parsing and processing and the actual crypto part of the aes_gcm converter are interleaved. This patch puts the crypto parts in a dedicated function for better reuse in the upcoming JWE processing.	2026-01-15 10:56:27 +01:00
Amaury Denoyelle	6870551a57	MEDIUM: proxy: force traffic on unpublished/disabled backends A recent patch has introduced a new state for proxies : unpublished backends. Such backends won't be eligilible for traffic, thus use_backend/default_backend rules which target them won't match and content switching rules processing will continue. This patch defines a new frontend keywords 'force-be-switch'. This keyword allows to ignore unpublished or disabled state. Thus, use_backend/default_backend will match even if the target backend is unpublished or disabled. This is useful to be able to test a backend instance before exposing it outside. This new keyword is converted into a persist rule of new type PERSIST_TYPE_BE_SWITCH, stored in persist_rules list proxy member. This is the only persist rule applicable to frontend side. Prior to this commit, pure frontend proxies persist_rules list were always empty. This new features requires adjustment in process_switching_rules(). Now, when a use_backend/default_backend rule matches with an non eligible backend, frontend persist_rules are inspected to detect if a force-be-switch is present so that the backend may be selected.	2026-01-15 09:08:19 +01:00
Amaury Denoyelle	16f035d555	MINOR: cfgparse: adapt warnif_cond_conflicts() error output Utility function warnif_cond_conflicts() is used when parsing an ACL. Previously, the function directly calls ha_warning() to report an error. Change the function so that it now takes the error message as argument. Caller can then output it as wanted. This change is necessary to use the function when parsing a keyword registered as cfg_kw_list. The next patch will reuse it.	2026-01-15 09:08:18 +01:00
Amaury Denoyelle	82907d5621	MINOR: stats: report BE unpublished status A previous patch defines a new proxy status : unpublished backends. This patch extends this by changing proxy status reported in stats. If unpublished is set, an extra "(UNPUB)" is added to the field. Also, HTML stats is also slightly updated. If a backend is up but unpublished, its status will be reported in orange color.	2026-01-15 09:08:18 +01:00
Amaury Denoyelle	797ec6ede5	MEDIUM: proxy: implement publish/unpublish backend CLI Define a new set of CLI commands publish/unpublish backend <be>. The objective is to be able to change the status of a backend to unpublished. Such a backend is considered ineligible to traffic : this allows to skip use_backend rules which target it. Note that contrary to disabled/stopped proxies, an unpublished backend still has server checks running on it. Internally, a new proxy flags PR_FL_BE_UNPUBLISHED is defined. CLI commands handler "publish backend" and "unpublish backend" are executed under thread isolation. This guarantees that the flag can safely be set or remove in the CLI handlers, and read during content-switching processing.	2026-01-15 09:08:18 +01:00
Amaury Denoyelle	21fb0a3f58	MEDIUM: proxy: do not select a backend if disabled A proxy can be marked as disabled using the keyword with the same name. The doc mentions that it won't process any traffic. However, this is not really the case for backends as they may still be selected via switching rules during stream processing. In fact, currently access to disabled backends will be conducted up to assign_server(). However, no eligible server is found at this stage, resulting in a connection closure or an HTTP 503, which is expected. So in the end, servers in disabled backends won't receive any traffic. But this is only because post-parsing steps are not performed on such backends. Thus, this can be considered as functional but only via side-effects. This patch clarifies the handling of disable backends, so that they are never selected via switching rules. Now, process_switching_rules() will ignore disable backends and continue rules evaluation. As this is a behavior change, this patch is labelled as medium. The documentation manuel for use_backend is updated accordingly.	2026-01-15 09:08:18 +01:00
Amaury Denoyelle	12975c5c37	MEDIUM: stream: refactor switching-rules processing This commit rewrites process_switching_rules() function. The objective is to simplify backend selection so that a single unified stream_set_backend() call is kept, both for regular and default backends case. This patch will be useful to add new capabilities on backends, in the context of dynamic backend support implementation.	2026-01-15 09:08:18 +01:00
Amaury Denoyelle	2f6aab9211	BUG/MINOR: proxy: free persist_rules force-persist proxy keyword is converted into a persist_rule, stored in proxy persist_rules list member. Each new rule is dynamically allocated during parsing. This commit fixes the memory leak on deinit due to a missing free on persist_rules list entries. This is done via deinit_proxy() modification. Each rule in the list is freed, along with its associated ACL condition type. This can be backported to every stable version.	2026-01-15 09:08:18 +01:00
Olivier Houchard	a209c35f30	MEDIUM: thread: Turn the group mask in thread set into a group counter Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details If we want to be able to have more than 64 thread groups, we can no longer use thread group masks as long. One remaining place where it is done is in struct thread_set. However, it is not really used as a mask anywhere, all we want is a thread group counter, so convert that mask to a counter.	2026-01-15 05:24:53 +01:00
Olivier Houchard	6249698840	BUG/MEDIUM: queues: Fix arithmetic when feeling non_empty_tgids Fix the arithmetic when pre-filling non_empty_tgids when we still have more than 32/64 thread groups left, to get the right index, we of course have to divide the number of thread groups by the number of bits in a long. This bug was introduced by commit `7e1fed4b7a`, but hopefully was not hit because it requires to have at least as much thread groups as there are bits in a long, which is impossible on 64bits machines, as MAX_TGROUPS is still 32.	2026-01-15 04:28:04 +01:00
Olivier Houchard	1397982599	MINOR: threads: Eliminate all_tgroups_mask. Now that it is unused, eliminate all_tgroups_mask, as we can't 64bits masks to represent thread groups, if we want to be able to have more than 64 thread groups.	2026-01-15 03:46:57 +01:00
Olivier Houchard	7e1fed4b7a	MINOR: queues: Turn non_empty_tgids into a long array. In order to be able to have more than 64 thread groups, turn non_empty_tgids into a long array, so that we have enough bits to represent everty thread group, and manipulate it with the ha_bit_* functions.	2026-01-15 03:46:57 +01:00
Aurelien DARRAGON	2ec387cdc2	BUG/MINOR: http_act: fix deinit performed on uninitialized lf_expr in release_http_map() Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details As reported by GH user @Lzq-001 on issue #3245, the config below would cause haproxy to SEGFAULT after having reported an error: frontend 0000000 http-request set-map %[hdr(0000)0_ Root cause is simple, in parse_http_set_map(), we define the release function (which is responsible to clear lf_expr expressions used by the action), prior to initializing the expressions, while the release function assumes the expressions are always initialized. For all similar actions, we already perform the init prior to setting the related release function, but this was not the case for parse_http_set_map(). We fix the bug by initializing the expressions earlier. Thanks to @Lzq-001 for having reported the issue and provided a simple reproducer. It should be backported to all stable versions, note for versions prior to 3.0, lf_expr_init() should be replace by LIST_INIT(), see `6810c41` ("MEDIUM: tree-wide: add logformat expressions wrapper")	2026-01-14 20:05:39 +01:00
Olivier Houchard	7f4b053b26	MEDIUM: counters: mostly revert `da813ae4d7` Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Contrarily to what was previously believed, there are corner cases where the counters may not be allocated, and we may want to make them optional at a later date, so we have to check if those counters are there. However, just checking that shared.tg is non-NULL is enough, we can then assume that shared.tg[tgid - 1] has properly been allocated too. Also modify the various COUNTER_SHARED_* macros to make sure they check for that too.	2026-01-14 12:39:14 +01:00
Amaury Denoyelle	7aa839296d	BUG/MEDIUM: quic: fix ACK ECN frame parsing Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details ACK frames are either of type 0x02 or 0x03. The latter is an indication that it contains extra ECN related fields. In haproxy QUIC stack, this is considered as a different frame type, set to QUIC_FT_ACK_ECN, with its own set of builder/parser functions. This patch fixes ACK ECN parsing function. Indeed, the latter suffered from two issues. First, 'first ACK range' and 'ACK ranges' were inverted. Then, the three remaining ECN fields were simply ignored by the parsing function. This issue can cause desynchronization in the frames parsing code, which may result in various result. Most of the time, the connection will be aborted by haproxy due to an invalid frame content read. Note that this issue was not detected earlier as most clients do not enable ECN support if the peer is not able to emit ACK ECN frame first, which haproxy currently never sends. Nevertheless, this is not the case for every client implementation, thus proper ACK ECN parsing is mandatory for a proper QUIC stack support. Fix this by adjusting quic_parse_ack_ecn_frame() function. The remaining ECN fields are parsed to ensure correct packet parsing. Currently, they are not used by the congestion controller. This must be backported up to 2.6.	2026-01-13 15:08:02 +01:00
Olivier Houchard	82196eb74e	BUG/MEDIUM: threads: Fix binding thread on bind. The code to parse the "thread" keyword on bind lines was changed to check if the thread numbers were correct against the value provided with max-threads-per-group, if any were provided, however, at the time those thread keywords have been set, it may not yet have been set, and that breaks the feature, so revert to check against MAX_THREADS_PER_GROUP instead, it should have no major impact.	2026-01-13 11:45:46 +01:00
Olivier Houchard	da813ae4d7	MEDIUM: counters: Remove some extra tests Before updating counters, a few tests are made to check if the counters exits. but those counters should always exist at this point, so just remmove them. This commit should have no impact, but can easily be reverted with no functional impact if various crashes appear.	2026-01-13 11:12:34 +01:00
Olivier Houchard	5495c88441	MEDIUM: counters: Dynamically allocate per-thread group counters Instead of statically allocating the per-thread group counters, based on the max number of thread groups available, allocate them dynamically, based on the number of thread groups actually used. That way we can increase the maximum number of thread groups without using an unreasonable amount of memory.	2026-01-13 11:12:34 +01:00
Willy Tarreau	37057feb80	BUG/MINOR: net_helper: fix IPv6 header length processing The IPv6 header contains a payload length that excludes the 40 bytes of IPv6 packet header, which differs from IPv4's total length which includes it. As a result, the parser was wrong and would only see the IP part and not the TCP one unless sufficient options were present tocover it. This issue came in 3.4-dev2 with recent commit `e88e03a6e4` ("MINOR: net_helper: add ip.fp() to build a simplified fingerprint of a SYN"), so no backport is needed.	2026-01-13 08:42:36 +01:00
Aurelien DARRAGON	fcd4d4a7aa	BUG/MINOR: hlua_fcn: ensure Patref:add_bulk() is given a table object before using it Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details As reported by GH user @kanashimia in GH #3241, providing anything else than a table to Patref:add_bulk() method could cause a segfault because we were calling lua_next() with the lua object without ensuring it actually is a table. Let's add the missing lua_istable() check on the stack object before calling lua_next() function on it. It should be backported up to 3.2 with `884dc62` ("MINOR: hlua_fcn: add Patref:add_bulk()")	2026-01-12 17:30:54 +01:00
Aurelien DARRAGON	04545cb2b7	BUG/MINOR: hlua_fcn: fix broken yield for Patref:add_bulk() In GH #3241, GH user @kanashimia reported that the Patref:add_bulk() method would raise a Lua exception when called with more than 101 elements at once. As identified by @kanashimia there was an error in the way the add_bulk() method was forced to yield after 101 elements precisely. The yield is there to ensure Lua doesn't eat too much ressources at once and doesn't impact haproxy's core responsiveness, but the check for the yield was misplaced resulting in improper stack content upon resume. Thanks to user @kanashimia who even provided a reproducer which helped a lot to troubleshoot the issue. This fix should be backported up to 3.2 with `884dc62` ("MINOR: hlua_fcn: add Patref:add_bulk()") where the bug was introduced.	2026-01-12 17:30:52 +01:00
Olivier Houchard	b1cfeeef21	BUG/MINOR: stats-file: Use a 16bits variable when loading tgid Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Now that the tgid stored in the stats file has been increased to 16bits by commit `022cb3ab7f`, don't forget to increase the variable size when reading it from the file, too. This should have no impact given the maximum thread group limit is still 32.	2026-01-12 09:48:54 +01:00
Olivier Houchard	c0f64fc36a	MINOR: receiver: Dynamically alloc the "members" field of shard_info Instead of always allocating MAX_TGROUPS members, allocate them dynamically, using the number of thread groups we'll use, so that increasing MAX_TGROUPS will not have a huge impact on the structure size.	2026-01-12 09:32:27 +01:00
Willy Tarreau	2560cce7c5	MINOR: tcp-sample: permit retrieving tcp_info from the connection/session stage Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details The fc_xxx info that are retrieved over tcp_info could currently not be accessed before a stream is created due to a test that verified the existence of a stream. The rationale here was that the function works both for frontend and backend. Let's always retrieve these info from the session for the frontend case so that it now becomes possible to set variables at connection/session time. The doc did not mention this limitation so this could almost be considered as a bug.	2026-01-11 15:48:20 +01:00
Willy Tarreau	880bbeeda4	MINOR: sample: also support retrieving fc.timer.handshake without a stream Some timers, like the handshake timer, are stored in the session and are only copied to the logs struct when a stream is created. But this means we can't measure it without a stream, nor store it once for all in a variable at session creation time. Let's extend the sample fetch function to retrieve it from the session when no stream is present. The doc did not mention this limitation so this could almost be considered as a bug.	2026-01-11 15:48:19 +01:00
Amaury Denoyelle	875bbaa7fc	MINOR: cfgparse: remove duplicate "force-persist" in common kw list Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details "force-persist" proxy keyword is listed twice in common_kw_list. This patch removes the duplicated occurence. This could be backported up to 2.4.	2026-01-09 16:45:54 +01:00
Willy Tarreau	46088b7ad0	MEDIUM: config: warn if some userlist hashes are too slow It was reported in GH #2956 and more recently in GH #3235 that some hashes are way too slow. The former triggers watchdog warnings during checks, the second sees the config parsing take 20 seconds. This is always due to the use of hash algorithms that are not suitable for use in low-latency environments like web. They might be fine for a local auth though. The difficulty, as explained by Philipp Hossner, is that developers are not aware of this cost and adopt this without suspecting any side effect. The proposal here is to measure the crypt() call time and emit a warning if it takes more than 10ms (which is already extreme). This was tested by Philipp and confirmed to catch his case. This is marked medium as it might start to report warnings on config suffering from this problem without ever detecting it till now.	2026-01-09 14:56:18 +01:00
akarl10	a203ce6854	BUG/MINOR: ech/quic: enable ech configuration also for quic listeners Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Patch `dba4fd24` ("MEDIUM: ssl/ech: config and load keys") introduced ECH configuration for bind lines, but the QUIC configuration parsers still suffers from not using the same code as the TCP/TLS one, so the init for QUIC was missed. Must be backported in 3.3.	2026-01-08 17:34:28 +01:00
William Lallemand	623aa725a2	BUG/MINOR: cli/stick-tables: argument to "show table" is optional Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Discussed in issue #3187, the CLI help is confusing for the "show table" command as it seems that the argument is mandatory. This patch adds the arguments between square brackets to remove the confusion.	2026-01-08 11:54:01 +01:00
Willy Tarreau	dbba442740	BUILD: sockpair: fix build issue on macOS related to variable-length arrays In GH issue #3226, Sergey Fedorov (@barracuda156) reported that since commit `10c14a1ed0` ("MINOR: proto_sockpair: send_fd_uxst: init iobuf, cmsghdr, cmsgbuf to zeros"), macOS 10.6.8 with gcc 14.3.0 doesn't build anymore: src/proto_sockpair.c: In function 'send_fd_uxst': src/proto_sockpair.c:246:49: error: variable-sized object may not be initialized except with an empty initializer 246 \| char cmsgbuf[CMSG_SPACE(sizeof(int))] = {0}; \| ^ src/proto_sockpair.c:247:45: error: variable-sized object may not be initialized except with an empty initializer 247 \| char buf[CMSG_SPACE(sizeof(int))] = {0}; \| ^ Upon investigation, it appears that the CMSG_SPACE() macro on this OS looks too complex for gcc to consider it as a constant, so it takes these buffers for variable-length arrays and cannot initialize them. Let's move to a simple memset() instead, which Sergey confirmed fixes the problem. This needs to be backported as far as 3.1. Thanks to Sergey for the report, the bisect and testing the fix.	2026-01-08 09:26:22 +01:00
Hyeonggeun Oh	c17ed69bf3	MINOR: cfgparse: Refactor "userlist" parser to print it in -dKall operation Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This patch covers issue https://github.com/haproxy/haproxy/issues/3221. The parser for the "userlist" section did not use the standard keyword registration mechanism. Instead, it relied on a series of strcmp() comparisons to identify keywords such as "group" and "user". This had two main drawbacks: 1. The keywords were not discoverable by the "-dKall" dump option, making it difficult for users to see all available keywords for the section. 2. The implementation was inconsistent with the parsers for other sections, which have been progressively refactored to use the standard cfg_kw_list infrastructure. This patch refactors the userlist parser to align it with the project's standard conventions. The parsing logic for the "group" and "user" keywords has been extracted from the if/else block in cfg_parse_users() into two new dedicated functions: - cfg_parse_users_group() - cfg_parse_users_user() These two keywords are now registered via a dedicated cfg_kw_list, making them visible to the rest of the HAPorxy ecosystem, including the -dKall dump.	2026-01-07 18:25:09 +01:00
William Lallemand	91cff75908	BUG/MINOR: cfgparse: wrong section name upon error When a unknown keyword was used in the "userlist" section, the error was mentioning the "users" section, instead of "userlist". Could be backported in every branches.	2026-01-07 18:13:12 +01:00
William Lallemand	4aff6d1c25	BUILD: tools: memchr definition changed in C23 New gcc and clang versions from fedora rawhide seems to use the C23 standard by default. This version changes the definition of some string.h functions, which now return a const char * instead of a char . src/tools.c: In function ‘fgets_from_mem’: src/tools.c:7200:17: warning: assignment discards ‘const’ qualifier from pointer target type [-Wdiscarded-qualifiers] 7200 \| new_pos = memchr(position, '\n', size); \| ^ Strangely, -Wdiscarded-qualifiers does not seem to catch all the memchr. Should fix issue #3228. This could be backported in previous versions.	2026-01-07 14:51:26 +01:00
William Lallemand	5322bd3785	BUILD: ssl: strchr definition changed in C23 New gcc and clang versions from fedora rawhide seems to use the C23 standard by default. This version changes the definition of some string.h functions, which now return a const char * instead of a char *. src/ssl_sock.c: In function ‘SSL_CTX_keylog’: src/ssl_sock.c:4475:17: error: assignment discards ‘const’ qualifier from pointer target type [-Werror=discarded-qualifiers] 4475 \| lastarg = strrchr(line, ' '); Strangely, -Wdiscarded-qualifiers does not seem to catch all the strrchr. Should fix issue #3228. This could be backported in previous versions.	2026-01-07 14:51:26 +01:00
Willy Tarreau	71b00a945d	[RELEASE] Released version 3.4-dev2 Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Released version 3.4-dev2 with the following main changes : - BUG/MEDIUM: mworker/listener: ambiguous use of RX_F_INHERITED with shards - BUG/MEDIUM: http-ana: Properly detect client abort when forwarding response (v2) - BUG/MEDIUM: stconn: Don't report abort from SC if read0 was already received - BUG/MEDIUM: quic: Don't try to use hystart if not implemented - CLEANUP: backend: Remove useless test on server's xprt - CLEANUP: tcpcheck: Remove useless test on the xprt used for healthchecks - CLEANUP: ssl-sock: Remove useless tests on connection when resuming TLS session - REGTESTS: quic: fix a TLS stack usage - REGTESTS: list all skipped tests including 'feature cmd' ones - CI: github: remove openssl no-deprecated job - CI: github: add a job to test the master branch of OpenSSL - CI: github: openssl-master.yml misses actions/checkout - BUG/MEDIUM: backend: Do not remove CO_FL_SESS_IDLE in assign_server() - CI: github: use git prefix for openssl-master.yml - BUG/MEDIUM: mux-h2: synchronize all conditions to create a new backend stream - REGTESTS: fix error when no test are skipped - MINOR: cpu-topo: Turn the cpu policy configuration into a struct - MEDIUM: cpu-topo: Add a "threads-per-core" keyword to cpu-policy - MEDIUM: cpu-topo: Add a "cpu-affinity" option - MEDIUM: cpu-topo: Add a new "max-threads-per-group" global keyword - MEDIUM: cpu-topo: Add the "per-thread" cpu_affinity - MEDIUM: cpu-topo: Add the "per-ccx" cpu_affinity - BUG/MINOR: cpu-topo: fix -Wlogical-not-parentheses build with clang - DOC: config: fix number of values for "cpu-affinity" - MINOR: tools: add a secure implementation of memset - MINOR: mux-h2: add missing glitch count for non-decodable H2 headers - MINOR: mux-h2: perform a graceful close at 75% glitches threshold - MEDIUM: mux-h1: implement basic glitches support - MINOR: mux-h1: perform a graceful close at 75% glitches threshold - MEDIUM: cfgparse: acknowledge that proxy ID auto numbering starts at 2 - MINOR: cfgparse: remove useless checks on no server in backend - OPTIM/MINOR: proxy: do not init proxy management task if unused - MINOR: patterns: preliminary changes for reorganization - MEDIUM: patterns: reorganize pattern reference elements - CLEANUP: patterns: remove dead code - OPTIM: patterns: cache the current generation - MINOR: tcp: add new bind option "tcp-ss" to instruct the kernel to save the SYN - MINOR: protocol: support a generic way to call getsockopt() on a connection - MINOR: tcp: implement the get_opt() function - MINOR: tcp_sample: implement the fc_saved_syn sample fetch function - CLEANUP: assorted typo fixes in the code, commits and doc - BUG/MEDIUM: cpu-topo: Don't forget to reset visited_ccx. - BUG/MAJOR: set the correct generation ID in pat_ref_append(). - BUG/MINOR: backend: fix the conn_retries check for TFO - BUG/MINOR: backend: inspect request not response buffer to check for TFO - MINOR: net_helper: add sample converters to decode ethernet frames - MINOR: net_helper: add sample converters to decode IP packet headers - MINOR: net_helper: add sample converters to decode TCP headers - MINOR: net_helper: add ip.fp() to build a simplified fingerprint of a SYN - MINOR: net_helper: prepare the ip.fp() converter to support more options - MINOR: net_helper: add an option to ip.fp() to append the TTL to the fingerprint - MINOR: net_helper: add an option to ip.fp() to append the source address - DOC: config: fix the length attribute name for stick tables of type binary / string - MINOR: mworker/cli: only keep positive PIDs in proc_list - CLEANUP: mworker: remove duplicate list.h include - BUG/MINOR: mworker/cli: fix show proc pagination using reload counter - MINOR: mworker/cli: extract worker "show proc" row printer - MINOR: cpu-topo: Factorize code - MINOR: cpu-topo: Rename variables to better fit their usage - BUG/MEDIUM: peers: Properly handle shutdown when trying to get a line - BUG/MEDIUM: mux-h1: Take care to update <kop> value during zero-copy forwarding - MINOR: threads: Avoid using a thread group mask when stopping. - MINOR: hlua: Add support for lua 5.5 - MEDIUM: cpu-topo: Add an optional directive for per-group affinity - BUG/MEDIUM: mworker: can't use signals after a failed reload - BUG/MEDIUM: stconn: Move data from <kip> to <kop> during zero-copy forwarding - DOC: config: fix a few typos and refine cpu-affinity - MINOR: receiver: Remove tgroup_mask from struct shard_info - BUG/MINOR: quic: fix deprecated warning for window size keyword	2026-01-07 11:02:12 +01:00
Amaury Denoyelle	e061547d9d	BUG/MINOR: quic: fix deprecated warning for window size keyword QUIC configuration was cleaned up in the previous release. Several global keyword names were changed to unify the configuration. For each of them the older keyword is marked as deprecated, with a warning to mention the newer alternative. This patch fixes the warning for 'tune.quic.frontend.default-max-size' as the alternative proposed was not correct. The proper value now is 'tune.quic.fe.cc.max-win-size'. This must be backported up to 3.3.	2026-01-07 09:54:31 +01:00
Olivier Houchard	41cd589645	MINOR: receiver: Remove tgroup_mask from struct shard_info The only purpose from tgroup_mask seems to be to calculate how many tgroups share the same shard, but this is an information we can calculate differently, we just have to increment the number when a new receiver is added to the shard, and decrement it when one is detached from the shard. Removing thread group masks will allow us to increase the maximum number of thread groups past 64.	2026-01-07 09:27:12 +01:00
Christopher Faulet	83457b9e38	BUG/MEDIUM: stconn: Move data from <kip> to <kop> during zero-copy forwarding Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details The <kip> of producer was not forwarded to <kop> of consumer when zero-copy data forwarding was tried. Because of the issue, the chunking of emitted H1 messages could be invalid. To fix the bug, sc_ep_fwd_kip() must be called at this stage. This fix is related to the previous one (`529a8dbfb` "BUG/MEDIUM: mux-h1: Take care to update <kop> value during zero-copy forwarding"). Both are required to fully fix the issue #3230. This patch must be backported to 3.3.	2026-01-06 15:41:50 +01:00
William Lallemand	97490a7789	BUG/MEDIUM: mworker: can't use signals after a failed reload In issue #3229 it was reported that the master couldn't reload after a failed reload following a wrong configuration. It is still possible to do a reload using the "reload" command of the master CLI. But every signals are blocked. The problem was introduced in `709cde6d0` ("BUG/MEDIUM: mworker: signals inconsistencies during startup and reload") which fixes the blocking of signals during the reload. However the patch missed a case, indeed, the run_master_in_recovery_mode() is not being called when the worker failed to parse the configuration, it is only failing when the master is failing. To handle this case, the mworker_unblock_signals() function must be called upon mworker_on_new_child_failure(). But since this is called in an haproxy signal handler it would mess with the signals. Instead, the patch adds a task which is started by the signal handler, and restores the signals outside of it. This must be backported as far as 3.1.	2026-01-06 14:27:53 +01:00
Olivier Houchard	56fd0c1a5c	MEDIUM: cpu-topo: Add an optional directive for per-group affinity Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details When using per-group affinity, add an optional new directive. It accepts the values of "auto", where when multiple thread groups are created, the available CPUs are split equally across the groups, and is the new default, and "loose", where all groups are bound to all available CPUs, this is the old default.	2026-01-06 11:32:45 +01:00
Mike Lothian	1c0f781994	MINOR: hlua: Add support for lua 5.5 Lua 5.5 adds an extra argument to lua_newstate(). Since there are already a few other ifdefs in hlua.c checking for the Lua version, and there's a single call place, let's do the same here. This should be safe for backporting if needed. Signed-off-by: Mike Lothian <mike@fireburn.co.uk>	2026-01-06 11:05:02 +01:00
Olivier Houchard	853604f87a	MINOR: threads: Avoid using a thread group mask when stopping. Remove the "stopped_tgroup_mask" variable, that indicated which thread groups were stopping, and instead just use "stopped_tgroups", a counter indicating how many thread groups are stopping. We want to remove all thread group masks, so that we can increase the maximum number of thread groups past 64.	2026-01-06 08:30:55 +01:00
Christopher Faulet	529a8dbfba	BUG/MEDIUM: mux-h1: Take care to update <kop> value during zero-copy forwarding Since the extra field was removed from the HTX structure, a regression was introduced when forwarding of chunked messages. The <kop> value was not decreased as it should be when data were sent via the zero-copy forwarding. Because of this bug, it was possible to announce a chunk size larger than the chunk data sent. To fix the bug, an helper function was added to properly update the <kop> value when a chunk size is emitted. This function is now called when new chunk is announced, including during zero-copy forwarding. As a workaround, "tune.disable-zero-copy-forwarding" or just "tune.h1.zero-copy-fwd-send off" can be set in the global section. This patch should fix the issue #3230. It must be backported to 3.3.	2026-01-06 07:39:05 +01:00
Christopher Faulet	0b29b76a52	BUG/MEDIUM: peers: Properly handle shutdown when trying to get a line Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details When a shutdown was reported to a peer applet, the event was not properly handled if it failed to receive data. The function responsible to get data was exiting too early if the applet buffer was empty, without testing the sedesc status. Because of this issue, it was possible to have frozen peer applets. For instance, it happend on client timeout. With too many frozen applets, it was possible to reach the maxconn. This patch should fix the issue #3234. It must be backported to 3.3.	2026-01-05 13:46:57 +01:00
Olivier Houchard	196d16f2b1	MINOR: cpu-topo: Rename variables to better fit their usage Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Rename "visited_tsid" and "visited_ccx" to "touse_tsid" and "touse_ccx". They are not there to remember which tsid/ccx we alreaday visited, contrarily to visited_ccx_set and visited_cl_set, they are there to know which tsid/ccx we should use, so make that clear.	2026-01-05 09:25:48 +01:00
Olivier Houchard	bbf5c30a87	MINOR: cpu-topo: Factorize code Factorize the code common to cpu_policy_group_by_ccx() and cpu_policy_group_by_cluster() into a new function, cpu_policy_assign_threads().	2026-01-05 09:24:44 +01:00
Alexander Stephan	e241144e70	MINOR: mworker/cli: extract worker "show proc" row printer Introduce cli_append_worker_row() to centralize formatting of a single worker row. Also, replace duplicated row-printing code in both current and old workers loops with the helper. Motivation: Reduces LOC and improves readability by removing duplication.	2026-01-05 08:59:45 +01:00
Alexander Stephan	4c10d9c70c	BUG/MINOR: mworker/cli: fix show proc pagination using reload counter After commit `594408cd61` ("BUG/MINOR: mworker/cli: 'show proc' is limited by buffer size"), related to ticket #3204, the "show proc" logic has been fixed to be able to print more than 202 processes. However, this fix can lead to the omission of entries in case they have the same timestamp. To fix this, we use the unique reload counter instead of the timestamp. On partial flush, set ctx->next_reload = child->reloads. On resume skip entries with child->reloads >= ctx->next_reload. Finally, we clear ctx->next_reload at the end of a complete dump so subsequent show proc starts from the top. Could be backported in all stable branches.	2026-01-05 08:59:34 +01:00
Alexander Stephan	a5f274de92	CLEANUP: mworker: remove duplicate list.h include Drop the second #include <haproxy/list.h> from mworker.c. No functional change; reduces redundancy and keeps includes tidy.	2026-01-05 08:59:34 +01:00
Alexander Stephan	c30eeb2967	MINOR: mworker/cli: only keep positive PIDs in proc_list Change mworker_env_to_proc_list() to if (child->pid > 0) before LIST_APPEND, avoiding invalid PIDs (0/-1) in the process list. This has no functional impact beyond stricter validation and it aligns with existing kill safeguards.	2026-01-05 08:59:14 +01:00
Willy Tarreau	a206f85f96	MINOR: net_helper: add an option to ip.fp() to append the source address The new value 4 will permit to append the source address to the fingerprint, making it easier to build rules checking a specific path.	2026-01-01 10:32:16 +01:00
Willy Tarreau	70ffae3614	MINOR: net_helper: add an option to ip.fp() to append the TTL to the fingerprint With mode value 1, the TTL will be appended immediately after the 7 bytes, making it a 8-byte fingerprint.	2026-01-01 10:19:48 +01:00
Willy Tarreau	2c317cfed7	MINOR: net_helper: prepare the ip.fp() converter to support more options It can make sense to support extra components in the fingerprint to ease configuration, so let's change the 0/1 value to a bit field. We also turn the current 1 (TCP options list) to 2 so that we'll reuse 1 for the TTL.	2026-01-01 10:19:20 +01:00
Willy Tarreau	e88e03a6e4	MINOR: net_helper: add ip.fp() to build a simplified fingerprint of a SYN Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Here we collect all the stuff that depends on the sender's settings, such as TOS, IP version, TTL range, presence of DF bit or IP options, presence of DATA in the SYN, CWR+ECE flags, TCP header length, wscale, initial window, mss, as well as the list of TCP extension kinds. It's obviously fairly limited but can allows to avoid blacklisting certain valid clients sharing the same IP address as a misbehaving one. It supports both a short and a long mode depending on the argument. These can be used with the tcp-ss bind option. The doc was updated accordingly.	2025-12-31 17:17:38 +01:00
Willy Tarreau	6e46d1345b	MINOR: net_helper: add sample converters to decode TCP headers This adds the following converters, used to decode fields in an incoming tcp header: tcp.dst, tcp.flags, tcp.seq, tcp.src, tcp.win, tcp.options.mss, tcp.options.tsopt, tcp.options.tsval, tcp.options.wscale, tcp.options_list, These can be used with the tcp-ss bind option. The doc was updated accordingly.	2025-12-31 17:17:23 +01:00
Willy Tarreau	e0a7a7ca43	MINOR: net_helper: add sample converters to decode IP packet headers This adds a few converters that help decode parts of IP packets: - ip.data : returns the next header (typically TCP) - ip.df : returns the dont-fragment flags - ip.dst : returns the destination IPv4/v6 address - ip.hdr : returns only the IP header - ip.proto: returns the upper level protocol (udp/tcp) - ip.src : returns the source IPv4/v6 address - ip.tos : returns the TOS / TC field - ip.ttl : returns the TTL/HL value - ip.ver : returns the IP version (4 or 6) These can be used with the tcp-ss bind option. The doc was updated accordingly.	2025-12-31 17:16:29 +01:00
Willy Tarreau	90d2f157f2	MINOR: net_helper: add sample converters to decode ethernet frames This adds a few converters that help decode parts of ethernet frame headers: - eth.data : returns the next header (typically IP) - eth.dst : returns the destination MAC address - eth.hdr : returns only the ethernet header - eth.proto: returns the ethernet proto - eth.src : returns the source MAC address - eth.vlan : returns the VLAN ID when present These can be used with the tcp-ss bind option. The doc was updated accordingly.	2025-12-31 17:15:36 +01:00
Willy Tarreau	933cb76461	BUG/MINOR: backend: inspect request not response buffer to check for TFO In 2.6, do_connect_server() was introduced by commit `0a4dcb65f` ("MINOR: stream-int/backend: Move si_connect() in the backend scope") and changed the approach to work with a stream instead of a stream-interface. However si_oc(si) was wrongly turned to &s->res instead of &s->req, which breaks TFO by always inspecting the response channel to figure whether there are data pending. This fix can be backported to all versions till 2.6.	2025-12-31 13:03:53 +01:00
Willy Tarreau	799653d536	BUG/MINOR: backend: fix the conn_retries check for TFO In 2.6, the retries counter on a stream was changed from retries left to retries done via commit `731c8e6cf` ("MINOR: stream: Simplify retries counter calculation"). However, one comparison fell through the cracks in order to detect whether or not we can use TFO (only first attempt), resulting in TFO never working anymore. This may be backported to all versions till 2.6.	2025-12-31 13:03:53 +01:00
Maxime Henrion	51592f7a09	BUG/MAJOR: set the correct generation ID in pat_ref_append(). Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This fixes crashes when creating more than one new revision of a map or acl file and purging the previous version.	2025-12-31 00:29:47 +01:00
Olivier Houchard	54f59e4669	BUG/MEDIUM: cpu-topo: Don't forget to reset visited_ccx. Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details We want to reset visited_ccx, as introduced by commit `8aef5bec1e`, each time we run the loop, otherwise the chances of its content being correct are very low, and will likely end up being bound to the wrong threads. This was reported in github issue #3224.	2025-12-26 23:55:57 +01:00
Ilia Shipitsin	f8a77ecf62	CLEANUP: assorted typo fixes in the code, commits and doc Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details	2025-12-25 19:45:29 +01:00
Willy Tarreau	6fb521d2f6	MINOR: tcp_sample: implement the fc_saved_syn sample fetch function Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This function retrieves the copy of a SYN packet that the system has kept for us when bind option "tcp-ss" was set to 1 or above. It's recommended to copy it to a local variable because it will be freed after being read. It allows to inspect all parts of an incoming SYN packet, provided that it was preserved (e.g. not possible with SYN cookies). The doc provides examples of how to use it.	2025-12-24 18:39:37 +01:00
Willy Tarreau	52d60bf9ee	MINOR: tcp: implement the get_opt() function It relies on the generic sock_conn_get_opt() function and will permit sample fetch functions to retrieve generic TCP-level info.	2025-12-24 18:38:51 +01:00
Willy Tarreau	6d995e59e9	MINOR: protocol: support a generic way to call getsockopt() on a connection It's regularly needed to call getsockopt() on a connection, but each time the calling code has to do all the job by itself. This commit adds a "get_opt()" callback on the protocol struct, that directly calls getsockopt() on the connection's FD. A generic implementation for standard sockets is provided, though QUIC would likely require a different approach, or maybe a mapping. Due to the overlap between IP/TCP/socket option values, it is necessary for the caller to indicate both the level and the option. An abstraction of the level could be done, but the caller would nonetheless have to know the optname, which is generally defined in the same include files. So for now we'll consider that this callback is only for very specific use. The levels and optnames are purposely passed as signed ints so that it is possible to further extend the API by using negative levels for internal namespaces.	2025-12-24 18:38:51 +01:00
Willy Tarreau	44c67a08dd	MINOR: tcp: add new bind option "tcp-ss" to instruct the kernel to save the SYN This option enables TCP_SAVE_SYN on the listening socket, which will cause the kernel to try to save a copy of the SYN packet header (L2, IP and TCP are supported). This can permit to check the source MAC address of a client, or find certain TCP options such as a source address encapsulated using RFC7974. It could also be used as an alternate approach to retrieving the source and destination addresses and ports. For now setting the option is enabled, but sample fetch functions and converters will be needed to extract info.	2025-12-24 11:35:09 +01:00
Maxime Henrion	1fdccbe8da	OPTIM: patterns: cache the current generation Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This makes a significant difference when loading large files and during commit and clear operations, thanks to improved cache locality. In the measurements below, master refers to the code before any of the changes to the patterns code, not the code before this one commit. Timing the replacement of 10M entries from the CLI with this command which also reports timestamps at start, end of upload and end of clear: $ (echo "prompt i"; echo "show activity"; echo "prepare acl #0"; awk '{print "add acl @1 #0",$0}' < bad-ip.map; echo "show activity"; echo "commit acl @1 #0"; echo "clear acl @0 #0";echo "show activity") \| socat -t 10 - /tmp/sock1 \| grep ^uptim master, on a 3.7 GHz EPYC, 3 samples: uptime_now: 6.087030 uptime_now: 25.981777 => 21.9 sec insertion time uptime_now: 29.286368 => 3.3 sec commit+clear uptime_now: 5.748087 uptime_now: 25.740675 => 20.0s insertion time uptime_now: 29.039023 => 3.3 s commit+clear uptime_now: 7.065362 uptime_now: 26.769596 => 19.7s insertion time uptime_now: 30.065044 => 3.3s commit+clear And after this commit: uptime_now: 6.119215 uptime_now: 25.023019 => 18.9 sec insertion time uptime_now: 27.155503 => 2.1 sec commit+clear uptime_now: 5.675931 uptime_now: 24.551035 => 18.9s insertion uptime_now: 26.652352 => 2.1s commit+clear uptime_now: 6.722256 uptime_now: 25.593952 => 18.9s insertion uptime_now: 27.724153 => 2.1s commit+clear Now timing the startup time with a 10M entries file (on another machine) on master, 20 samples: Standard Deviation, s: 0.061652677408033 Mean: 4.217 And after this commit: Standard Deviation, s: 0.081821371548669 Mean: 3.78	2025-12-23 21:17:39 +01:00
Maxime Henrion	99e625a41d	CLEANUP: patterns: remove dead code Situations where we are iterating over elements and find one with a different generation ID cannot arise anymore since the elements are kept per-generation.	2025-12-23 21:17:39 +01:00
Maxime Henrion	545cf59b6f	MEDIUM: patterns: reorganize pattern reference elements Instead of a global list (and tree) of pattern reference elements, we now have an intermediate pat_ref_gen structure and store the elements in those. This simplifies the logic of some operations such as commit and clear, and improves performance in some cases - numbers to be provided in a subsequent commit after one important optimization is added. A lot of the changes are due to adding an extra level of indirection, changing many cases where we iterate over all elements to an outer loop iterating over the generation and an inner one iterating over the elements of the current generation. It is therefore easier to read this patch using 'git diff -w'.	2025-12-23 21:17:39 +01:00
Maxime Henrion	5547bedebb	MINOR: patterns: preliminary changes for reorganization Safe and non-functional changes that only add currently unused structures, field, functions and macros, in preparation of larger changes that alter the way pattern reference elements are stored. This includes code to create and lookup generation objects, and macros to iterate over the generations of a pattern reference.	2025-12-23 21:17:39 +01:00
Amaury Denoyelle	a4a17eb366	OPTIM/MINOR: proxy: do not init proxy management task if unused Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details Each proxy has its owned task for internal purpose. Currently, it is only used either by frontends or if a stick-table is present. This commit rendres the task allocation optional to only the required case. Thus, it is not allocated anymore for backend only proxies without stick-table.	2025-12-23 16:35:49 +01:00
Amaury Denoyelle	c397f6fc9a	MINOR: cfgparse: remove useless checks on no server in backend A legacy check could be activated at compile time to reject backends without servers. In practice this is not used anymore and does not have much sense with the introduction of dynamic servers.	2025-12-23 16:35:49 +01:00
Amaury Denoyelle	b562602044	MEDIUM: cfgparse: acknowledge that proxy ID auto numbering starts at 2 Each frontend/backend/listen proxies is assigned an unique ID. It can either be set explicitely via 'id' keyword, or automatically assigned on post parsing depending on the available values. It was expected that the first automatically assigned value would start at '1'. However, due to a legacy bug this is not the case as this value is always skipped. Thus, automatically assigned proxies always start at '2' or more. To avoid breaking the current existing state, this situation is now acknowledged with the current patch. The code is rewritten with an explicit warning to ensure that this won't be fixed without knowing the current status. A new regtest also ensures this.	2025-12-23 16:35:49 +01:00
Willy Tarreau	5904f8279b	MINOR: mux-h1: perform a graceful close at 75% glitches threshold Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details This avoids hitting the hard wall for connections with non-compliant peers that are accumulating errors. We recycle the connection early enough to permit to reset the counter. Example below with a threshold set to 100: Before, 1% errors: $ h1load -H "Host : blah" -c 1 -n 10000000 0:4445 # time conns tot_conn tot_req tot_bytes err cps rps bps ttfb 1 1 1039 103872 6763365 1038 1k03 103k 54M1 9.426u 2 1 2128 212793 14086140 2127 1k08 108k 58M5 8.963u 3 1 3215 321465 21392137 3214 1k08 108k 58M3 8.982u 4 1 4307 430684 28735013 4306 1k09 109k 58M6 8.935u 5 1 5390 538989 36016294 5389 1k08 108k 58M1 9.021u After, no more errors: $ h1load -H "Host : blah" -c 1 -n 10000000 0:4445 # time conns tot_conn tot_req tot_bytes err cps rps bps ttfb 1 1 1509 113161 7487809 0 1k50 113k 59M9 8.482u 2 1 3002 225101 15114659 0 1k49 111k 60M9 8.582u 3 1 4508 338045 22809911 0 1k50 112k 61M5 8.523u 4 1 5971 447785 30286861 0 1k46 109k 59M7 8.772u 5 1 7472 560335 37955271 0 1k49 112k 61M2 8.537u	2025-12-20 19:29:37 +01:00
Willy Tarreau	05b457002b	MEDIUM: mux-h1: implement basic glitches support We now count glitches for each parsing error, including those that have been accepted via accept-unsafe-violations-*. Front and back are considered and the connection gets killed on error once if the threshold is reached or passed and the CPU usage is beyond the configured limit (0 by default). This was tested with: curl -ivH "host : blah" 0:4445{,,,,,,,,,} which sends 10 requests to a configuration having a threshold of 5. The global keywords are named similarly to H2 and quic: tune.h1.be.glitches-threshold xxxx tune.h1.fe.glitches-threshold xxxx The glitches count of each connection is also reported when non-null in the connection dumps (e.g. "show fd").	2025-12-20 19:29:33 +01:00
Willy Tarreau	0901f60cef	MINOR: mux-h2: perform a graceful close at 75% glitches threshold This avoids hitting the hard wall for connections with non-compliant peers that would be accumulating errors over long connections. We now permit to recycle the connection early enough to reset the connection counter. This was tested artificially by adding this to h2c_frt_handle_headers(): h2c_report_glitch(h2c, 1, "new stream"); or this to h2_detach(): h2c_report_glitch(h2c, 1, "detaching"); and injecting using h2load -c 1 -n 1000 0:4445 on a config featuring tune.h2.fe.glitches-threshold 1000: finished in 8.74ms, 85802.54 req/s, 686.62MB/s requests: 1000 total, 751 started, 751 done, 750 succeeded, 250 failed, 250 errored, 0 timeout status codes: 750 2xx, 0 3xx, 0 4xx, 0 5xx traffic: 6.00MB (6293303) total, 132.57KB (135750) headers (space savings 29.84%), 5.86MB (6144000) data min max mean sd +/- sd time for request: 9us 178us 10us 6us 99.47% time for connect: 139us 139us 139us 0us 100.00% time to 1st byte: 339us 339us 339us 0us 100.00% req/s : 87477.70 87477.70 87477.70 0.00 100.00% The failures are due to h2load not supporting reconnection.	2025-12-20 19:26:29 +01:00
Willy Tarreau	52adeef7e1	MINOR: mux-h2: add missing glitch count for non-decodable H2 headers One rare error case could produce a protocol error on the stream when not being able to decode response headers wasn't being accounted as a glitch, so let's fix it.	2025-12-20 19:11:16 +01:00
Maxime Henrion	c8750e4e9d	MINOR: tools: add a secure implementation of memset Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This guarantees that the compiler will not optimize away the memset() call if it detects a dead store. Use this to clear SSL passphrases. No backport needed.	2025-12-19 17:42:57 +01:00
William Lallemand	03340748de	BUG/MINOR: cpu-topo: fix -Wlogical-not-parentheses build with clang src/cpu_topo.c:1325:15: warning: logical not is only applied to the left hand side of this bitwise operator [-Wlogical-not-parentheses] 1325 \| } else if (!cpu_policy_conf.flags & CPU_POLICY_ONE_THREAD_PER_CORE) \| ^ ~ src/cpu_topo.c:1325:15: note: add parentheses after the '!' to evaluate the bitwise operator first 1325 \| } else if (!cpu_policy_conf.flags & CPU_POLICY_ONE_THREAD_PER_CORE) \| ^ \| ( ) src/cpu_topo.c:1325:15: note: add parentheses around left hand side expression to silence this warning 1325 \| } else if (!cpu_policy_conf.flags & CPU_POLICY_ONE_THREAD_PER_CORE) \| ^ \| ( ) src/cpu_topo.c:1533:15: warning: logical not is only applied to the left hand side of this bitwise operator [-Wlogical-not-parentheses] 1533 \| } else if (!cpu_policy_conf.flags & CPU_POLICY_ONE_THREAD_PER_CORE) \| ^ ~ src/cpu_topo.c:1533:15: note: add parentheses after the '!' to evaluate the bitwise operator first 1533 \| } else if (!cpu_policy_conf.flags & CPU_POLICY_ONE_THREAD_PER_CORE) \| ^ \| ( ) src/cpu_topo.c:1533:15: note: add parentheses around left hand side expression to silence this warning 1533 \| } else if (!cpu_policy_conf.flags & CPU_POLICY_ONE_THREAD_PER_CORE) \| ^ \| ( ) No backport needed.	2025-12-19 10:15:17 +01:00
Olivier Houchard	8aef5bec1e	MEDIUM: cpu-topo: Add the "per-ccx" cpu_affinity Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details Add a new cpu-affinity keyword, "per-ccx". If used, each thread will be bound to all the hardware threads available in one CCX of the threads group.	2025-12-18 18:52:52 +01:00
Olivier Houchard	c524b181a2	MEDIUM: cpu-topo: Add the "per-thread" cpu_affinity Add a new cpu-affinity keyword, "per-thread". If used, each thread will be bound to only one hardware thread of the thread group. If used in conjonction with the "threads-per-core 1" cpu_policy, then each thread will be bound on a different core.	2025-12-18 18:52:52 +01:00
Olivier Houchard	7e22d9c484	MEDIUM: cpu-topo: Add a new "max-threads-per-group" global keyword Add a new global keyword, max-threads-per-group. It sets the maximum number of threads a thread group can contain. Unless the number of thread groups is fixed with "thread-groups", haproxy will just create more thread groups as needed. The default and maximum value is 64.	2025-12-18 18:52:52 +01:00
Olivier Houchard	3865f6c5c6	MEDIUM: cpu-topo: Add a "cpu-affinity" option Add a new global option, "cpu-affinity", which controls how threads are bound. It currently accepts three values, "per-core", which will bind one thread to each hardware thread of a given core, and "per-group" which will use all the available hardware threads of the thread group, and "auto", the default, which will use "per-group", unless "threads-per-core 1" has been specified in cpu_policy, in which case it will use per-core.	2025-12-18 18:52:52 +01:00
Olivier Houchard	3671652bc9	MEDIUM: cpu-topo: Add a "threads-per-core" keyword to cpu-policy Add a new, optional key-word to "cpu-policy", "threads-per-core". It takes one argument, "1" or "auto". If "1" is used, then only one thread per core will be created, no matter how many hardware thread each core has. If "auto" is used, then one thread will be created per hardware thread, as is the case by default. for example: cpu-policy performance threads-per-core 1	2025-12-18 18:52:52 +01:00
Olivier Houchard	58f04b4615	MINOR: cpu-topo: Turn the cpu policy configuration into a struct Turn the cpu policy configuration into a struct. Right now it just contains an int, that represents the policy used, but will get more information soon.	2025-12-18 18:52:52 +01:00
Willy Tarreau	9a046fc3ad	BUG/MEDIUM: mux-h2: synchronize all conditions to create a new backend stream In H2 the conditions to create a new stream differ for a client and a server when a GOAWAY was exchanged. While on the server, any stream whose ID is lower than or equal to the one advertised in GOAWAY is valid, for a client it's forbidden to create any stream after receipt of a GOAWAY, even if its ID is lower than or equal to the last one, despite the server not being able to tell the difference from the number of streams in flight. Unfortunately, the logic in the code did not always reflect this specificity of the client (the backend code in our case), and most often considered that it was still permitted to create a new stream until the max_id was greater than or equal to the advertised last_id. This is for example what h2c_is_dead() and h2c_streams_left() do. In other places, such as h2_avail_streams(), the rule is properly taken into account. Very often the advertised last_id is the same, and this is also what haproxy does (which explains why it's impossible to reproduce the issue by chaining two haproxy layers), but a server may wish to advertise any ID including 2^31-1 as mentioned in the spec, and in this case the functions would behave differently. This discrepancy results in a corner case where a GOAWAY received on an idle connection will cause the next stream creation to be initially accepted but then rejected via h2_avail_streams(), and the connection left in a bad state, still attached to the session due to http-reuse safe, but not reinserted into idle list, since the backend code currently is not able to properly recover from this situation. Worse, the idle flags are no longer on it but TASK_F_USR1 still is, and this makes the recently added BUG_ON() rightfully trigger since this case is not supposed to happen. Admittedly more of the backend recovery code needs to be reworked, however the mux must consistently decide whether or not a connection may be reused or needs to be released. This commit fixes the affected logic by introducing a new function "h2c_reached_last_stream()" which says if a connection has reached its last stream, regardless of the side, and using this one everywhere max_id was compared to last_id. This is sufficient to address the corner case that be_reuse_connection() currently cannot recover from. This is in relation to GH issue #3215 and it should be sufficient to fix the issue there. Thanks to Chris Staite for reporting the issue and kudos to Amaury for spotting the events sequence that can lead to this situation. This patch must be backported to 3.3 first, then to older versions later. It's worth noting that it's much more difficult to observe the issue before 3.3 because the BUG_ON() is not there, and the possibly non-released connection might end up being killed for other reasons (timeouts etc). But one possible visible effect might be the impossibility to delete a server (which Chris observed in 3.3).	2025-12-18 17:01:32 +01:00
Olivier Houchard	40d16af7a6	BUG/MEDIUM: backend: Do not remove CO_FL_SESS_IDLE in assign_server() Back in the mists of time, commit `e91a526c8f` decided that if we were trying to stay on the same server than the previous request, and if there were a connection available in the session, we'd remove its CO_FL_SESS_IDLE. The reason for doing that has been long lost, probably it fixed a bug at some point, but it was most probably not the right place to do that. And starting with 3.3, this triggers a BUG_ON() because that flag is expected later on. So just revert the commit, if the ancient bug shows up again, it will be fixed another way. This should be backported to 3.3. There is little reason to backport it to previous versions, unless other patches depend on it.	2025-12-18 16:09:34 +01:00
Christopher Faulet	a25394b6c8	CLEANUP: ssl-sock: Remove useless tests on connection when resuming TLS session In ssl_sock_srv_try_reuse_sess(), the connection is always defined, to TCP and QUIC connections. No reason to test it. Because it is not so obvious for the QUIC part, a BUG_ON() could be added here. For now, just remove useless tests. This patch should fix a Coverity report from #3213.	2025-12-15 08:16:59 +01:00
Christopher Faulet	d6b1d5f6e9	CLEANUP: tcpcheck: Remove useless test on the xprt used for healthchecks The xprt used to perform a healthcheck is always defined and cannot be NULL. So there is no reason to test it. It could lead to wrong assumptions later in the code. This patch should fix a Coverity report from #3213.	2025-12-15 08:01:21 +01:00
Christopher Faulet	5c5914c32e	CLEANUP: backend: Remove useless test on server's xprt The server's xprt is always defined and cannot be NULL. So there is no reason to test it. It could lead to wrong assumptions later in the code. This patch should fix a Coverity report from #3213.	2025-12-15 07:56:53 +01:00
Olivier Houchard	a08bc468d2	BUG/MEDIUM: quic: Don't try to use hystart if not implemented Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Not every CC algos implement hystart, so only call the method if it is actually there. Failure to do so will cause crashes if hystart is on, and the algo doesn't implement it. This should fix github issue #3218 This should be backported up to 3.0.	2025-12-14 16:46:12 +01:00
Christopher Faulet	54e58103e5	BUG/MEDIUM: stconn: Don't report abort from SC if read0 was already received Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details SC_FL_ABRT_DONE flag should never be set when SC_FL_EOS was already set. These both flags were introduced to replace the old CF_SHUTR and to have a flag for shuts driven by the stream and a flag for the read0 received by the mux. So both flags must not be seen at same time on a SC. It is espeically important because some processing are performed when these flags are set. And wrong decisions may be made. This patch must be backproted as far as 2.8.	2025-12-12 08:41:08 +01:00
Christopher Faulet	a483450fa2	BUG/MEDIUM: http-ana: Properly detect client abort when forwarding response (v2) The first attempt to fix this issue (`c672b2a29` "BUG/MINOR: http-ana: Properly detect client abort when forwarding the response") was not fully correct and could be responsible to false report of client abort during the response forwarding. I guess it is possible to truncate the response. Instead, we must also take care that the client closed on its side, by checking SC_FL_EOS flag on the front SC. Indeed, if the client has aborted, this flag should be set. This patch should be backported as far as 2.8.	2025-12-12 08:41:08 +01:00
William Lallemand	5b19d95850	BUG/MEDIUM: mworker/listener: ambiguous use of RX_F_INHERITED with shards Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details The RX_F_INHERITED flag was ambiguous, as it was used to mark both listeners inherited from the parent process and listeners duplicated from another local receiver. This could lead to incorrect behavior concerning socket unbinding and suspension. This commit refactors the handling of inherited listeners by splitting the RX_F_INHERITED flag into two more specific flags: - RX_F_INHERITED_FD: Indicates a listener inherited from the parent process via its file descriptor. These listeners should not be unbound by the master. - RX_F_INHERITED_SOCK: Indicates a listener that shares a socket with another one, either by being inherited from the parent or by being duplicated from another local listener. These listeners should not be suspended or resumed individually. Previously, the sharding code was unconditionally using RX_F_INHERITED when duplicating a file descriptor. In HAProxy versions prior to 3.1, this led to a file descriptor leak for duplicated unix stats sockets in the master process. This would eventually cause the master to crash with a BUG_ON in fd_insert() once the file descriptor limit was reached. This must be backported as far as 3.0. Branches earlier than 3.0 are affected but would need a different patch as the logic is different.	2025-12-11 18:09:47 +01:00
Willy Tarreau	3ec5818807	MINOR: h2/trace: emit a trace of the received RST_STREAM type Right now we don't get any state trace when receiving an RST_STREAM, and this is not convenient because RST_STREAM(0) is not visible at all, except in developer level because the function is entered and left. Let's extract the RST code first and always log it using TRACE_PRINTF() (along with h2c/h2s) so that it's possible to detect certain codes being used.	2025-12-10 15:58:56 +01:00
Amaury Denoyelle	5b8e6d6811	BUG/MEDIUM: h3: fix access to QCS <sd> definitely Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details The previous patch tried to fix access to QCS <sd> member, as the latter is not always allocated anymore on the frontend side. `a15f0461a0` BUG/MEDIUM: h3: do not access QCS <sd> if not allocated In particular, access was prevented after HEADERS parsing in case h3_req_headers_to_htx() returned an error, which indicates that the stream-endpoint allocation was not performed. However, this still is not enough when QCS instance is already closed at this step. Indeed, in this case, h3_req_headers_to_htx() returns OK but stream-endpoint allocation is skipped as an optimization as no data exchange will be performed. To definitely fix this kind of problems, add checks on qcs <sd> member before accessing it in H3 layer. This method is the safest one to ensure there is no NULL dereferencement. This should fix github issue #3211. This must be backported along the above mentionned patch.	2025-12-10 12:04:37 +01:00
Maxime Henrion	6eedd0d485	CLEANUP: more conversions and cleanups for alignment Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details - Convert additional cases to use the automatic alignment feature for the THREAD_ALIGN(ED) macros. This includes some cases that are less obviously correct where it seems we wanted to align only in the USE_THREAD case but were not using the thread specific macros. - Also move some alignment requirements to the structure definition instead of having it on variable declaration.	2025-12-09 17:40:58 +01:00
Maxime Henrion	bc8e14ec23	CLEANUP: use the automatic alignment feature - Use the automatic alignment feature instead of hardcoding 64 all over the code. - This also converts a few bare __attribute__((aligned(X))) to using the ALIGNED macro.	2025-12-09 17:14:58 +01:00
Olivier Houchard	420b42df1c	BUG/MEDIUM: ssl: Don't resume session for check connections Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Don't attempt to use stored sessions when creating new check connections, as the check SSL parameters might be different from the server's ones. This has not been proven to be a problem yet, but it doesn't mean it can't be, and this should be backported up to 2.8 along with `dcce936912` if it is.	2025-12-09 16:45:54 +01:00
Olivier Houchard	be4e1220c2	BUG/MEDIUM: ssl: Don't store the ALPN for check connections When establishing check connections, do not store the negociated ALPN into the server's path_param if the connection is a check connection, as it may use different SSL parameters than the regular connections. To do so, only store them if the CO_FL_SSL_NO_CACHED_INFO is not set. Otherwise, the check ALPN may be stored, and the wrong mux can be used for regular connections, which will end up generating 502s. This should fix Github issue #3207 This should be backported to 3.3.	2025-12-09 16:43:31 +01:00
Olivier Houchard	dcce936912	MINOR: connections: Add a new CO_FL_SSL_NO_CACHED_INFO flag Add a new flag to connections, CO_FL_SSL_NO_CACHED_INFO, and set it for checks. It lets the ssl layer know that he should not use cached informations, such as the ALPN as stored in the server, or cached sessions. This wlil be used for checks, as checks may target different servers, or used a different SSL configuration, so we can't assume the stored informations are correct. This should be backported to 3.3, and may be backported up to 2.8 if the attempts to do session resume by checks is proven to be a problem.	2025-12-09 16:43:31 +01:00
Olivier Houchard	260d64d787	BUG/MEDIUM: ssl: Always check the ALPN after handshake Move the code that is responsible for checking the ALPN, and updating the one stored in the server's path_param, from after we created the mux, to after we did an handshake. Once we did it once, the mux will not be created by the ssl code anymore, as when we know which mux to use thanks to the ALPN, it will be done earlier in connect_server(), so in the unlikely event it changes, we would not detect it anymore, and we'd keep on creating the wrong mux. This can be reproduced by doing a first request, and then changing the ALPN of the server without haproxy noticing (ie without haproxy noticing that the server went down). This should be backported to 3.3.	2025-12-09 16:43:31 +01:00
William Lallemand	594408cd61	BUG/MINOR: mworker/cli: 'show proc' is limited by buffer size In ticket #3204, it was reported that "show proc" is not able to display more than 202 processes. Indeed the bufsize is 16k by default in the master, and can't be changed anymore since 3.1. This patch allows the 'show proc' to start again to dump when the buffer is full, based on the timestamp of the last PID it attempted to dump. Using pointers or count the number of processes might not be a good idea since the list can change between calls. Could be backported in all stable branche.	2025-12-09 16:09:10 +01:00
William Lallemand	dabe8856ad	CLEANUP: mworker/cli: remove useless variable The msg variable is declared and free but never used, this patch removes it.	2025-12-09 16:09:10 +01:00
Amaury Denoyelle	a15f0461a0	BUG/MEDIUM: h3: do not access QCS <sd> if not allocated Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Since the following commit, allocation of QCS stream-endpoint on FE side has been delayed. The objective is to allocate it only for QCS attached to an upper stream object. Stream-endpoint allocation is now performed on qcs_attach_sc() called during HEADERS parsing. commit `e6064c5616` OPTIM: mux-quic: delay FE sedesc alloc to stream creation Also, stream-endpoint is accessed through the QCS instance after HEADERS or DATA frames parsing, to update the known input payload length. The above patch triggered regressions as in some code paths, <sd> field is dereferenced while still being NULL. This patch fixes this by restricting access to <sd> field after newer conditions. First, after HEADERS parsing, known input length is only updated if h3_req_headers_to_htx() previously returned a success value, which guarantee that qcs_attach_sc() has been executed. After DATA parsing, <sd> is only accessed after the frame validity check. This ensures that HEADERS were already parsed, thus guaranteing that stream-endpoint is allocated. This should fix github issue #3211. This must be backported up to 3.3. This is sufficient, unless above patch is backported to previous releases, in which case the current one must be picked with it.	2025-12-09 15:00:23 +01:00
Christopher Faulet	3cf4e7afb9	BUG/MEDIUM: http-ana: Don't close server connection on read0 in TUNNEL mode It is a very old bug (2012), dating from the introduction of the keep-alive support to HAProxy. When a request is fully received, the SC on backend side is switched to NOHALF mode. It means that when the read0 is received from the server, the server connection is immediately closed. It is expected to do so at the end of a classical request. However, it must not be performed if the session is switched to the TUNNEL mode (after an HTTP/1 upgrade or a CONNECT). The client may still have data to send to the server. And closing brutally the server connection this way will be handled as an error on client side. This bug is especially visible when a H2 connection on client side because a RST_STREAM is emitted and a "SD--" is reported in logs. Thanks to @chrisstaite This patch should fix the issue #3205. It must be backported to all stable versions.	2025-12-08 15:22:01 +01:00

... 2 3 4 5 6 ...

20717 commits