haproxy

mirror of https://github.com/haproxy/haproxy.git synced 2026-04-20 22:01:49 -04:00

Author	SHA1	Message	Date
Amaury Denoyelle	940e1820f6	MEDIUM: quic/mux-quic: adjust app-ops install This patch reworks the installation of app-ops layer by QUIC MUX. Previously, app_ops field was stored directly into the quic_conn structure. Then the MUX reused it directly during its qmux_init(). This patch removes app_ops field from quic_conn and replaces it with a copy of the negotiated ALPN. By using quic_alpn_to_app_ops(), it ensures it remains compatible with a known application layer. On the MUX layer, qcc_install_app_ops() now uses the standard conn_get_alpn() to retrieve the ALPN from the transport layer. This is done via the newly defined <get_alpn> QUIC xprt callback. This new architecture should be cleaner as it better highlights the responsibility of each layers in the ALPN/app negotiation.	2026-03-03 16:22:57 +01:00
Amaury Denoyelle	9c7cf1c684	MINOR: mux-quic: add function for ALPN to app-ops conversion Extract the conversion from ALPN to qcc_app_ops type from quic_conn source file into QUIC MUX. The newly created function is named quic_alpn_to_app_ops(). This will serve as a central point to identify which ALPNs are currently supported in our QUIC stack. This patch is purely a small refactoring. It will be useful for the next one which rework MUX app-ops layer init. The current cleanup allows notably to remove H3/hq-interop headers from quic_conn source file.	2026-03-03 16:20:16 +01:00
Amaury Denoyelle	4120faf289	MINOR: quic/h3: reorganize stream reject after MUX closure The QUIC MUX layer is closed after its transport counterpart. This may be necessary then to reject any new streams opened by the remote peer. This operation is dependent however from the application protocol. Previously, a function qc_h3_request_reject() was directly implemented in quic_conn source file for use when HTTP/3 was previously negotiated. However, this solution was not evolutive and broke layering. This patch introduces a new proper separation with a <strm_reject> callback defined in quic_conn structure. When set, it will be used to preemptively close any new stream. QUIC MUX is responsible to set it just before its closure. No functional change. This patch is purely a refactoring with a better architecture design. Especially, H3 specific code from transport layer is now completely removed.	2026-03-03 16:19:13 +01:00
Amaury Denoyelle	58830990d0	MINOR: quic: use signed char type for ALPN manipulation In most of haproxy code, ALPN is used as a signed char pointer. In QUIC code instead, it is manipulated as unsigned. Unifies this by using signed type in QUIC code. This allows to remove a bunch of unnecessary casts.	2026-03-03 16:11:58 +01:00
Christopher Faulet	13c3445163	BUG/MEDIUM: stream: Handle TASK_WOKEN_RES as a stream event The conversion of TASK_WOKEN_RES to a stream event was missing. Among other things, this wakeup reason is used when a stream is dequeued. So it was possible to skip the connection establishment if the stream was also woken up for a timer reason. When this happened, the stream was blocked till the queue timeout expiration. Converting TASK_WOKEN_RES to STRM_EVT_RES fixes the issue. This patch should fix the issue #3290. It must be backported as far as 3.2.	2026-03-03 15:16:10 +01:00
Amaury Denoyelle	f58b2698ce	MEDIUM: proxy: add lock for global accesses during proxy free Define a new lock with label PROXIES_DEL_LOCK. Its purpose is to protect operations performed on global lists or trees while a proxy is freed. Currently, this lock is unneeded as proxies are only freed on single-thread init or deinit. However, with the incoming dynamic backend deletion, this operation will be also performed at runtime, outside of thread isolation.	2026-03-02 14:09:25 +01:00
Amaury Denoyelle	98c8c5e16e	MINOR: cli: implement wait on be-removable Implement be-removable argument to CLI wait. This is implemented via be_check_for_deletion() invokation, also used by "del backend" handler. The objective is to test whether a backend instance can be removed. If this is not the case, the command may returns immediately if the target proxy is incompatible with dynamic removal or if a user action is required. Else, the command will wait until the temporary restriction is lifted.	2026-03-02 14:08:30 +01:00
Amaury Denoyelle	7f725f0754	MINOR: proxy: prevent deletion of backend referenced by config elements Define a new proxy flag PR_FL_NON_PURGEABLE. This is used to mark every proxy instance explicitely referenced in the config. Such instances cannot be deleted at runtime. Static use_backend/default_backend rules are handled in proxy_finalize(). Also, sample expression proxy references are protected via smp_resolve_args(). Note that this last case also incidentally protects any proxies referenced via a CLI "set var" expression. This should not be the case as in this case variable value is instantly resolved so the proxy reference is not needed anymore. This also affects dynamic servers.	2026-03-02 14:08:30 +01:00
Amaury Denoyelle	ee1f0527c6	MINOR: proxy: rename default refcount to avoid confusion Rename proxy conf <refcount> to <def_ref>. This field only serves for defaults proxy instances. The objective is to avoid confusion with the newly introduced <refcount> field used for dynamic backends. As an optimization, it could be possible to remove <def_ref> and only use <refcount> also for defaults proxies usage. However for now the simplest solution is implemented. This patch does not bring any functional change.	2026-03-02 14:07:40 +01:00
Amaury Denoyelle	f3127df74d	MINOR: proxy: add refcount to proxies Implement refcount notion into proxy structure. The objective is to be able to increment refcount on proxy to prevent its deletion temporarily. This is similar to the server refcount : "del backend" is not blocked and will remove the targetted instance from the global proxies_list. However, the final free operation is delayed until the refcount is null. As stated above, the API is similar to servers. Proxies are initialized with a refcount of 1. Refcount can be incremented via proxy_take(). When no longer useful, refcount is decremented via proxy_drop() which replaces the older free_proxy(). Deinit is only performed once refcount is null. This commit also defines flag PR_FL_DELETED. It is set when a proxy instance has been removed via a "del backend" CLI command. This should serve as indication to modules which may still have a refcount on the target proxy so that they can release it as soon as possible. Note that this new refcount is completely ignored for a default proxy instance. For them, proxy_take() is pure noop. Free is immediately performed on first proxy_drop() invokation.	2026-03-02 10:44:59 +01:00
Amaury Denoyelle	ebbdfc5915	MINOR: lua: use watcher for proxies iterator Ensures proxies iteration via lua functions is safe via a new watcher member. The principle is similar to the one already used for servers iteration.	2026-03-02 10:36:21 +01:00
Amaury Denoyelle	20376c54e2	MINOR: stats: protect proxy iteration via watcher Define a new <px_watch> watcher member in stats applet context. It is used to register the applet on a proxy when iterating over the proxies list. <obj1> is automatically updated via the watcher interaction. Watcher is first initialized prior to stats_dump_proxies() invocation. This guarantees that stats dump is safe even if applet yields and a backend is removed in parallel.	2026-02-27 10:28:24 +01:00
Amaury Denoyelle	4bcfc09acf	MINOR: proxy: define proxy watcher member Define a new member watcher_list in proxy. It will be used to register modules which iterate over the proxies list. This will ensure that the operation is safe even if a backend is removed in parallel.	2026-02-27 10:28:24 +01:00
Amaury Denoyelle	08623228a1	MINOR: proxy: define a basic "del backend" CLI Add "del backend" handler which is restricted to admin level. Along with it, a new function be_check_for_deletion() is used to test if the backend is removable.	2026-02-27 10:28:24 +01:00
Amaury Denoyelle	d166894fef	MINOR: server: refactor srv_detach() Correct documentation for srv_detach() which previously stated that this function could be called for a server even if not stored in its proxy list. In fact there is a BUG_ON() which detects this case.	2026-02-26 18:24:36 +01:00
Amaury Denoyelle	dd55f2246e	MINOR: proxy: convert proxy flags to uint Proxy flags member were of type char. This will soon enough not be sufficient as new flags will be defined. As such, convert flags member to unsigned int type.	2026-02-26 18:24:36 +01:00
Willy Tarreau	b604064980	MEDIUM: counters: make EXTRA_COUNTERS_GET() consider tgid Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Now we store and retrieve only counters for the current tgid when more than one is supported. This allows to significantly reduce contention on shared stats. The haterm utility saw its performance increase from 4.9 to 5.8M req/s in H1, and 6.0 to 7.6M for H2, both with 5 groups of 16 threads, showing that we don't necessarily need insane amounts of groups.	2026-02-26 17:03:53 +01:00
Willy Tarreau	9019a5db93	MEDIUM: counters: return aggregate extra counters in ->fill_stats() Now thanks to new macro EXTRA_COUNTERS_AGGR() we can iterate over all thread groups storages when returning the data for a given metric. This remains convenient and mostly transparent. The caller continues to pass the pointer to the metric in the first group, and offsets are calculated for all other groups and data summed. For now all groups except the first one contain only zeroes but reported values are nevertheless correct.	2026-02-26 17:03:53 +01:00
Willy Tarreau	de0eddf512	MINOR: counters: add EXTRA_COUNTERS_BASE() to retrieve extra_counters base storage The goal is to always retrieve the storage address of the first thread group for the given module. This will be used to iterate over all thread groups. For now it returns the same value as EXTRA_COUNTERS_GET().	2026-02-26 17:03:53 +01:00
Willy Tarreau	a60e1fcf7f	MEDIUM: counters: store the number of thread groups accessing extra_counters In order to be able to properly allocate all storage and retrieve data from there, we'll need to know how many thread groups are supposed to access it. Let's store the number of thread groups at init time. If the tgrp_step is zero, there's always only one tg though. Now EXTRA_COUNTERS_ALLOC() takes this number of thread groups in argument and stores it in the structure. It also allocates as many areas as needed, incrementing the datap pointer by the step for each of them. EXTRA_COUNTERS_FREE() uses this info to free all allocated areas. EXTRA_COUNTERS_INIT() initializes all allocated areas, this is used elsewhere to clear/preset counters, e.g. in proxy_stats_clear_counters(). It involves a memcpy() call for each array, which is normally preset to something empty but might also be used to preset certain non-scalar fields such as an instance name.	2026-02-26 17:03:53 +01:00
Willy Tarreau	7ac47910a2	MINOR: counters: store a tgroup step for extra_counters to access multiple tgroups We'll need to permit any user to update its own tgroup's extra counters instead of the global ones. For this we now store the per-tgroup step between two consecutive data storages, for when they're stored in a tgroup array. When shared (e.g. resolvers or listeners), we just store zero to indicate that it doesn't scale with tgroups. For now only the registration was handled, it's not used yet.	2026-02-26 17:03:53 +01:00
Willy Tarreau	04a9f86a85	MEDIUM: counters: add a dedicated storage for extra_counters in various structs Servers, proxies, listeners and resolvers all use extra_counters. We'll need to move the storage to per-tgroup for those where it matters. Now we're relying on an external storage, and the data member of the struct was replaced with a pointer to that pointer to data called datap. When the counters are registered, these datap are set to point to relevant locations. In the case of proxies and servers, it points to the first tgrp's storage. For listeners and resolvers, it points to a local storage. The rationale here is that listeners are limited to a single group anyway, and that resolvers have a low enough load so that we do not care about contention there. Nothing should change for the user at this point.	2026-02-26 17:03:47 +01:00
Willy Tarreau	95a9f472d2	MEDIUM: counters: change the fill_stats() API to pass the module and extra_counters We'll soon need to iterate over thread groups in the fill_stats() functions, so let's first pass the extra_counters and stats_module pointers to the fill_stats functions. They now call EXTRA_COUNTERS_GET() themselves with these elements in order to retrieve the required pointer. Nothing else changed, and it's getting even a bit more transparent for callers. This doesn't change anything visible however.	2026-02-26 08:24:03 +01:00
Willy Tarreau	56fc12d6fa	CLEANUP: stats: drop stats.h / stats-t.h where not needed A number of C files include stats.h or stats-t.h, many of which were just to access the counters. Now those which really need counters rely on counters.h or counters-t.h, which already reduces the amount of preprocessed code to be built (~3000 lines or about 0.05%).	2026-02-26 08:24:03 +01:00
Willy Tarreau	2b463e9b1f	REORG: stats/counters: move extra_counters to counters not stats It was always difficult to find extra_counters when the rest of the counters are now in counters-t.h. Let's move the types to counters-t.h and the macros to counters.h. Stats include them since they're used there. But some users could be cleaned from the stats definitions now.	2026-02-26 08:24:03 +01:00
Willy Tarreau	9910af6117	CLEANUP: quic-stats: include counters from quic_stats There's something a bit awkward in the way stats counters are inherited through the QUIC modules: quic_conn-t includes quic_stats-t.h, which declares quic_stats_module as extern from a type that's not known from this file. And anyway externs should not be exported from type defintions since they're not part of the ABI itself. This commit moves the declaration to quic_stats.h which now takes care to include stats-t.h to get the definition of struct stats_module. The few users who used to learn it through quic_conn-t.h now include it explicitly. As a bonus this reduces the number of preprocessed lines by 5000 (~0.1%). By the way, it looks like struct stats_module could benefit from being moved off stats-t.h since it's only used at places where the rest of the stats is not needed. Maybe something to consider for a future cleanup.	2026-02-26 08:24:03 +01:00
Frederic Lecaille	89c75b0777	BUG/MINOR: quic: missing app ops init during backend 0-RTT sessions The QUIC mux requires "application operations" (app ops), which are a list of callbacks associated with the application level (i.e., h3, h0.9) and derived from the ALPN. For 0-RTT, when the session cache cannot be reused before activation, the current code fails to reach the initialization of these app ops, causing the mux to crash during its initialization. To fix this, this patch restores the behavior of ssl_sock_srv_try_reuse_sess(), whose purpose was to reuse sessions stored in the session cache regardless of whether 0-RTT was enabled, prior to this commit: MEDIUM: quic-be: modify ssl_sock_srv_try_reuse_sess() to reuse backend sessions (0-RTT) With this patch, this function now does only one thing: attempt to reuse a session, and that's it! This patch allows ignoring whether a session was successfully reused from the cache or not. This directly fixes the issue where app ops initialization was skipped upon a session cache reuse failure. From a functional standpoint, starting a mux without reusing the session cache has no negative impact; the mux will start, but with no early data to send. Finally, there is the case where the ALPN is reset when the backend is stopped. It is critical to continue locking read access to the ALPN to secure shared access, which this patch does. It is indeed possible for the server to be stopped between the call to connect_server() and quic_reuse_srv_params(). But this cannot prevent the mux to start without app ops. This is why a 'TODO' section was added, as a reminder that a race condition regarding the ALPN reset still needs to be fixed. Must be backported to 3.3	2026-02-25 11:13:52 +01:00
Maxime Henrion	a9dc8e2587	MINOR: quic: add a new metric for ncbuf failures Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details This counts the number of times we failed to add data to the ncbuf buffer because of the gap size limit.	2026-02-23 17:47:45 +01:00
Willy Tarreau	868dd3e88b	MINOR: traces: always mark trace_source as thread-aligned Some perf profiles occasionally show that reading the trace source's state can take some time, which is not expected at all. It just happens that the trace_source is not cache-aligned so depending on linkage, it may share a cache line with a more active variable, thereby inducing a slow down to all threads trying to read the variable. Let's always mark it aligned to avoid this. For now the problem was not observed again.	2026-02-23 16:22:59 +01:00
Amaury Denoyelle	b8cb8e1a65	BUG/MINOR: quic: fix counters used on BE side quic_conn is initialized with a pointer to its proxy counters. These counters are then updated during the connection lifetime. Counters pointer was incorrect for backend quic_conn, as it always referenced frontend counters. For pure backend, no stats would be updated. For listen instances, this resulted in incorrect stats reporting. Fix this by correctly set proxy counters based on the connection side. This must be backported up to 3.3.	2026-02-20 14:08:27 +01:00
Amaury Denoyelle	de5fc2f515	BUG/MINOR: server: set auto SNI for dynamic servers Auto SNI configuration is configured during check config validity. However, nothing was implemented for dynamic servers. Fix this by implementing auto SNI configuration during "add server" CLI handler. Auto SNI configuration code is moved in a dedicated function srv_configure_auto_sni() called both for static and dynamic servers. Along with this, allows the keyword "no-sni-auto" on dynamic servers, so that this process can be deactivated if wanted. Note that "sni-auto" remains unavailable as it only makes sense with default-servers which are never used for dynamic server creation. This must be backported up to 3.3.	2026-02-20 09:02:47 +01:00
Aurelien DARRAGON	2b7849fd02	BUG/MINOR: stats-file: manipulate shm-stats-file heartbeat using unsigned int shm-stats-file heartbeat is derived from now_ms with an extra time added to it, thus it should be handled using the same time as now_ms is. Until now, we used to handle heartbeat using signed integer. This was not found to cause severe harm but it could result in improper handling due to early wrapping because of signedness for instance, so let's better fix that before it becomes a real issue. It should be backported in 3.3	2026-02-19 16:13:55 +01:00
Frederic Lecaille	c9d47804d1	MINOR: haterm: add haterm HTTP server Contrary to haproxy, httpterm does not support all the HTTP protocols. Furthermore, it has become easier to handle inbound/outbound connections / streams since the rework done at conn_stream level. This patch implements httpterm HTTP server services into haproxy. To do so, it proceeds the same way as for the TCP checks which use only one stream connector, but on frontend side. The makefile is modified to handle haterm.c in additions to all the C files for haproxy to build new haterm program into haproxy, the haterm server also instantiates a haterm stream (hstream struct) attached to a stream connector for each incoming connection without backend stream connector. This is the role of sc_new_from_endp() called by the muxes to instantiate streams/hstreams. As for stream_new(), hstream_new() instantiates a task named process_hstream() (see haterm.c) which has the same role as process_stream() but for haterm streams. haterm into haproxy takes advantage of the HTTP muxes and HTX API to support all the HTTP protocols supported by haproxy.	2026-02-19 15:10:37 +01:00
Frederic Lecaille	2bf091e9da	MINOR: stconn: stream instantiation from proxy callback Add a pointer to function to proxies as ->stream_new_from_sc proxy struct member to instantiate stream from connection as this is done by all the muxes when they call sc_new_from_endp(). The default value for this pointer is obviously stream_new() which is exported by this patch.	2026-02-19 14:46:49 +01:00
Frederic Lecaille	6013f4baeb	MINOR: init: allow a fileless init mode This patch provides the possibility to initialize haproxy without configuration file. This may be identified by the new global and exported <fileless_mode> and <fileless_cfg> variables which may be used to provide a struct cfgfile to haproxy by others means than a physical file (built in memory). When enabled, this fileless mode skips all the configuration files parsing.	2026-02-19 14:46:49 +01:00
Frederic Lecaille	234ce775c3	MINOR: trace: add definitions for haterm streams Add definitions for haterm stream as arguments to be used by the TRACE API. This will be used by the haterm module to come which will have to handle hstream struct objects (in place of stream struct objects).	2026-02-19 14:46:49 +01:00
Frederic Lecaille	5d3bca4b17	MINOR: ssl/ckch: certificates generation from "load" "crt-store" directive Add "generate-dummy" on/off type keyword to "load" directive to automatically generate dummy certificates as this is done for ACME from ckch_conf_load_pem_or_generate() function which is called if a "crt" keyword is also provide for this directive. Also implement "keytype" to specify the key type used for these certificates. Only "RSA" or "ECDSA" is accepted. This patch also implements "bits" keyword for the "load" directive to specify the private key size used for RSA. For ECDSA, a new "curves" keyword is also provided by this patch to specify the curves to be used for the EDCSA private keys generation. ckch_conf_load_pem_or_generate() is modified to use these parameters provided by "keytype", "bits" and "curves" to generate the private key with ssl_gen_EVP_PKEY() before generating the X509 certificate calling ssl_gen_x509().	2026-02-19 14:46:49 +01:00
Frederic Lecaille	36b1fba871	MINOR: ssl/ckch: Move EVP_PKEY and cert code generation from acme Move acme_EVP_PKEY_gen() implementation to ssl_gencrt.c and rename it to ssl_EVP_PKEY_gen(). Also extract from acme_gen_tmp_x509() the generic part to implement ssl_gen_x509() into ssl_gencrt.c. To generate a self-signed expired certificate ssl_EVP_PKEY_gen() must be used to generate the private key. Then, ssl_gen_x509() must be called with the private key as argument. acme_gen_tmp_x509() is also modified to called these two functions to generate a temporary certificate has done before modifying this part. Such an expired self-signed certificate should not be use on the field but only during testing and development steps.	2026-02-19 14:46:47 +01:00
Nenad Merdanovic	5a079d1811	MEDIUM: Add connect/queue/tarpit timeouts to set-timeout Add the ability to set connect, queue and tarpit timeouts from the set-timeout action. This is especially useful when using set-dst to dynamically connect to servers. This patch also adds the relevant fe_/be_/cur_ sample fetches for these timeouts.	2026-02-19 08:20:37 +01:00
Christopher Faulet	a324616cdb	MINOR: dynbuf: Add helpers to know if a buffer is a default or a large buffer b_is_default() and b_is_large() can now be used to know if a buffer is a default buffer or a large one. _b_free() now relies on it. These functions are also used when possible (stream_free(), stream_release_buffers() and http_wait_for_msg_body()).	2026-02-18 13:26:21 +01:00
Christopher Faulet	5737fc9518	MEDIUM: http-ana: Use a large buffer if necessary when waiting for body Thanks to previous patches, it is now possible to allocate a large buffer to store the message payload in the context of the "wait-for-body" action. To do so, "use-large-buffer" option must be set. It means now it is no longer necessary to increase the regular buffer size to be able to get message payloads of some requests or responses.	2026-02-18 13:26:21 +01:00
Christopher Faulet	ce912271db	MEDIUM: chunk: Add support for large chunks Because there is now a memory pool for large buffers, we must also add the support for large chunks. So, if large buffers are configured, a dedicated memory pool is created to allocate large chunks. alloc_large_trash_chunk() must be used to allocate a large chunk. alloc_trash_chunk_sz() can be used to allocate a chunk with the best size. However free_trash_chunk() remains the only way to release a chunk, regular or large. In addition, large trash buffers are also created, using the same mechanism than for regular trash buffers. So three thread-local trash buffers are created. get_large_trash_chunk() must be used to get a large trash buffer. And get_trash_chunk_sz() may be used to get a trash buffer with the best size.	2026-02-18 13:26:21 +01:00
Christopher Faulet	d89ec33a34	MEDIUM: dynbuf: Add a pool for large buffers with a configurable size Add the support for large bufers. A dedicated memory pool is added. The size of these buffers must be explicitly configured by setting "tune.bufsize.large" directive. If it is not set, the pool is not created. In addition, if the size for large buffers is the same than for regular buffer, the feature is automatically disable. For now, large buffers remain unused.	2026-02-18 13:26:21 +01:00
Christopher Faulet	4f27a72d19	MEDIUM: htx: Improve detection of fragmented/unordered HTX messages First, an HTX flags was added to know when blocks are unordered. It may happen when a header is added while part of the payload was already received or when the start-line is replaced by an new one. In these cases, the blocks indexes are in the right order but not the blocks payload. Knowing a message is unordered can be useful to trigger a defragmentation, mainly to be able to append data properly for instance. Then, detection of fragmented messages was improved, especially when a header or a start-line is replaced by a new one. Finally, when data are added in a message and cannot be appended into the previous DATA block because the message is not aligned, a defragmentation is performed to realign the message and append data.	2026-02-18 13:26:21 +01:00
Christopher Faulet	53b7150357	MEDIUM: stream: Limit number of synchronous send per stream wakeup It is not a bug fix, because there is no way to hit the issue for now. But there is nothing preventing a loop of synchronous sends in process_stream(). Indead, when a synchronous send is successfully performed, we restart the SCs evaluation and at the end another synchronous send is attempted. So with an endpoint consuming data bit by bit or with a filter fowarding few bytes at each call, it is possible to loop for a while in process_stream(). Because it is not expected, we now limit the number of synchronous send per wakeup to two calls. In a nominal case, it should never be more. This commit is mandatory to be able to handle large buffers on channels There is no reason to backport this commit except if the large buffers support on channels are backported.	2026-02-18 13:26:21 +01:00
Christopher Faulet	6bf450b7fe	MINOR: tree-wide: Use the buffer size instead of global setting when possible At many places, we rely on global.tune.bufsize value instead of using the buffer size. For now, it is not a problem. But if we want to be able to deal with buffers of different sizes, it is good to reduce as far as possible dependencies on the global value. most of time, we can use b_size() or c_size() functions. The main change is performed on the error snapshot where the buffer size was added into the error_snapshot structure.	2026-02-18 13:26:20 +01:00
Christopher Faulet	806c8c830d	REORG: stconn: Move functions related to channel buffers to sc_strm.h sc_have_buff(), sc_need_buff(), sc_have_room() and sc_need_room() are related to the buffer's channel. So we can move them in sc_strm.h header file. In addition, this will be mandatory for the next commit.	2026-02-18 09:44:16 +01:00
Christopher Faulet	1b1a0b3bae	MINOR: stconn: Add missing SC_FL_NO_FASTFWD flag in sc_show_flags SC_FL_NO_FASTFWD flag was not listed in sc_show_flags() function. Let's do so. This patch could be backported as far as 3.0.	2026-02-18 09:44:16 +01:00
Amaury Denoyelle	f3003d1508	BUG/MAJOR: Revert "MEDIUM: mux-quic: add BUG_ON if sending on locally closed QCS" This reverts commit `235e8f1afd`. Prior to the above commit, snd_buf callback for QUIC MUX was able to deal with data even after stream closure. The excess was simply discarded, as no STREAM frame can be emitted after FIN/RESET_STREAM. This code was later removed and replaced by a BUG_ON() to ensure snd_buf is never called after stream closure. However, this approach is too strict. Indeed, there is nothing in the haproxy stream architecture which forbids this scheduling, in part because QUIC MUX is the sole responsible of the stream closure. As such, it is preferable to revert to the old code to prevent any triggering of a BUG_ON() failure. Note that nego_ff does not implement data draining if called after stream closure. This will be done in a future patch. Thanks to Mike Walker for his investigation on the subject. This must be backported up to 2.8.	2026-02-17 18:18:44 +01:00
William Lallemand	df8e05815c	BUG/MINOR: ssl: lack crtlist_dup_ssl_conf() declaration Add lacking crtlist_dup_ssl_conf() declaration in ssl_crt-list.h. Could be backported if needed.	2026-02-16 16:04:35 +01:00
Aurelien DARRAGON	d71e2e73ea	MEDIUM: filters: use per-channel filter list when relevant Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details In the historical implementation, all filter related information where stored at the stream level (using struct strm_flt * context), and filters iteration was performed at the stream level also. We identified that this was not ideal and would make the implementation of future filters more complex since filters ordering should be handled in a different order during request and response handling for decompression for instance. To make such thing possible, in this commit we migrate some channel specific filter contexts in the channel directly (request or response), and we implement 2 additional filter lists, one on the request channel and another on the response channel. The historical stream filter list is kept as-is because in some contexts only the stream is available and we have to iterate on all filters. But for functions where we only are interested in request side or response side filters, we now use dedicated channel filters list instead. The only overhead is that the "struct filter" was expanded by two "struct list". For now, no change of behavior is expected.	2026-02-13 12:24:13 +01:00
Aurelien DARRAGON	bb6cfbe754	MINOR: filters: rework filter iteration for channel related callback functions Multiple channel related functions have the same construction: they use list_for_each_entry() to work on a given filter from the stream+channel combination. In future commits we will try to use filter list from dedicated channel list instead of the stream one, thus in this patch we need as a prerequisite to implement and use the flt_list_{start,next} API to iterate over filter list, giving the API the responsibility to iterate over the correct list depending on the context, while the calling function remains free to use the iteration construction it needs. This way we will be able to easily change the way we iterate over filter list without duplicating the code for requests and responses.	2026-02-13 12:24:07 +01:00
Egor Shestakov	f5f9c008b1	CLEANUP: initcall: adjust comments to INITCALL{0,1} macros Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details It is likely that copy-pasted text was not initially adjusted in the comments, that is misleading regarding number of arguments. No backport needed.	2026-02-11 09:10:56 +01:00
William Lallemand	ea92b0ef01	BUG/MINOR: ssl: SSL_CERT_DIR environment variable doesn't affect haproxy Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details The documentation of @system-ca specifies that one can overwrite the value provided by the SSL Library using SSL_CERT_DIR. However it seems like X509_get_default_cert_dir() is not affected by this environment variable, and X509_get_default_cert_dir_env() need to be used in order to get the variable name, and get the value manually. This could be backported in every stable branches. Note that older branches don't have the memprintf in ssl_sock.c.	2026-02-10 21:34:45 +01:00
Willy Tarreau	e2631ee5f7	MEDIUM: activity: apply and use new finegrained task profiling settings In continuity of previous patch, this one makes use of the new profiling flags. For this, based on the global "profiling" setting, when switching profiling on, we set or clear two flags on the thread context, TH_FL_TASK_PROFILING_L and TH_FL_TASK_PROFILING_M to indicate whether lock profiling and/or malloc profiling are desired when profiling is enabled. These flags are checked along with TH_FL_TASK_PROFILING to decide when to collect time around a lock or a malloc. And by default we're back to the behavior of 3.2 in that neither lock nor malloc times are collected anymore. This is sufficient to see the CPU usage spent in the VDSO to significantly drop from 22% to 2.2% on a highly loaded system. This should be backported to 3.3 along with the previous patch.	2026-02-10 17:52:59 +01:00
Willy Tarreau	a7b2353cb3	MINOR: activity: support setting/clearing lock/memory watching for task profiling Damien Claisse reported in issue #3257 a performance regression between 3.2 and 3.3 when task profiling is enabled, more precisely in relation with the following patches were merged: `98cc815e3e` ("MINOR: activity: collect time spent with a lock held for each task") `503084643f` ("MINOR: activity: collect time spent waiting on a lock for each task") `9d8c2a888b` ("MINOR: activity: collect CPU time spent on memory allocations for each task") The issue mostly comes from the first patches. What happens is that the local time is taken when entering and leaving each lock, which costs a lot on a contended system. The problem here is the lack of finegrained settings for lock and malloc profiling. This patch introduces a better approach. The task profiler goes back to its default behavior in on/auto modes, but the configuration now accepts new extra options "lock", "no-lock", "memory", "no-memory" to precisely indicate other timers to watch for each task when profiling turns on. This is achieved by setting two new flags HA_PROF_TASKS_LOCK and HA_PROF_TASKS_MEM in the global "profiling" variable. This patch only parses the new values and assigns them to the global variable from the config file for now. The doc was updated.	2026-02-10 17:47:02 +01:00
Willy Tarreau	64c5d45a26	BUG/MEDIUM: lb-chash: always properly initialize lb_nodes with dynamic servers Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details An issue was introduced in 3.0 with commit `faa8c3e024` ("MEDIUM: lb-chash: Deterministic node hashes based on server address"): the new server_key field and lb_nodes entries initialization were not updated for servers added at run time with "add server": server_key remains zero and the key used in lb_node remains the one depending only on the server's ID. This will cause trouble when adding new servers with consistent hashing, because the hash-key will be ignored until the server's weight changes and the key difference is detected, leading to its recalculation. This is essentially caused by the poorly placed lb_nodes initialization that is specific to lb-chash and had to be replicated in the code dealing with server addition. This commit solves the problem by adding a new ->server_init() function in the lbprm proxy struct, that is called by the server addition code. This also allows to abandon the complex check for LB algos that was placed there for that purpose. For now only lb-chash provides such a function, and calls it as well during initial setup. This way newly added servers always use the correct key now. While it should also theoretically have had an impact on servers added with the "random" algorithm, it's unlikely that the difference between proper server keys and those based on their ID could have had any visible effect. This patch should be backported as far as 3.0. The backport may be eased by a preliminary backport of previous commit "CLEANUP: lb-chash: free lb_nodes from chash's deinit(), not global", though this is not strictly necessary if context is manually adjusted.	2026-02-10 07:22:54 +01:00
Amaury Denoyelle	5753c14e84	MINOR: proxy: assign dynamic proxy ID Implement proxy ID generation for dynamic backends. This is performed through the already function existing proxy_get_next_id(). As an optimization, lookup will performed starting from a global variable <dynpx_next_id>. It is initialized to the greatest ID assigned after parsing, and updated each time a backend instance is created. When backend deletion will be implemented, it could be lowered to the newly available slot.	2026-02-06 17:28:27 +01:00
Amaury Denoyelle	e152913327	MINOR: proxy: parse mode on dynamic backend creation Add an optional "mode" argument to "add backend" CLI command. This argument allows to specify if the backend is in TCP or HTTP mode. By default, it is mandatory, unless the inherited default proxy already explicitely specifies the mode. To differentiate if TCP mode is implicit or explicit, a new proxy flag PR_FL_DEF_EXPLICIT_MODE is defined. It is set for every defaults instances which explicitely defined their mode.	2026-02-06 17:27:50 +01:00
Amaury Denoyelle	817003aa31	MINOR: backend: add function to check support for dynamic servers Move backend compatibility checks performed during 'add server' in a dedicated function be_supports_dynamic_srv(). This should simplify addition of future restriction. This function will be reused when implementing backend creation at runtime.	2026-02-06 14:35:19 +01:00
Amaury Denoyelle	dc6cf224dd	MINOR: proxy: refactor mode parsing Define a new utility function str_to_proxy_mode() which is able to convert a string into the corresponding proxy mode if possible. This new function is used for the parsing of "mode" configuration proxy keyword. This patch will be reused for dynamic backend implementation, in order to parse a similar "mode" argument via a CLI handler.	2026-02-06 14:35:18 +01:00
Amaury Denoyelle	87ea407cce	MINOR: proxy: refactor proxy inheritance of a defaults section If a proxy is referencing a defaults instance, some checks must be performed to ensure that inheritance will be compatible. Refcount of the defaults instance may also be incremented if some settings cannot be copied. This operation is performed when parsing a new proxy of defaults section which references a defaults, either implicitely or explicitely. This patch extracts this code into a dedicated function named proxy_ref_defaults(). This in turn may call defaults_px_ref() (previously called proxy_ref_defaults()) to increment its refcount. The objective of this patch is to be able to reuse defaults inheritance validation for dynamic backends created at runtime, outside of the parsing code.	2026-02-06 14:35:18 +01:00
Amaury Denoyelle	a8bc83bea5	MINOR: cfgparse: move proxy post-init in a dedicated function A lot of proxies initialization code is delayed on post-parsing stage, as it depends on the configuration fully parsed. This is performed via a loop on proxies_list. Extract this code in a dedicated function proxy_finalize(). This patch will be useful for dynamic backends creation. Note that for the moment the code has been extracted as-is. With each new features, some init code was added there. This has become a giant loop with no real ordering. A future patch may provide some cleanup in order to reorganize this.	2026-02-06 14:35:18 +01:00
Amaury Denoyelle	2c8ad11b73	MINOR: cfgparse: validate defaults proxies separately Default proxies validation occurs during post-parsing. The objective is to report any tcp/http-rules which could not behave as expected. Previously, this was performed while looping over standard proxies list, when such proxy is referencing a default instance. This was enough as only named referenced proxies were kept after parsing. However, this is not the case anymore in the context of dynamic backends creation at runtime. As such, this patch now performs validation on every named defaults outside of the standard proxies list loop. This should not cause any behavior difference, as defaults are validated without using the proxy which relies on it. Along with this change, PR_FL_READY proxy flag is now removed. Its usage was only really needed for defaults, to avoid validating a same instance multiple times. With the validation of defaults in their own loop, it is now redundant.	2026-02-06 14:35:18 +01:00
Olivier Houchard	f26562bcb7	MINOR: quic: Fix build with USE_QUIC_OPENSSL_COMPAT Commit `fa094d0b61` changed the msg callback args, but forgot to fix quic_tls_msg_callback() accordingly, so do that, and remove the unused struct connection paramter.	2026-02-03 04:05:34 +01:00
William Lallemand	23e8ed6ea6	MEDIUM: ssl: porting to X509_STORE_get1_objects() for OpenSSL 4.0 OpenSSL 4.0 is deprecating X509_STORE_get0_objects(). Every occurence of X509_STORE_get0_objects() was first replaced by X509_STORE_get1_objects(). This changes the ref count of the STACK_OF(X509_OBJECT) everywhere, and need it to be sk_X509_OBJECT_pop_free(objs, X509_OBJECT_free) each time. X509_STORE_get1_objects() is not available in AWS-LC, OpenSSL < 3.2, LibreSSL and WolfSSL, so we need to still be compatible with get0. To achieve this, 2 macros were added X509_STORE_getX_objects() and sk_X509_OBJECT_popX_free(), these macros will use either the get0 or the get1 macro depending on their availability. In the case of get0, sk_X509_OBJECT_popX_free() will just do nothing instead of trying to free. Don't backport that unless really needed if we want to be compatible with OpenSSL 4.0. It changes all the refcounts.	2026-01-29 17:08:41 +01:00
Amaury Denoyelle	fa094d0b61	MEDIUM: ssl: remove connection from msg callback args Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details SSL msg callbacks are used for notification about sent/received SSL messages. Such callbacks are registered via ssl_sock_register_msg_callback(). Prior to this patch, connection was passed as first argument of these callbacks. However, most of them do not use it. Worst, this may lead to confusion as connection can be NULL in QUIC context. This patch cleans this by removing connection argument. As an alternative, connection can be retrieved in callbacks if needed using ssl_sock_get_conn() but the code must be ready to deal with potential NULL instances. As an example, heartbeat parsing callback has been adjusted in this manner.	2026-01-29 11:14:09 +01:00
Willy Tarreau	35d63cc3c7	MEDIUM: h1: strictly verify quoting in chunk extensions Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details As reported by Ben Kallus in the following thread: https://www.mail-archive.com/haproxy@formilux.org/msg46471.html there exist some agents which mistakenly accept CRLF inside quoted chunk extensions, making it possible to fool them by injecting one extra chunk they won't see for example, or making them miss the end of the body depending on how it's done. Haproxy, like most other agents nowadays, doesn't care at all about chunk extensions and just drops them, in agreement with the spec. However, as discussed, since chunk extensions are basically never used except for attacks, and that the cost of just matching quote pairs and checking backslashed quotes is escape consistency remains relatively low, it can make sense to add such a check to abort the message parsing when this situation is encountered. Note that it has to be done at two places, because there is a fast path and a slow path for chunk parsing. Also note that it will cause transfers using improperly formatted chunk extensions to fail, but since these are really not used, and that the likelihood of them being used but improperly quoted certainly is much lower than the risk of crossing a broken parser on the client's request path or on the server's response path, we consider the risk as acceptable. The test is not subject to the configurable parser exceptions and it's very unlikely that it will ever be needed. Since this is done in 3.4 which will be LTS, this patch will have to be backported to 3.3 so that any unlikely trouble gets a chance to be detected before users upgrade to 3.4. Thanks to Ben for the discussion, and to Rajat Raghav for sparking it in the first place even though the original report was mistaken. Cc: Ben Kallus <benjamin.p.kallus.gr@dartmouth.edu> Cc: Rajat Raghav <xclow3n@gmail.com> Cc: Christopher Faulet <cfaulet@haproxy.com>	2026-01-28 18:54:23 +01:00
Willy Tarreau	a9df6947b4	OPTIM: proxy: separate queues fields from served There's still a lot of contention when accessing the backend's totpend and queueslength for every request in may_dequeue_tasks(), even when queues are not used. This only happens because it's stored in the same cache line as >beconn which is being written by other threads: 0.01 \| call sess_change_server 0.02 \| mov 0x188(%r15),%esi ## s->queueslength \| if (may_dequeue_tasks(srv, s->be)) 0.00 \| mov 0xa8(%r12),%rax 0.00 \| mov -0x50(%rbp),%r11d 0.00 \| mov -0x60(%rbp),%r10 0.00 \| test %esi,%esi \| jne 3349 0.01 \| mov 0xa00(%rax),%ecx ## p->queueslength 8.26 \| test %ecx,%ecx 4.08 \| je 288d This patch moves queueslength and totpend to their own cache line, thus adding 64 bytes to the struct proxy, but gaining 3.6% of RPS on a 64-core EPYC thanks to the elimination of this false sharing. process_stream() goes down from 3.88% to 3.26% in perf top, with the next top users being inc/dec (s->served) and be->beconn.	2026-01-28 16:07:27 +00:00
Willy Tarreau	3ca2a83fc0	OPTIM: server: move queueslength in server struct This field is shared by all threads and must be in the shared area instead, because where it's placed, it slows down access to other fields of the struct by false sharing. Just moving this field gives a steady 2% gain on the request rate (1.93 to 1.96 Mrps) on a 64-core EPYC.	2026-01-28 16:07:27 +00:00
William Lallemand	6995fe60c3	MINOR: ssl: allow to disable certificate compression This option allows to disable the certificate compression (RFC 8879) using OpenSSL >= 3.2.0. This feature is known to permit some denial of services by causing extra memory allocations of approximately 22MiB and extra CPU work per connection with OpenSSL versions affected by CVE-2025-66199. ( https://openssl-library.org/news/vulnerabilities/index.html#CVE-2025-66199 ) Setting this to "off" permits to mitigate the problem. Must be backported to every stable branches.	2026-01-27 16:10:41 +01:00
Remi Tricot-Le Breton	9b1faee4c9	BUG/MINOR: ssl: Encrypted keys could not be loaded when given alongside certificate The SSL passphrase callback function was only called when loading private keys from a dedicated file (separate from the corresponding certificate) but not when both the certificate and the key were in the same file. We can now load them properly, regardless of how they are provided. A flas had to be added in the 'passphrase_cb_data' structure because in the 'ssl_sock_load_pem_into_ckch' function, when calling 'PEM_read_bio_PrivateKey' there might be no private key in the PEM file which would mean that the callback never gets called (and cannot set the 'passphrase_idx' to -1). This patch can be backported to 3.3.	2026-01-26 14:09:13 +01:00
Willy Tarreau	1a3252e956	MEDIUM: pools: better check for size rounding overflow on registration Certain object sizes cannot be controlled at declaration time because the resulting object size may be slightly extended (tag, caller), aligned and rounded up, or even doubled depending on pool settings (e.g. if backup is used). This patch addresses this by enlarging the type in the pool registration to 64-bit so that no info is lost from the declaration, and extra checks for overflows can be performed during registration after various rounding steps. This allows to catch issues such as these ones and to report a suitable error: global tune.http.logurilen 2147483647 frontend capture request header name len 2147483647 http-request capture src len 2147483647 tcp-request content capture src len 2147483647	2026-01-26 11:54:14 +01:00
Amaury Denoyelle	b52c60d366	MEDIUM: proxy: implement persistent named defaults This patch changes the handling of named defaults sections. Prior to this patch, every unreferenced defaults proxies were removed on post parsing. Now by default, these sections are kept after postparsing and only purged on deinit. The objective is to allow reusing them as base configuration for dynamic backends. To implement this, refcount of every still addressable named sections is incremented by one after parsing. This ensures that they won't be removed even if referencing proxies are removed at runtime. This is done via the new function proxy_ref_all_defaults(). To ensure defaults instances are still properly removed on deinit, the inverse operation is performed : refcount is decremented by one on every defaults sections via proxy_unref_all_defaults(). The original behavior can still be used by using the new global keyword tune.defaults.purge. This is useful for users using configuration with large number of defaults and not interested in dynamic backends creation.	2026-01-22 18:06:42 +01:00
Amaury Denoyelle	116983ad94	MEDIUM: cfgparse: do not store unnamed defaults in name tree Defaults section are indexed by their name in defproxy_by_name tree. For named sections, there is no duplicate : if two instances have the same name, the older one is removed from the tree. However, this was not the case for unnamed defaults which are all stored inconditionnally in defproxy_by_name. This commit introduces a new approach for unnamed defaults. Now, these instances are never inserted in the defproxy_by_name tree. Indeed, this is not needed as no tree lookup is performed with empty names. This may optimize slightly config parsing with a huge number of named and unnamed defaults sections, as the first ones won't fill up the tree needlessly. However, defproxy_by_name tree is also used to purge unreferenced defaults instances, both on postparsing and deinit. Thus, a new approach is needed for unnamed sections cleanup. Now, each time a new defaults is parsed, if the previous instance is unnamed, it is freed unless if referenced by a proxy. When config parsing is ended, a similar operation is performed to ensure the last unnamed defaults section won't stay in memory. To implement this, last_defproxy static variable is now set to global. Unnamed sections which cannot be removed due to proxies referencing proxies will still be removed when such proxies are freed themselves, at runtime or on deinit.	2026-01-22 17:57:16 +01:00
Amaury Denoyelle	848e0cd052	MINOR: proxy: simplify defaults proxies list storage Defaults proxies instance are stored in a global name tree. When there is a name conflict and the older entry cannot be simply discarded as it is already referenced, the older entry is instead removed from the name tree and inserted into the orphaned list. The purpose of the orphaned list was to guarantee that any remaining unreferenced defaults are purged either on postparsing or deinit. However, this is in fact completely useless. Indeed on postparsing, orphaned entries are always referenced. On deinit instead, defaults are already freed along the cleanup of all frontend/backend instances clean up, thanks to their refcounting. This patch streamlines this by removing orphaned list. Instead, a defaults section is inserted into a new global defaults_list during their whole lifetime. This is not strictly necessary but it ensures that defaults instances can still be accessed easily in the future if needed even if not present in the name tree. On deinit, a BUG_ON() is added to ensure that defaults_list is indeed emptied. Another benefit from this patch is to simplify the defaults deletion procedure. Orphaned simple list is replaced by a proper double linked list implementation, so a single LIST_DELETE() is now performed. This will be notably useful as defaults may be removed at runtime in the future if backends deletion at runtime is implemented.	2026-01-22 17:57:09 +01:00
Amaury Denoyelle	434e979046	MINOR: proxy: refactor defaults proxies API This patch renames functions which deal with defaults section. A common "defaults_px_" prefix is defined. This serves as a marker to identify functions which can only be used with proxies defaults capability. New BUG_ON() are enforced to ensure this is valid. Also, older proxy_unref_or_destroy_defaults() is renamed defaults_px_detach().	2026-01-22 17:55:47 +01:00
Amaury Denoyelle	6c0ea1fe73	MINOR: proxy: remove proxy_preset_defaults() Function proxy_preset_defaults() purpose has evolved over time. Originally, it was only used to initialize defaults proxies instances. Until today, it was extended so that all proxies use it. Its objective is to initialize settings to common default values. To remove the confusion, this function is now removed. Its content is integrated directly into init_new_proxy().	2026-01-22 16:20:25 +01:00
Hyeonggeun Oh	95e8483b35	MINOR: vars: store variable names for runtime access Currently, variable names are only used during parsing and are not stored at runtime. This makes it impossible to iterate through variables and retrieve their names. This patch adds infrastructure to store variable names: - Add 'name' and 'name_len' fields to var_desc structure - Add 'name' field to var structure - Add VDF_NAME_ALLOCATED flag to track memory ownership - Store names in vars_fill_desc(), var_set(), vars_check_arg(), and parse_store() - Free names in var_clear() and release_store_rule() - Add ARGT_VAR handling in release_sample_arg() to free the allocated name when the flag is set This prepares the ground for implementing dump_all_vars() in the next commit. Tested with: - ASAN-enabled build on Linux (TARGET=linux-glibc USE_OPENSSL=1 ARCH_FLAGS="-g -fsanitize=address") - Regression tests: reg-tests/sample_fetches/vars.vtc - Regression tests: reg-tests/startup/default_rules.vtc	2026-01-21 10:44:19 +01:00
Hyeonggeun Oh	25564b6075	MINOR: tools: add chunk_escape_string() helper function This function takes a string appends it to a buffer in a format compatible with most languages (double-quoted, with special characters escaped). It handles standard escape sequences like \n, \r, \", \\. This generic utility is desined to be used for logging or debugging purposes where arbitrary string data needs to be safely emitted without breaking the output format. It will be primarily used by the upcoming dump_all_vars() sample fetch to dump variable contents safely.	2026-01-21 10:44:19 +01:00
Remi Tricot-Le Breton	416b87d5db	MINOR: jwe: Add new jwt_decrypt_secret converter This converter checks the validity and decrypts the content of a JWE token that has a symetric "alg" algorithm. In such a case, we only require a secret as parameter in order to decrypt the token.	2026-01-15 10:56:27 +01:00
Amaury Denoyelle	6870551a57	MEDIUM: proxy: force traffic on unpublished/disabled backends A recent patch has introduced a new state for proxies : unpublished backends. Such backends won't be eligilible for traffic, thus use_backend/default_backend rules which target them won't match and content switching rules processing will continue. This patch defines a new frontend keywords 'force-be-switch'. This keyword allows to ignore unpublished or disabled state. Thus, use_backend/default_backend will match even if the target backend is unpublished or disabled. This is useful to be able to test a backend instance before exposing it outside. This new keyword is converted into a persist rule of new type PERSIST_TYPE_BE_SWITCH, stored in persist_rules list proxy member. This is the only persist rule applicable to frontend side. Prior to this commit, pure frontend proxies persist_rules list were always empty. This new features requires adjustment in process_switching_rules(). Now, when a use_backend/default_backend rule matches with an non eligible backend, frontend persist_rules are inspected to detect if a force-be-switch is present so that the backend may be selected.	2026-01-15 09:08:19 +01:00
Amaury Denoyelle	16f035d555	MINOR: cfgparse: adapt warnif_cond_conflicts() error output Utility function warnif_cond_conflicts() is used when parsing an ACL. Previously, the function directly calls ha_warning() to report an error. Change the function so that it now takes the error message as argument. Caller can then output it as wanted. This change is necessary to use the function when parsing a keyword registered as cfg_kw_list. The next patch will reuse it.	2026-01-15 09:08:18 +01:00
Amaury Denoyelle	797ec6ede5	MEDIUM: proxy: implement publish/unpublish backend CLI Define a new set of CLI commands publish/unpublish backend <be>. The objective is to be able to change the status of a backend to unpublished. Such a backend is considered ineligible to traffic : this allows to skip use_backend rules which target it. Note that contrary to disabled/stopped proxies, an unpublished backend still has server checks running on it. Internally, a new proxy flags PR_FL_BE_UNPUBLISHED is defined. CLI commands handler "publish backend" and "unpublish backend" are executed under thread isolation. This guarantees that the flag can safely be set or remove in the CLI handlers, and read during content-switching processing.	2026-01-15 09:08:18 +01:00
Amaury Denoyelle	21fb0a3f58	MEDIUM: proxy: do not select a backend if disabled A proxy can be marked as disabled using the keyword with the same name. The doc mentions that it won't process any traffic. However, this is not really the case for backends as they may still be selected via switching rules during stream processing. In fact, currently access to disabled backends will be conducted up to assign_server(). However, no eligible server is found at this stage, resulting in a connection closure or an HTTP 503, which is expected. So in the end, servers in disabled backends won't receive any traffic. But this is only because post-parsing steps are not performed on such backends. Thus, this can be considered as functional but only via side-effects. This patch clarifies the handling of disable backends, so that they are never selected via switching rules. Now, process_switching_rules() will ignore disable backends and continue rules evaluation. As this is a behavior change, this patch is labelled as medium. The documentation manuel for use_backend is updated accordingly.	2026-01-15 09:08:18 +01:00
Olivier Houchard	a209c35f30	MEDIUM: thread: Turn the group mask in thread set into a group counter Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details If we want to be able to have more than 64 thread groups, we can no longer use thread group masks as long. One remaining place where it is done is in struct thread_set. However, it is not really used as a mask anywhere, all we want is a thread group counter, so convert that mask to a counter.	2026-01-15 05:24:53 +01:00
Olivier Houchard	1397982599	MINOR: threads: Eliminate all_tgroups_mask. Now that it is unused, eliminate all_tgroups_mask, as we can't 64bits masks to represent thread groups, if we want to be able to have more than 64 thread groups.	2026-01-15 03:46:57 +01:00
Olivier Houchard	7f4b053b26	MEDIUM: counters: mostly revert `da813ae4d7` Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Contrarily to what was previously believed, there are corner cases where the counters may not be allocated, and we may want to make them optional at a later date, so we have to check if those counters are there. However, just checking that shared.tg is non-NULL is enough, we can then assume that shared.tg[tgid - 1] has properly been allocated too. Also modify the various COUNTER_SHARED_* macros to make sure they check for that too.	2026-01-14 12:39:14 +01:00
Olivier Houchard	da813ae4d7	MEDIUM: counters: Remove some extra tests Before updating counters, a few tests are made to check if the counters exits. but those counters should always exist at this point, so just remmove them. This commit should have no impact, but can easily be reverted with no functional impact if various crashes appear.	2026-01-13 11:12:34 +01:00
Olivier Houchard	5495c88441	MEDIUM: counters: Dynamically allocate per-thread group counters Instead of statically allocating the per-thread group counters, based on the max number of thread groups available, allocate them dynamically, based on the number of thread groups actually used. That way we can increase the maximum number of thread groups without using an unreasonable amount of memory.	2026-01-13 11:12:34 +01:00
Olivier Houchard	022cb3ab7f	MINOR: stats: Increase the tgid from 8bits to 16bits Increase the size of the stored tgid in the stat file from 8bits to 32bits, so that we can have more than 256 thread group. 65536 should be enough for some time. This bumps thet stat file minor version, as the structure changes.	2026-01-12 09:39:52 +01:00
Olivier Houchard	c0f64fc36a	MINOR: receiver: Dynamically alloc the "members" field of shard_info Instead of always allocating MAX_TGROUPS members, allocate them dynamically, using the number of thread groups we'll use, so that increasing MAX_TGROUPS will not have a huge impact on the structure size.	2026-01-12 09:32:27 +01:00
Tim Duesterhus	96faf71f87	CLEANUP: connection: Remove outdated note about CO_FL `0x00002000` being unused Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This flag is used as of commit `dcce936912` ("MINOR: connections: Add a new CO_FL_SSL_NO_CACHED_INFO flag"). This patch should be backported to 3.3. Apparently `dcce936912` has been backported to 3.2 and 3.1 already, with that change already applied, so no need for a backport there.	2026-01-12 03:22:15 +01:00
Olivier Houchard	41cd589645	MINOR: receiver: Remove tgroup_mask from struct shard_info The only purpose from tgroup_mask seems to be to calculate how many tgroups share the same shard, but this is an information we can calculate differently, we just have to increment the number when a new receiver is added to the shard, and decrement it when one is detached from the shard. Removing thread group masks will allow us to increase the maximum number of thread groups past 64.	2026-01-07 09:27:12 +01:00
Ilia Shipitsin	f8a77ecf62	CLEANUP: assorted typo fixes in the code, commits and doc Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details	2025-12-25 19:45:29 +01:00
Willy Tarreau	6d995e59e9	MINOR: protocol: support a generic way to call getsockopt() on a connection It's regularly needed to call getsockopt() on a connection, but each time the calling code has to do all the job by itself. This commit adds a "get_opt()" callback on the protocol struct, that directly calls getsockopt() on the connection's FD. A generic implementation for standard sockets is provided, though QUIC would likely require a different approach, or maybe a mapping. Due to the overlap between IP/TCP/socket option values, it is necessary for the caller to indicate both the level and the option. An abstraction of the level could be done, but the caller would nonetheless have to know the optname, which is generally defined in the same include files. So for now we'll consider that this callback is only for very specific use. The levels and optnames are purposely passed as signed ints so that it is possible to further extend the API by using negative levels for internal namespaces.	2025-12-24 18:38:51 +01:00
Willy Tarreau	44c67a08dd	MINOR: tcp: add new bind option "tcp-ss" to instruct the kernel to save the SYN This option enables TCP_SAVE_SYN on the listening socket, which will cause the kernel to try to save a copy of the SYN packet header (L2, IP and TCP are supported). This can permit to check the source MAC address of a client, or find certain TCP options such as a source address encapsulated using RFC7974. It could also be used as an alternate approach to retrieving the source and destination addresses and ports. For now setting the option is enabled, but sample fetch functions and converters will be needed to extract info.	2025-12-24 11:35:09 +01:00
Maxime Henrion	1fdccbe8da	OPTIM: patterns: cache the current generation Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This makes a significant difference when loading large files and during commit and clear operations, thanks to improved cache locality. In the measurements below, master refers to the code before any of the changes to the patterns code, not the code before this one commit. Timing the replacement of 10M entries from the CLI with this command which also reports timestamps at start, end of upload and end of clear: $ (echo "prompt i"; echo "show activity"; echo "prepare acl #0"; awk '{print "add acl @1 #0",$0}' < bad-ip.map; echo "show activity"; echo "commit acl @1 #0"; echo "clear acl @0 #0";echo "show activity") \| socat -t 10 - /tmp/sock1 \| grep ^uptim master, on a 3.7 GHz EPYC, 3 samples: uptime_now: 6.087030 uptime_now: 25.981777 => 21.9 sec insertion time uptime_now: 29.286368 => 3.3 sec commit+clear uptime_now: 5.748087 uptime_now: 25.740675 => 20.0s insertion time uptime_now: 29.039023 => 3.3 s commit+clear uptime_now: 7.065362 uptime_now: 26.769596 => 19.7s insertion time uptime_now: 30.065044 => 3.3s commit+clear And after this commit: uptime_now: 6.119215 uptime_now: 25.023019 => 18.9 sec insertion time uptime_now: 27.155503 => 2.1 sec commit+clear uptime_now: 5.675931 uptime_now: 24.551035 => 18.9s insertion uptime_now: 26.652352 => 2.1s commit+clear uptime_now: 6.722256 uptime_now: 25.593952 => 18.9s insertion uptime_now: 27.724153 => 2.1s commit+clear Now timing the startup time with a 10M entries file (on another machine) on master, 20 samples: Standard Deviation, s: 0.061652677408033 Mean: 4.217 And after this commit: Standard Deviation, s: 0.081821371548669 Mean: 3.78	2025-12-23 21:17:39 +01:00
Maxime Henrion	545cf59b6f	MEDIUM: patterns: reorganize pattern reference elements Instead of a global list (and tree) of pattern reference elements, we now have an intermediate pat_ref_gen structure and store the elements in those. This simplifies the logic of some operations such as commit and clear, and improves performance in some cases - numbers to be provided in a subsequent commit after one important optimization is added. A lot of the changes are due to adding an extra level of indirection, changing many cases where we iterate over all elements to an outer loop iterating over the generation and an inner one iterating over the elements of the current generation. It is therefore easier to read this patch using 'git diff -w'.	2025-12-23 21:17:39 +01:00
Maxime Henrion	5547bedebb	MINOR: patterns: preliminary changes for reorganization Safe and non-functional changes that only add currently unused structures, field, functions and macros, in preparation of larger changes that alter the way pattern reference elements are stored. This includes code to create and lookup generation objects, and macros to iterate over the generations of a pattern reference.	2025-12-23 21:17:39 +01:00
Maxime Henrion	c8750e4e9d	MINOR: tools: add a secure implementation of memset Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This guarantees that the compiler will not optimize away the memset() call if it detects a dead store. Use this to clear SSL passphrases. No backport needed.	2025-12-19 17:42:57 +01:00
Olivier Houchard	7e22d9c484	MEDIUM: cpu-topo: Add a new "max-threads-per-group" global keyword Add a new global keyword, max-threads-per-group. It sets the maximum number of threads a thread group can contain. Unless the number of thread groups is fixed with "thread-groups", haproxy will just create more thread groups as needed. The default and maximum value is 64.	2025-12-18 18:52:52 +01:00
William Lallemand	5b19d95850	BUG/MEDIUM: mworker/listener: ambiguous use of RX_F_INHERITED with shards Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details The RX_F_INHERITED flag was ambiguous, as it was used to mark both listeners inherited from the parent process and listeners duplicated from another local receiver. This could lead to incorrect behavior concerning socket unbinding and suspension. This commit refactors the handling of inherited listeners by splitting the RX_F_INHERITED flag into two more specific flags: - RX_F_INHERITED_FD: Indicates a listener inherited from the parent process via its file descriptor. These listeners should not be unbound by the master. - RX_F_INHERITED_SOCK: Indicates a listener that shares a socket with another one, either by being inherited from the parent or by being duplicated from another local listener. These listeners should not be suspended or resumed individually. Previously, the sharding code was unconditionally using RX_F_INHERITED when duplicating a file descriptor. In HAProxy versions prior to 3.1, this led to a file descriptor leak for duplicated unix stats sockets in the master process. This would eventually cause the master to crash with a BUG_ON in fd_insert() once the file descriptor limit was reached. This must be backported as far as 3.0. Branches earlier than 3.0 are affected but would need a different patch as the logic is different.	2025-12-11 18:09:47 +01:00
Maxime Henrion	6eedd0d485	CLEANUP: more conversions and cleanups for alignment Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details - Convert additional cases to use the automatic alignment feature for the THREAD_ALIGN(ED) macros. This includes some cases that are less obviously correct where it seems we wanted to align only in the USE_THREAD case but were not using the thread specific macros. - Also move some alignment requirements to the structure definition instead of having it on variable declaration.	2025-12-09 17:40:58 +01:00
Maxime Henrion	bc8e14ec23	CLEANUP: use the automatic alignment feature - Use the automatic alignment feature instead of hardcoding 64 all over the code. - This also converts a few bare __attribute__((aligned(X))) to using the ALIGNED macro.	2025-12-09 17:14:58 +01:00
Maxime Henrion	74719dc457	CLEANUP: improvements to the alignment macros - It is now possible to use the THREAD_ALIGN and THREAD_ALIGNED macros without a parameter. In this case, we automatically align on the cache line size. - The cache line size is set to 64 by default to match the current code, but it can be overridden on the command line. - This required moving the DEFVAL/DEFNULL/DEFZERO macros to compiler.h instead of tools-t.h, to avoid namespace pollution if we included tools-t.h from compiler.h.	2025-12-09 17:05:52 +01:00
Olivier Houchard	dcce936912	MINOR: connections: Add a new CO_FL_SSL_NO_CACHED_INFO flag Add a new flag to connections, CO_FL_SSL_NO_CACHED_INFO, and set it for checks. It lets the ssl layer know that he should not use cached informations, such as the ALPN as stored in the server, or cached sessions. This wlil be used for checks, as checks may target different servers, or used a different SSL configuration, so we can't assume the stored informations are correct. This should be backported to 3.3, and may be backported up to 2.8 if the attempts to do session resume by checks is proven to be a problem.	2025-12-09 16:43:31 +01:00
Christopher Faulet	be998b590e	MEDIUM: ssl/server: No longer store the SNI of cached TLS sessions Thanks to the previous patch, "BUG/MEDIUM: ssl: Don't reuse TLS session if the connection's SNI differs", it is no useless to store the SNI of cached TLS sessions. This SNI is no longer tested and new connections reusing a session must have the same SNI. The main change here is for the ssl_sock_set_servername() function. It is no longer possible to compare the SNI of the reused session with the one of the new connection. So, the SNI is always set, with no other processing. Mainly, the session is not destroyed when SNIs don't match. It means the commit `119a4084bf` ("BUG/MEDIUM: ssl: for a handshake when server-side SNI changes") is implicitly reverted. It is good to note that it is unclear for me when and why the reused session should be destroyed. Because I'm unable to reproduce any issue fixed by the commit above. This patch could be backported as far as 3.0 with the commit above.	2025-12-08 15:22:01 +01:00
Christopher Faulet	28654f3c9b	MINOR: connection/ssl: Store the SNI hash value in the connection itself When a SNI is set on a new connection, its hash is now saved in the connection itself. To do so, a dedicated field was added into the connection strucutre, called sni_hash. For now, this value is only used when the TLS session is cached.	2025-12-08 15:22:01 +01:00
Christopher Faulet	9794585204	MINOR: ssl: Store hash of the SNI for cached TLS sessions For cached TLS sessions, in addition to the SNI itself, its hash is now also saved. No changes are expected here because this hash is not used for now. This commit relies on: * MINOR: ssl: Add a function to hash SNIs	2025-12-08 15:22:00 +01:00
Christopher Faulet	d993e1eeae	MINOR: ssl: Add a function to hash SNIs This patch only adds the function ssl_sock_sni_hash() that can be used to get the hash value corresponding to an SNI. A global seed, sni_hash_seed, is used.	2025-12-08 15:22:00 +01:00
Christopher Faulet	a83ed86b78	MEDIUM: quic: Add connection as argument when qc_new_conn() is called This patch reverts the commit `efe60745b` ("MINOR: quic: remove connection arg from qc_new_conn()"). The connection will be mandatory when the QUIC connection is created on backend side to fix an issue when we try to reuse a TLS session. So, the connection is again an argument of qc_new_conn(), the 4th argument. It is NULL for frontend QUIC connections but there is no special check on it.	2025-12-08 15:22:00 +01:00
Frederic Lecaille	90064ac88b	BUG/MINOR: quic: do not set first the default QUIC curves This patch impacts both the QUIC frontends and listeners. Note that "ssl-default-bind-ciphersuites", "ssl-default-bind-curves", are not ignored by QUIC by the frontend. This is also the case for the backends with "ssl-default-server-ciphersuites" and "ssl-default-server-curves". These settings are set by ssl_sock_prepare_ctx() for the frontends and by ssl_sock_prepare_srv_ssl_ctx() for the backends. But ssl_quic_initial_ctx() first sets the default QUIC frontends (see <quic_ciphers> and <quic_groups>) before these ssl_sock.c function are called, leading some TLS stack to refuse them if they do not support them. This is the case for some OpenSSL 3.5 stack with FIPS support. They do not support X25519. To fix this, set the default QUIC ciphersuites and curves only if not already set by the settings mentioned above. Rename <quic_ciphers> global variable to <default_quic_ciphersuites> and <quic_groups> to <default_quic_curves> to reflect the OpenSSL API naming. These options are taken into an account by ssl_quic_initial_ctx() which inspects these four variable before calling SSL_CTX_set_ciphersuites() with <default_quic_ciphersuites> as parameter and SSL_CTX_set_curves() with <default_quic_curves> as parameter if needed, that is to say, if no ciphersuites and curves were set by "ssl-default-bind-ciphersuites", "ssl-default-bind-curves" as global options or "ciphersuites", "curves" as "bind" line options. Note that the bind_conf struct is not modified when no "ciphersuites" or "curves" option are used on "bind" lines. On backend side, rely on ssl_sock_init_srv() to set the server ciphersuites and curves. This function is modified to use respectively <default_quic_ciphersuites> and <default_quic_curves> if no ciphersuites and curves were set by "ssl-default-server-ciphersuites", "ssl-default-server-curves" as global options or "ciphersuites", "curves" as "server" line options. Thank to @rwagoner for having reported this issue in GH #3194 when using an OpenSSL 3.5.4 stack with FIPS support. Must be backported as far as 2.6	2025-12-08 10:40:59 +01:00
Frederic Lecaille	c36e27d10e	BUG/MINOR: quic-be: handshake errors without connection stream closure This bug was revealed on backend side by reg-tests/ssl/del_ssl_crt-list.vtc when run wich QUIC connections. As expected by the test, a TLS alert is generated on servsr side. This latter sands a CONNECTION_CLOSE frame with a CRYPTO error (>= 0x100). In this case the client closes its QUIC connection. But the stream connection was not informed. This leads the connection to be closed after the server timeout expiration. It shouls be closed asap. This is the reason why reg-tests/ssl/del_ssl_crt-list.vtc could succeeds or failed, but only after a 5 seconds delay. To fix this, mimic the ssl_sock_io_cb() for TCP/SSL connections. Call the same code this patch implements with ssl_sock_handle_hs_error() to correctly handle the handshake errors. Note that some SSL counters were not incremented for both the backends and frontends. After such errors, ssl_sock_io_cb() start the mux after the connection has been flagged in error. This has as side effect to close the stream in conn_create_mux(). Must be backported to 3.3 only for backends. This is not sure at this time if this bug may impact the frontends.	2025-12-08 10:40:59 +01:00
Amaury Denoyelle	47dff5be52	MINOR: quic: implement cc-algo server keyword Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details Extend QUIC server configuration so that congestion algorithm and maximum window size can be set on the server line. This can be achieved using quic-cc-algo keyword with a syntax similar to a bind line. This should be backported up to 3.3 as this feature is considered as necessary for full QUIC backend support. Note that this relies on the serie of previous commits which should be picked first.	2025-12-01 15:53:58 +01:00
Amaury Denoyelle	979588227f	MINOR: quic: define quic_cc_algo as const Each QUIC congestion algorithm is defined as a structure with callbacks in it. Every quic_conn has a member pointing to the configured algorithm, inherited from the bind-conf keyword or to the default CUBIC value. Convert all these definitions to const. This ensures that there never will be an accidental modification of a globally shared structure. This also requires to mark quic_cc_algo field in bind_conf and quic_cc as const.	2025-12-01 15:05:41 +01:00
Willy Tarreau	36133759d3	[RELEASE] Released version 3.4-dev0 Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details Released version 3.4-dev0 with the following main changes : - MINOR: version: mention that it's development again	2025-11-26 16:12:45 +01:00
Willy Tarreau	e8d6ffb692	MINOR: version: mention that it's development again This essentially reverts `d8ba9a2a92`.	2025-11-26 16:11:47 +01:00
Willy Tarreau	d8ba9a2a92	MINOR: version: mention that 3.3 is stable now This version will be maintained up to around Q1 2027. The INSTALL file also mentions it.	2025-11-26 15:54:30 +01:00
Amaury Denoyelle	49e6fca51b	MINOR: quic: use separate global quic_conns FE/BE lists Each quic_conn instance is stored in a global list. Its purpose is to be able to loop over all known connections during "show quic". Split this into two separate lists for frontend and backend usage. Another change is that closing backend connections do not move into quic_conns_clo list. They remain instead in their original list. The objective of this patch is to reduce the contention between the two sides. Note that this prevents backend connections to be listed in "show quic" now. This will be adjusted in a future patch.	2025-11-25 14:30:18 +01:00
Amaury Denoyelle	a5801e542d	MINOR: quic: split global CID tree between FE and BE sides QUIC CIDs are stored in a global tree. Prior to this patch, CIDs used on both frontend and backend sides were mixed together. This patch implement CID storage separation between FE and BE sides. The original tre quic_cid_trees is splitted as quic_fe_cid_trees/quic_be_cid_trees. This patch should reduce contention between frontend and backend usages. Also, it should reduce the risk of random CID collision.	2025-11-25 14:30:18 +01:00
Jacques Heunis	91eb9b082b	BUG/MINOR: freq_ctr: Prevent possible signed overflow in freq_ctr_overshoot_period Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details All of the other bandwidth-limiting code stores limits and intermediate (byte) counters as unsigned integers. The exception here is freq_ctr_overshoot_period which takes in unsigned values but returns a signed value. While this has the benefit of letting the caller know how far away from overshooting they are, this is not currently leveraged anywhere in the codebase, and it has the downside of halving the positive range of the result. More concretely though, returning a signed integer when all intermediate values are unsigned (and boundaries are not checked) could result in an overflow, producing values that are at best unexpected. In the case of flt_bwlim (the only usage of freq_ctr_overshoot_period in the codebase at the time of writing), an overflow could cause the filter to wait for a large number of milliseconds when in fact it shouldn't wait at all. This is a niche possibility, because it requires that a bandwidth limit is defined in the range [2^31, 2^32). In this case, the raw limit value would not fit into a signed integer, and close to the end of the period, the `(elapsed * freq)/period` calculation could produce a value which also doesn't fit into a signed integer. If at the same time `curr` (the number of events counted so far in the current period) is small, then we could get a very large negative value which overflows. This is undefined behaviour and could produce surprising results. The most obvious outcome is flt_bwlim sometimes waiting for a large amount of time in a case where it shouldn't wait at all, thereby incorrectly slowing down the flow of data. Converting just the return type from signed to unsigned (and checking for the overflow) prevents this undefined behaviour. It also makes the range of valid values consistent between the input and output of freq_ctr_overshoot_period and with the input and output of other freq_ctr functions, thereby reducing the potential for surprise in intermediate calculations: now everything supports the full 0 - 2^32 range.	2025-11-24 14:10:13 +01:00
Christopher Faulet	8e08a635eb	MINOR: muxes: Support an optional ALPN string when defining mux protocols When a multiplexer protocol is defined, it is now possible to specify the ALPN it supports, in binary format. This info is optionnal. For now only the h2 and the h1 multiplexers define an ALPN because this will be mandatory for a fix. But this could be used in future for different purpose. This patch will be mandatory for the next fix.	2025-11-20 16:14:52 +01:00
Willy Tarreau	91d4f4f618	MINOR: limits: keep a copy of the rough estimate of needed FDs in global struct It's always a pain to guess the number of FDs that can be needed by listeners, checks, threads, pollers etc. We have this estimate in global.maxsock before calling set_global_maxconn(), but we lose it the line after. Let's copy it into global.est_fd_usage and keep it. This will be helpful to try to provide more accurate suggestions for maxconn.	2025-11-20 08:44:52 +01:00
Frederic Lecaille	a88fdf8669	MINOR: quic/flags: add missing QUIC flags for flags dev tool. Add missing QUIC_FL_CONN_XPRT_CLOSED quic_conn flags definition.	2025-11-20 08:10:58 +01:00
Amaury Denoyelle	d54d78fe9a	BUG/MINOR: quic: fix FD usage for quic_conn_closed on backend side Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details On the frontend side, QUIC transfer can be performed either via a connection owned FD or multiplex on the listener one. When a quic_conn is freed and converted to quic_conn_closed instance, its FD if open is closed and all exchanges are now multiplex via the listener FD. This is different for the backend as connections only has the choice to use their owned FD. Thus, special care care must be taken when freeing a connection and converting it to a quic_conn_closed instance. In this case, qc_release_fd() is delayed to the quic_conn_closed release. Furthermore, when the FD is transferred, its iocb and owner fields are updated to the new quic_conn_closed instance. Without it, a crash will occur when accessing the freed quic_conn tasklet. A newly dedicated handler quic_conn_closed_sock_fd_iocb is used to ensure access to quic_conn_closed members only.	2025-11-19 16:02:22 +01:00
Amaury Denoyelle	e55bcf5746	BUG/MINOR: mux-quic: implement max-reuse server parameter Properly implement support for max-reuse server keyword. This is done by adding a total count of streams seen for the whole connection. This value is used in avail_streams callback.	2025-11-19 16:02:22 +01:00
Amaury Denoyelle	c67a614e45	MINOR: quic: remove <ipv4> arg from qc_new_conn() Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details Remove <ipv4> argument from qc_new_conn(). This parameter is unnecessary as it can be derived from the family type of the addresses also passed as argument.	2025-11-17 10:20:54 +01:00
Amaury Denoyelle	133f100467	MINOR: quic: refactor qc_new_conn() prototype The objective of this patch is to streamline qc_new_conn() usage so that it is similar for frontend and backend sides. Previously, several parameters were set only for frontend connections. These arguments are replaced by a single quic_rx_packet argument, which represents the INITIAL packet triggering the connection allocation on the server side. For a QUIC client endpoint, it remains NULL. This usage is consider more explicit. As a minor change, <target> is moved as the first argument of the function. This is considered useful as this argument determines whether the connection is a frontend or backend entry. Along with these changes, qc_new_conn() documentation has been reworded so that it is now up-to-date with the newest usage.	2025-11-17 10:13:40 +01:00
Amaury Denoyelle	49edaca513	MINOR: quic: try to clarify quic_conn CIDs fields direction quic_conn has two fields named <dcid> and <scid>. It may cause confusion as it is not obvious how these fields are related to the connection direction. Try to improve this by extending the documentation of these two fields.	2025-11-17 10:11:04 +01:00
Amaury Denoyelle	8720130cc7	MINOR: quic: do not use quic_newcid_from_hash64 on BE side quic_newcid_from_hash64 is an external callback. If defined, it serves as a CID method generation, as an alternative to the default random implementation. This mechanism was not correctly implemented on the backend side. Indeed, <hash64> quic_conn member is only setted for frontend connections. The simplest solution would be to properly define it also for backend ones. However, quic_newcid_from_hash64 derivation is really only useful for the frontend side for now. Thus, this patch disables using it on the backend side in favor of the default random generator. To implement this, quic_cid_generate() is splitted in two functions, for both methods of CIDs generation. This is the responsibility of the caller to select the proper method. On backend side, only random implementation is now used.	2025-11-17 10:11:04 +01:00
Christopher Faulet	fc6e3e9081	MINOR: stick-tables: Rename stksess shards to use buckets The shard keyword is already used by the peers and on the server lines. And it is unrelated with the session keys distribution. So instead of talking about shard for the session key hashing, we now use the term "bucket".	2025-11-17 07:42:51 +01:00
Willy Tarreau	675c86c4aa	DEBUG: add BUG_ON_STRESS(): a BUG_ON() implemented only when DEBUG_STRESS > 0 The purpose of this new BUG_ON is beyond BUG_ON_HOT(). While BUG_ON_HOT() is meant to be light but placed on very hot code paths, BUG_ON_STRESS() might be heavy and only used under stress-testing, to try to detect early that something bad is starting to happen. This one is not even type-checked when not defined because we don't want to risk the compiler emitting the slightest piece of code there in production mode, so as to give enough freedom to the developers.	2025-11-14 16:42:53 +01:00
Willy Tarreau	3d441e78e5	DEBUG: extend DEBUG_STRESS to ease testing and turn on extra checks DEBUG_STRESS is currently used only to expose "stress-level". With this patch, we go a bit further, by automatically forcing DEBUG_STRICT and DEBUG_STRICT_ACTION to their highest values in order to enable all BUG_ON levels, and make all of them result in a crash. In addition, care is taken to always only have 0 or 1 in the macro, so that it can be tested using "#if DEBUG_STRESS > 0" as well as "if (DEBUG_STRESS) { }" everywhere. The goal will be to ease insertion of extra tests for builds dedicated to stress-testing that enable possibly expensive extra checks on certain code paths that cannot reasonably be compiled in for production code right now.	2025-11-14 16:38:04 +01:00
Amaury Denoyelle	d79295d89b	Revert "BUG/MEDIUM: connections: permit to permanently remove an idle conn" The target patch fixes a rare race condition which happen when a MUX IO handler is working on a connection already moved into the purge list. In this case, the handler will incorrectly moved back the connection into the idle list. To fix this, conn_delete_from_tree() was extended to remove flags along with the connection from the idle list. This was performed when the connection is moved into the purge list. However, it introduces another issue related to the idle server connection accounting. Thus it is necessary to revert it prior to the incoming newer fix. This patch must be backported to every version where the original commit is.	2025-11-14 16:06:34 +01:00
William Lallemand	3d15c07ed0	MINOR: cfgcond: add "awslc_api_atleast" and "awslc_api_before" AWS-LC features are not easily tested with just the openssl version constant. AWS-LC uses its own API versioning stored in the AWSLC_API_VERSION constant. This patch add the two awslc_api_atleast and awslc_api_before predicates that help to check the AWS-LC API.	2025-11-14 11:01:45 +01:00
Amaury Denoyelle	8415254cea	MINOR: check: clarify check-reuse-pool interaction with reuse policy check-reuse-pool can only perform as expected if reuse policy on the backend is set to aggressive or higher. Update the documentation to reflect this and implement a server diag warning.	2025-11-14 10:44:05 +01:00
William Lallemand	2bdf5a7937	BUG/MEDIUM: acme: move from mt_list to a rwlock + ebmbtree The current ACME scheduler suffers from problems due to the way the tasks are stored: - MT_LIST are not scalables when having a lot of ACME tasks and having to look for a specific one. - the acme_task pointer was stored in the ckch_store in order to not passing through the whole list. But a ckch_store can be updated and the pointer lost in the previous one. - when a task fails, the ptr in the ckch_store was not removed because we only work with a copy of the original ckch_store, it would need to lock the ckchs_tree and remove this pointer. This patch fixes the issues by removing the MT_LIST-based architecture, and replacing it by a simple ebmbtree + rwlock design. The pointer to the task is not stored anymore in the ckch_store, but instead it is stored in the acme_tasks tree. Finding a task is done by doing a lookup on this tree with a RDLOCK. Instead of checking if store->acme_task is not NULL, a lookup is also done. This allow to remove the stuck "acme_task" pointer in the store, which was preventing to restart an acme task when the previous failed for this specific certificate. Must be backported in 3.2.	2025-11-13 15:18:12 +01:00
Frederic Lecaille	d84463f9f6	MINOR: quic-be: validate the 0-RTT transport parameters During 0-RTT sessions, some server transport parameters are reused after having been save from previous sessions. These parameters must not be reduced when it resends them. The client must check this is the case when some early data are accepted by the server. This is what is implemented by this patch. Implement qc_early_tranport_params_validate() which checks the new server parameters are not reduced. Also implement qc_ssl_eary_data_accepted() which was not implemented for TLS stack without 0-RTT support (for instance wolfssl). That said this function was no more used. This is why the compilation against wolfssl could not fail.	2025-11-13 14:04:31 +01:00
Frederic Lecaille	6419b9f204	MEDIUM: quic-be: enable the use of 0-RTT This patch allows the use of 0-RTT feature on QUIC server lines with "allow-0rtt" option. In fact 0-RTT is really enabled only if ssl_sock_srv_try_reuse_sess() successfully manages to reuse the SSL session and the chosen application protocol from previous connections. Note that, at this time, 0-RTT works only with quictls and aws-lc as TLS stack. (0-RTT does not work at all (even for QUIC frontends) with libressl).	2025-11-13 14:04:31 +01:00
Frederic Lecaille	a4bbbc75db	MINOR: quic-be: Send post handshake frames from list of frames (0-RTT) This patch is required to make 0-RTT work. It modifies the prototype of quic_build_post_handshake_frames() to send post handshake frames from a list of frames in place of the application encryption level (used as <qc->ael> local variable). This patch does not modify at all the current QUIC stack behavior (even for QUIC frontends). It must be considered as a preparation for the code to come about 0-RTT support for QUIC backends.	2025-11-13 14:04:31 +01:00
Frederic Lecaille	6e14365a5b	MEDIUM: quic-be: modify ssl_sock_srv_try_reuse_sess() to reuse backend sessions (0-RTT) This function is called for both TCP and QUIC connections to reuse SSL sessions saved by ssl_sess_new_srv_cb() callback called upon new SSL session creation. In addition to this, a QUIC SSL session must reuse the ALPN and some specific QUIC transport parameters. This is what is added by this patch for QUIC 0-RTT sessions. Note that for now on, ssl_sock_srv_try_reuse_sess() may fail for QUIC connections if it did not managed to reuse the ALPN. The caller must be informed of such an issue. It must not enable 0-RTT for the current session in this case. This is impossible without ALPN which is required to start a mux. ssl_sock_srv_try_reuse_sess() is modified to always succeeds for TCP connections.	2025-11-13 14:04:31 +01:00
Frederic Lecaille	5309dfb56b	MINOR: quic-be: Save the backend 0-RTT parameters For both TCP and QUIC connections, this is ssl_sess_new_srv_cb() callback which is called when a new SSL session is created. Its role is to save the session to be reused for the next sessions. This patch modifies this callback to save the QUIC parameters to be reused for the next 0-RTT sessions (or during SSL session resumption). The already existing path_params->nego_alpn member is used to store the ALPN as this is done for TCP alongside path_params->tps new quic_early_transport_params struct used to save the QUIC transport parameters to be reused for 0-RTT sessions.	2025-11-13 14:04:31 +01:00
Frederic Lecaille	41e40eb431	MINOR: quic-be: helper quic_reuse_srv_params() function to reuse server params (0-RTT) Implement quic_reuse_srv_params() whose role is to reuse the ALPN negotiated during a first connection to a QUIC backend alongside its transport parameters.	2025-11-13 14:04:31 +01:00
Frederic Lecaille	33564ca54c	MINOR: quic-be: helper functions to save/restore transport params (0-RTT) Define quic_early_transport_params new struct for QUIC transport parameters in relation with 0-RTT. This parameters must be saved during a first session to be reused for 0-RTT next sessions. qc_early_transport_params_cpy() copies the 0-RTT transport parameters to be saved during a first connection to a backend. The copy is made from a quic_transport_params struct to a quic_ealy_transport_params struct. On the contrary, qc_early_transport_params_reuse() copies the transport parameters to be reused for a 0-RTT session from a previous one. The copy is made from a quic_early_transport_params strcut to a quic_transport_params struct. Also add QUIC_EV_EARLY_TRANSP_PARAMS trace event to dump such 0-RTT transport parameters from traces.	2025-11-13 14:04:31 +01:00
Frederic Lecaille	80070fe51c	MEDIUM: quic-be: Parse, store and reuse tokens provided by NEW_TOKEN Add a per thread ist struct to srv_per_thread struct to store the QUIC token to be reused for subsequent sessions. Parse at packet level (from qc_parse_ptk_frms()) these tokens and store them calling qc_try_store_new_token() newly implemented function. This is this new function which does its best (may fail) to update the tokens. Modify qc_do_build_pkt() to resend these tokens calling quic_enc_token() implemented by this patch.	2025-11-13 14:04:31 +01:00
Frederic Lecaille	8f23d4d287	MINOR: quic-be: Parse the NEW_TOKEN frame Rename ->data qf_new_token struct field to ->w_data to distinguish it from ->r_data new field used to parse the NEW_TOKEN frame. Indeed to build the NEW_TOKEN we need to write it to a static buffer into the frame struct. To parse it we only need to store the address of the token field into the RX buffer.	2025-11-13 14:04:31 +01:00
Amaury Denoyelle	5a8728d03a	MEDIUM/OPTIM: quic: alloc quic_conn after CID collision check Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details On Initial packet parsing, a new quic_conn instance is allocated via qc_new_conn(). Then a CID is allocated with its value derivated from client ODCID. On CID tree insert, a collision can occur if another thread was already parsing an Initial packet from the same client. In this case, the connection is released and the packet will be requeued to the other thread. Originally, CID collision check was performed prior to quic_conn allocation. This was changed by the commit below, as this could cause issue on quic_conn alloc failure. commit `4ae29be18c` BUG/MINOR: quic: Possible endless loop in quic_lstnr_dghdlr() However, this procedure is less optimal. Indeed, qc_new_conn() performs many steps, thus it could be better to skip it on Initial CID collision, which can happen frequently. This patch restores the older order of operations, with CID collision check prior to quic_conn allocation. To ensure this does not cause again the same bug, the CID is removed in case of quic_conn alloc failure. This should prevent any loop as it ensures that a CID found in the global tree does not point to a NULL quic_conn, unless if CID is attach to a foreign thread. When this thread will parse a re-enqueued packet, either the quic_conn is already allocated or the CID has been removed, triggering a fresh CID and quic_conn allocation procedure.	2025-11-10 12:10:14 +01:00
Amaury Denoyelle	2623e0a0b7	BUG/MEDIUM: quic: handle collision on CID generation CIDs are provided by haproxy so that the peer can use them as DCID of its packets. Their value is set via a random generator. It happens on several occasions during connection lifetime: * via ODCID derivation if haproxy is the server * on quic_conn init if haproxy is the client * during post-handshake if haproxy is the server * on RETIRE_CONNECTION_ID frame parsing CIDs are stored in a global tree. On ODCID derivation, a check is performed to ensure the CID is not a duplicate value. This is mandatory to properly handle multiple INITIAL packets from the same client on different thread. However, for the other cases, no check is performed for CID collision. As _quic_cid_insert() is silent, the issue is not detected at all. This results in a CID advertized to the peer but not stored in the global one. In the end, this may cause two issues. The first one is that packets from the client which use the new CID will be rejected by haproxy, most probably with a STATELESS_RESET. The second issue is that it can cause a crash during quic_conn release. Indeed, the CID is stored in the quic_conn local tree and thus eb_delete() for the global tree will be performed. As <leaf_p> member is uninit, this results in a segfault. Note that this issue is pretty rare. It can only be observed if running with a high number of concurrent connections in parallel, so that the random generator will provide duplicate values. Patch is still labelled as MEDIUM as this modifies code paths used frequently. To fix this, _quic_cid_insert() unsafe function is completely removed. Instead, quic_cid_insert() can be used, which reports an error code if a collision happens. CID are then stored in the quic_conn tree only after global tree insert success. Here is the solution for each steps if a collision occurs : * on init as client: the connection is completely released * post-handshake: the CID is immediately released. The connection is kept, but it will miss an extra CID. * on RETIRE_CONNECTION_ID parsing: a loop is implemented to retry random generation. It it fails several times, the connection is closed in error. A small convenience change is made to quic_cid_insert(). Output parameter <new_tid> can now be NULL, which is useful as most of the times caller do not care about it. This must be backported up to 2.6.	2025-11-10 12:10:14 +01:00
Amaury Denoyelle	419e5509d8	MINOR: quic: split CID alloc/generation function Split new_quic_cid() function into multiple ones. This patch should not introduce any visible change. The objective is to render CID allocation and generation more modular. The first advantage of this patch is to bring code simplication. In particular, conn CID sequence number increment and insertion into connection tree is simpler than before. Another improvment is also that errors could now be handled easier at each different steps of the CID init. This patch is a prerequisite for the fix on CID collision, thus it must be backported prior to it to every affected version.	2025-11-10 12:10:14 +01:00
Christopher Faulet	ecc2c3a35d	MEDIUM: peers: Remove commitupdate field on stick-tables This stick-table field was atomically updated with the last update id pushed and dumped on the CLI but never used otherwise. And all peer sessions share the same id because it is a stick-table info. So the info in peers dump is pretty limited. So, let's remove it.	2025-11-07 12:17:53 +01:00
Ben Kallus	d5ca3bb3b4	IMPORT: cebtree: Replace offset calculation with offsetof to avoid UB Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This is the same as the equivalent fix in ebtree: The C standard specifies that it's undefined behavior to dereference NULL (even if you use & right after). The hand-rolled offsetof idiom &(((s*)NULL)->f) is thus technically undefined. This clutters the output of UBSan and is simple to fix: just use the real offsetof when it's available. This is cebtree commit 2d08958858c2b8a1da880061aed941324e20e748.	2025-11-07 07:32:58 +01:00
Willy Tarreau	14087e48b9	MINOR: tools: add env_suggest() to suggest alternate variable names The purpose here is to look in the environment for a variable whose name looks like the provided one. This will be used to try to auto- correct misspelled environment variables that would silently be turned to an empty string.	2025-11-06 19:57:44 +01:00
Willy Tarreau	a4d78dd4f5	MINOR: tools: add support for ist to the word fingerprinting functions The word fingerprinting functions are used to compare similar words to suggest a correctly spelled one that looks like what the user proposed. Currently the functions only support const char*, but there's no reason for this, and it would be convenient to support substrings extracted from random pieces of configurations. Here we're adding new variants "_with_len" that take these ISTs and which are in fact a slight change of the original ones that the old ones now rely on.	2025-11-06 19:57:44 +01:00
Willy Tarreau	0144426dfb	BUG/MEDIUM: server: close a race around ready_srv when deleting a server When a server is being disabled or deleted, in case it matches the backend's ready_srv, this one is reset. However it's currently done in a non-atomic way when the server goes down, and that could occasionally reset the entry matching another server, but more importantly if in parallel some requests are dequeued for that server, it may re-appear there after having been removed, leading to a possible crash once it is fully removed, as shown in issue #3177. Let's make sure we reset the pointer when detaching the server from the proxy, and use a CAS in both cases to only reset this server. This fix needs to be backported to 3.2. There, srv_detach() is in server.c instead of server.h. Thanks to Basha Mougamadou for the detailed report and the useful backtraces.	2025-11-06 19:57:44 +01:00
Christopher Faulet	a1b5325a7a	MINOR: channel: Remove total field from channels The <total> field in the channel structure is now useless, so it can be removed. The <bytes_in> field from the SC is used instead. This patch is related to issue #1617.	2025-11-06 15:01:29 +01:00
Christopher Faulet	1effe0fc0a	MINOR: applet: Add function to get amount of data in the output buffer The helper function applet_output_data() returns the amount of data in the output buffer of an applet. For applets using the new API, it is based on data present in the outbuf buffer. For legacy applets, it is based on input data present in the input channel's buffer. The HTX version, applet_htx_output_data(), is also available This patch is related to issue #1617.	2025-11-06 15:01:29 +01:00
Christopher Faulet	4991a51208	MINOR: stats: Add stats about request and response bytes received and sent In previous patches, these counters were added per frontend, backend, server and listener. With this patch, these counters are reported on stats, including promex. Note that the stats file minor version was incremented by one because the shm_stats_file_object struct size has changed. This patch is related to issue #1617.	2025-11-06 15:01:29 +01:00
Christopher Faulet	0084baa6ba	MINOR: counters: Remove bytes_in and bytes_out counter from fe/be/srv/li bytes_in and bytes_out counters per frontend, backend, listener and server were removed and we now rely on, respectively on, req_in and res_in counters. This patch is related to issue #1617.	2025-11-06 15:01:29 +01:00
Christopher Faulet	567df50d91	MINOR: stream: Remove bytes_in and bytes_out counters from stream per-stream bytes_in and bytes_out counters was removed and replaced by req.in and res.in. Coorresponding samples still exists but replies on new counters. This patch is related to issue #1617.	2025-11-06 15:01:29 +01:00
Christopher Faulet	1c62a6f501	MINOR: counters: Add req_in/req_out/res_in/res_out counters for fe/be/srv/li Thanks to the previous patch, and based on info available on the stream, it is now possible to have counters for frontends, backends, servers and listeners to report number of bytes received and sent on both sides. This patch is related to issue #1617.	2025-11-06 15:01:29 +01:00
Christopher Faulet	ac9201f929	MINOR: stream: Add samples to get number of bytes received or sent on each side req.in and req.out samples can now be used to get the number of bytes received by a client and send to the server. And res.in and res.out samples can be used to get the number of bytes received by a server and send to the client. These info are stored in the logs structure inside a stream. This patch is related to issue #1617.	2025-11-06 15:01:28 +01:00
Christopher Faulet	629fbbce19	MINOR: stconn: Add counters to SC to know number of bytes received and sent <bytes_in> and <bytes_out> counters were added to SC to count, respectively, the number of bytes received from an endpoint or sent to an endpoint. These counters are updated for connections and applets. This patch is related to issue #1617.	2025-11-06 15:01:28 +01:00
Willy Tarreau	5fe4677231	MINOR: server: move the lock inside srv_add_idle() Almost all callers of _srv_add_idle() lock the list then call the function. It's not the most efficient and it requires some care from the caller to take care of that lock. Let's change this a little bit by having srv_add_idle() that takes the lock and calls _srv_add_idle() that is now inlined. This way callers don't have to handle the lock themselves anymore, and the lock is only taken around the sensitive parts, not the function call+return. Interestingly, perf tests show a small perf increase from 2.28-2.32M RPS to 2.32-2.37M RPS on a 128-thread system.	2025-11-06 13:16:24 +01:00
William Lallemand	546c67d137	MINOR: acme: generate a temporary key pair This patch provides two functions acme_gen_tmp_pkey() and acme_gen_tmp_x509(). These functions generates a unique keypair and X509 certificate that will be stored in tmp_x509 and tmp_pkey. If the key pair or certificate was already generated they will return the existing one. The key is an RSA2048 and the X509 is generated with a expiration in the past. The CN is "expired". These are just placeholders to be used if we don't have files.	2025-11-06 11:56:27 +01:00
William Lallemand	1df55b441b	MEDIUM: ssl/ckch: use ckch_store instead of ckch_data for ckch_conf_kws This is an API change, instead of passing a ckch_data alone, the ckch_conf_kws.func() is called with a ckch_store. This allows the callback to access the whole ckch_store, with the ckch_conf and the ckch_data. But it requires the ckch_conf to be actually put in the ckch_store before.	2025-11-06 11:56:27 +01:00
Amaury Denoyelle	b9809fe0d0	MINOR: quic: remove <mux_state> field Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details This patch removes <mux_state> field from quic_conn structure. The purpose of this field was to indicate if MUX layer above quic_conn is not yet initialized, active, or already released. It became tedious to properly set it as initialization order of the various quic_conn/conn/MUX layers now differ between the frontend and backend sides, and also depending if 0-RTT is used or not. Recently, a new change introduced in connect_server() will allow to initialize QUIC MUX earlier if ALPN is cached on the server structure. This had another level of complexity. Thus, this patch removes <mux_state> field completely. Instead, a new flag QUIC_FL_CONN_XPRT_CLOSED is defined. It is set at a single place only on close XPRT callback invokation. It can be mixed with the new utility functions qc_wait_for_conn()/qc_is_conn_ready() to determine the status of conn/MUX layers now without an extra quic_conn field.	2025-11-05 14:03:34 +01:00
Willy Tarreau	096999ee20	BUG/MEDIUM: connections: permit to permanently remove an idle conn There's currently a function conn_delete_from_tree() which is used to detach an idle connection from the tree it's currently attached to so that it is no longer found. This function is used in three circumstances: - when picking a new connection that no longer has any avail stream - when temporarily working on the connection from an I/O handler, in which case it's re-added at the end - when killing a connection The 2nd case above is quite specific, as it requires to preserve the CO_FL_LIST_MASK flags so that the connection can be re-inserted into the proper tree when leaving the handler. However, there's a catch. When killing a connection, we want to be certain it will not be reinserted into the tree. The flags preservation is causing a tiny race if an I/O happens while the connection is in the kill list, because in this case the I/O handler will note the connection flags, do its work, then reinsert the connection where it believed it was, then the connection gets purged, and another user can find it in the tree. The issue is very difficult to reproduce. On a 128-thread machine it happens in H2 around 500k req/s after around 50M requests. In H1 it happens after around 1 billion requests. The fix here consists in passing an extra argument to the function to indicate if the removal is permanent or not. When it's permanent, the function will clear the associated flags. The callers were adjusted so that all those dequeuing a connection in order to kill it do it permanently and all other ones do it only temporarily. A slightly different approach could have worked: the function could always remove all flags, and the callers would need to restore them. But this would require trickier modifications of the various call places, compared to only passing 0/1 to indicate the permanent status. This will need to be backported to all stable versions. The issue was at least reproduced since 3.1 (not tested before). The patch will need to be adjusted for 3.2 and older, because a 2nd argument "thr" was added in 3.3, so the patch will not apply to older versions as-is.	2025-11-05 11:08:25 +01:00
Olivier Houchard	7d4aa7b22b	BUG/MEDIUM: server: Add a rwlock to path parameter Add a rwlock to control the server's path_parameter, to make sure multiple threads don't set it at the same time, and it can't be seen in an inconsistent state. Also don't set the parameter every time, only set them if they have changed, to prevent needless writes. This does not need to be backported.	2025-11-04 18:47:34 +01:00
Amaury Denoyelle	efe60745b3	MINOR: quic: remove connection arg from qc_new_conn() Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This patch is similar to the previous one, this time dealing with qc_new_conn(). This function was asymetric on frontend and backend side, as connection argument was set only in the latter case. This was required prior due to qc_alloc_ssl_sock_ctx() signature. This has changed with the previous patch, thus qc_new_conn() can also be realigned on both FE and BE sides. <conn> member of quic_conn instance is always set outside it, in qc_xprt_start() on the backend case.	2025-11-04 17:47:42 +01:00
Amaury Denoyelle	5a17cade4f	MINOR: quic: do not set conn member if ssl_sock_ctx ssl_sock_ctx is a generic object used both on TCP/SSL and QUIC stacks. Most notably it contains a <conn> member which is a pointer to struct connection. On QUIC frontend side, this member is always set to NULL. Indeed, connection is only created after handshake completion. However, this has changed for backend side, where the connection is instantiated prior to its quic_conn counterpart. Thus, ssl_sock_ctx member would be set in this case as a convenience for use later in qc_ssl_do_hanshake(). However, this method was unsafe as the connection can be released, without resetting ssl_sock_ctx member. Thus, the previous patch fixes this by using on <conn> member through the quic_conn instance which is the proper way. Thus, this patch resets ssl_sock_ctx <conn> member to NULL. This is deemed the cleanest method as it ensures that both frontend and backend sides must not use it anymore.	2025-11-04 17:38:09 +01:00
Willy Tarreau	fd012b6c59	OPTIM: proxy: move atomically access fields out of the read-only ones Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Perf top showed that h1_snd_buf() was having great difficulties accessing the proxy's server_id_hdr_name field in the middle of the headers loop. Moving the assignment out of the loop to a local variable moved the problem there as well: \| if (!(h1m->flags & H1_MF_RESP) && isttest(h1c->px->server_id_hdr_n 0.10 \|20b0: mov -0x120(%rbp),%rdi 1.33 \| mov 0x60(%rdi),%r10 0.01 \| test %eax,%eax 0.18 \| jne 2118 12.87 \| mov 0x350(%r10),%rdi 0.01 \| test %rdi,%rdi 0.05 \| je 2118 \| mov 0x358(%r10),%r11 It turns out that there are several atomically accessed fields in its vicinity, causing the cache line to bounce all the time. Let's collect the few frequently changed fields and place them together at the end of the structure, and plug the 32-bit hole with another isolated field. Doing so also reduced a little bit the cost of decrementing be->be_conn in process_stream(), and overall the HTTP/1 performance increased by about 1% both on ARM and x86_64.	2025-11-03 13:54:49 +01:00
Amaury Denoyelle	6bfabfdc77	OPTIM: backend: skip conn reuse for incompatible proxies Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details When trying to reuse a backend connection, a connection hash is calculated to match an entry with similar parameters. Previously, this operation was skipped if the stream content wasn't based on HTTP, as it would have been incompatible with http-reuse. With the introduction of SPOP backends, this condition was removed, so that it can also benefit from connection reuse. However, this means that now hash calcul is always performed when connecting to a server, even for TCP or log backends. This is unnecessary as these proxies cannot perform connection reuse. Note also that reuse mode is resetted on postparsing for incompatible backends. This at least guarantees that no tree lookup will be performed via be_reuse_connection(). However, connection lookup is still performed in the session via session_get_conn() which is another unnecessary operation. Thus, this patch restores the condition so that reuse operations are now entirely skipped if a backend mode is incompatible. This is implemented via a new utility function named be_supports_conn_reuse(). This could be backported up to 3.1, as this commit could be considered as a performance regression for tcp/log backend modes.	2025-11-03 10:43:50 +01:00
Amaury Denoyelle	14a6468df5	MINOR: quic: reject conf with QUIC servers if not compiled Ensure that QUIC support is compiled into haproxy when a QUIC server is configured. This check is performed during _srv_parse_finalize() so that it is detected both on configuration parsing and when adding a dynamic server via the CLI. Note that this changes the behavior of srv_is_quic() utility function. Previously, it always returned false when QUIC support wasn't compiled. With this new check introduced, it is now guaranteed that a QUIC server won't exist if compilation support is not active. Hence srv_is_quic() does not rely anymore on USE_QUIC define.	2025-10-31 11:32:20 +01:00
Willy Tarreau	b0e8edaef2	MEDIUM: mux-h2: do not needlessly refrain from sending data early The mux currently refrains from sending data before H2_CS_FRAME_H, i.e. before the peer's SETTINGS frame was received. While it makes sense on the frontend, it's causing harm on the backend because it forces the first request to be sent in two halves over an extra RTT: first the preface and settings, second the request once the settings are received. This is totally contrary to the philosophy of the H2 protocol, consisting in permitting the client to send as soon as possible. Actually what happens is the following: - process_stream() calls connect_server() - connect_server() creates a connection, and if the proto/alpn is guessed or known, the mux is instantiated for the current request. - the H2 init code wakes the h2 tasklet up and returns - process_stream() tries to send the request using h2_snd_buf(), but that one sees that we're before H2_CS_FRAME_H, refrains from doing so and returns. - process_stream() subscribes and quits - the h2 tasklet can now execute to send the preface and settings, which leave as a first TCP segment. The connection is ready. - the iocb is woken again once the server's SETTINGS frame is received, turning the connection to the H2_CS_FRAME_H state, and the iocb wake up process_stream(). - process_stream() executes again and can try to send again. - h2_snd_buf() is called and finally sends the request as a second TCP segment. Not only this is inefficient, but it also renders 0-RTT and TFO impossible on H2 connections. When 0-RTT is used, only the preface and settings leave as early data (the very first data of that connection), which is totally pointless. In order to fix this, we have to go through a few steps: - first we need to let data be sent to a server immediately after the SETTINGS frame was sent (i.e. in H2_CS_SETTINGS1 state instead of H2_CS_FRAME_H). However, some protocol extensions are advertised by the server using SETTINGS (e.g. RFC8441) and some requests might need to know the existence of such extensions. For this reason we're adding a new h2c flag, H2_CF_SETTINGS_NEEDED, which indicates that some operations were not done because a server's SETTINGS frame is needed. This is set when trying to send a protocol upgrade or extended CONNECT during H2_CS_SETTINGS1, indicating that it's needed to wait for H2_CS_FRAME_H in this case. The flag is always set on frontend connections. This is what is being done in this patch. - second, we need to be able to push the preface opportunistically with the first h2_snd_buf() so that it's not needed to wake the tasklet up just to send that and wake process_stream() again. This will be in a separate patch. By doing the first step, we're at least saving one needless tasklet wakeup per connection (~9%), which results in ~5% backend connection rate increase.	2025-10-30 18:16:54 +01:00
William Lallemand	1e2f920be6	MINOR: listener: implement bind_conf_find_by_name() Returns a pointer to the first bind_conf matching <name> in a frontend <front>. When name is prefixed by a @ (@<filename>:<linenum>), it tries to look for the corresponding filename and line of the configuration file. NULL is returned if no match is found.	2025-10-30 10:37:42 +01:00
sftcd	23f5cbb411	MINOR: ssl/ech: add logging and sample fetches for ECH status and outer SNI This patch adds functions to expose Encrypted Client Hello (ECH) status and outer SNI information for logging and sample fetching. Two new helper functions are introduced in ech.c: - conn_get_ech_status() places the ECH processing status string into a buffer. - conn_get_ech_outer_sni() retrieves the outer SNI value if ECH succeeded. Two new sample fetch keywords are added: - "ssl_fc_ech_status" returns the ECH status string. - "ssl_fc_ech_outer_sni" returns the outer SNI value seen during ECH. These allow ECH information to be used in HAProxy logs, ACLs, and captures.	2025-10-30 10:37:30 +01:00
sftcd	dba4fd248a	MEDIUM: ssl/ech: config and load keys This patch introduces the USE_ECH option in the Makefile to enable support for Encrypted Client Hello (ECH) with OpenSSL. A new function, load_echkeys, is added to load ECH keys from a specified directory. The SSL context initialization process in ssl_sock.c is updated to load these keys if configured. A new configuration directive, `ech`, is introduced to allow users to specify the ECH key directory in the listener configuration.	2025-10-30 10:37:12 +01:00
Remi Tricot-Le Breton	dc35a3487b	MINOR: ssl: Do not dump decrypted privkeys in 'dump ssl cert' A private keys that is password protected and was decoded during init thanks to the password obtained thanks to 'ssl-passphrase-cmd' should not be dumped via 'dump ssl cert' CLI command.	2025-10-29 10:54:17 +01:00
Remi Tricot-Le Breton	478dd7bad0	MEDIUM: ssl: Add certificate password callback that calls external command When a certificate is protected by a password, we can provide the password via the dedicated pem_password_cb param provided to PEM_read_bio_PrivateKey. HAProxy will fetch the password automatically during init by calling a user-defined external command that should dump the right password on its standard output (see new 'ssl-passphrase-cmd' global option).	2025-10-29 10:54:17 +01:00
Remi Tricot-Le Breton	1ec59d3426	MINOR: init: Make devnullfd global and create it earlier in init The devnull fd might be needed during configuration parsing, if some options require to fork/exec for instance. So we now create it much earlier in the init process and without depending on the '-q' or '-d' parameters.	2025-10-29 10:54:17 +01:00
Willy Tarreau	2d7e3ddd4a	BUG/MEDIUM: cli: do not return ACKs one char at a time Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Since 3.0 where the CLI started to use rcv_buf, it appears that some external tools sending chained commands are randomly experiencing failures. Each time this happens when the whole command is sent as a single packet, immediately followed by a close. This is not a correct way to use the CLI but this has been working for ages for simple netcat-based scripts, so we should at least try to preserve this. The cause of the failure is that the first LF that acks a command is immediately sent back to the client and rejected due to the closed connection. This in turn forwards the error back to the applet which aborts its processing. Before 3.0 the responses would be queued into the buffer, then sent back to the channel, and would all fail at once. This changed when snd_buf/rcv_buf were implemented because the applets are much more responsive and since they yield between each command, they can deliver one ACK at a time that is immediately forwarded down the chain. An easy way to observe the problem is to send 5 map updates, a shutdown, and immediately close via tcploop, and in parallel run a periodic "show map" to count the number of elements: $ tcploop -U /tmp/sock1 C S:"add map #0 1 1; add map #0 2 2; add map #0 3 3; add map #0 4 4; add map #0 5 5\n" F K Before 3.0, there would always be 5 elements. Since 3.0 and before `20ec1de214` ("MAJOR: cli: Refacor parsing and execution of pipelined commands"), almost always 2. And since that commit above in 3.2, almost always one. Doing the same using socat or netcat shows almost always 5... It's entirely timing-dependent, and might even vary based on the RTT between the client and haproxy! The approach taken here consists in doing the same principle as MSG_MORE or Nagle but on the response buffer: the applet doesn't need to send a single ACK for each command when it has already been woken up and is scheduled to come back to work. It's fine (and even desirable) that ACKs are grouped in a single packet as much as possible. For this reason, this patch implements APPCTX_CLI_ST1_YIELD, a new CLI flag which indicates that the applet left in yielding condition, i.e. it has not finished its work. This flag is used by .rcv_buf to hold pending data. This way we won't return partial responses for no reason, and we can continue to emulate the previous behavior. One very nice benefit to this is that it saves huge amounts of CPU on the client. In the test below that tries to update 1M map entries, the CPU used by socat went from 100% to 0% and the total transfer time dropped by 28%: before: $ time awk 'BEGIN{ printf "prompt i\n"; for (i=0;i<1000000;i++) { \ printf "add map #0 %d %d\n",i,i,i }}' \| socat /tmp/sock1 - >/dev/null real 0m2.407s user 0m1.485s sys 0m1.682s after: $ time awk 'BEGIN{ printf "prompt i\n"; for (i=0;i<1000000;i++) { \ printf "add map #0 %d %d\n",i,i,i }}' \| socat /tmp/sock1 - >/dev/null real 0m1.721s user 0m0.952s sys 0m0.057s The difference is also quite visible on the number of syscalls during the test (for 1k updates): before: % time seconds usecs/call calls errors syscall ------ ----------- ----------- --------- --------- ---------------- 100.00 0.071691 0 100001 sendmsg after: % time seconds usecs/call calls errors syscall ------ ----------- ----------- --------- --------- ---------------- 100.00 0.000011 1 9 sendmsg This patch will need to be backported to 3.0, and depends on these two patches to be backported as well: MINOR: applet: do not put SE_FL_WANT_ROOM on rcv_buf() if the channel is empty MINOR: cli: create cli_raw_rcv_buf() from the generic applet_raw_rcv_buf()	2025-10-27 16:57:07 +01:00
Olivier Houchard	837351245a	BUG/MEDIUM: mt_list: Use atomic operations to prevent compiler optims As a folow-up to `f40f5401b9`, explicitely use atomic operations to set the prev and next fields, to make sure the compiler can't assume anything about it, and just does it. This should be backported after `f40f5401b9` up to 2.8.	2025-10-24 13:34:41 +02:00
Willy Tarreau	2ec6df59bf	BUILD: openssl-compat: fix build failure with OPENSSL=0 and KTLS=1 The USE_KTLS test is currently being done outside of the USE_OPENSSL guard so disabling USE_OPENSSL still results in build failures on libcs built with support for kernels before 4.17, because we enable KTLS by default on linux. Let's move the KTLS block inside the USE_OPENSSL guard instead. No backport is needed since KTLS is only in 3.3.	2025-10-24 10:45:02 +02:00
Aurelien DARRAGON	d655ed5f14	BUG/MAJOR: stats-file: ensure shm_stats_file_object struct mapping consistency (2nd attempt) Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This is a second attempt at fixing issues on 32bits systems which would trigger the following BUG_ON() statement: FATAL: bug condition "sizeof(struct shm_stats_file_object) != 544" matched at src/stats-file.c:825 shm_stats_file_object struct size changed, is is part of the exported API: ensure all precautions were taken (ie: shm_stats_file version change) before adjusting this This is a drop-in replacement for `d30b88a6c` + `4693ee0ff`, as suggested by Willy. Indeed, on supported platforms unsigned int can be assumed to be 4 bytes long, and long can be assumed to be 8 bytes long. As such, the previous attempt was overkill and added unecessary maintenance complexity which could result in bugs if not used properly. Moreover, it would only partially solve the issue, since on little endian vs big endian architectures, the provisioned memory areas (originating from the same shm stats file) could be read differently by the host. Instead we fix the aligments issues, and this alone helps to ensure struct memory consistency on 64 vs 32bits platforms. It was tested on both i386 and i586. last_change and last_sess counters are now stored as unsigned int, as it helped to fix the alignment issues and they were found to be used as 32bits integers anyway. Thanks to Willy for problem analysis and the patch proposal. No backport needed.	2025-10-24 09:35:38 +02:00
Aurelien DARRAGON	a931779dde	Revert "MINOR: compiler: add FIXED_SIZE(size, type, name) macro" This reverts commit `466a603b59`. Due to the last 2 commits, this macro is now unused, and will probably never be used, so let's get rid of that for now.	2025-10-24 09:35:34 +02:00
Aurelien DARRAGON	8277f891d2	Revert "MEDIUM: freq-ctr: use explicit-size types for freq-ctr struct" This reverts commit `4693ee0ff7`. As discussed in GH #3168, this works but it is not the proper way to fix the issue. See following commits.	2025-10-24 09:35:29 +02:00
Aurelien DARRAGON	c0d952ccc1	Revert "BUG/MAJOR: stats-file: ensure shm_stats_file_object struct mapping consistency" This reverts commit `d30b88a6cc`. As discussed in GH #3168, this works but it is not the proper way to fix the issue. See following commits.	2025-10-24 09:35:25 +02:00
Amaury Denoyelle	7ba4b0ad5f	BUG/MINOR: quic: rename and duplicate stream settings Some checks failed Contrib / build (push) Has been cancelled Details alpine/musl / gcc (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details Several settings can be set to control stream multiplexing and associated receive window. Previously, all of these settings were configured using prefix "tune.quic.frontend.", despite being applied blindly on both sides. Fix this by duplicating these settings specific to frontend and backend side. Options are also renamed to use the standardize prefix "tune.quic.[be\|fe].stream." notation. Also, each option is individually renamed to better reflect its purpose and hide technical details relative to QUIC transport parameter naming : * max-data-size -> stream.rxbuf * max-streams-bidi -> stream.max-concurrent * stream-data-ratio -> stream.data-ratio No need to backport.	2025-10-23 16:49:20 +02:00
Amaury Denoyelle	d5142706f8	BUG/MINOR: quic: split option for congestion max window size	2025-10-23 16:49:20 +02:00
Amaury Denoyelle	33afba0dda	BUG/MINOR: quic: split max-idle-timeout option for FE/BE usage Streamline max-idle-timeout option. Rename it to use the newer cohesive naming scheme 'tune.quic.fe\|be.'. Two different fields were already defined in global struct. These fields are moved into quic_tune along with other QUIC settings. However, no parser was defined for backend option, this commit fixes this. No need to backport this.	2025-10-23 16:49:20 +02:00
Amaury Denoyelle	5bc659a4a2	MINOR: quic: rename frontend sock-per-conn setting On frontend side, a quic_conn can have a dedicated FD or use the listener one. These different modes can be activated via a global QUIC tune setting. This patch adjusts the option. First, it is renamed to the more meaningful name 'tune.quic.fe.sock-per-conn'. Also, arguments are now either 'default-on' or 'force-off'. The objective is to better highlight reliationship with 'quic-socket' bind option. The older option is deprecated and will be removed in 3.5.	2025-10-23 16:49:20 +02:00
Amaury Denoyelle	a14c6cee17	MINOR: quic: rename retry-threshold setting A QUIC global tune setting is defined to be able to force Retry emission prior to handshake. By definition, this ability is only supported by QUIC servers, hence it is a frontend option only. Rename the option to use "fe" prefix. The old option name is deprecated and will be removed in 3.5	2025-10-23 16:49:20 +02:00
Amaury Denoyelle	d248c5bd21	MINOR: quic: rename max Tx mem setting QUIC global memory can be limited across the entire process via a global tune setting. Previously, this setting used to misleading "frontend" prefix. As this is applied as a sum between all QUIC connections, both from frontend and backend sides, remove the prefix. The new option name is "tune.quic.mem.tx-max". The older option name is deprecated and will be removed in 3.5.	2025-10-23 16:49:20 +02:00
Amaury Denoyelle	9bfe9b9e21	MINOR: quic: split Tx options for FE/BE usage This patch is similar to the previous one, except that it is focused on Tx QUIC settings. It is now possible to toggle GSO and pacing on frontend and backend sides independently. As with previous patch, option are renamed to use "fe/be" unified prefixes. This is part of the current serie of commits which unify QUI settings. Older options are deprecated and will be removed on 3.5 release.	2025-10-23 16:49:20 +02:00
Amaury Denoyelle	33a8cb87a9	MINOR: quic: split congestion controler options for FE/BE usage Various settings can be configured related to QUIC congestion controler. This patch duplicates them to be able to set independent values on frontend and backend sides. As with previous patch, option are renamed to use "fe/be" unified prefixes. This is part of the current serie of commits which unify QUIC settings. Older options are deprecated and will be removed on 3.5 release.	2025-10-23 16:49:20 +02:00
Amaury Denoyelle	7640e9a9ee	MINOR: quic: duplicate glitches FE option on BE side Previously, QUIC glitches support was only implemented for frontend side. Extend this so that the option can be specified separately both on frontend and backend sides. Function _qcc_report_glitch() now retrieves the relevant max value based on connection side. In addition to this, option has been renamed to use "fe/be" prefixes. This is part of the current serie of commits which unify QUIC settings. Older options are deprecated and will be removed on 3.5 release.	2025-10-23 16:49:20 +02:00
Amaury Denoyelle	b34cd0b506	MINOR: quic: rename "no-quic" to "tune.quic.listen" Rename the option to quickly enable/disable every QUIC listeners. It now takes an argument on/off. The documentation is extended to reflect the fact that QUIC backend are not impacted by this option. The older keyword is simply removed. Deprecation is considered unnecessary as this setting is only useful during debugging.	2025-10-23 16:47:58 +02:00
Amaury Denoyelle	42e5ec6519	MINOR: quic: prepare support for options on FE/BE side A major reorganization of QUIC settings is going to be performed. One of its objective is to clearly define options which can be separately configured on frontend and backend proxy sides. To implement this, quic_tune structure is extended to support fe and be options. A set of macros/functions is also defined : it allows to retrieve an option defined on both sides with unified code, based on proxy side of a quic_conn/connection instance.	2025-10-23 15:06:01 +02:00
Olivier Houchard	f40f5401b9	BUG/MEDIUM: mt_lists: Avoid el->prev = el->next = el Avoid setting both el->prev and el->next on the same line. The goal is to set both el->prev and el->next to el, but a naive compiler, such as when we're using -O0, will set el->next first, then will set el->prev to the value of el->next, but if we're unlucky, el->next will have been set to something else by another thread. So explicitely set both to what we want. This should be backported up to 2.8.	2025-10-23 14:43:51 +02:00

... 2 3 4 5 6 ...

9024 commits