haproxy

mirror of https://github.com/haproxy/haproxy.git synced 2026-06-10 17:32:03 -04:00

Author	SHA1	Message	Date
Willy Tarreau	2f61566b03	CLEANUP: mux-h1: remove the unneeded test on conn->owner in h1s_finish_detach() Some checks are pending Contrib / admin/halog/ (push) Waiting to run Details Contrib / dev/flags/ (push) Waiting to run Details Contrib / dev/haring/ (push) Waiting to run Details Contrib / dev/hpack/ (push) Waiting to run Details Contrib / dev/poll/ (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details There was a test below the "release" label on conn->owner to decide whether to kill the connection or not. But this test is not needed, because: - for frontends, it's always set so the test never matches - for backends, it was NULL on the second stream once a request was being reused from an idle pool, so it couldn't be used to discriminate between connections. In practice, the goal was to try to detect certain dead connections but all cases leading to such connections are either already handled in the tests before (which don't reach this label), or are handled by the other conditions. Thus, let's remove this confusing test.	2026-04-21 08:45:46 +02:00
Willy Tarreau	2e26e427a2	CLEANUP: mux-h1: avoid using conn->owner in uncertain areas Some places use conn->owner to retrieve the session. It's valid because each time it is done, it's on the frontend, though it's not always 100% obvious and sometimes requires deep code analysis. Let's clarify these points and even rely on an intermediary variable to make it clearer. One case where the owner couldn't differ from the session without being NULL was also eliminated.	2026-04-21 08:45:46 +02:00
Willy Tarreau	d93c53b0df	MEDIUM: session: always reset the conn->owner on backend when installing mux When installing a mux on the backend, unless we have a good reason for keeping the session set in conn->owner, we must reset it. Having the session there just hides potential bugs and prevents certain tests from being properly done. Now it is much clearer: conn->owner remains set to the session on frontend connections, is set to the session when the connection is private or assimilated private and belongs to the session list, or is NULL.	2026-04-21 08:45:46 +02:00
Willy Tarreau	90b2154d93	MEDIUM: muxes: always set conn->owner to the session that owns the connection When an idle connection is private or considered private, session_add_conn() is called to add it to the list of connections owned by the session. But in case of allocation failure, the session is not set, which results in a long list of possible situations that are all corner cases which are difficult to test (and debug). This commit relies on the fact that it is already permitted to have conn->owner pointing to a session even if the connection couldn't be added to the session's list, as this was already the case in conn_backend_get() when dealing with HOL_RISK. Also as seen in commit `3aab17bd56` added in 2.4, it is already possible to have conn->owner set with the connection not being in a list, and only the list element is checked for this. This commit modifies session_add_conn() to always set conn->onwer, even if the list element couldn't be allocated. This way it's possible to always refer to conn->owner to find the session owning a private conn even in case of failure to allocate an entry. This requires to change the checks on conn->owner to a check of the list element to see if the connection belongs to a session, the pre-assignment of sess to conn->owner in conn_backend_get() is no longer needed, same for the pre-assignment in http_wait_for_response(), and that's all. The H1 mux remained unchanged because since it cannot multiplex, in case it fails to allocate a pconn, it instantly kills the connection.	2026-04-21 08:45:46 +02:00
Willy Tarreau	9141d87830	BUG/MINOR: sample: adjust dependencies for channel output bytes counters The bytes_in, bytes_out, {req,res}.bytes_{in,out} sample fetch functions are marked as internal dependencies only. But that's not exact, they are statistics. Request traffic (bytes_in, req.bytes) is usable starting from the request, while response traffic (bytes_out, res.bytes) is usable as soon as a response begins to be received, and all are valid till the end of the transaction. The impact is that the log-format below: log-format "req.bytes_in=%[req.bytes_in] req.bytes_out=%[req.bytes_out] res.bytes_in=%[res.bytes_in] res.bytes_out=%[res.bytes_out]" is emitted too early and only logs zeroes when uploading 1MB and downloading 1MB: req.bytes_in=0 req.bytes_out=0 res.bytes_in=15288 res.bytes_out=0 This patch marks the request stats RQFIN and the response stats RSFIN, so that they're valid at any moment and the logs backend knows it must wait for the latest moment to emit such a line. With this change, the line above now correctly produces: req.bytes_in=1000157 req.bytes_out=1000157 res.bytes_in=1048629 res.bytes_out=1048629 This should be backported as far as the latest LTS probably, along with these 2 previous patches: BUG/MINOR: log: consider format expression dependencies to decide when to log MINOR: sample: make RQ/RS stats available everywhere	2026-04-21 08:01:07 +02:00
Willy Tarreau	6df10d0802	MINOR: sample: make RQ/RS stats available everywhere Sample fetch functions working on the request/response stats were marked as being only compatible with the log phase. This is a mistake because by definitions, stats can be consulted anywhere from the moment they start to appear. It's only that they are valid as far as the logs. At the moment, no sample fetch function depends on RQFIN, and only res.timer.data depends on RSFIN. But this will be needed to relax certain sample fetch functions (and will need to be backported along with a few other patches).	2026-04-21 08:01:07 +02:00
Willy Tarreau	e51be30f78	BUG/MINOR: log: consider format expression dependencies to decide when to log Log-format properly takes into account the LW_* flags set by the log aliases, however its consideration for the sample fetch expressions is very minimalistic (HTTP y/n). It poses a problem because logging some statistics doesn't work unless some log aliases are involved to force the log to wait till the end. Before this change, the following log-format: log-format "res.timer.data=%[res.timer.data]" would log "res.timer.data=0" regardless of the time taken to transfer data, and the log would be emitted instantly. However, this line: log-format "res.timer.data=%[res.timer.data] %B" would properly log the time taken to transfer the data because %B which carries the log flag LW_BYTES forces the log to wait till the end. This patch makes sure that anything requiring response (headers or body) waits for at least the response, and that anything requiring response body or end of transfer (req/res) waits till the end (LW_BYTES). Thanks to this, the log above is now correct even without the "%B" hack. This should be backported at least till the latest LTS.	2026-04-21 08:01:07 +02:00
William Lallemand	95c400d08e	MINOR: acme: allow IP SAN in certificate request Some checks are pending Contrib / admin/halog/ (push) Waiting to run Details Contrib / dev/flags/ (push) Waiting to run Details Contrib / dev/haring/ (push) Waiting to run Details Contrib / dev/hpack/ (push) Waiting to run Details Contrib / dev/poll/ (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Implement IP in both requestOrder and CSR so a certificate with SAN IPs can be generated.	2026-04-20 18:10:47 +02:00
William Lallemand	0d14bb7473	MINOR: acme: implement draft-ietf-acme-profiles The ACME Profiles extension (draft-ietf-acme-profiles) allows a client to request a specific certificate profile by including a "profile" field in the newOrder request. This lets the CA select the appropriate certificate issuance policy (e.g. "classic", "shortlived") for a given order. A new "profile" keyword is added to the acme section. When set, its value is included in the newOrder JSON payload sent to the CA.	2026-04-20 18:10:35 +02:00
Olivier Houchard	78712c3898	BUG/MEDIUM: checks: Don't forget to set the "alt_proto" field The target address type has been added to checks in commit `d759e60a32`, but as part of that address type is the "alt_proto" field, that was not properly set for dynamic servers, That could lead to checks not working for any protocol that use a non-zero alt_proto, such as QUIC. So set it properly.	2026-04-20 11:59:44 +02:00
William Lallemand	95e9629530	BUILD: ssl/sample: potential null pointer dereference in sample_conv_aes gcc flags aead_tag_trash as potentially NULL at the chunk_memcpy call inside the (!dec && gcm) block, because it cannot correlate the condition with the allocation that only happens in that same branch. Add an explicit NULL check to silence the warning. This was caught by cross-zoo.yml: In file included from include/haproxy/connection.h:28, from src/ssl_sample.c:27: In function ‘b_orig’, inlined from ‘sample_conv_aes’ at src/ssl_sample.c:540:23: include/haproxy/buf.h:80:17: error: potential null pointer dereference [-Werror=null-dereference] 80 \| return b->area; \| ~^~~~~~ In function ‘b_data’, inlined from ‘sample_conv_aes’ at src/ssl_sample.c:540:3: include/haproxy/buf.h💯17: error: potential null pointer dereference [-Werror=null-dereference] 100 \| return b->data; \| ~^~~~~~	2026-04-20 11:00:24 +02:00
Amaury Denoyelle	1f435f031b	BUG/MINOR: xprt_qstrm: reduce max record length check When trying to read QMux transport parameters frame, the record length is checked to ensure it is not bigger than the buffer size. The objective is to detect as soon as possible when receiving data that cannot be handled and to close the connection. In fact, this check is not accurate, as it did not take into account the size of the Record length field itself. This patch fixes the comparison by substracting with the size of the decoded varint. No need to backport.	2026-04-20 10:21:30 +02:00
Amaury Denoyelle	0610b4487b	BUG/MINOR: xprt_qstrm: read record length in 64bits QMux record lengths are encoded as a QUIC varint. Thus in theory, it requires a 64bits integer to be able to read the whole value. In practice, if the record is bigger than bufsize, read operation cannot be completed and an error must be reported. This patch fixes record length decoding both in xprt_qstrm layer, which is now performed in two steps. The value is first read in a 64bits integer instead of a size_t whose size is dependent on the architecture. Result is then checked against bufsize and if inferior stored in the previously used variable (xprt ctx rxrlen member). This should partially fix build issue reported on github #3334. No need to backport.	2026-04-20 09:23:29 +02:00
Willy Tarreau	bb59ba0a98	BUILD: haterm: don't pass size_t to %lu in error messages It fails on 32-bit systems, let's cast it to ulong like in other places. No backport needed.	2026-04-18 11:25:30 +02:00
Amaury Denoyelle	1acf147e2a	MINOR: mux-quic: release BE idle conn after GOAWAY reception Some checks failed Contrib / dev/flags/ (push) Has been cancelled Details Contrib / dev/haring/ (push) Has been cancelled Details Contrib / dev/hpack/ (push) Has been cancelled Details Contrib / dev/poll/ (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details VTest / Alpine+musl, gcc (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details An idle backend connection is useless if a HTTP/3 GOAWAY frame has been received. Indeed, it is forbid to open new stream on such connection. Thus, this patch ensures such connections are removed as soon as possible. This is performed via a new check in qcc_is_dead() on QC_CF_CONN_SHUT flag for backend connections. This ensures that a shut connection is released instead of being inserted in idle list on detach operation. This commits also completes qcc_recv() with a new call to qcc_is_dead() on its ending. This is necessary if GOAWAY is received on an idle connection. For now, this is only checked for backend connections as a GOAWAY is without any real effect for frontend connections. Thus, this extra protection ensures that we do not break by incident QUIC frontend support. qcc_io_recv() also performs qcc_decode_qcs(). However, an extra qcc_is_dead() is not necessary in this case as the following qcc_io_process() already performs it.	2026-04-17 13:28:17 +02:00
Amaury Denoyelle	220b1bf6d9	MEDIUM: h3: prevent new streams on GOAWAY reception Implement the reception of a HTTP/3 GOAWAY frame. This is performed via the new function h3_parse_goaway_frm(). The advertised ID is stored in new <id_shut_r> h3c member. It serves to ensure that a bigger ID is not advertised when receiving multiple GOAWAY frames. GOAWAY frame reception is only really useful on the backend side for haproxy. When this occurs, h3c is now flagged with H3_CF_GOAWAY_RECV. Also, QCC is also updated with new flag QC_CF_CONN_SHUT. This flag indicates that no new stream may be opened on the connection. Callback avail_streams() is thus edited to report 0 in this case.	2026-04-17 13:28:17 +02:00
Amaury Denoyelle	5c8c9fc528	MINOR: h3: simplify GOAWAY local emission Rework GOAWAY emission handling at the HTTP/3 layer. Previously, h3c member <id_goaway> were updated during the connection on each new streams attach. This ID was finally reused when a GOAWAY was emitted. However, this is unnecessary to keep an updated ID during the connection lifetime. Indeed, <largest_bidi_r> QCC member can be used for the same purpose. Note that this is only useful for the frontend side. For a client connection, GOAWAY contains a PUSH ID, thus 0 can be used for now. Thus, <id_goaway> in h3c is renamed <id_shut_l>. Now it is only sent when the GOAWAY is emitted. This allows to reject any streams with a greater ID. This approach is considered simpler. Note that <largest_bidi_r> is not strictly similar to the obsolete <id_goaway>. Indeed, if an error occurs before the corresponding stream layer allocation, the former would still be incremented. However, this is not a real issue as GOAWAY specification is clear that lower IDs are not guaranteed to being handled well, until either the stream is closed or resetted, or the whole connection is teared down.	2026-04-17 11:36:01 +02:00
Amaury Denoyelle	143d0034c9	BUG/MINOR: mux_quic: limit avail_streams() to 2^62 QUIC streams ID are encoded as 62-bit integer and cannot reuse an ID within a connection. This is necessary to take into account this limitation for backend connections. This patch implements this via qmux_avail_streams() callback. In the case where the connection is approaching the encoding limit, reduce the advertised value until the limit is reached. Note that this is very unlikely to happen as the value is pretty high. This should be backported up to 3.3.	2026-04-17 11:36:01 +02:00
Aurelien DARRAGON	4945d02c99	MINOR: compression: prefix compression oriented functions with "comp_" add comp_ prefix to all compression related functions, in anticipation of decompression functions that will be integrated in the same file, so we don't get mixed up between the two. No change of behavior expected.	2026-04-17 08:26:56 +02:00
Willy Tarreau	a0541f5d21	BUG/MEDIUM: mux-h2: ignore conn->owner when deciding if a connection is dead Originally, valid backend connections always used to have conn->owner pointing to the owner session. In 1.9, commit 93c885 enforced this when implementing backend H2 support by making sure that no orphaned connection was left on its own with no remaining stream able to handle it. Later, idle connections were reworked so that they were no longer necessarily attached to a stream, but could be directly in the server, accessed via a hash, so it started to become possible to have conn->owner left to NULL when picking such a connection. It in fact happens for http-reuse always, when the second stream picks the connection because its owner is NULL and it's not changed. More recently, a case was identified where it could be theoretically possible to reinsert a dead connection into an idle list, and commit `59c599f3f0` ("BUG/MEDIUM: mux-h2: make sure not to move a dead connection to idle") addressed that possibility in 3.3 by adding the h2c_is_dead() test in h2_detach() before deciding to reinsert a connection into the idle list. Unfortunately, the combination of changes above results in the following sequence being possible: - a stream requires a connection, connect_server() creates one, sets conn->owner to the session, then when the session is being set up, the SSL stack calls conn_create_mux() which gets the session from conn->owner, passes it to mux->init() (h2_init), which in turn creates the backend stream and assigns it this session. - when the stream ends, it detaches (h2_detach), and the call to h2c_is_dead() returns false because h2c->conn->owner is set. The connection is thus added into the server's idle list. - a new stream comes, it finds the connection in the server's list, which doesn't require to set conn->owner, the stream is added via h2_attach() which passes the stream's session, and that one is properly set on h2s again, but never on conn->owner. - the stream finishes, detaches, and this time the call to h2c_is_dead() sees the owner is NULL, thus indicates that the connection seems dead so it's not added again to the idle list, and it's destroyed. Note that this most only happens at low loads (at most one active stream per connection, so typically at most than one active stream per thread), where the H2 reuse ratio on a server configured with http-reuse always or http-reuse aggressive is close to 50%. At high loads, this is much more rare, though looking at the reuse stats for a server, it's visible that a sustained load still shows around 1% of the connections being periodically renewed. Interestingly, for RHTTP the impact is more important because there was already a work around for this test in h2c_is_dead() but it uses conn_is_reverse(), which is never correct in this case (it should be called conn_to_reverse() because it says the conn must be reversed and has not yet been), so this extra test doesn't protect against the NULL check, and connections are closed after each stream is terminated (if there is no other stream left). After a long analysis with Amaury and Olivier, it was concluded that: - the h2c_is_dead() addition is finally not the best solution and could be refined, however in the current state it's a bit tricky. - the conn->owner test in h2c_is_dead() is no longer relevant, probably since 2.4 when connections were stored using hash_nodes in the servers and would no longer depend on a session, so that test should be removed. - the test conn_is_reverse() on the same line, that was added to ignore the former for RHTTP, and which doesn't properly work either should be removed as well. Some further cleanups should be performed to clarify this situation. This patch implements the points above, and it should be backported wherever commit `59c599f3f0` was backported.	2026-04-16 18:27:15 +02:00
Willy Tarreau	0af603f46f	MEDIUM: threads: change the default max-threads-per-group value to 16 A lot of our subsystems start to be shared by thread groups now (listeners, queues, stick-tables, stats, idle connections, LB algos). This has allowed to recover the performance that used to be out of reach on losely shared platforms (typically AMD EPYC systems), but in parallel other large unified systems (Xeon and large Arm in general) still suffer from the remaining contention when placing too many threads in a group. A first test running on a 64-core Neoverse-N1 processor with a single backend with one server and no LB algo specifiied shows 1.58 Mrps with 64 threads per group, and 1.71 Mrps with 16 threads per group. The difference is essentially spent updating stats counters everywhere. Another test is the connection:close mode, delivering 85 kcps with 64 threads per group, and 172 kcps (202%) with 16 threads per group. In this case it's mostly the more numerous listeners which improve the situation as the change is mostly in the kernel: max-threads-per-group 64: # perf top Samples: 244K of event 'cycles', 4000 Hz, Event count (approx.): 61065854708 los Overhead Shared Object Symbol 10.41% [kernel] [k] queued_spin_lock_slowpath 10.36% [kernel] [k] _raw_spin_unlock_irqrestore 2.54% [kernel] [k] _raw_spin_lock 2.24% [kernel] [k] handle_softirqs 1.49% haproxy [.] process_stream 1.22% [kernel] [k] _raw_spin_lock_bh # h1load time conns tot_conn tot_req tot_bytes err cps rps bps ttfb 1 1024 84560 83536 4761666 0 84k5 83k5 38M0 11.91m 2 1024 168736 167713 9559698 0 84k0 84k0 38M3 11.98m 3 1024 253865 252841 14412165 0 85k0 85k0 38M7 11.84m 4 1024 339143 338119 19272783 0 85k1 85k1 38M8 11.80m 5 1024 424204 423180 24121374 0 84k9 84k9 38M7 11.86m max-threads-per-group 16: # perf top Samples: 1M of event 'cycles', 4000 Hz, Event count (approx.): 375998622679 lost Overhead Shared Object Symbol 15.20% [kernel] [k] queued_spin_lock_slowpath 4.31% [kernel] [k] _raw_spin_unlock_irqrestore 3.33% [kernel] [k] handle_softirqs 2.54% [kernel] [k] _raw_spin_lock 1.46% haproxy [.] process_stream 1.12% [kernel] [k] _raw_spin_lock_bh # h1load time conns tot_conn tot_req tot_bytes err cps rps bps ttfb 1 1020 172230 171211 9759255 0 172k 171k 78M0 5.817m 2 1024 343482 342460 19520277 0 171k 171k 78M0 5.875m 3 1021 515947 514926 29350953 0 172k 172k 78M5 5.841m 4 1024 689972 688949 39270207 0 173k 173k 79M2 5.783m 5 1024 863904 862881 49184274 0 173k 173k 79M2 5.795m So let's change the default value to 16. It also happens to match what's used by default on EPYC systems these days. This change was marked MEDIUM as it will increase the number of listening sockets on some systems, to match their counter parts from other vendors, which is easier for capacity planning.	2026-04-16 10:48:43 +02:00
Willy Tarreau	d7c747b572	BUG/MINOR: threads: properly set the number of tgroups when non using policy When nbthread is set, the CPU policies are not used and do not set nbthread nor nbtgroups. When back into thread_detect_count(), these are set respectively to thr_max and 1. The problem which becomes very visible with max-threads-per-group, is that setting this one in combination with nbthreads results in only one group with the calculated number of threads per group. And there's not even a warning. So basically a configuration having: global nbthread 64 max-threads-per-group 8 would only start 8 threads. In this case, grp_min remains valid and should be used, so let's just change the assignment so that the number of groups is always correct. A few ifdefs had to move because the calculations were only made for the USE_CPU_AFFINITY case. Now these parts have been refined so that all the logic continues to apply even without USE_CPU_AFFINITY. One visible side effect is that setting nbthread above 64 will automatically create the associated number of groups even when USE_CPU_AFFINITY is not set. Previously it was silently changed to match the per-group limit. Ideally this should be backported to 3.2 where the issue was introduced, though it may change the behavior of configs that were silently being ignored (e.g. "nbthread 128"), so the backport should be considered with care. At least 3.3 should have it because it uses cpu-policy by default so it's only for failing cases that it would be involved.	2026-04-15 17:47:26 +02:00
William Lallemand	794737cc8d	CLEANUP: acme: no need to reset ctx state and http_state before nextreq The nextreq label already implement setting http_state to ACME_HTTP_REQ and setting ctx->state to st. It is only needed to set the st variable before jumping to nextreq.	2026-04-15 16:17:39 +02:00
William Lallemand	69211b869f	BUG/MINOR: acme: fix fallback state after failed initial DNS check When the opportunistic initial DNS check (ACME_INITIAL_RSLV_READY) fails, the state machine was incorrectly transitioning to ACME_RSLV_RETRY_DELAY instead of ACME_CLI_WAIT. This caused the challenge to enter the DNS retry loop rather than falling back to the normal cond_ready flow that waits for the CLI signal. Also reorder ACME_CLI_WAIT in the state enum and trace switch to reflect the actual execution order introduced in the previous commit: it comes after ACME_INITIAL_RSLV_READY, not before ACME_INITIAL_RSLV_TRIGGER. No backport needed.	2026-04-15 16:06:59 +02:00
William Lallemand	c295a5c861	MINOR: acme: opportunistic DNS check for dns-persist-01 to skip challenge-ready steps For dns-persist-01, the "_validation-persist.<domain>" TXT record is set once and never changes between renewals. Add an initial opportunistic DNS check (ACME_INITIAL_RSLV_TRIGGER / ACME_INITIAL_RSLV_READY states) that runs before the challenge-ready conditions are evaluated. If all domains already have the TXT record, the challenge is submitted immediately without going through the cli/delay/dns challenge-ready steps, making renewals faster once the record is in place. The new ACME_RDY_INITIAL_DNS flag is automatically set for dns-persist-01 in cond_ready.	2026-04-15 15:57:57 +02:00
Willy Tarreau	5fe0579d49	MEDIUM: threads: start threads by groups Till now, threads were all started one at a time from thread 1. This will soon cause us limitations once we want to reduce shared stuff between thread groups. Let's slightly change the startup sequence so that the first thread starts one initial thread for each group, and that each of these threads then starts all other threads from their group before switching to the final task. Since it requires an intermediary step, we need to store that threads' start function to access it from the group, so it was put into the tgroup_info which still has plenty of room available. It could also theoretically speed up the boot sequence, though in practice it doesn't change anything because each thead's initialization is made one at a time to avoid races during the early boot. However ther is now a function in charge of starting all extra threads of a group, and whih is called from this group.	2026-04-15 15:53:56 +02:00
Amaury Denoyelle	e2dbcd20f2	MINOR: mux-quic: close connection when reaching max-total streams This commit completes the previous one which implements a new setting to limit the number of streams usable by a client on a QUIC connection. When the connection becomes idle after reaching this limit, it is immediately closed. This is implemented by extending checks in qcc_is_dead(). This results in a CONNECTION_CLOSE emission, which is useful to free resources as soon as possible.	2026-04-15 15:18:37 +02:00
Amaury Denoyelle	497cabd9e5	MEDIUM: quic: implement fe.stream.max-total Implement a new setting to limit the total number of bidirectional streams that the client may use on a single connection. By default, it is set to 0 which means it is not limited at all. If a positive value is configured, the client can only open a fixed number of request streams per QUIC connection. Internally, this is implemented in two steps : * First, MAX_STREAMS_BIDI flow control advertizing will be reduced when approaching the limit before being completely turned off when reaching it. This guarantees that the client cannot exceed the limit without violating the flow control. * Second, when attaching the latest stream with ID matching max-total setting, connection graceful shutdown is initiated. In HTTP/3, this results in a GOAWAY emission. This allows the remaining streams to be completed before the connection becomes completely idle.	2026-04-15 15:18:37 +02:00
Amaury Denoyelle	a7e1c82648	MINOR: mux-quic: perform app init in case of early shutdown Adds a qcc_app_init() call in qcc_app_shutdown(). This is necessary if shutdown is performed early, before any invokation of qcc_io_send(). Currently, this should never occur in practice. However, this will become necessary with the new settings tune.quic.fe.stream.max-total. Indeed, when using a very small value, app-ops layer may be closed early in the connection lifetime.	2026-04-15 14:34:13 +02:00
Amaury Denoyelle	1038720675	MINOR: mux-quic: reorganize code for app init/shutdown Refactor code related to app-layer init/shutdown operations. In short, qcc_shutdown() is renamed to qcc_app_shutdown(). It is also moved next to qcc_app_init() to better reflect their link.	2026-04-15 14:34:13 +02:00
Amaury Denoyelle	6c837723bf	MINOR: mux-quic: improve documentation for qcs_attach_sc() Complete function doc for qcs_attach_sc() by using the proper terminology related to stream/stconn/sedesc. The purpose of this function should be clearer now.	2026-04-15 14:34:13 +02:00
Emeric Brun	8f06c8fda4	BUG/MEDIUM: peers: trash of expired entries delayed after fullresync stksess_new has set the entry expire to the table expire delay, if it is a new entry, set_entry inserts at that position in the expire tree. There was a touch_remote updating the expire setting but the tree's re-ordering is not designed to set back in the past resulting to an entry that will be trashed only after a full table's expire delay regardless the expire set on the stktsess. This patch sets the newts expire before the call of 'set_entry'. This way a new inserted entry is set directly at the right position in the tree to trash the entry in time. This patch should be backported on all supported branches and at least v2.8	2026-04-15 10:03:17 +02:00
Willy Tarreau	90e8ccd9c2	MINOR: sample: add new sample fetch functions reporting current CPU usage Some checks are pending Contrib / dev/flags/ (push) Waiting to run Details Contrib / dev/haring/ (push) Waiting to run Details Contrib / dev/hpack/ (push) Waiting to run Details Contrib / dev/poll/ (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details VTest / Alpine+musl, gcc (push) Waiting to run Details Windows / Windows, gcc, all features (push) Waiting to run Details Some features can automatically turn on or off depending on CPU usage, but it's not easy to measure it. Let's provide 3 new sample fetch functions reporting the CPU usage as measured inside haproxy during the previous polling loop, and reported in "idle" stats header / "show info", or used by tune.glitches.kill.cpu-usage, or maxcompcpuusage: - cpu_usage_thr: CPU usage between 0 and 100 of the current thread, used by functions above - cpu_usage_grp: CPU usage between 0 and 100, averaged over all threads of the same group as the current one. - cpu_usage_proc: CPU usage between 0 and 100, averaged over all threads of the current process Note that the value will fluctuate since it only covers a few tens to hundreds of requests of the last polling loop, but it reports what is being used to take decisions. It could also be used to disable some non-essential debugging/processing under too high loads for example.	2026-04-14 17:47:18 +02:00
Willy Tarreau	630ef96f92	MINOR: sample: return the number of the current thread group Just like we have a sample fetch function that returns the number of the current thread, let's have the same with the thread group number. This can be useful for troubleshooting, given that certain things are currently per thread-group (e.g. idle backend connections, certain LB algos etc).	2026-04-14 17:05:34 +02:00
Willy Tarreau	b943d2a7eb	CLEANUP: sample: fix the comment regarding the range of the thread sample fetch The comment says "between 1 and nbthread" while it's in fact between 0 and nbthread-1 and this is also documented like this in the config manual. No backport needed though it cannot hurt.	2026-04-14 16:59:56 +02:00
Willy Tarreau	9c6e07c43f	MINOR: stats: report the number of thread groups in "show info" Since thread groups were enabled by default in 3.3, it has become an important element of diagnostic that we're missing in "show info". Let's add it under "NbThreadGroups".	2026-04-14 16:48:16 +02:00
William Lallemand	f28dd158ed	MINOR: ssl: add TLS 1.2 values in HAPROXY_KEYLOG_XX_LOG_FMT Add the CLIENT_RANDOM line for TLS1.2 in HAPROXY_KEYLOG_FC_LOG_FMT and HAPROXY_KEY_LOG_BC_FMT. These are useful to produce a keylog file compatible with both TLS1.3 and TLS1.2.	2026-04-14 16:03:25 +02:00
Christopher Faulet	7270bfcff5	BUG/MEDIUM: htx: Don't count delta twice when block value is replaced A regression was introduced by the commit `a8887e55a` ("BUG/MEDIUM: htx: Fix function used to change part of a block value when defrag"). When a block value was replaced and a defragmentation was performed, the delta between the old value and the new one was counted twice. htx_defrag() already is responsible to set the new size for the HTX message. So it must not be performed in htx_replace_blk_value(). This patch must be backported with the commit above. So theorically to all stable versions.	2026-04-14 14:07:21 +02:00
Christopher Faulet	d899f23017	BUG/MEDIUM: htx: Properly handle block modification during defragmentation A regression was introcuded by the commit `0c6f2207f` ("MEDIUM: htx: Refactor htx defragmentation to merge data blocks"). When a defragmentation is performed, it is possible to alter a block size. The main usage is to prepare a block value replacement. However, since the commit above, the change is no longer handled. The block info are changed but the size of the message is not modified accordingly. This patch depends on the commit "MINOR: htx: Add helper function to get type and size from the block info field" No backport needed.	2026-04-14 14:07:21 +02:00
William Lallemand	3415abe56d	MINOR: mjson: reintroduce mjson_next() The lack of mjson_next() prevents to iterate easily and need to hack by iterating on a loop of snprintf + $.field[XXX] combined with mjson_find(). This reintroduce mjson_next() so we could iterate without having to build the string. The patch does not reintroduce MJSON_ENABLE_NEXT so it could be used without having to define it.	2026-04-14 10:57:21 +02:00
William Lallemand	cf72132f22	MINOR: acme: display the type of challenge in ACME_INITIAL_DELAY Some checks failed Contrib / build (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details alpine/musl / gcc (push) Has been cancelled Details The ACME_INITIAL_DELAY state displays a message about 'dns-01', but this state is also used for 'dns-persist-01'. This patch displays the challenge that was configured instead of dns-01	2026-04-14 10:16:11 +02:00
Tim Duesterhus	ed0c51d2c0	MINOR: http_fetch: Add support for checks to `unique-id` fetch Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details This allows to use the `unique-id` fetch within `tcp-check` or `http-check` ruleset. The format is taken from the checked server's backend (which is naturally inherited from the corresponding `defaults` section). This is particularly useful with http-check send ... hdr request-id %[unique-id] to ensure all requests sent by HAProxy have a unique ID header attached. This resolves GitHub Issue #3307. Reviewed-by: Volker Dusch <github@wallbash.com>	2026-04-13 20:02:21 +02:00
Tim Duesterhus	2c748125f5	MINOR: check: Support generating a `unique_id` for checks This implementation is directly modeled after `stream_generate_unique_id()` and the corresponding `unique_id` field on `struct stream`. It will be used in a future commit to enable the use of the `%[unique-id]` fetch in check rules.	2026-04-13 20:01:42 +02:00
Tim Duesterhus	7ff2627112	CLEANUP: log: Stop touching `struct stream` internals for `%ID` Use the return value of `stream_generate_unique_id()` instead of relying on the `unique_id` field of `struct stream` when handling the `%ID` log placeholder. This also allowed to unify the "stream available" and "stream not available" paths. Reviewed-by: Volker Dusch <github@wallbash.com>	2026-04-13 20:01:42 +02:00
Tim Duesterhus	38796d4c06	MINOR: Allow inlining of `stream_generate_unique_id()` With the introduction of the `generate_unique_id()` helper, the actual complicated logic is sitting in a different file. Allow inlining of `stream_generate_unique_id()`, so that callers can benefit from an abstraction without hiding away the access of `strm->unique_id` behind a function call.	2026-04-13 20:01:42 +02:00
Tim Duesterhus	73040e3a8e	MINOR: Add `generate_unique_id()` helper This new function will handle the actual generation of the unique ID according to a format. The caller is responsible to check that no unique ID is stored yet.	2026-04-13 20:01:02 +02:00
Tim Duesterhus	4cf06a7d23	CLEANUP: Make `lf_expr` parameter of `sess_build_logline_orig()` const Since this is safely possible without making any changes, we can provide this hint to the compiler.	2026-04-13 19:59:12 +02:00
Willy Tarreau	9a5db56a36	BUG/MINOR: haterm: don't apply the default pipe size margin twice Commit `6d16b11022` ("BUG/MINOR: haterm: preserve the pipe size margin for splicing") solved the issue of pipe size being sufficient for the vmsplice() call, but as Christopher pointed out, the ratio was applied to the default size of 64k, so now it's applied twice, giving 100k instead of 80k. Let's drop it from there. No backport needed.	2026-04-13 19:38:48 +02:00
Egor Shestakov	79c54d28b0	BUG/MINOR: acme: don't pass NULL into format string Printing a "(null)" when NULL passed with the %s format specifier is a GNU extension, so it must be avoided for portability reasons. Must be backported as far as 3.2	2026-04-13 18:56:13 +02:00
William Lallemand	53679fe5f6	BUG/MINOR: acme: read the wildcard flag from the authorization response Some checks are pending Contrib / build (push) Waiting to run Details alpine/musl / gcc (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details The wildcard field was declared and used when building the dns-persist-01 TXT record value (policy=wildcard suffix), but was never populated from the server's authorization response. Add the missing mjson_get_bool() call to read $.wildcard before saving auth->dns.	2026-04-13 18:49:53 +02:00

1 2 3 4 5 ...

21080 commits