haproxy

mirror of https://github.com/haproxy/haproxy.git synced 2026-07-13 11:09:54 -04:00

Author	SHA1	Message	Date
William Lallemand	dfb6daca1f	BUG/MEDIUM: auth: fix unconfigured password NULL deref Fix a case of dereference NULL pointer when trying to use an user from an userlist which does not have a password configured. The check_user() function tries to do an strcmp of the password, howver u->pass is NULL and the strcmp would crash when trying. Must be backported in every stable branches.	2026-05-26 14:13:23 +02:00
Amaury Denoyelle	e4a5a64198	BUG/MINOR: h3: reject server MAX_PUSH_ID frame Previously, MAX_PUSH_ID frames were silently ignored both on client and server sides. However, such frame cannot be emitted by the server. This patch fixes this by properly issuing connection error FRAME_UNEXPECTED when receiving a MAX_PUSH_ID frame as a client. This is implemented by extending h3_check_frame_valid(). This must be backported up to 3.3.	2026-05-26 13:52:04 +02:00
Amaury Denoyelle	4a8bb2fe5f	BUG/MINOR: h3: adjust error on PUSH_PROMISE frame reception HTTP/3 PUSH_PROMISE frames are systematically rejected with H3 error FRAME_UNEXPECTED. This is adapted on the server side as a client can never emit them. This patch adapts error reporting when haproxy runs as a client. In this case, server is still forbidden to emit any PUSH_PROMISE as MAX_PUSH_ID frames are never emitted. In this case, ID_ERROR must be used as an error code. This must be backported up to 3.3.	2026-05-26 13:52:03 +02:00
Amaury Denoyelle	d8460a5339	BUG/MINOR: h3: reject client CANCEL_PUSH frame CANCEL_PUSH frames are silently ignored on both client and server sides. However, as push support is not implemented by haproxy, clients are thus forbidden to emit any of those frames. Fix this by closing the connection with ID_ERROR when receiving a client CANCEL_PUSH as a server. On client side, the frame is still silently discarded. This must be backported up to 2.6.	2026-05-26 13:52:03 +02:00
Amaury Denoyelle	8e77620616	BUG/MINOR: h3: reject server push stream Push streams are not supported by haproxy as a client. Thus, it never emits any MAX_PUSH_ID frame. In this case, the server is not allowed to initiate any push stream. This patch ensures that such stream is closed with error H3_ID_ERROR, as specified by HTTP/3 RFC. This must be backported up to 3.3.	2026-05-26 13:52:03 +02:00
Amaury Denoyelle	433cce7af1	BUG/MEDIUM: h3: reject client push stream HTTP/3 push streams can only be opened by a server instance. The specification mandates that the connection must be closed if a server receives a client-initiated push stream. This patch should ensure that it is not possible to exploit unidirectional streams for an unexpected usage. This must be backported up to 2.6.	2026-05-26 13:52:03 +02:00
Willy Tarreau	4a9ec66fd8	MEDIUM: tools: switch the main PRNG to a thread-local xoshiro256 The current PRNG is xoroshiro128, it was introduced in 2.2 with commit `52bf83939` ("BUG/MEDIUM: random: implement a thread-safe and process-safe PRNG"). It features a 2^128 sequence and can perform 2^64 or 2^96 jumps, though only the 2^96 jump is implemented. It was initially designed to support both processes and threads, and implements a shared state between threads instead of allocating distinct sequences based on PID and thread numbers. Since then, the PRNG's usage grew and processes have disappeared, but the lock or the DWCAS are still there due to its shared nature, and it's possible to trigger watchdog warnings by issuing 100 UUIDs in a single log-format string. Also, UUID and QUIC retry tokens now consume 128 bits from the PRNG in two 64-bit calls, and used to weaken the PRNG by rapidly disclosing its internal state on reasonably idle systems. This indicates that most of the time we now need 128 bits. This patch modernizes the internal generator by switching to xoshiro256**, which has comparable properties (it's even faster), and features even longer 2^256 periods, still returning 64 bits per call. It can be initialized with 2^128 and 2^192 jumps. More details here: https://prng.di.unimi.it/ https://prng.di.unimi.it/xoshiro256starstar.c Here we implement a thread-local state instead of the old shared one, so there is no more need for synchronization. The state is seeded at boot, and each thread performs as many 2^192 jumps as their TID is large. The master process performs a 2^128 jump where it used to perform a 2^96 jump so that it doesn't overlap with any worker thread. However a cleaner approach could be to perform a 2^128 jump for each fork() (here the worker) and 2^192 for each thread. This might be for a future improvement. ha_random64_internal() is now the new PRNG, so that everything else remains totally transparent. _ha_random64_pair_hashed() continues to hash the first 128 bits of the state. A simple config generating 100 UUID on 20 threads jumps from 135k to 1.25M req/s, which translates to a bump from 13.5M to 125M UUID/s, or 9 times faster. And there is no more DWCAS can be seen anymore in perf top: Before: 13.5M/s Overhead Shared Object Symbol 99.04% haproxy [.] ha_random64_internal 0.66% haproxy [.] _ha_random64_pair_hashed 0.03% libc-2.42.so [.] __printf_buffer 0.02% [kernel] [k] _raw_spin_lock 0.01% libc-2.42.so [.] __strchrnul_avx2 0.01% [kernel] [k] ktime_get 0.01% [kernel] [k] lapic_next_deadline 0.01% haproxy [.] sample_process 0.01% haproxy [.] chunk_printf 0.01% libc-2.42.so [.] __printf_buffer_write 0.01% [kernel] [k] hrtimer_active 0.01% libc-2.42.so [.] __memmove_avx_unaligned_erms 0.01% libc-2.42.so [.] _itoa_word After: 125M/s 18.84% libc-2.42.so [.] __printf_buffer 9.84% haproxy [.] sample_process 8.33% libc-2.42.so [.] __strchrnul_avx2 6.61% libc-2.42.so [.] __memmove_avx_unaligned_erms 6.06% libc-2.42.so [.] __printf_buffer_write 4.43% haproxy [.] strlcpy2 4.09% libc-2.42.so [.] _itoa_word 2.62% haproxy [.] sess_build_logline_orig 2.12% haproxy [.] _ha_random64_pair_hashed 1.28% haproxy [.] pool_put_to_cache 1.06% haproxy [.] __pool_alloc 1.00% haproxy [.] smp_fetch_uuid 0.93% haproxy [.] lf_text_len 0.82% haproxy [.] ha_generate_uuid_v4	2026-05-26 13:13:24 +02:00
Willy Tarreau	73b5f0eed4	MEDIUM: quic: use ha_random64_pair_hashed() to generate the QUIC retry tokens The QUIC retry tokens used to directly return ha_random64(), making the next tokens easily predictable on low-load systems before the XXH64 call. Let's now switch to the faster and safer ha_random64_pair_hashed() instead.	2026-05-26 13:13:24 +02:00
Willy Tarreau	7ac4d7d69f	MEDIUM: h1: use ha_random64_pair_hashed() for the WebSocket key Instead of using two consecutive calls to ha_random64(), let's use the cleaner and safer ha_random64_pair_hashed(). This way the internal PRNG state will not leak into the emitted headers.	2026-05-26 13:13:24 +02:00
Willy Tarreau	85003563c5	MEDIUM: tools: use the hashed random pair for UUID generation The UUID generation used to emit the internal PRNG state, which allows to predict previous and next ones, or disclose the internal PRNG state. While not critical, it may eventually become an issue. This patch uses the new ha_random64_pair_hashed() function that returns a pair of u64 that are hashed from the internal PRNG state. It's almost twice as fast on 20 threads (14.1M UUID/s vs 7.8M/s).	2026-05-26 13:13:24 +02:00
Willy Tarreau	f932863484	MEDIUM: init: fall back to ha_random64_pair_hashed() for the cluster secret The cluster secret, when SSL is not working, used to involve a mix of calls to ha_random64() and random() to mask the bits that we didn't want to see leaked. Let's now simply fall back to ha_random64_pair_hashed() that does a much better job.	2026-05-26 13:13:24 +02:00
Willy Tarreau	26c3b3f41d	MINOR: tools: provide a function to generate a hashed random pair A lot of places call two ha_random64() in a row to generate a 128-bit random. While it's now safe against linear analysis thanks to the XXH64 call, it's still particularly expensive due to the lock. Here we introduce a new function ha_random64_pair_hashed(), that feeds two uint64_t with a hash of the PRNG's internal state, and make it advance. This will cut in half the number of calls to ha_random64() and should recover a part of the performance lost in the lock. For now it's not used.	2026-05-26 13:13:24 +02:00
Willy Tarreau	9b6389c8a0	BUG/MEDIUM: tools: insert an XXH64 layer on the PRNG output Consuming randoms in pairs directly exposes the internal PRNG's state on moderately idle system. It can allow to predict next (or previous) UUIDs, QUIC retry tokens, and WS keys for example. Let's insert an XXH64 call on the ha_random64() output to avoid this. We expand the boot seed as the secret at boot, and use now_ns as the seed for each call. The original ha_random64() function was renamed to ha_random64_internal() for use cases where it's not a problem to directly use the internal state. The performance loss is only measurable when single-threaded. It drops from 7.32M UUID per second to 7.16M. Above that there is no longer any difference due to the DWCAS loop which reaches up to 98.5% CPU at 20 threads. This will need to be backported to stable releases after a period of observation.	2026-05-26 13:13:24 +02:00
Willy Tarreau	2a47cab7f3	BUG/MINOR: resolvers: switch to a better PRNG for query IDs The PRNG used by the DNS currently is easily predictable once an observer can collect a few consecutive IDs from the same thread, since it's a 32-bit xorshift reduced to 16 bits output. Let's switch it to ha_random32() instead. This should be backported, however on older releases the ha_random32() cost is higher due to the lock involved.	2026-05-26 13:13:24 +02:00
Willy Tarreau	c41c731f5e	BUG/MINOR: ssl-hello: make use of the null-terminated servername In ssl_sock_switchctx_cbk(), the servername is copied into the trash and null-terminated, but later in the call to strncpy() it's still used as-is, so anything that follows it will be copied as well, which is not really expected. Let's make the servername point to the trash after sanitizing it, like ssl_sock_switchcbk_wolfSSL_cbk() does. This can be backported to 2.6 since it was introduced with commit `a996763619` ("BUG/MINOR: ssl: Store client SNI in SSL context in case of ClientHello error").	2026-05-26 13:13:24 +02:00
Willy Tarreau	2653936510	BUG/MINOR: payload: fix the handshake length bounds check smp_client_hello_parse() After reading the handshake length, which is covered by the previous 4 bytes check, the size was not subtracted before being compared to the retrieved handshake length, making it possible to accept a handshake that claims to be 4 bytes larger than it really is. Similarly, a few lines later, data[34] is accessed without checking that it is present, because the test is made on the second hs_len, which doesn't guarantee that the data are there. This fix adds both tests. It can be backported to all stable versions as it was introduced in 1.6 with commit `bb2acf589f` ("MINOR: payload: add support for tls session ticket ext").	2026-05-26 13:13:24 +02:00
Willy Tarreau	997c99df9c	BUG/MINOR: base64: return empty string for empty input in base64dec() Right now no special case is made of size zero and the parser assumes that it can read the last two chars, which do not exist in this case. Let's check for this empty string situation and return zero (empty) as well. This should be backported to all versions.	2026-05-26 13:13:24 +02:00
Willy Tarreau	076655e18d	BUG/MINOR: http-ext: always check remaining data when reading rfc7239 nodeport http_7239_extract_nodeport() reads the first byte of the passed string but the caller doesn't check that it's not empty, which can happen if passed as 'host="127.0.0.1:"'. In that case the function would read and return garbage that is present in the buffer after the colon. Let's just check the remaining length before reading. This can be backported to 2.8 as it was introduced with commit `b2bb9257d2` ("MINOR: proxy/http_ext: introduce proxy forwarded option").	2026-05-26 13:13:24 +02:00
Willy Tarreau	8cb0a0c53d	BUG/MEDIUM: acme: protect against risk of null-deref on connection failure 7 ACME state handlers iterate over hc->res.hdrs, but they can be called after an error was detected, and the HTTP client will leave res.hdrs NULL on connection errors before headers are received. Let's check this inside the loop, like the chkorder handler already does. Most of them, if not all, need to be backported to 3.2.	2026-05-26 13:13:24 +02:00
Willy Tarreau	e583b38c63	BUG/MINOR: http-fetch: check against the whole token in get_http_auth() In 1.4, Basic authentication support was added by commit `f9423ae43a` ("[MINOR] acl: add http_auth and http_auth_group"). Interestingly, a mistake there consisted in taking the length of the comparison from the input token, so "b" matches "Basic". It was later propagated to Bearer in 2.5 with commit `f5dd337b12` ("MINOR: http: Add http_auth_bearer sample fetch"). Let's just compare the entire tokens. This may be backported though it is very minor.	2026-05-26 13:13:24 +02:00
Willy Tarreau	ffdc91c4a1	BUG/MINOR: sample: request an extra output byte for the url_dec converter A dynamic chunk size is now being allocated for output since commit `dfc4085413` ("MEDIUM: sample: Get chunks with a size dependent on input data when necessary"). However this one missed the need for the trailing zero when specifying the size, let's add it. No backport is needed, this is only in 3.4.	2026-05-26 13:13:24 +02:00
Willy Tarreau	4f58fef3d4	BUG/MINOR: resolvers: relax size checks in authority record parsing Both boundary checks in the authority record parsing loop of resolv_validate_dns_response() use >= bufend where they should use > bufend, causing valid DNS responses with exactly enough bytes to be rejected as invalid. The first one, "reader + offset + 10 >= bufend" is too strict since it prevents 10-byte responses from being accepted as valid while they are. The second one, "reader + len >= bufend" has the same issue, when exactly len bytes remain, the check rejects it even though dns_max_name() already validated it. It may be backported though it is unlikely to ever be noticed.	2026-05-26 13:13:24 +02:00
Willy Tarreau	73472025f2	BUG/MINOR: cache: also recognize directives in the form "token=" The caching RFC (9111, but was present since 2616) indicate that cache-control supports both the "token" and "token=..." forms and that consumers are supposed to recognize both. In addition, "private=..." is explicitly mentioned, so servers could very well emit it. However, haproxy only recognizes the short form without argument, except for "no-cache" where it also supports it followed by the beginning of a set-cookie argument. Thus it could miss "private=" or "no-store=". Let's refine the checks. Now we explicitly recognize the form no-cache="set-cookie", and all variants of "token" or "token=" as identical to disable caching. It will more reliably catch such edge cases and make sure we never cache a response marked like this. This should be backported, at least to the latest LTS (3.2), maybe further after some observation.	2026-05-26 13:13:24 +02:00
Willy Tarreau	5cb932826d	BUG/MEDIUM: cache: always verify the primary hash in get_secondary_entry() When checking for secondary entries, the tree is walked within duplicates of the primary key, only indexed on the first 32 bits, which means that in case of hash collision, we could start looking for an object and switch to another one while visiting secondaries. In order to avoid this we simply need to always check the full primary hash of the entry that was found. This should be backported to all stable versions.	2026-05-26 13:13:24 +02:00
Willy Tarreau	8bdcc55163	BUG/MEDIUM: h1: limit status codes to 3 digits by default By default, HTTP/1 status codes are not limited in the parser. However, the value is stored in a 16-bit field, meaning that it may be truncated if too large. Let's just restrict to 3-digits by default, and permit to relax the check when accept-unsafe-violations is set, provided that the value still fits in 16 bits. This could be backported to latest LTS release.	2026-05-26 13:13:24 +02:00
Willy Tarreau	b9aaf3c18a	BUG/MEDIUM: h1: drop headers whose names contain invalid chars Originally with "option accept-invalid-http-request", we couldn't really edit the request on the fly to remove offending headers. But since we have HTX and the headers are indexed one at a time, it has become trivial. A non-negligible number of violations are conditioned by the now renamed "option accept-unsafe-violations-in-http-request", and a controversial one could definitely be reporting and passing invalid header names containing control chars or spaces. The option was placed so as not to block requests/responses containing them, but there's no point in passing them to the other side. Most of the time it will be totally harmless since the other side will reject them. But in case haproxy is placed in front of a non-compliant server, it would fail to protect it. This patch implements a name check for all headers when a parsing error was detected. It's cheap enough (especially since only done after an error), and will skip the header if its name is invalid. This may also remove some possibilities of confusion in logs, or when encoding headers names for example. This should be backported at least till the latest LTS.	2026-05-26 13:13:18 +02:00
Willy Tarreau	635652c5aa	MINOR: haterm: do not emit a warning when not using SSL Latest commit `04811943b5` ("MINOR: haterm: enable h3 for TCP bindings") produces a warning when SSL is not enabled due to the addition of expose-experimental-directives. Let's condition it to the use of SSL.	2026-05-26 13:11:35 +02:00
Frederic Lecaille	04811943b5	MINOR: haterm: enable h3 for TCP bindings Add "h3" as ALPN identifier to be supported by TCP "bind" lines. So, QMUX is transparently enabled for such bindings.	2026-05-26 10:56:18 +02:00
Willy Tarreau	e8c9aabd62	BUG/MINOR: haterm: fix the random suffix multiplication Some checks are pending Contrib / admin/halog/ (push) Waiting to run Details Contrib / dev/flags/ (push) Waiting to run Details Contrib / dev/haring/ (push) Waiting to run Details Contrib / dev/hpack/ (push) Waiting to run Details Contrib / dev/poll/ (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details Passing a size or anything with suffix "r" is supposed to apply a random factor form 0 to 1. However due to the replacement of random() with ha_random64(), all 64 bits are random before the divide, so the end result is a random 32-bit value. In addition, ha_random64() is slow since shared between threads. Let's use statistical_prng() which is designed for this purpose and is much cheaper. No backport is needed, this is only in 3.4.	2026-05-25 20:49:22 +02:00
Willy Tarreau	32fc35ef09	CLEANUP: resolvers: fix comment typos and wrong filenames in file headers Some checks are pending Contrib / admin/halog/ (push) Waiting to run Details Contrib / dev/flags/ (push) Waiting to run Details Contrib / dev/haring/ (push) Waiting to run Details Contrib / dev/hpack/ (push) Waiting to run Details Contrib / dev/poll/ (push) Waiting to run Details VTest / Generate Build Matrix (push) Waiting to run Details VTest / (push) Blocked by required conditions Details Windows / Windows, gcc, all features (push) Waiting to run Details A few asorted comment fixes for resolvers (incorrect file name etc).	2026-05-25 10:57:14 +02:00
Willy Tarreau	6bb8cb51e6	CLEANUP: resolvers: remove pool_free(NULL) in SRV additional record matching In resolv_validate_dns_response(), when matching an additional A/AAAA record to an SRV record, the code checked tmp_record->ar_item == NULL then called pool_free(resolv_answer_item_pool, tmp_record->ar_item). This is a copy-paste mistake from similar patterns elsewhere since the pointer is confirmed to be NULL a few lines above, so let's just drop the confusing pool_free.	2026-05-25 10:57:14 +02:00
Willy Tarreau	8fe8d5fbe3	CLEANUP: resolvers: use read_n32() instead of open-coded big-endian read In resolv_validate_dns_response(), the second DNS record parsing path manually constructs a 32-bit big-endian TTL value from four individual bytes using the expression: reader[0] * 16777216 + reader[1] * 65536 + reader[2] * 256 + reader[3] We have read_n32() to do this, and it's more robust against unexpected signedness surprises (which should not happen right here since reader is unsigned char and we use -fwrapv so the result is defined). Also, let's make the ttl an uint instead of an int. The TTL is only retrieved and not used for now, so better clean it now.	2026-05-25 10:57:13 +02:00
Willy Tarreau	b78b023d55	BUG/MINOR: sample: limit the be2hex converter's chunk size In 2.5, commit `da0264a96` ("MINOR: sample: Add be2hex converter") introduced the be2hex() converter, which reads input data of a given chunk size, processes it as a big endian block and turns it to hex output. There's an issue if the configured chunk_size (2nd argument) is larger than tune.bufsize/2, because the max_size calculation will underflow, and the later loop will always match since it compares a size_t to an int (BTW, compilers love to annoy us with useless warnings but I never found how to see some for these ones). This can result in overflowing the output trash if the input sample is at least as large as half a buffer. Let's add an explicit check for this, and change the max_size type to size_t so that the comparison is always right. While we're at it, let's ask the trash buffer to be twice as large, just like bin2hex() does, as it may result in offering a larger buffer in 3.4. thanks to the large buffers support. Despite the risk, this is marked as minor because a config with that large an argument in the converter makes absolutely no sense. This should be backported to 2.6. The *2 for the trash allocation will conflict and have to be dropped in stable versions, which is safe.	2026-05-25 10:57:13 +02:00
Willy Tarreau	7d182a2ed5	BUG/MINOR: init: use more than ha_random64() for the cluster secret When not set, the cluster secret is randomly generated by two consecutive calls to ha_random64(). However, the random64 PRNG may be partially observed on a fully idle machine (QUIC retry tokens, UUID, WS key), and it could be rolled back to the initial call that produced the secret. This is purely theoretical as a normally loaded system wouldn't reveal meaningful sequences, but better address this while it's still easy. The first here consists in isolating the cluster_secret from the PRNG sequence. When RAND_bytes() is available and works, it's used. Otherwise ha_random64() is mixed with uncorrelated bits from random(). This could be backported to stable releases.	2026-05-25 10:52:42 +02:00
Willy Tarreau	c0e302fe79	BUG/MINOR: dict: fix refcount race on insert collision In dict_insert(), when ebis_insert() returns an existing node n indicating that another thread inserted the same key concurrently, the code freed its own newly-allocated entry and returned the winner without bumping its refcount. Both callers then held a reference with refcount=1 instead of 2, so when one expires the other becomes a use-after-free or double-free. The bug likely comes from the fact that new_dict_entry() creates an entry with a refcount preset to 1 (saves an atomic op) and that because of this there is no refcount increment upon a successful insertion in the tree, resulting in requiring different code paths for collision and normal insertion. A simple fix consists in bumping the refcount under the lock and unlocking only at the end, but this would mean performing two free() calls under a lock, which we always try to avoid. The code was slightly rearranged so that we can now bump the existing entry's refcount under the lock in case of duplicate, or unlock immediately in the common case, so that the free() call is done out of the lock. The probably of the race is very low (at peers connection setup only), reason why it's marked low. This should be backported to all versions.	2026-05-25 10:52:42 +02:00
Willy Tarreau	478e7e52cb	BUG/MINOR: log: look for the end of priority before the end of the buffer In parse_log_message(), the first loop looks for '>' that finishes the priority field, and unfortunately it stops once it has checked the first byte after the end of the buffer. This means that a priority made only of digits for the whole buffer would read one extra byte. In practice since pools have a tag at the end this is only detectable when using ASAN, but this should be fixed nevertheless. This can be backported to all versions. It's worth noting that RFC5424 now says that the PRI field is 1..3 digits only, so maybe at some point we could seriously limit the length as well.	2026-05-25 10:52:42 +02:00
Willy Tarreau	8e1d33a648	BUG/MINOR: mux-h2: validate HEADERS frame length before reading stream dep When the PRIORITY flag is present on a HEADERS frame, the frame must contain a stream dependency and a weight, for a total of 5 bytes. The length is checked after reading the stream dep field so theoretically such a frame could cause up to 4-byte OOB read at the end of the buffer, though in practice buffers allocated from pools never end on a page boundary (one extra word at the end) and the anomaly is still detected after reading the stream ID and the connection aborted with the glitch count incremented. Thus while not technically correct, practically speaking it's harmless. This should be backported to all stable releases.	2026-05-25 10:52:42 +02:00
Willy Tarreau	49d6306de3	BUG/MINOR: resolvers: fix risk of appending garbage past the domain name The previous fix `75f72c2eb` ("BUG/MEDIUM: resolvers: Fix test on dn label size in resolv_dn_label_to_str()") may still leave garbage from the input buffer into the response: if a component length is passed as zero, it should mark the end, but instead a dot will be emitted, and whatever follows it in the input buffer would continue to be appended as extra components. While having no direct consequences beyond the domain not being properly decoded, it could at least complicate troubleshooting. This should be backported where the fix above is backported.	2026-05-25 10:52:42 +02:00
Willy Tarreau	01ebb668a4	BUG/MINOR: resolvers: fix room for trailing zero in resolv_dn_label_to_str() The previous fix `75f72c2eb` ("BUG/MEDIUM: resolvers: Fix test on dn label size in resolv_dn_label_to_str()") can still be fooled by an input exactly the size of str_len, in which case the trailing zero appended at the end was not being accounted for. Let's add 1 to the condition to prepare for it. This needs to be backported wherever the fix above is backported.	2026-05-25 10:52:42 +02:00
Willy Tarreau	340cc86efb	BUG/MINOR: log: free logformat expr on compile failure in cfg_parse_log_profile When lf_expr_compile() fails in cfg_parse_log_profile, the code leaves without freeing the previously strdup()'d strings in target_lf->str and target_lf->conf.file. Let's add a call to lf_expr_deinit() there to release it. It was harmless anyway since the startup will abort when this happens, but better clean it because with increasingly dynamic setups, one day it could become a runtime leak. No backport is needed.	2026-05-25 10:52:42 +02:00
Willy Tarreau	f62d020140	BUG/MEDIUM: cache: fix a refcount leak for missed secondary entries When a primary cache hit has a Vary secondary_key_signature, the code calls retain_entry() and shctx_row_detach() before performing the secondary lookup. If get_secondary_entry() returns NULL (no stored variant matches), res is set to NULL and the function falls through to return ACT_RET_CONT without calling release_entry() or shctx_row_reattach(). Each such request leaks one refcount and pins one shctx row permanently, eventually exhausting the cache if this happens to all objects. This is visible when requesting a secondary key covered by vary for an object that is already stored without that key. "show cache" then shows the object's refcount increasing after each request. In order to fix this we must do like when no secondary key could be built and release everything. We only reattach to the row if we previously detached. The issue was introduced in 2.4 with commit `1785f3dd9` ("MEDIUM: cache: Add the Vary header support"). The code changed a bit in 2.9 with commit `48f81ec09` ("MAJOR: cache: Delay cache entry delete in reserve_hot function"), so in order to backport to 2.8 and older, the patch will have to be manually applied (no test on detached).	2026-05-25 10:52:42 +02:00
Willy Tarreau	bbef74fb21	BUG/MEDIUM: tcpcheck/spoe: bound the SPOP error code to valid values tcpcheck_spop_expect_hello() stores the SPOA agent-supplied status-code varint directly into check->code (signed short) without range validation. The code is later used as an index into spop_err_reasons[100]. Let's just replace invalid status codes with SPOP_ERR_UNKNOWN to avoid any problem. The SPOP tcp-check was introduced in 3.1 so this fix must be backported to 3.2.	2026-05-25 10:16:06 +02:00
Willy Tarreau	608951844e	BUG/MEDIUM: regex: allocate a large enough pcre2 match for all matches In 3.3 with commit `fda6dc959` ("MINOR: regex: use a thread-local match pointer for pcre2") we got a thread-local match that saves us from having to allocate a match array with each match. However something was clearly overlooked or misunderstood in the pcre2 API because the local match array was initialized via pcre2_match_data_create() for MAX_MATCH-1 entries instead of MAX_MATCH, despite the commit message mentioning MAX_MATCH entries. It was possibly confused with an index. Due to this there is a risk of crash when matching more than 9 groups in a regex. This fix must be backported to 3.3.	2026-05-25 10:16:06 +02:00
Willy Tarreau	f9088a5d75	BUG/MEDIUM: log-forward: make sure the month is unsigned In 2.3, in preparation for log forwarding, commit `546488559` ("MEDIUM: log/sink: re-work and merge of build message API.") extended the log send API to be able to use metadata from an existing header. However the month number is parsed from the passed meta-data and compared against 11 but there's no check for negative values which could in theory cause a negative monthname[] index. It can be a problem when the date is received as RFC5424 and forced to RFC3164 because certain characters in the month field could result in a negative month value. Let's fix it by turning the month to unsigned to make sure we only accept months 0..11. This should be backported to all branches.	2026-05-25 10:16:06 +02:00
CyberpsychoJacob	4db85fc53e	BUG/MEDIUM: acme: NUL terminate response buffer before PEM parsing acme_res_certificate() passes the httpclient response buffer to ssl_sock_load_pem_into_ckch(), which will then call BIO_new_mem_buf(buf, -1). The "-1" flag will make the OpenSSL PEM parser determine the length by using strlen(). However, the httpclient populates the response buffer with __b_putblk() without writing a trailing NUL to it. The byte at area[data] is whatever data previously resided there in the memory pool. Thus, a malicious or compromised ACME CA can perform an arbitrary-length out-of-bounds read until hitting the first NULL byte past the response body. The OpenSSL PEM loader will try to iterate to load the chain certificates, thus the PEM-looking garbage found in freed memory chunks can be erroneously loaded as additional intermediate certificates. The presence of a single NUL inside the valid response body will result in silent truncation of the certificate. Make sure that the area[data] contains a terminating NULL before passing the buffer to the parser. Fail on insufficient room for the NUL terminator. No backport required: The ACME client has been added in 3.x and this code path didn't exist in 2.x.	2026-05-23 18:09:59 +02:00
Christopher Faulet	41bb1c24f6	BUG/MEDIUM: cli: Fix parsing of pattern finishing a command payload Some checks failed Contrib / admin/halog/ (push) Has been cancelled Details Contrib / dev/flags/ (push) Has been cancelled Details Contrib / dev/haring/ (push) Has been cancelled Details Contrib / dev/hpack/ (push) Has been cancelled Details Contrib / dev/poll/ (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details When the dedidacted buffer to store the command payload was added (`c5ae0da62` "MEDIUM: cli: Make a buffer for the command payload"), an bug was introduced. When the pattern finishing the command payload is found, it is removed from the buffer. A NULL-bytes is added before it, skipping the previous newline character. It worked well in all cases before the commit above, because the commandline was already parsed and was placed at the beginning of the cmdline buffer. So, there is always a line before the payload. Now, the payload is stored in a dedicated buffer. So there is nothing preceeding it in a buffer. If the payload is empty, we cannot rewind to the previous line to set the NULL-byte character. We must handle this case to avoid integer underflow on the payload buffer length. It is a 3.4-specific bug. No backport needed.	2026-05-22 17:17:01 +02:00
Christopher Faulet	9091cfa617	BUG/MEDIUM: hlua: Fix integer underflow when receiving line from lua cosocket In hlua_socket_receive_yield(), when we try to get a line, the trailing CRLF is stripped by decrementing the block length. The '\n' is first skipped, then, possible a preceeding '\r'. But the block lenght is never checked. If an empty line is returned, this leads to an integer underflow and most probably to a crash because this length is used to copy data into a LUA string. To fix the issue, the block length is now properly tested against 0 before decrementing it. This patch must be backported to all stable versions.	2026-05-22 17:17:01 +02:00
Christopher Faulet	57b526e022	BUG/MINOR: tcpchecks: Limit parsing of agent-check reply to the buffer When parsing the agent-check reply, we first loop on the response to find the newline character, to add a NULL-byte at the end of the line. However, this loop is not bounded to the data available in the buffer. So it is possible to read bytes outside the buffer and eventually write a NULL-byte ouside the buffer. So let's check for the end of the buffer when looping on the agent-check reply. This patch must be backported to all stable versions.	2026-05-22 17:17:01 +02:00
Christopher Faulet	2644f9ddf9	BUG/MEDIUM: dict: hold lock while decrementing refcount in dict_entry_unref In dict_entry_unref(), the write lock on d->rwlock was only acquired after decrementing the refcount. However, between the decrement and the lock, another thread could increment it by calling dict_insert(). That could lead to a UAF. To fix the issue, the call to HA_ATOMIC_SUB_FETCH is moved inside the write lock. This patch must be backported to all stable versions.	2026-05-22 17:17:01 +02:00
Amaury Denoyelle	7cab3a3c3a	BUG/MINOR: quic: fix ODCID lookup from derived value Some checks failed Contrib / admin/halog/ (push) Has been cancelled Details Contrib / dev/flags/ (push) Has been cancelled Details Contrib / dev/haring/ (push) Has been cancelled Details Contrib / dev/hpack/ (push) Has been cancelled Details Contrib / dev/poll/ (push) Has been cancelled Details VTest / Generate Build Matrix (push) Has been cancelled Details Windows / Windows, gcc, all features (push) Has been cancelled Details VTest / (push) Has been cancelled Details In haproxy, when an Initial packet is received, a new connection may be created and a DCID must be attributed. This CID is derived from the original DCID used by the client in its first packet. This is an optimization to avoid storing two CIDs values in the CID tree. On CID lookup, if the DCID used is not found, derivation is performed again. This should permit to retrieve the DCID node. However, this operation is not performed as expected in quic_get_cid_tid(), as the wrong value is used on the second lookup. Fix this function by using derive CID for it. Note that retrieve_qc_conn_from_cid() performs the same lookup but the bug was not present there. The impact of this bug is relatively low as most clients send a single Initial packet. Even in case of multiple packets in a single datagram, this does not cause any issue as the current thread is assigned as default. This should be backported up to 2.8.	2026-05-22 16:03:10 +02:00

1 2 3 4 5 ...

21447 commits