haproxy

mirror of https://github.com/haproxy/haproxy.git synced 2026-04-25 16:17:21 -04:00

Author	SHA1	Message	Date
Frédéric Lécaille	a956d15118	MINOR: quic: Support transport parameters draft TLS extension If we want to run quic-tracker against haproxy, we must at least support the draft version of the TLS extension for the QUIC transport parameters (0xffa5). quic-tracker QUIC version is draft-29 at this time. We select this depending on the QUIC version. If draft, we select the draft TLS extension.	2021-11-19 14:37:35 +01:00
Frédéric Lécaille	28f51faf0b	MINOR: quic: Correctly pad UDP datagrams UDP datagrams with Initial packet were padded only for the clients (haproxy servers). But such packets MUST also be padded for the servers (haproxy listeners). Furthere, for servers, only UDP datagrams containing ack-eliciting Initial packet must be padded.	2021-11-19 14:37:35 +01:00
Frédéric Lécaille	8370c93a03	MINOR: quic: Possible wrong connection identification A client may send several Initial packets. This is the case for picoquic with -Q option. In this case we must identify the connection of incoming Initial packets thanks to the original destination connection ID.	2021-11-19 14:37:35 +01:00
Frédéric Lécaille	d169efe52b	MINOR: quic_sock: missing CO_FL_ADDR_TO_SET flag When allocating destination addresses for QUIC connections we did not set this flag which denotes these addresses have been set. This had as side effect to prevent the H3 request results from being returned to the QUIC clients. Note that this bug was revealed by this commit: "MEDIUM: backend: Rely on addresses at stream level to init server connection" Thanks to Christopher for having found the real cause of this issue.	2021-11-19 14:37:35 +01:00
Willy Tarreau	3a8bbcc38e	BUG/MEDIUM: mux-h2: always process a pending shut read During 2.4-dev, an issue with partial frames was fixed with commit `3d4631fec` ("BUG/MEDIUM: mux-h2: fix read0 handling on partial frames"). However this patch is not completely correct. It makes h2_recv() return 0 if the connection was shut for reads, but this not make h2_io_cb() call h2_process(), so if there are any pending data left in the demux buffer, they will never be processed, and the I/O callback will be called in loops forever from the poller. The correct return value there is 1, as is done at the end of the function to report a pending read0. This should definitely fix issue #1328. However even after a lot of tests I couldn't manage to reproduce it, the conditions to enter that situation are quite racy. This must be backported to 2.0 since the fix above was merged into 2.0.21 and 2.2.9.	2021-11-19 12:10:02 +01:00
William Lallemand	7980dff10c	BUG/MEDIUM: ssl: abort with the correct SSL error when SNI not found Since commit `c2aae74` ("MEDIUM: ssl: Handle early data with OpenSSL 1.1.1"), the codepath of the clientHello callback changed, letting an unknown SNI escape with a 'return 1' instead of passing through the abort label. An error was still emitted because the frontend continued the handshake with the initial_ctx, which can't be used to achieve an handshake. However, it had the ugly side effect of letting the request pass in the case of a TLS resume. Which could be surprising when combining strict-sni with the removing of a crt-list entry over the CLI for example. (like its done in the ssl/new_del_ssl_crlfile.vtc reg-test). This patch switches the code path of the allow_early and abort label, so the default code path is the abort one, letting the clientHello returns the correct SSL_AD_UNRECOGNIZED_NAME in case of errors. Which means the client will now receive: OpenSSL error[0x14094458] ssl3_read_bytes: tlsv1 unrecognized name Instead of: OpenSSL error[0x14094410] ssl3_read_bytes: sslv3 alert handshake failure Which was the error emitted before HAProxy 1.8. This patch must be carrefuly backported as far as 1.8 once we validated its impact.	2021-11-19 03:59:56 +01:00
William Lallemand	e18d4e8286	BUG/MEDIUM: ssl: backend TLS resumption with sni and TLSv1.3 When establishing an outboud connection, haproxy checks if the cached TLS session has the same SNI as the connection we are trying to resume. This test was done by calling SSL_get_servername() which in TLSv1.2 returned the SNI. With TLSv1.3 this is not the case anymore and this function returns NULL, which invalidates any outboud connection we are trying to resume if it uses the sni keyword on its server line. This patch fixes the problem by storing the SNI in the "reused_sess" structure beside the session itself. The ssl_sock_set_servername() now has a RWLOCK because this session cache entry could be accessed by the CLI when trying to update a certificate on the backend. This fix must be backported in every maintained version, however the RWLOCK only exists since version 2.4.	2021-11-19 03:58:30 +01:00
Willy Tarreau	ec347b1239	MINOR: config: support default values for environment variables Sometimes it is really useful to be able to specify a default value for an optional environment variable, like the ${name-value} construct in shell. In fact we're really missing this for a number of settings in reg tests, starting with timeouts. This commit simply adds support for the common syntax above. Other common forms like '+' to replace existing variables, or ':-' and ':+' to act on empty variables, were not implemented at this stage, as they are less commonly needed.	2021-11-18 17:54:49 +01:00
William Lallemand	002e2068cc	CLEANUP: ssl: fix wrong #else commentary The else is not for boringSSL but for the lack of Client Hello callback. Should have been changed in `1fc44d4` ("BUILD: ssl: guard Client Hello callbacks with HAVE_SSL_CLIENT_HELLO_CB macro instead of openssl version"). Could be backported in 2.4.	2021-11-18 15:38:42 +01:00
Amaury Denoyelle	10eed8ed03	BUG/MINOR: quic: fix version negotiation packet generation Fix wrong memcpy usage for source and connection ID in generated Version Negotiation packet.	2021-11-18 13:49:40 +01:00
William Lallemand	c4810b8cc8	BUG/MEDIUM: mworker: cleanup the listeners when reexecuting Previously, the cleanup of the listeners was done in mworker_loop(), which was called once the configuration file was parsed. HAProxy was switching in wait mode when the configuration failed to load, so no listeners where created. Since the latest change on the mworker mode, HAProxy switch to wait mode after successfuly loading the configuration, without cleaning its listeners, because it was done in mworker_loop, resulting in the master not closing its listeners and keeping them. The master needs its configuration to know which listeners it need to close, so that must be done before the exec(). This patch fixes the problem by cleaning the listeners in the mworker_reexec() function. No backport needeed.	2021-11-18 11:01:16 +01:00
Amaury Denoyelle	a22d860406	MEDIUM: quic: send version negotiation packet on unknown version If the client announced a QUIC version not supported by haproxy, emit a Version Negotiation Packet, according to RFC9000 6. Version Negotiation. This is required to be able to use the framework for QUIC interop testing from https://github.com/marten-seemann/quic-interop-runner. The simulator checks that the server is available by sending packets to force the emission of a Version Negotiation Packet.	2021-11-18 10:50:58 +01:00
Amaury Denoyelle	154bc7f864	MINOR: quic: support hq-interop Implement a new app_ops layer for quic interop. This layer uses HTTP/0.9 on top of QUIC. Implementation is minimal, with the intent to be able to pass interoperability test suite from https://github.com/marten-seemann/quic-interop-runner. It is instantiated if the negotiated ALPN is "hq-interop".	2021-11-18 10:50:58 +01:00
Amaury Denoyelle	71e588c8a7	MEDIUM: quic: inspect ALPN to install app_ops Remove the hardcoded initialization of h3 layer on mux init. Now the ALPN is looked just after the SSL handshake. The app layer is then installed if the ALPN negotiation returned a supported protocol. This required to add a get_alpn on the ssl_quic layer which is just a call to ssl_sock_get_alpn() from ssl_sock. This is mandatory to be able to use conn_get_alpn().	2021-11-18 10:50:58 +01:00
Amaury Denoyelle	abbe91e5e8	MINOR: quic: redirect app_ops snd_buf through mux This change is required to be able to use multiple app_ops layer on top of QUIC. The stream-interface will now call the mux snd_buf which is just a proxy to the app_ops snd_buf function. The architecture may be simplified in the structure to install the app_ops on the stream_interface and avoid the detour via the mux layer on the sending path.	2021-11-18 10:50:58 +01:00
Amaury Denoyelle	d1acaf9828	BUG/MINOR: h3: ignore unknown frame types When receiving an unknown h3 frame type, the frame must be discarded silently and the processing of the remaing frames must continue. This is according to the HTTP/3 draft34. This issue was detected when using the quiche client which uses GREASE frame to test interoperability.	2021-11-18 10:50:58 +01:00
Christopher Faulet	7530830414	BUG/MEDIUM: mux-h1: Handle delayed silent shut in h1_process() to release H1C The commit `a85c522d4` ("BUG/MINOR: mux-h1: Save shutdown mode if the shutdown is delayed") revealed several hidden bugs in connection's shutdown handling. One of them is about delayed silent shudown. If outgoing data are not fully sent, we delayed the shutdown. However, in h1_process(), only normal (or clean) shutdown are really detected. If a silent (or dirty) shutdown is performed, the H1 connection is not immediately released. Of course, in this situation, the client never acknowledged the shutdown. Thus, the H1 connection remains open till the client timeout. This patch should fix the issues #1448 and #1453. It must be backported as far as 2.0.	2021-11-15 15:03:21 +01:00
Christopher Faulet	1ccbe12f4a	DOC: log: Add comments to specify when session's listener is defined or not When a log message is emitted, The session's listener is always defined when the session's owner is an inbound connection while it is undefined for a health-check. It is not obvious. So, comments have been added to make it clear. This patch is related to the issue #1434.	2021-11-15 11:31:09 +01:00
Christopher Faulet	d9e6b35701	CLEANUP: peers: Remove useless test on peer variable in peer_trace() A useless test on peer variable was reported by cppcheck in peer_trace(). This patch should fix the issue #1165.	2021-11-15 09:41:00 +01:00
Christopher Faulet	b7c962b0c0	BUG/MINOR: stick-table/cli: Check for invalid ipv6 key When an ipv6 key is used to filter a CLI command on a stick table (clear/set/show table ...), the return value of inet_pton() call must be checked to be sure the key is valid. This patch should fix the issue #1163. It should be backported to all supported versions.	2021-11-15 09:17:27 +01:00
Willy Tarreau	fdf53b4962	BUG/MINOR: pools: don't mark ourselves as harmless in DEBUG_UAF mode When haproxy is built with DEBUG_UAF=1, some particularly slow allocation functions are used for each pool, and it was not uncommon to see the watchdog trigger during performance tests. For this reason the allocation functions were surrounded by a pair of thread_harmless calls to mention that the function was waiting in slow syscalls. The problem is that this also releases functions blocked in thread_isolate() which can then start their work. In order to protect against the accidental removal of a shared resource in this situation, in 2.5-dev4 with commit `ba3ab7907` ("MEDIUM: servers: make the server deletion code run under full thread isolation") was added thread_isolate_full() for functions which want to be totally protected due to being manipulating some data. But this is not sufficient, because there are still places where we can allocate/free (thus sleep) under a lock, such as in long call chains involving the release of an idle connection. In this case, if one thread asks for isolation, one thread might hang in pool_alloc_area_uaf() with a lock held (for example the conns_lock when coming from conn_backend_get()->h1_takeover()->task_new()), with another thread blocked on a lock waiting for that one to release it, both keeping their bit clear in the thread_harmless mask, preventing the first thread from being released, thus causing a deadlock. In addition to this, it was already seen that the "show fd" CLI handler could wake up during a pool_free_area_uaf() with an incompletely released memory area while deleting a file descriptor, and be fooled showing bad pointers, or during a pool_alloc() on another thread that was in the process of registering a freshly allocated connection to a new file descriptor. One solution could consist in replacing all thread_isolate() calls by thread_isolate_full() but then that makes thread_isolate() useless and only shifts the problem by one slot. A better approach could possibly consist in having a way to mark that a thread is entering an extremely slow section. Such sections would be timed so that this is not abused, and the bit would be used to make the watchdog more patient. This would be acceptable as this would only affect debugging. The approach used here for now consists in removing the harmless bits around the UAF allocator, thus essentially undoing commit `85b2cae63` ("MINOR: pools: make the thread harmless during the mmap/munmap syscalls"). This is marked as minor because nobody is expected to be running with DEBUG_UAF outside of development or serious debugging, so this issue cannot affect regular users. It must be backported to stable branches that have thread_harmless_now() around the mmap() call.	2021-11-12 11:17:37 +01:00
Christopher Faulet	47940c39e2	BUG/MINOR: mux-h2: Fix H2_CF_DEM_SHORT_READ value The value for H2_CF_DEM_SHORT_READ flag is wrong. 2 bits are erroneously set, 0x200 and 0x80000. It is not an issue because both bits are not used anywhere else. The typo was introduced in the commit `b5f7b5296` ("BUG/MEDIUM: mux-h2: Handle remaining read0 cases on partial frames"). Thus this patch must also be backported as far a 2.0.	2021-11-10 18:04:36 +01:00
William Lallemand	67b778418e	BUG/MEDIUM: httpclient/cli: free of unallocated hc->req.uri httpclient_new() sets the hc->req.uri ist without duplicating its memory, which is a problem since the string in the ist could be inaccessible at some point. The API was made to use a ist which was allocated dynamically, but httpclient_new() didn't do that, which result in a crash when calling istfree(). This patch fixes the problem by doing an istdup() Fix issue #1452.	2021-11-10 17:02:50 +01:00
William Lallemand	5f47b2e280	BUG/MINOR: mworker: doesn't launch the program postparser When in wait mode, the mworker-prog postparser is launched, but unfortunately the child structure doesn't contain all required information to be able to launch the test. This test is only required when doing a configuration parsing. Must be backported as far as 2.0.	2021-11-10 15:53:01 +01:00
William Lallemand	90034bba15	MINOR: mworker: change the way we set PROC_O_LEAVING Since the wait mode is always used once we successfuly loaded the configuration, every processes were marked as old workers. To fix this, the PROC_O_LEAVING flag is set only on the processes which have a number of reloads greater than the current processes.	2021-11-10 15:53:01 +01:00
William Lallemand	3ba7c7b5e1	MINOR: mworker: ReloadFailed shown depending on failedreload The ReloadFailed prompt in the master CLI is shown only when failedreloads > 0. It was previously using a check on the wait mode, but we always use the wait mode now.	2021-11-10 15:53:01 +01:00
William Lallemand	6883674084	MINOR: mworker: implement a reload failure counter Implement a reload failure counter which counts the number of failure since the last success. This counter is available in 'show proc' over the master CLI.	2021-11-10 15:53:01 +01:00
William Lallemand	ad221f4ece	MINOR: mworker: only increment the number of reload in wait mode Since the wait mode will be started in any case of succesful or failed reload, change the way haproxy computes the number of reloads of the processes.	2021-11-10 15:53:01 +01:00
William Lallemand	836bda226c	MINOR: mworker: clarify starting/failure messages Clarify the startup and reload messages: On a successful configuration load, haproxy will emit "Loading success." after successfuly forked the children. When it didn't success to load the configuration it will emit "Loading failure!". When trying to reload the master process, it will emit "Reloading HAProxy".	2021-11-10 15:53:01 +01:00
William Lallemand	fab0fdce98	MEDIUM: mworker: reexec in waitpid mode after successful loading Use the waitpid mode after successfully loading the configuration, this way the memory will be freed in the master, and will preserve the memory. This will be useful when doing a reload with a configuration which has large maps or a lot of SSL certificates, avoiding an OOM because too much memory was allocated in the master.	2021-11-10 15:53:01 +01:00
William Lallemand	5d71a6b0f1	CLEANUP: mworker: remove any relative PID reference nbproc was removed, it's time to remove any reference to the relative PID in the master-worker, since there can be only 1 current haproxy process. This patch cleans up the alerts and warnings emitted during the exit of a process, as well as the "show proc" output.	2021-11-10 15:53:01 +01:00
Christopher Faulet	99293b0380	MINOR: mux-h1: Slightly Improve H1 traces Connection and conn-stream pointers and flags are now dumped, if available, in each trace messages. In addition, shutr and shutw mode is now reported.	2021-11-10 11:45:27 +01:00
Christopher Faulet	4c5a591b10	Revert "BUG/MINOR: http-ana: Don't eval front after-response rules if stopped on back" This reverts commit `597909f4e6` http-after-response rules evaluation was changed to do the same that was done for http-response, in the code. However, the opposite must be performed instead. Only the rules of the current section must be stopped. Thus the above commit is reverted and the http-response rules evaluation will be fixed instead. Note that only "allow" action is concerned. It is most probably an uncommon action for an http-after-request rule. This patch must be backported as far as 2.2 if the above commit was backported.	2021-11-09 18:02:49 +01:00
Christopher Faulet	46f46df300	BUG/MINOR: http-ana: Apply stop to the current section for http-response rules A TCP/HTTP action can stop the rules evaluation. However, it should be applied on the current section only. For instance, for http-requests rules, an "allow" on a frontend must stop evaluation of rules defined in this frontend. But the backend rules, if any, must still be evaluated. For http-response rulesets, according the configuration manual, the same must be true. Only "allow" action is concerned. However, since the beginning, this action stops evaluation of all remaining rules, not only those of the current section. This patch may be backported to all supported versions. But it is not so critical because the bug exists since a while. I doubt it will break any existing configuration because the current behavior is counterintuitive.	2021-11-09 18:02:36 +01:00
William Dauchy	42d7c402d5	MINOR: promex: backend aggregated server check status - add new metric: `haproxy_backend_agg_server_check_status` it counts the number of servers matching a specific check status this permits to exclude per server check status as the usage is often to rely on the total. Indeed in large setup having thousands of servers per backend the memory impact is not neglible to store the per server metric. - realign promex_str_metrics array quite simple implementation - we could improve it later by adding an internal state to the prometheus exporter, thus to avoid counting at every dump. this patch is an attempt to close github issue #1312. It may bebackported to 2.4 if requested. Signed-off-by: William Dauchy <wdauchy@gmail.com>	2021-11-09 10:51:08 +01:00
William Lallemand	db8a1f391d	BUG/MEDIUM: httpclient: channel_add_input() must use htx->data The httpclient uses channel_add_input() to notify the channel layer that it must forward some data. This function was used with b_data(&req->buf) which ask to send the size of a buffer (because of the HTX metadata which fill the buffer completely). This is wrong and will have the consequence of trying to send data that doesn't exist, letting HAProxy looping at 100% CPU. When using htx channel_add_input() must be used with the size of the htx payload, and not the size of a buffer. When sending the request payload it also need to sets the buffer size to 0, which is achieved with a htx_to_buf() when the htx payload is empty.	2021-11-08 17:36:31 +01:00
William Lallemand	933fe394bb	BUG/MINOR: httpclient/lua: rcv freeze when no request payload This patch fixes the receive part of the lua httpclient when no payload was sent. The lua task was not awoken once it jumped into hlua_httpclient_rcv_yield(), which caused the lua client to freeze. It works with a payload because the payload push is doing the wakeup. A change in the state machine of the IO handler is also require to achieve correctly the change from the REQ state to the RES state, it has to detect if there is the right EOM flag in the request.	2021-11-08 17:36:31 +01:00
Willy Tarreau	1f38bdb3f6	BUG/MINOR: cache: properly ignore unparsable max-age in quotes When "max-age" or "s-maxage" receive their values in quotes, the pointer to the integer to be parsed is advanced by one, but the error pointer check doesn't consider this advanced offset, so it will not match a parse error such as max-age="a" and will take the value zero instead. This probably needs to be backported, though it's unsure it has any effect in the real world.	2021-11-08 12:09:27 +01:00
Willy Tarreau	49b0482ed4	CLEANUP: chunk: remove misleading chunk_strncat() function This function claims to perform an strncat()-like operation but it does not, it always copies the indicated number of bytes, regardless of the presence of a NUL character (what is currently done by chunk_memcat()). Let's remove it and explicitly replace it with chunk_memcat().	2021-11-08 12:08:26 +01:00
Tim Duesterhus	9f7ed8a60c	CLEANUP: Apply ist.cocci This is to make use of `chunk_istcat()`.	2021-11-08 12:08:26 +01:00
Tim Duesterhus	2471f5c2b2	CLEANUP: Apply ist.cocci Make use of the new rules to use `isttrim()`.	2021-11-08 12:08:26 +01:00
Frédéric Lécaille	c4becf5424	MINOR: quic: Fix potential null pointer dereference Fix compilation warnings about non initialized pointers. This partially address #1445 github issue.	2021-11-08 11:31:12 +01:00
Amaury Denoyelle	b9ce14e5a2	MINOR: h3: fix potential NULL dereference Fix potential allocation failure of HTX start-line during H3 request decoding. In this case, h3_decode_qcs returns -1 as error code. This addresses in part github issue #1445.	2021-11-08 09:17:24 +01:00
Amaury Denoyelle	7bb54f9906	MINOR: mux-quic: fix gcc11 warning Fix minor warnings about an unused variable. This addresses in part github issue #1445.	2021-11-08 08:59:30 +01:00
Amaury Denoyelle	3cae4049b0	MINOR: h3/qpack: fix gcc11 warnings Fix minor warnings about unused variables and mixed declarations. This addresses in part github issue #1445.	2021-11-08 08:59:30 +01:00
Tim Duesterhus	16cc16dd82	CLEANUP: Re-apply xalloc_size.cocci Use a consistent size as the parameter for the *alloc family.	2021-11-08 08:05:39 +01:00
Tim Duesterhus	4c8f75fc31	CLEANUP: Apply ist.cocci Make use of the new rules to use `istend()`.	2021-11-08 08:05:39 +01:00
Willy Tarreau	68574dd492	MEDIUM: log: add the client's SNI to the default HTTPS log format During a troublehooting it came obvious that the SNI always ought to be logged on httpslog, as it explains errors caused by selection of the default certificate (or failure to do so in case of strict-sni). This expectation was also confirmed on the mailing list. Since the field may be empty it appeared important not to leave an empty string in the current format, so it was decided to place the field before a '/' preceding the SSL version and ciphers, so that in the worst case a missing field leads to a field looking like "/TLSv1.2/AES...", though usually a missing element still results in a "-" in logs. This will change the log format for users who already deployed the 2.5-dev versions (hence the medium level) but no released version was using this format yet so there's no harm for stable deployments. The reg-test was updated to check for "-" there since we don't send SNI in reg-tests. Link: https://www.mail-archive.com/haproxy@formilux.org/msg41410.html Cc: William Lallemand <wlallemand@haproxy.org>	2021-11-06 09:20:07 +01:00
Willy Tarreau	579259d150	MINOR: ssl: make the ssl_fc_sni() sample-fetch function always available Its definition is enclosed inside an ifdef SSL_CTRL_SET_TLSEXT_HOSTNAME which is defined since OpenSSL 0.9.8. Having it conditioned like this prevents us from using it by default in a log format, which could cause an error on an old or exotic library. Let's just always define it and make the sample fetch fail to return anything on such libs instead.	2021-11-06 09:20:07 +01:00
Willy Tarreau	6f7497616e	MEDIUM: connection: rename fc_conn_err and bc_conn_err to fc_err and bc_err Commit `3d2093af9` ("MINOR: connection: Add a connection error code sample fetch") added these convenient sample-fetch functions but it appears that due to a misunderstanding the redundant "conn" part was kept in their name, causing confusion, since "fc" already stands for "front connection". Let's simply call them "fc_err" and "bc_err" to match all other related ones before they appear in a final release. The VTC they appeared in were also updated, and the alpha sort in the keywords table updated. Cc: William Lallemand <wlallemand@haproxy.org>	2021-11-06 09:20:07 +01:00
Christopher Faulet	44d34bfbe7	MINOR: compression: Warn for 'compression offload' in defaults sections This directive is documented as being ignored if set in a defaults section. But it is only mentionned in a small note in the configuration manual. Thus, now, a warning is emitted. To do so, the errors handling in parse_compression_options() function was slightly changed. In addition, this directive is now documented apart from the other compression directives. This way, it is clearly visible that it must not be used in a defaults section.	2021-11-05 16:36:42 +01:00
Christopher Faulet	34a3eb4c42	MINOR: backend: Get client dst address to set the server's one only if needful In alloc_dst_address(), the client destination address must only be retrieved when we are sure to use it. Most of time, this save a syscall to getsockname(). It is not a bugfix in itself. But it revealed a bug in the QUIC part. The CO_FL_ADDR_TO_SET flag is not set when the destination address is create for anew quic client connection.	2021-11-05 15:25:34 +01:00
Frédéric Lécaille	b0006eee09	MINOR: quic: Use QUIC_LOCK QUIC specific lock label. Very minor modifications without any impact.	2021-11-05 15:20:04 +01:00
Frédéric Lécaille	46ea033be0	MINOR: quic: Remove a useless lock for CRYPTO frames ->frms_rwlock is an old lock supposed to be used when several threads could handle the same connection. This is no more the case since this commit: "MINOR: quic: Attach the QUIC connection to a thread."	2021-11-05 15:20:04 +01:00
Frédéric Lécaille	324ecdafbb	MINOR: quic: Enhance the listener RX buffering part Add a buffer per QUIC connection. At this time the listener which receives the UDP datagram is responsible of identifying the underlying QUIC connection and must copy the QUIC packets to its buffer. ->pkt_list member has been added to quic_conn struct to enlist the packets in the order they have been copied to the connection buffer so that to be able to consume this buffer when the packets are freed. This list is locked thanks to a R/W lock to protect it from concurent accesses. quic_rx_packet struct does not use a static buffer anymore to store the QUIC packets contents.	2021-11-05 15:20:04 +01:00
Frédéric Lécaille	c5c69a0ad2	CLEANUP: quic: Remove useless code Remove old I/O handler implementation (listener and server). At this time keep a defined but not used function for servers (qc_srv_pkt_rcv()).	2021-11-05 15:20:04 +01:00
Frédéric Lécaille	c1029f6182	MINOR: quic: Allocate listener RX buffers At this time we allocate an RX buffer by thread. Also take the opportunity offered by this patch to rename TX related variable names to distinguish them from the RX part.	2021-11-05 15:20:04 +01:00
Tim Duesterhus	284fbe1214	CLEANUP: Apply ist.cocci Make use of the new rules to use `istnext()`.	2021-11-05 07:48:38 +01:00
Tim Duesterhus	025b93e3a2	CLEANUP: Apply ha_free.cocci Use `ha_free()` where possible.	2021-11-05 07:48:38 +01:00
Remi Tricot-Le Breton	7266350181	BUG/MINOR: jwt: Fix jwt_parse_alg incorrectly returning JWS_ALG_NONE jwt_parse_alg would mistakenly return JWT_ALG_NONE for algorithms "", "n", "no" and "non" because of a strncmp misuse. It now sees them as unknown algorithms. No backport needed. Cc: Tim Duesterhus <tim@bastelstu.be>	2021-11-03 17:19:48 +01:00
Emeric Brun	f8642ee826	MEDIUM: resolvers: rename dns extra counters to resolvers extra counters This patch renames all dns extra counters and stats functions, types and enums using the 'resolv' prefix/suffixes. The dns extra counter domain id used on cli was replaced by "resolvers" instead of "dns". The typed extra counter prefix dumping resolvers domain "D." was also renamed "N." because it points counters on a Nameserver. This was done to finish the split between "resolver" and "dns" layers and to avoid further misunderstanding when haproxy will handle dns load balancing. This should not be backported.	2021-11-03 17:16:46 +01:00
Emeric Brun	d174f0e59a	MINOR: resolvers/dns: split dns and resolver counters in dns_counter struct This patch add a union and struct into dns_counter struct to split application specific counters. The only current existing application is the resolver.c layer but in futur we could handle different application such as dns load balancing with others specific counters. This patch should not be backported.	2021-11-03 17:16:46 +01:00
Emeric Brun	0161d32df2	BUG/MINOR: resolvers: throw log message if trash not large enough for query Before this patch the sent error counter was increased for each targeted nameserver as soon as we were unable to build the query message into the trash buffer. But this counter is here to count sent errors at dns.c transport layer and this error is not related to a nameserver. This patch stops to increase those counters and sent a log message to signal the trash buffer size is not large enough to build the query. Note: This case should not happen except if trash size buffer was customized to a very low value. The function was also re-worked to return -1 in this error case as it was specified in comment. This function is currently called at multiple point in resolver.c but return code is still not yet handled. So to advert the user of the malfunction the log message was added. This patch should be backported on all versions including the layer split between dns.c and resolver.c (v >= 2.4)	2021-11-03 17:16:46 +01:00
Emeric Brun	c37caab21c	BUG/MINOR: resolvers: fix sent messages were counted twice The sent messages counter was increased at both resolver.c and dns.c layers. This patch let the dns.c layer count the sent messages since this layer handle a retry if transport layer is not ready (EAGAIN on udp or tcp session ring buffer full). This patch should be backported on all versions using a split of those layers for resolving (v >=2.4)	2021-11-03 17:16:46 +01:00
Amaury Denoyelle	f9d5957cd9	MINOR: server: add ws keyword Implement parsing for the server keyword 'ws'. This is used to configure the mode of selection for websocket protocol. The configuration documentation has been updated. A new regtest has been created to test the proper behavior of the keyword.	2021-11-03 16:24:48 +01:00
Amaury Denoyelle	9c3251d108	MEDIUM: server/backend: implement websocket protocol selection Handle properly websocket streams if the server uses an ALPN with both h1 and h2. Add a new field h2_ws in the server structure. If set to off, reuse is automatically disable on backend and ALPN is forced to http1.x if possible. Nothing is done if on. Implement a mechanism to be able to use a different http version for websocket streams. A new server member <ws> represents the algorithm to select the protocol. This can overrides the server <proto> configuration. If the connection uses ALPN for proto selection, it is updated for websocket streams to select the right protocol. Three mode of selection are implemented : - auto : use the same protocol between non-ws and ws streams. If ALPN is use, try to update it to "http/1.1"; this is only done if the server ALPN contains "http/1.1". - h1 : use http/1.1 - h2 : use http/2.0; this requires the server to support RFC8441 or an error will be returned by haproxy.	2021-11-03 16:24:48 +01:00
Amaury Denoyelle	ac03ef26e8	MINOR: connection: add alternative mux_ops param for conn_install_mux_be Add a new parameter force_mux_ops. This will be useful to specify an alternative to the srv->mux_proto field. If non-NULL, it will be use to force the mux protocol wether srv->mux_proto is set or not. This argument will become useful to install a mux for non-standard streams, most notably websocket streams.	2021-11-03 16:24:48 +01:00
Amaury Denoyelle	2454bda140	MINOR: connection: implement function to update ALPN Implement a new function to update the ALPN on an existing connection. on an existing connection. The ALPN from the ssl context can be checked to update the ALPN only if it is a subset of the context value. This method will be useful to change a connection ALPN for websocket, must notably if the server does not support h2 websocket through the rfc8441 Extended Connect.	2021-11-03 16:24:48 +01:00
Amaury Denoyelle	90ac605ef3	MINOR: stream/mux: implement websocket stream flag Define a new stream flag SF_WEBSOCKET and a new cs flag CS_FL_WEBSOCKET. The conn-stream flag is first set by h1/h2 muxes if the request is a valid websocket upgrade. The flag is then converted to SF_WEBSOCKET on the stream creation. This will be useful to properly manage websocket streams in connect_server().	2021-11-03 16:24:48 +01:00
Amaury Denoyelle	0df043608f	BUG/MEDIUM: mux-h2: reject upgrade if no RFC8441 support The RFC8441 was not respected by haproxy in regards with server support for Extended CONNECT. The Extended CONNECT method was used to convert an Upgrade header stream even if no SETTINGS_ENABLE_CONNECT_PROTOCOL was received, which is forbidden by the RFC8441. In this case, the behavior of the http/2 server is unspecified. Fix this by flagging the connection on receiption of the RFC8441 settings SETTINGS_ENABLE_CONNECT_PROTOCOL. Extended CONNECT is thus only be used if the flag is present. In the other case, the stream is immediatly closed as there is no way to handle it in http/2. It results in a http/1.1 502 or http/2 RESET_STREAM to the client side. The protocol-upgrade regtest has been extended to test that haproxy does not emit Extended CONNECT on servers without RFC8441 support. It must be backported up to 2.4.	2021-11-03 16:24:48 +01:00
Amaury Denoyelle	e0c258c84d	MINOR: mux-h2: add trace on extended connect usage Add a state trace to report that a protocol upgrade is converted using the rfc8441 Extended connect method. This is useful in regards with the recent changes to improve http/2 websockets.	2021-11-03 11:42:02 +01:00
Tim Duesterhus	ab896ee3f7	MINOR: jwt: Make invalid static JWT algorithms an error in `jwt_verify` converter It is not useful to start a configuration where an invalid static string is provided as the JWT algorithm. Better make the administrator aware of the suspected typo by failing to start.	2021-11-03 11:15:32 +01:00
Jaroslaw Rzesz�tko	c8637032a7	MINOR: vars: add "set-var" for "tcp-request connection" rules. Session struct is already allocated when "tcp-request connection" rules are evaluated so session-scoped variables turned out easy to support. This resolves github issue #1408.	2021-11-02 17:58:35 +01:00
Willy Tarreau	44c5ff69ac	MEDIUM: vars: make the var() sample fetch function really return type ANY A long-standing issue was reported in issue #1215. In short, var() was initially internally declared as returning a string because it was not possible by then to return "any type". As such, users regularly get trapped thinking that when they're storing an integer there, then the integer matching method automatically applies. Except that this is not possible since this is related to the config parser and is decided at boot time where the variable's type is not known yet. As such, what is done is that the output being declared as type string, the string match will automatically apply, and any value will first be converted to a string. This results in several issues like: http-request set-var(txn.foo) int(-1) http-request deny if { var(txn.foo) lt 0 } not working. This is because the string match on the second line will in fact compare the string representation of the variable against strings "lt" and "0", none of which matches. The doc says that the matching method is mandatory, though that's not the case in the code due to that default string type being permissive. There's not even a warning when no explicit match is placed, because this happens very deep in the expression evaluator and making a special case just for "var" can reveal very complicated. The set-var() converter already mandates a matching method, as the following will be rejected: ... if { int(12),set-var(txn.truc) 12 } while this one will work: ... if { int(12),set-var(txn.truc) -m int 12 } As such, this patch this modifies var() to match the doc, returning the type "any", and mandating the matching method, implying that this bogus config which does not work: http-request set-var(txn.foo) int(-1) http-request deny if { var(txn.foo) lt 0 } will need to be written like this: http-request set-var(txn.foo) int(-1) http-request deny if { var(txn.foo) -m int lt 0 } This will break some configs (and even 3 of our regtests relied on this), but except those which already match string exclusively, all other ones are already broken and silently fail (and one of the 3 regtests, the one on FIX, was bogus regarding this). In order to fix existing configs, one can simply append "-m str" after a "var()" in an ACL or "if" expression: http-request deny unless { var(txn.jwt_alg) "ES" } must become: http-request deny unless { var(txn.jwt_alg) -m str "ES" } Most commonly, patterns such as "le", "lt", "ge", "gt", "eq", "ne" in front of a number indicate that the intent was to match an integer, and in this case "-m int" would be desired: tcp-response content reject if ! { var(res.size) gt 3800 } ought to become: tcp-response content reject if ! { var(res.size) -m int gt 3800 } This must not be backported, but if a solution is found to at least detect this exact condition in the generic expression parser and emit a warning, this could probably help spot configuration bugs. Link: https://www.mail-archive.com/haproxy@formilux.org/msg41341.html Cc: Christopher Faulet <cfaulet@haproxy.com> Cc: Tim D�sterhus <tim@bastelstu.be>	2021-11-02 17:28:43 +01:00
Christopher Faulet	e8f3596cd0	MINOR: stream: Improve dump of bogus streams Stream flags and information about the HTTP txn, if defined, are now emitted. This will help us to identify bugs when such message is reported.	2021-11-02 17:25:48 +01:00
Christopher Faulet	9ed1a0601d	BUG/MEDIUM: resolvers: Track api calls with a counter to free resolutions The kill list introduced in commit `f766ec6b5` ("MEDIUM: resolvers: use a kill list to preserve the list consistency") contains a bug. The deatch_row must be initialized before calling resolv_process_responses() function. However, this function is called for the dns code. The death_row is not visible from the outside. So, it is possible to add a resolution in an uninitialized death_row, leading to a crash. But, with the current implementation, it is not possible to handle the death_row in resolv_process_responses() function because, internally, the kill list may be freed via a call to resolv_unlink_resolution(). At the end, we are unable to determine all call chains to guarantee a safe use of the kill list. It is a shameful observation, but unfortunatly true. So, to make the fix simple, we track all calls to the public resolvers api. A counter is incremented when we enter in the resolver code and decremented when we leave it. This way, we are able to track the recursions to init and release the kill list only once, at the edge. Following functions are incrementing/decrementing the recurse counter: * resolv_trigger_resolution() * resolv_srvrq_expire_task() * resolv_link_resolution() * resolv_unlink_resolution() * resolv_detach_from_resolution_answer_items() * resolv_process_responses() * process_resolvers() * resolvers_finalize_config() * resolv_action_do_resolve() This patch should fix the issue #1404. It must be backported everywhere the above commit was backported.	2021-11-02 16:55:01 +01:00
Christopher Faulet	69fad00ebf	BUG/MEDIUM: stream-int: Block reads if channel cannot receive more data First of all, we must be careful here because this part was modified and each time, this introduced a bug. But, in si_update_rx(), we must not re-enables receives if the channel buffer cannot receive more data. Otherwise the multiplexer will be wake up for nothing. Because the stream is woken up when the multiplexer is waiting for more room to move on, this may lead to a ping-pong loop between the stream and the mux. Note that for now, it does not fix any known bug. All reported issues in this area were fixed in another way. This patch must be backported with a special care. Technically speaking, it may be backported as far as 2.0.	2021-11-02 16:55:01 +01:00
William Lallemand	0f41c384ea	BUG/MINOR: httpclient: use a placeholder value for Host header A Host header must be present for http_update_host() to success. htx_add_header(htx, ist("Host"), IST_NULL) was used but this is not a good idea from a semantic point of view. It also tries to make a memcpy with a len of 0, which is unrequired. Use an ist("h") instead as a placeholder value. This patch fixes bug #1439.	2021-11-02 15:53:09 +01:00
William Lallemand	d1187eb3e1	BUG/MINOR: httpclient/lua: misplaced luaL_buffinit() Some luaL_buffinit() call was done before the push of the variable name, where it seems to work correctly with lua < 5.4.3, it brokes systematically on this version. This patch inverts the pushstring and the buffinit.	2021-11-02 10:40:06 +01:00
Remi Tricot-Le Breton	7da35bff9f	BUG/MINOR: http: http_auth_bearer fetch does not work on custom header name The http_auth_bearer sample fetch can take a header name as parameter, in which case it will try to extract a Bearer value out of the given header name instead of the default "Authorization" one. In this case, the extraction would not have worked because of a misuse of strncasecmp. This patch fixes this by replacing the standard string functions by ist ones. It also properly manages the multiple spaces that could be found between the scheme and its value. No backport needed, that's part of JWT which is only in 2.5. Co-authored-by: Tim Duesterhus <tim@bastelstu.be>	2021-10-29 17:40:17 +02:00
Remi Tricot-Le Breton	68c4eae87f	BUG/MINOR: http: Authorization value can have multiple spaces after the scheme As per RFC7235, there can be multiple spaces in the value of an Authorization header, between the scheme and the actual authentication parameters. This can be backported to all stable versions since basic auth has almost always been there.	2021-10-29 17:40:17 +02:00
Christopher Faulet	b0c87f1c61	BUG/MEDIUM: http-ana: Drain request data waiting the tarpit timeout expiration When a tarpit action is performed, we must be sure to drain data from the request channel. Otherwise, the mux on the frontend side may be blocked because the request channel buffer is full. This may lead to Two bugs. The first one is a HOL blocking on the H2 multiplexer. A tarpitted stream may block all the others because data are not drained for the whole tarpit timeout. The second bug is a ping-pong loop between the multiplexer and the stream. The mux is waiting for more space in the channel buffer, so it wakes up the stream. And the stream systematically re-enables receives. This last part is not pretty clean and it will be addressed with another fix. But draning request data is a good way to fix both bugs in same time. This patch must be backported as far as 2.0. The legacy HTTP mode is probably affected, but I don't know if same bugs may be experienced in this mode.	2021-10-29 15:06:31 +02:00
Christopher Faulet	bce6db6c3c	BUG/MEDIUM: resolvers: Don't recursively perform requester unlink When a requester is unlink from a resolution, by reading the code, we can have this call chain: _resolv_unlink_resolution(srv->resolv_requester) resolv_detach_from_resolution_answer_items(resolution, requester) resolv_srvrq_cleanup_srv(srv) _resolv_unlink_resolution(srv->resolv_requester) A loop on the resolution answer items is performed inside resolv_detach_from_resolution_answer_items(). But by reading the code, it seems possible to recursively unlink the same requester. To avoid any loop at this stage, the requester clean up must be performed before the call to resolv_detach_from_resolution_answer_items(). This way, the second call to _resolv_unlink_resolution() does nothing and returns immediately because the requester was already detached from the resolution. This patch is related to the issue #1404. It must be backported as far as 2.2.	2021-10-29 15:06:31 +02:00
Christopher Faulet	e76b4f055d	BUG/MEDIUM: mux-h1: Perform a connection shutdown when the h1c is released When the H1 connection is released, a connection shutdown is now performed. If it was already performed when the stream was detached, this action has no effect. But it is mandatory, when an idle H1C is released. Otherwise the xprt and the socket shutdown is never perfmed. It is especially important for SSL client connections, because it is the only way to perform a clean SSL shutdown. Without this patch, SSL_shutdown is never called, preventing, among other things, the SSL session caching. This patch depends on the commit "BUG/MINOR: mux-h1: Save shutdown mode if the shutdown is delayed". It should be backported as far as 2.0.	2021-10-29 15:06:31 +02:00
Christopher Faulet	a85c522d42	BUG/MINOR: mux-h1: Save shutdown mode if the shutdown is delayed The connection shutdown may be delayed if there are pending outgoing data. The action is performed once data are fully sent. In this case the mode (dirty/clean) was lost and a clean shutdown was always performed. Now, the mode is saved to be sure to perform the connection shutdown using the right mode. To do so, H1C_F_ST_SILENT_SHUT flag is introduced. This patch should be backported as far as 2.0.	2021-10-29 15:06:31 +02:00
William Lallemand	bd5739e93e	MINOR: httpclient/lua: handle the streaming into the lua applet With this feature the lua implementation of the httpclient is now able to stream a payload larger than an haproxy buffer. The hlua_httpclient_send() function is now split into: hlua_httpclient_send() which initiate the httpclient and parse the lua parameters hlua_httpclient_snd_yield() which will send the request and be called again to stream the request if the body is larger than an haproxy buffer hlua_httpclient_rcv_yield() which will receive the response and store it in the lua buffer.	2021-10-28 16:24:14 +02:00
William Lallemand	0da616ee18	MINOR: httpclient: request streaming with a callback This patch add a way to handle HTTP requests streaming using a callback. The end of the data must be specified by using the "end" parameter in httpclient_req_xfer().	2021-10-28 16:24:14 +02:00
Tim Duesterhus	8aee3030f8	CLEANUP: hlua: Remove obsolete branch in `hlua_alloc()` This branch is no longer required, because the `!nsize` case is handled for any value of `ptr` now. see `22586524e3` see `a5efdff93c`	2021-10-28 09:45:48 +02:00
Tim Duesterhus	e0c1d749a8	CLEANUP: jwt: Remove the use of a trash buffer in jwt_jwsverify_rsa_ecdsa() `trash` was completely unused within this function.	2021-10-28 09:45:48 +02:00
Tim Duesterhus	c87d3c21bf	CLEANUP: jwt: Remove the use of a trash buffer in jwt_jwsverify_hmac() The OpenSSL documentation (https://www.openssl.org/docs/man1.1.0/man3/HMAC.html) specifies: > It places the result in md (which must have space for the output of the hash > function, which is no more than EVP_MAX_MD_SIZE bytes). If md is NULL, the > digest is placed in a static array. The size of the output is placed in > md_len, unless it is NULL. Note: passing a NULL value for md to use the > static array is not thread safe. `EVP_MAX_MD_SIZE` appears to be defined as `64`, so let's simply use a stack buffer to avoid the whole memory management.	2021-10-28 09:45:48 +02:00
Willy Tarreau	14e7f29e86	MINOR: protocols: replace protocol_by_family() with protocol_lookup() At a few places we were still using protocol_by_family() instead of the richer protocol_lookup(). The former is limited as it enforces SOCK_STREAM and a stream protocol at the control layer. At least with protocol_lookup() we don't have this limitationn. The values were still set for now but later we can imagine making them configurable on the fly.	2021-10-27 17:41:07 +02:00
Willy Tarreau	e3b4518414	MINOR: protocols: make use of the protocol type to select the protocol Instead of using sock_type and ctrl_type to select a protocol, let's make use of the new protocol type. For now they always match so there is no change. This is applied to address parsing and to socket retrieval from older processes.	2021-10-27 17:31:20 +02:00
Willy Tarreau	337edfdbc5	MINOR: protocols: add a new protocol type selector The protocol selection is currently performed based on the family, control type and socket type. But this is often not enough, as both only provide DGRAM or STREAM, leaving few variants. Protocols like SCTP for example might be indistinguishable from TCP here. Same goes for TCP extensions like MPTCP. This commit introduces a new enum proto_type that is placed in each and every protocol definition, that will usually more or less match the sock_type, but being an enum, will support additional values.	2021-10-27 17:05:36 +02:00
Willy Tarreau	bdcee7fbc9	DEBUG: protocol: yell loudly during registration of invalid sock_domain The test on the sock_domain is a bit useless because the protocols are registered at boot time, and the test silently fails and returns no error. Use a BUG_ON() instead to make sure to catch such bugs in the code if any.	2021-10-27 15:50:49 +02:00
Christopher Faulet	52b28d2f30	BUILD: log: Fix compilation without SSL support When compiled without SSL support, a variable is reported as not used by GCC. src/log.c: In function ‘sess_build_logline’: src/log.c:2056:36: error: unused variable ‘conn’ [-Werror=unused-variable] 2056 \| struct connection *conn; \| ^~~~ This does not need to be backported.	2021-10-27 12:00:15 +02:00
Christopher Faulet	16f16afb31	MINOR: stream: Use backend stream-interface dst address instead of target_addr target_addr field in the stream structure is removed. The backend stream-interface destination address is now used.	2021-10-27 11:35:59 +02:00
Christopher Faulet	888cd700f4	MINOR: tcp-sample: Add samples to get original info about client connection Because source and destination address of the client connection are now updated at the appropriated level (connection, session or stream), original info about the client connection are preserved. src/src_port/src_is_local and dst/dst_port/dst_is_local return current info about the client connection. It is the info at the highest available level. Most of time, the stream. Any tcp/http rules may alter this info. To get original info, "fc_" prefix must be added. For instance "fc_src". Here, only "tcp-request connection" rules may alter source and destination address/port.	2021-10-27 11:35:59 +02:00
Christopher Faulet	1e83b70409	MINOR: tcp-act: Add set-src/set-src-port for "tcp-request content" rules This patch was reverted because it was inconsitent to change connection addresses at stream level. Especially in HTTP because all requests was affected by this change and not only the current one. In HTTP/2, it was worse. Several streams was able to change the connection addresses at the same time. It is no longer an issue, thanks to recent changes. With multi-level client source and destination addresses, it is possible to limit the change to the current request. Thus this patch can be reintroduced. If it possible to set source IP/Port from "tcp-request connection", "tcp-request session" and "http-request" rules but not from "tcp-request content" rules. There is no reason for this limitation and it may be a problem for anyone wanting to call a lua fetch to dynamically set source IP/Port from a TCP proxy. Indeed, to call a lua fetch, we must have a stream. And there is no stream when "tcp-request connection/session" rules are evaluated. Thanks to this patch, "set-src" and "set-src-port" action are now supported by "tcp_request content" rules. This patch is related to the issue #1303.	2021-10-27 11:35:59 +02:00
Christopher Faulet	d69377eb02	MEDIUM: tcp-act: Set addresses at the apprioriate level in set-(src/dst) actions When client source or destination addresses are changed via a tcp/http action, we update addresses at the appropriate level. When "tcp-request connection" rules are evaluated, we update addresses at the connection level. When "tcp-request session" rules is evaluated, we update those at the session level. And finally, when "tcp-request content" or "http-request" rules are evaluated, we update the addresses at the stream level. The same is performed when source or destination ports are changed. Of course, for now, not all level are supported. But thanks to this patch, it will be possible.	2021-10-27 11:35:59 +02:00
Christopher Faulet	e83e8821bb	MEDIUM: connection: Assign session addresses when NetScaler CIP proto is parsed Just like for the PROXY protocol, when the NetScaler Client IP insertion header is received, the retrieved client source and destination addresses are set at the session level. This leaves those at the connection level intact.	2021-10-27 11:35:59 +02:00
Christopher Faulet	c105c9213f	MEDIUM: connection: Assign session addresses when PROXY line is received When PROXY protocol line is received, the retrieved client source and destination addresses are set at the session level. This leaves those at the connection level intact.	2021-10-27 11:35:59 +02:00
Christopher Faulet	a8e95fed43	MEDIUM: backend: Rely on addresses at stream level to init server connection Client source and destination addresses at stream level are used to initiate the connections to a server. For now, stream-interface addresses are never set. So, thanks to the fallback mechanism, no changes are expected with this patch. But its purpose is to rely on addresses at the appropriate level when set instead of those at the connection level.	2021-10-27 11:35:59 +02:00
Christopher Faulet	b097aef2ef	MEDIUM: connection: Rely on addresses at stream level to make proxy line If the stream exists, the frontend stream-interface is used to get the client source and destination addresses when the proxy line is built. For now, stream-interface or session addresses are never set. So, thanks to the fallback mechanism, no changes are expected with this patch. But its purpose is to rely on addresses at the appropriate level when set instead of those at the connection level.	2021-10-27 11:35:57 +02:00
Christopher Faulet	c03be1a129	MEDIUM: tcp-sample: Rely on addresses at the appropriate level in tcp samples In src, src-port, dst and dst-port sample fetches, the client source and destination addresses are retrieved from the appropriate level. It means that, if the stream exits, we use the frontend stream-interface to get the client source and destination addresses. Otherwise, the session is used. For now, stream-interface or session addresses are never set. So, thanks to the fallback mechanism, no changes are expected with this patch. But its purpose is to rely on addresses at the appropriate level when set instead of those at the connection level.	2021-10-27 11:34:21 +02:00
Christopher Faulet	568008d199	MINOR: mux-fcgi: Rely on client addresses at stream level to set default params Client source and destination addresses at stream level are now used to emit SERVER_NAME/SERVER_PORT and REMOTE_ADDR/REMOTE_PORT parameters. For now, stream-interface addresses are never set. So, thanks to the fallback mechanism, no changes are expected with this patch. But its purpose is to rely on addresses at the stream level, when set, instead of those at the connection level.	2021-10-27 11:34:21 +02:00
Christopher Faulet	6fc817a28e	MINOR: http-fetch: Rely on addresses at stream level in HTTP sample fetches Client source and destination addresses at stream level are now used to compute base32+src and url32+src hashes. For now, stream-interface addresses are never set. So, thanks to the fallback mechanism, no changes are expected with this patch. But its purpose is to rely on addresses at the stream level, when set, instead of those at the connection level.	2021-10-27 11:34:21 +02:00
Christopher Faulet	8a104ba3e0	MINOR: http-ana: Rely on addresses at stream level to set xff and xot headers Client source and destination addresses at stream level are now used to emit X-Forwarded-For and X-Original-To headers. For now, stream-interface addresses are never set. So, thanks to the fallback mechanism, no changes are expected with this patch. But its purpose is to rely on addresses at the stream level, when set, instead of those at the connection level.	2021-10-27 11:34:21 +02:00
Christopher Faulet	c269f664bd	MINOR: session: Rely on client source address at session level to log error When an embryonic session is killed, if no log format is defined for this error, a generic error is emitted. When this happens, we now rely on the session to get the client source address. For now, session addresses are never set. So, thanks to the fallback mechanism, no changes are expected with this patch. But its purpose is to rely on addresses at the session level when set instead of those at the connection level.	2021-10-27 11:34:21 +02:00
Christopher Faulet	f9c4d8d5be	MINOR: log: Rely on client addresses at the appropriate level to log messages When a log message is emitted, if the stream exits, we use the frontend stream-interface to retrieve the client source and destination addresses. Otherwise, the session is used. For now, stream-interface or session addresses are never set. So, thanks to the fallback mechanism, no changes are expected with this patch. But its purpose is to rely on addresses at the appropriate level when set instead of those at the connection level.	2021-10-27 11:34:21 +02:00
Christopher Faulet	c9c8e1cc01	MINOR: frontend: Rely on client src and dst addresses at stream level For now, stream-interface or session addresses are never set. So, thanks to the fallback mechanism, no changes are expected with this patch. But its purpose is to rely on the client addresses at the stream level, when set, instead of those at the connection level. The addresses are retrieved from the frontend stream-interface.	2021-10-27 11:34:21 +02:00
Christopher Faulet	859ff84f8c	MINOR: stream-int: Add src and dst addresses to the stream-interface For now, these addresses are never set. But the idea is to be able to set, at least first, the client source and destination addresses at the stream level without updating the session or connection ones. Of course, because these addresses are carried by the strream-interface, it would be possible to set server source and destination addresses at this level too. Functions to fill these addresses have been added: si_get_src() and si_get_dst(). If not already set, these functions relies on underlying layers to fill stream-interface addresses. On the frontend side, the session addresses are used if set, otherwise the client connection ones are used. On the backend side, the server connection addresses are used. And just like for sessions and conncetions, si_src() and si_dst() may be used to get source and destination addresses or the stream-interface. And, if not set, same mechanism as above is used.	2021-10-27 11:34:21 +02:00
Christopher Faulet	f46e1ea1ad	MINOR: session: Add src and dst addresses to the session For now, these addresses are never set. But the idea is to be able to set client source and destination addresses at the session level without updating the connection ones. Functions to fill these addresses have been added: sess_get_src() and sess_get_dst(). If not already set, these functions relies on conn_get_src() and conn_get_dst() to fill session addresses. And just like for conncetions, sess_src() and sess_dst() may be used to get source and destination addresses. However, if not set, the corresponding address from the underlying client connection is returned. When this happens, the addresses is filled in the connection object.	2021-10-27 11:34:21 +02:00
Christopher Faulet	e6465b3b75	CLEANUP: lua: Use a const address to retrieve info about a connection hlua_socket_info() only extracts information about an address, there is no reason to not use a const.	2021-10-27 11:34:21 +02:00
Christopher Faulet	4bfce397b8	CLEANUP: connection: No longer export make_proxy_line_v1/v2 functions These functions are only used by the make_proxy_line() function. Thus, we can turn them as static.	2021-10-27 11:34:14 +02:00
vishnu	0af4bd7beb	BUG/MEDIUM: lua: fix invalid return types in hlua_http_msg_get_body hlua_http_msg_get_body must return either a Lua string or nil. For some HTTPMessage objects, HTX_BLK_EOT blocks are also present in the HTX buffer along with HTX_BLK_DATA blocks. In such cases, _hlua_http_msg_dup will start copying data into a luaL_Buffer until it encounters an HTX_BLK_EOT. But then instead of pushing neither the luaL_Buffer nor `nil` to the Lua stack, the function will return immediately. The end result will be that the caller of the HTTPMessage.body() method from a Lua filter will see whatever object was on top of the stack as return value. It may be either a userdata object if HTTPMessage.body() was called with only two arguments, or the third argument itself if called with three arguments. Hence HTTPMessage.body() would return either nil, or HTTPMessage body as Lua string, or a userdata objects, or number. This fix ensure that HTTPMessage.body() will always return either a string or nil. Reviewed-by: Christopher Faulet <cfaulet@haproxy.com>	2021-10-27 11:04:16 +02:00
William Lallemand	6137a9ee20	MINOR: httpclient/lua: return an error when it can't generate the request Add a check during the httpclient request generation which yield an lua error when the generation didn't work. The most common case is the lack of space in the buffer, it can because of too much headers or a too big body.	2021-10-27 10:19:58 +02:00
William Lallemand	dc2cc9008b	MINOR: httpclient/lua: support more HTTP methods Add support for HEAD/PUT/POST/DELETE method with the lua httpclient. This patch use the httpclient_req_gen() function with a different meth parameter to implement this. Also change the reg-test to support a POST request with a body.	2021-10-27 10:19:49 +02:00
William Lallemand	dec25c3e14	MINOR: httpclient: support payload within a buffer httpclient_req_gen() takes a payload argument which can be use to put a payload in the request. This payload can only fit a request buffer. This payload can also be specified by the "body" named parameter within the lua. httpclient. It is also used within the CLI httpclient when specified as a CLI payload with "<<".	2021-10-27 10:19:41 +02:00
Amaury Denoyelle	8e358af8a3	MINOR: connection: remove unneeded memset 0 for idle conns Remove the zeroing of an idle connection node on remove from a tree. This is not needed and should improve slightly the performance of idle connection usage. Besides, it breaks the memory poisoning feature.	2021-10-22 17:29:25 +02:00
Amaury Denoyelle	926712ab2d	MINOR: backend: improve perf with tcp proxies skipping idle conns Skip the hash connection calcul when reuse must not be used in connect_server() : this is the case for TCP proxies. This should result in slightly better performance when using this use-case.	2021-10-22 17:28:29 +02:00
Amaury Denoyelle	aee4fdbd17	BUG/MINOR: backend: fix improper insert in avail tree for always reuse In connect_server(), if http-reuse always is set, the backend connection is inserted into the available tree as soon as created. However, the hash connection field is only set later at the end of the function. This seems to have no impact as the hash connection field is always position before a lookup. However, this is not a proper usage of ebmb API. Fix this by setting the hash connection field before the insertion into the avail tree. This must be backported up to 2.4.	2021-10-22 17:26:22 +02:00
Amaury Denoyelle	1252b6f951	MINOR: backend: add traces for idle connections reuse Add traces in connect_server() to debug idle connection reuse. These are attached to stream trace module, as it's already in use in backend.c with the macro TRACE_SOURCE.	2021-10-22 17:21:14 +02:00
Willy Tarreau	1de51eb727	MINOR: memprof: add one pointer size to the size of allocations The current model causes an issue when trying to spot memory leaks, because malloc(0) or realloc(0) do not count as allocations since we only account for the application-usable size. This is the problem that made issue #1406 not to appear as a leak. What we're doing now is to account for one extra pointer (the one that memory allocators usually place before the returned area), so that a malloc(0) will properly account for 4 or 8 bytes. We don't need something exact, we just need something non-zero so that a realloc(X) followed by a realloc(0) without a free() gives a small non-zero result. It was verified that the results are stable including in the presence of lots of malloc/realloc/free as happens when stressing Lua. It would make sense to backport this to 2.4 as it helps in bug reports.	2021-10-22 16:40:09 +02:00
Willy Tarreau	8cce4d79ff	MINOR: memprof: report the delta between alloc and free on realloc() realloc() calls are painful to analyse because they have two non-zero columns and trying to spot a leaking one requires a bit of scripting. Let's simply append the delta at the end of the line when alloc and free are non-nul. It would be useful to backport this to 2.4 to help with bug reports.	2021-10-22 16:40:09 +02:00
Willy Tarreau	a5efdff93c	BUG/MEDIUM: lua: fix memory leaks with realloc() on non-glibc systems In issue #1406, Lev Petrushchak reported a nasty memory leak on Alpine since haproxy 2.4 when using Lua, that memory profiling didn't detect. After inspecting the code and Lua's code, it appeared that Lua's default allocator does an explicit free() on size zero, while since 2.4 commit `d36c7fa5e` ("MINOR: lua: simplify hlua_alloc() to only rely on realloc()"), haproxy only calls realloc(ptr,0) that performs a free() on glibc but not on other systems as it's not required by POSIX... This patch reinstalls the explicit test for nsize==0 to call free(). Thanks to Lev for the very documented report, and to Tim for the links to a musl thread on the same subject that confirms the diagnostic. This must be backported to 2.4.	2021-10-22 16:40:09 +02:00
Frédéric Lécaille	46be7e92b4	MINOR: quic: Increase the size of handshake RX UDP datagrams Some browsers may send Initial packets with sizes greater than 1252 bytes (QUIC_INITIAL_IPV4_MTU). Let us increase this size limit up to 2048 bytes. Also use this size for "max_udp_payload_size" transport parameter to limit the size of the datagrams we want to receive.	2021-10-22 15:48:19 +02:00
Willy Tarreau	dbb0bb59e3	CLEANUP: resolvers: get rid of single-iteration loop in resolv_get_ip_from_response() In issue 1424 Coverity reports that the loop increment is unreachable, which is true, the list_for_each_entry() was replaced with a for loop, but it was already not needed and was instead used as a convenient construct for a single iteration lookup. Let's get rid of all this now and replace the loop with an "if" statement.	2021-10-22 08:34:14 +02:00
Willy Tarreau	0b22247606	MINOR: mux-h2: perform a full cycle shutdown+drain on close While in H1 we can usually close quickly, in H2 a client might be sending window updates or anything while we're sending a GOAWAY and the pending data in the socket buffers at the moment the close() is performed on the socket results in the output data being lost and an RST being emitted. One example where this happens easily is with h2spec, which randomly reports connection resets when waiting for a GOAWAY while haproxy sends it, as seen in issue #1422. With h2spec it's not window updates that are causing this but the fact that h2spec has to upload the payload that comes with invalid frames to accommodate various implementations, and does that in two different segments. When haproxy aborts on the invalid frame header, the payload was not yet received and causes an RST to be sent. Here we're dealing with this two ways: - we perform a shutdown(WR) on the connection to forcefully push pending data on a front connection after the xprt is shut and closed ; - we drain pending data - then we close This totally solves the issue with h2spec, and the extra cost is very low, especially if we consider that H2 connections are not set up and torn down often. This issue was never observed with regular clients, most likely because this pattern does not happen in regular traffic. After more testing it could make sense to backport this, at least to avoid reporting errors on h2spec tests.	2021-10-21 22:24:31 +02:00
Willy Tarreau	20b622e04b	MINOR: connection: add a new CO_FL_WANT_DRAIN flag to force drain on close Sometimes we'd like to do our best to drain pending data before closing in order to save the peer from risking to receive an RST on close. This adds a new connection flag CO_FL_WANT_DRAIN that is used to trigger a call to conn_ctrl_drain() from conn_ctrl_close(), and the sock_drain() function ignores fd_recv_ready() if this flag is set, in order to catch latest data. It's not used for now.	2021-10-21 21:48:23 +02:00
Willy Tarreau	e6dc7a0129	BUG/MINOR: mux-h2: do not prevent from sending a final GOAWAY frame Some checks were added by commit `9a3d3fcb5` ("BUG/MAJOR: mux-h2: Don't try to send data if we know it is no longer possible") to make sure we don't loop forever trying to send data that cannot leave. But one of the conditions there is not correct, the one relying on H2_CS_ERROR2. Indeed, this state indicates that the error code was serialized into the mux buffer, and since the test is placed before trying to send the data to the socket, if the connection states only contains a GOAWAY frame, it may refrain from sending and may close without sending anything. It's not dramatic, as GOAWAY reports connection errors in situations where delivery is not even certain, but it's cleaner to make sure the error is properly sent, and it avoids upsetting h2spec, as seen in github issue #1422. Given that the patch above was backported as far as 1.8, this patch will also have to be backported that far. Thanks to Ilya for reporting this one.	2021-10-21 17:37:22 +02:00
Willy Tarreau	3193eb9907	BUG/MINOR: task: do not set TASK_F_USR1 for no reason This applicationn specific flag was added in 2.4-dev by commit `6fa8bcdc7` ("MINOR: task: add an application specific flag to the state: TASK_F_USR1") to help preserve a the idle connections status across wakeup calls. While the code to do this was OK for tasklets, it was wrong for tasks, as in an effort not to lose it when setting the RUNNING flag (that tasklets don't have), it ended up being inconditionally set. It just happens that for now no regular tasks use it, only tasklets. This fix makes sure we always atomically perform (state & flags \| running) there, using a CAS. It also does it for tasklets because it was possible to lose some such flags if set by another thread, even though this should not happen with current code. In order to make the code more readable (and avoid the previous mistake of repeated flags in the bit field), a new TASK_PERSISTENT aggregate was declared in task.h for this. In practice the CAS is cheap here because task states are stable or convergent so the loop will almost never be taken. This should be backported to 2.4.	2021-10-21 16:17:29 +02:00
Willy Tarreau	dde1b4499a	OPTIM: dns: use an atomic check for the list membership The crash that was fixed by commit `7045590d8` ("BUG/MAJOR: dns: attempt to lock globaly for msg waiter list instead of use barrier") was now completely analysed and confirmed to be partially a result of the debugging code added to LIST_INLIST(), which was looking at both pointers and their reciprocals, and that, if used in a concurrent context, could perfectly return false if a neighbor was being added or removed while the current one didn't change, allowing the LIST_APPEND to fail. As the LIST API was not designed to be used in a concurrent context, we should not rely on LIST_INLIST() but on the newly introduced LIST_INLIST_ATOMIC(). This patch simply reverts the commit above to switch to the new test, saving a lock during potentially long operations. It was verified that the check doesn't fail anymore. It is unsure what the performance impact of the fix above could be in some contexts. If any performance regression is observed, it could make sense to backport this patch, along with the previous commit introducing the LIST_INLIST_ATOMIC() macro.	2021-10-21 15:28:42 +02:00
Willy Tarreau	dcb696cd31	MEDIUM: resolvers: hash the records before inserting them into the tree We're using an XXH32() on the record to insert it into or look it up from the tree. This way we don't change the rest of the code, the comparisons are still made on all fields and the next node is visited on mismatch. This also allows to continue to use roundrobin between identical nodes. Just doing this is sufficient to see the CPU usage go down from ~60-70% to 4% at ~2k DNS requests per second for farm with 300 servers. A larger config with 12 backends of 2000 servers each shows ~8-9% CPU for 6-10000 DNS requests per second. It would probably be possible to go further with multiple levels of indexing but it's not worth it, and it's important to remember that tree nodes take space (the struct answer_list went back from 576 to 600 bytes).	2021-10-21 08:29:02 +02:00
Willy Tarreau	7893ae117f	MEDIUM: resolvers: replace the answer_list with a (flat) tree With SRV records, a huge amount of time is spent looking for records by walking long lists. It is possible to reduce this by indexing values in trees instead. However the whole code relies a lot on the list ordering, and even implements some round-robin on it to distribute IP addresses to servers. This patch starts carefully by replacing the list with a an eb32 tree that is still used like a list, with a constant key 0. Since ebtrees preserve insertion order for duplicates, the tree walk visits the nodes in the exact same order it did with the lists. This allows to implement the required infrastructure without changing the behavior.	2021-10-21 08:02:08 +02:00
Willy Tarreau	a89c19127d	BUG/MEDIUM: checks: fix the starting thread for external checks When cleaning up the code to remove most explicit task masks in commit `beeabf531` ("MINOR: task: provide 3 task_new_* wrappers to simplify the API"), a mistake was done with the external checks where the call does task_new_on(1) instead of task_new_on(0) due to the confusion with the previous mask 1. No backport is needed as that's only 2.5-dev.	2021-10-20 18:43:30 +02:00
Willy Tarreau	6878f80427	MEDIUM: resolvers: remove the last occurrences of the "safe" argument This one was used to indicate whether the callee had to follow particularly safe code path when removing resolutions. Since the code now uses a kill list, this is not needed anymore.	2021-10-20 17:54:27 +02:00
Willy Tarreau	f766ec6b53	MEDIUM: resolvers: use a kill list to preserve the list consistency When scanning resolution.curr it's possible to try to free some resolutions which will themselves result in freeing other ones. If one of these other ones is exactly the next one in the list, the list walk visits deleted nodes and causes memory corruption, double-frees and so on. The approach taken using the "safe" argument to some functions seems to work but it's extremely brittle as it is required to carefully check all call paths from process_ressolvers() and pass the argument to 1 there to refrain from deleting entries, so the bug is very likely to come back after some tiny changes to this code. A variant was tried, checking at various places that the current task corresponds to process_resolvers() but this is also quite brittle even though a bit less. This patch uses another approach which consists in carefully unlinking elements from the list and deferring their removal by placing it in a kill list instead of deleting them synchronously. The real benefit here is that the complexity only has to be placed where the complications are. A thread-local list is fed with elements to be deleted before scanning the resolutions, and it's flushed at the end by picking the first one until the list is empty. This way we never dereference the next element and do not care about its presence or not in the list. One function, resolv_unlink_resolution(), is exported and used outside, so it had to be modified to use this list as well. Internal code has to use _resolv_unlink_resolution() instead.	2021-10-20 17:54:22 +02:00
Willy Tarreau	aae7320b0d	CLEANUP: resolvers: replace all LIST_DELETE with LIST_DEL_INIT The code as it is uses crossed lists between many elements, and at many places the code relies on list iterators or emptiness checks, which does not work with only LIST_DELETE. Further, it is quite difficult to place debugging code and checks in the current situation, and gdb is helpless. This code replaces all LIST_DELETE calls with LIST_DEL_INIT so that it becomes possible to trust the lists.	2021-10-20 17:54:14 +02:00
Willy Tarreau	239675e4a9	CLEANUP: resolvers: simplify resolv_link_resolution() regarding requesters This function allocates requesters by hand for each and every type. This is complex and error-prone, and it doesn't even initialize the list part, leaving dangling pointers that complicate debugging. This patch introduces a new function resolv_get_requester() that either returns the current pointer if valid or tries to allocate a new one and links it to its destination. Then it makes use of it in the function above to clean it up quite a bit. This allows to remove complicated but unneeded tests.	2021-10-20 17:54:01 +02:00
Willy Tarreau	48664c048d	CLEANUP: always initialize the answer_list Similar to the previous patch, the answer's list was only initialized the first time it was added to a list, leading to bogus outdated pointer to appear when debugging code is added around it to watch it. Let's make sure it's always initialized upon allocation.	2021-10-20 17:53:54 +02:00
Willy Tarreau	25e010906a	BUG/MEDIUM: resolvers: always check a valid item in query_list The query_list is physically stored in the struct resolution itself, so we have a list that contains a list to items stored in itself (and there is a single item). But the list is first initialized in resolv_validate_dns_response(), while it's scanned in resolv_process_responses() later after calling the former. First, this results in crashes as soon as the code is instrumented a little bit for debugging, as elements from a previous incarnation can appear. But in addition to this, the presence of an element is checked by verifying that the return of LIST_NEXT() is not NULL, while it may never be NULL even for an empty list, resulting in bugs or crashes if the number of responses does not match the list's contents. This is easily triggered by testing for the list non-emptiness outside of the function. Let's make sure the list is always correct, i.e. it's initialized to an empty list when the structure is allocated, elements are checked by first verifying the list is not empty, they are deleted once checked, and in any case at end so that there are no dangling pointers. This should be backported, but only as long as the patch fits without modifications, as adaptations can be risky there given that bugs tend to hide each other.	2021-10-20 17:53:35 +02:00
Willy Tarreau	10c1a8c3bd	BUILD: resolvers: avoid a possible warning on null-deref Depending on the code that precedes the loop, gcc may emit this warning: src/resolvers.c: In function 'resolv_process_responses': src/resolvers.c:1009:11: warning: potential null pointer dereference [-Wnull-dereference] 1009 \| if (query->type != DNS_RTYPE_SRV && flags & DNS_FLAG_TRUNCATED) { \| ~~~~~^~~~~~ However after carefully checking, r_res->header.qdcount it exclusively 1 when reaching this place, which forces the for() loop to enter for at least one iteration, and <query> to be set. Thus there's no code path leading to a null deref. It's possibly just because the assignment is too far and the compiler cannot figure that the condition is always OK. Let's just mark it to please the compiler.	2021-10-20 17:53:35 +02:00
Willy Tarreau	2acc160c05	CLEANUP: resolvers: do not export resolv_purge_resolution_answer_records() This code is dangerous enough that we certainly don't want external code to ever approach it, let's not export unnecessary functions like this one. It was made static and a comment was added about its purpose.	2021-10-20 17:52:50 +02:00
Willy Tarreau	2a67aa0a51	BUG/MAJOR: resolvers: add other missing references during resolution removal There is a fundamental design bug in the resolvers code which is that a list of active resolutions is being walked to try to delete outdated entries, and that the code responsible for removing them also removes other elements, including the next one which will be visited by the list iterator. This randomly causes a use-after-free condition leading to crashes, infinite loops and various other issues such as random memory corruption. A first fix for the memory fix for this was brought by commit `0efc0993e` ("BUG/MEDIUM: resolvers: Don't release resolution from a requester callbacks"). While preparing for more fixes, some code was factored by commit `11c6c3965` ("MINOR: resolvers: Clean server in a dedicated function when removing a SRV item"), which inadvertently passed "0" as the "safe" argument all the time, missing one case of removal protection, instead of always using "safe". This patch reintroduces the correct argument. This must be backported with all fixes above. Cc: Christopher Faulet <cfaulet@haproxy.com>	2021-10-20 17:52:36 +02:00
Willy Tarreau	62e467c667	DEBUG: dns: add a few more BUG_ON at sensitive places A few places have been caught triggering late bugs recently, always cases of use-after-free because a freed element was still found in one of the lists. This patch adds a few checks for such elements in dns_session_free() before the final pool_free() and dns_session_io_handler() before adding elements to lists to make sure they remain consistent. They do not trigger anymore now.	2021-10-20 17:52:17 +02:00
Willy Tarreau	b56a878950	CLEANUP: dns: always detach the appctx from the dns session on release When dns_session_release() calls dns_session_free(), it was shown that it might still be attached there: Program terminated with signal SIGSEGV, Segmentation fault. #0 0x00000000006437d7 in dns_session_free (ds=0x7f895439e810) at src/dns.c:768 768 BUG_ON(!LIST_ISEMPTY(&ds->ring.waiters)); [Current thread is 1 (Thread 0x7f895bbe2700 (LWP 31792))] (gdb) bt #0 0x00000000006437d7 in dns_session_free (ds=0x7f895439e810) at src/dns.c:768 #1 0x0000000000643ab8 in dns_session_release (appctx=0x7f89545a4ff0) at src/dns.c:805 #2 0x000000000062e35a in si_applet_release (si=0x7f89545a5550) at include/haproxy/stream_interface.h:236 #3 0x000000000063150f in stream_int_shutw_applet (si=0x7f89545a5550) at src/stream_interface.c:1697 #4 0x0000000000640ab8 in si_shutw (si=0x7f89545a5550) at include/haproxy/stream_interface.h:437 #5 0x0000000000643103 in dns_session_io_handler (appctx=0x7f89545a4ff0) at src/dns.c:725 #6 0x00000000006d776f in task_run_applet (t=0x7f89545a5100, context=0x7f89545a4ff0, state=81924) at src/applet.c:90 #7 0x000000000068b82b in run_tasks_from_lists (budgets=0x7f895bbbf5c0) at src/task.c:611 #8 0x000000000068c258 in process_runnable_tasks () at src/task.c:850 #9 0x0000000000621e61 in run_poll_loop () at src/haproxy.c:2636 #10 0x0000000000622328 in run_thread_poll_loop (data=0x8d7440 <ha_thread_info+64>) at src/haproxy.c:2807 #11 0x00007f895c54a06b in start_thread () from /lib64/libpthread.so.0 #12 0x00007f895bf3772f in clone () from /lib64/libc.so.6 (gdb) p &ds->ring.waiters $1 = (struct list ) 0x7f895439e8a8 (gdb) p ds->ring.waiters $2 = { n = 0x7f89545a5078, p = 0x7f89545a5078 } (gdb) p ds->ring.waiters->n $3 = (struct list ) 0x7f89545a5078 (gdb) p *ds->ring.waiters->n $4 = { n = 0x7f895439e8a8, p = 0x7f895439e8a8 } Let's always detach it before freeing so that it remains possible to check the dns_session's ring before releasing it, and possibly catch bugs.	2021-10-20 17:52:13 +02:00
Emeric Brun	7045590d8a	BUG/MAJOR: dns: attempt to lock globaly for msg waiter list instead of use barrier The barrier is insufficient here to protect the waiters list as we can definitely catch situations where ds->waiter shows an inconsistency whereby the element is not attached when entering the "if" block and is already attached when attaching it later. This patch uses a larger lock to maintain consistency. Without it the code would crash in 30-180 minutes under heavy stress, always showing the same problem (ds->waiter->n->p != &ds->waiter). Now it seems to always resist, suggesting that this was indeed the problem. This will have to be backported to 2.4.	2021-10-20 17:52:07 +02:00
Emeric Brun	d20dc21eec	BUG/MAJOR: dns: tcp session can remain attached to a list after a free Using tcp, after a session release and free, the session can remain attached to the list of sessions with a response message waiting for a commit (ds->waiter). This results to a use after free of this session. Also, on some error path and after free, a session could remain attached to the lists of available idle/free sessions (ds->list). This patch ensure to remove the session from those external lists before a free. This patch should be backported to all version including the dns over tcp (2.4)	2021-10-20 17:52:02 +02:00
Christopher Faulet	d16e7dd0e4	BUG/MEDIUM: tcpcheck: Properly catch early HTTP parsing errors When an HTTP response is parsed, early parsing errors are not properly handled. When this error is reported by the multiplexer, nothing is copied into the input buffer. The HTX message remains empty but the HTX_FL_PARSING_ERROR flag is set. In addition CS_FL_EOI is set on the conn-stream. This last flag must be handled to prevent subscription for receive events. Otherwise, in the best case, a L7 timeout error is reported. But a transient loop is also possible if a shutdown is received because the multiplexer notifies the check of the event while the check never handles it and waits for more data. Now, if CS_FL_EOI flag is set on the conn-stream, expect rules are evaluated. Any error must be handled there. Thanks to @kazeburo for his valuable report. This patch should fix the issue #1420. It must be backported at least to 2.4. On 2.3 and 2.2, there is no loop but the wrong error is reported (empty response instead of invalid one). Thus it may also be backported as far as 2.2.	2021-10-20 14:35:38 +02:00
William Lallemand	34b3a93655	MINOR: httpclient/cli: access should be only done from expert mode Only enable the usage of the CLI HTTP client in expert mode.	2021-10-19 15:02:42 +02:00

1 2 3 4 5 ...

12621 commits