haproxy

mirror of https://github.com/haproxy/haproxy.git synced 2026-03-27 21:06:45 -04:00

Author	SHA1	Message	Date
Willy Tarreau	13843641e5	MINOR: pools: split the OS-based allocator in two Now there's one part dealing with the allocation itself and keeping counters up to date, and another one on top of it to return such an allocated pointer to the user and update the use count and stats. This is in anticipation for being able to group cache-related parts. The release code is still done at once.	2021-04-19 15:24:33 +02:00
Willy Tarreau	207c095098	MINOR: pools: move the fault injector to __pool_alloc() Till now it was limited to objects allocated from the OS which means it had little use as soon as pools were enabled. Let's move it upper in the layers so that any code can benefit from fault injection. In addition this allows to pass a new flag POOL_F_NO_FAIL to disable it if some callers prefer a no-failure approach.	2021-04-19 15:24:33 +02:00
Willy Tarreau	20f88abad5	MINOR: pools: use cheaper randoms for fault injections ha_random() is quite heavy and uses atomic ops or even a lock on some architectures. Here we don't seek good randoms, just statistical ones, so let's use the statistical prng instead.	2021-04-19 15:24:33 +02:00
Willy Tarreau	635cced32f	CLEANUP: pools: rename __pool_free() to pool_put_to_shared_cache() Now the multi-level cache becomes more visible: pool_get_from_local_cache() pool_put_to_local_cache() pool_get_from_shared_cache() pool_put_to_shared_cache()	2021-04-19 15:24:33 +02:00
Willy Tarreau	8c77ee5ae5	CLEANUP: pools: rename pool__{from,to}_cache() to _local_cache() The functions were rightfully called from/to_cache when the thread-local cache was considered as the only cache, but this is getting terribly confusing. Let's call them from/to local_cache to make it clear that it is not related with the shared cache. As a side note, since pool_evict_from_cache() used not to work for a particular pool but for all of them at once, it was renamed to pool_evict_from_local_caches() (plural form).	2021-04-19 15:24:33 +02:00
Willy Tarreau	8fe726f118	CLEANUP: pools: re-merge pool_refill_alloc() and __pool_refill_alloc() They were strictly equivalent, let's remerge them and rename them to pool_alloc_nocache() as it's the call which performs a real allocation which does not check nor update the cache. The only difference in the past was the former taking the lock and not the second but now the lock is not needed anymore at this stage since the pool's list is not touched. In addition, given that the "avail" argument is no longer used by the function nor by its callers, let's drop it.	2021-04-19 15:24:33 +02:00
Willy Tarreau	eb3cc29622	MEDIUM: pools: unify pool_refill_alloc() across all models Now we don't loop anymore trying to refill multiple items at once, and an allocated object is directly returned to the requester instead of being stored into the shared pool. This has multiple benefits. The first one is that no locking is needed anymore on the allocation path and the second one is that the loop will no longer cause latency spikes.	2021-04-19 15:24:33 +02:00
Willy Tarreau	64383b8181	MINOR: pools: make the basic pool_refill_alloc()/pool_free() update needed_avg This is a first step towards unifying all the fallback code. Right now these two functions are the only ones which do not update the needed_avg rate counter since there's currently no shared pool kept when using them. But their code is similar to what could be used everywhere except for this one, so let's make them capable of maintaining usage statistics. As a side effect the needed field in "show pools" will now be populated.	2021-04-19 15:24:33 +02:00
Willy Tarreau	53a7fe49aa	MINOR: pools: enable the fault injector in all allocation modes The mem_should_fail() call enabled by DEBUG_FAIL_ALLOC used to be placed only in the no-cache version of the allocator. Now we can generalize it to all modes and remove the exclusive test on CONFIG_HAP_NO_GLOBAL_POOLS.	2021-04-19 15:24:33 +02:00
Willy Tarreau	2d6f628d34	MINOR: pools: rename CONFIG_HAP_LOCAL_POOLS to CONFIG_HAP_POOLS We're going to make the local pool always present unless pools are completely disabled. This means that pools are always enabled by default, regardless of the use of threads. Let's drop this notion of "local" pools and make it just "pool". The equivalent debug option becomes DEBUG_NO_POOLS instead of DEBUG_NO_LOCAL_POOLS. For now this changes nothing except the option and dropping the dependency on USE_THREAD.	2021-04-19 15:24:33 +02:00
Willy Tarreau	d5140e7c6f	MINOR: pool: remove the size field from pool_cache_head Everywhere we have access to the pool so we don't need to cache a copy of the pool's size into the pool_cache_head. Let's remove it.	2021-04-19 15:24:33 +02:00
Willy Tarreau	9f3129e583	MEDIUM: pools: move the cache into the pool header Initially per-thread pool caches were stored into a fixed-size array. But this was a bit ugly because the last allocated pools were not able to benefit from the cache at all. As a work around to preserve performance, a size of 64 cacheable pools was set by default (there are 51 pools at the moment, excluding any addon and debugging code), so all in-tree pools were covered, at the expense of higher memory usage. In addition an index had to be calculated for each pool, and was used to acces the pool cache head into that array. The pool index was not even stored into the pools so it was required to determine it to access the cache when the pool was already known. This patch changes this by moving the pool cache head into the pool head itself. This way it is certain that each pool will have its own cache. This removes the need for index calculation. The pool cache head is 32 bytes long so it was aligned to 64B to avoid false sharing between threads. The extra cost is not huge (~2kB more per pool than before), and we'll make better use of that space soon. The pool cache head contains the size, which should probably be removed since it's already in the pool's head.	2021-04-19 15:24:33 +02:00
Willy Tarreau	3e970b11eb	MINOR: pools: drop the unused static history of artificially failed allocs When building with DEBUG_FAIL_ALLOC we call a random generator to decide whether the pool alloc should succeed or fail, and there was a preliminary debugging mechanism to keep sort of a history of the previous decisions. But it was never used, enforces a lock during the allocation, and forces to use static variables, all of which are limiting the ability to pursue the pools cleanups with no real benefit. Let's get rid of them now.	2021-04-19 15:24:33 +02:00
Willy Tarreau	a5b229d01d	BUG/MINOR: pools/buffers: make sure to always reserve the required buffers Since recent commit ae07592 ("MEDIUM: pools: add CONFIG_HAP_NO_GLOBAL_POOLS and CONFIG_HAP_GLOBAL_POOLS") the pre-allocation of all desired reserved buffers was not done anymore on systems not using the shared cache. This basically has no practical impact since these ones will quickly be refilled by all the ones used at run time, but it may confuse someone checking if they're allocated in "show pools". That's only 2.4-dev, no backport is needed.	2021-04-19 15:24:33 +02:00
Willy Tarreau	932dd19cc3	BUG/MINOR: pools: maintain consistent ->allocated count on alloc failures When running with CONFIG_HAP_NO_GLOBAL_POOLS, it's theoritically possible to keep an incorrect count of allocated entries in a pool because the allocated counter was used as a cumulated counter of alloc calls instead of a number of currently allocated items (it's possible the meaning has changed over time). The only impact in this mode essentially is that "show pools" will report incorrect values. But this would only happen on limited pools, which is not even certain still exist. This was added by recent commit `0bae07592` ("MEDIUM: pools: add CONFIG_HAP_NO_GLOBAL_POOLS and CONFIG_HAP_GLOBAL_POOLS") so no backport is needed.	2021-04-19 15:24:33 +02:00
Tim Duesterhus	5be6ab269e	MEDIUM: http_act: Rename uri-normalizers This patch renames all existing uri-normalizers into a more consistent naming scheme: 1. The part of the URI that is being touched. 2. The modification being performed as an explicit verb.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	a407193376	MINOR: uri_normalizer: Add a `percent-upper` normalizer This normalizer uppercases the hexadecimal characters used in percent-encoding. See GitHub Issue #714.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	d7b89be30a	MINOR: uri_normalizer: Add a `sort-query` normalizer This normalizer sorts the `&` delimited query parameters by parameter name. See GitHub Issue #714.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	560e1a6352	MINOR: uri_normalizer: Add support for supressing leading `../` for dotdot normalizer This adds an option to supress `../` at the start of the resulting path.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	9982fc2bbd	MINOR: uri_normalizer: Add a `dotdot` normalizer to http-request normalize-uri This normalizer merges `../` path segments with the predecing segment, removing both the preceding segment and the `../`. Empty segments do not receive special treatment. The `merge-slashes` normalizer should be executed first. See GitHub Issue #714.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	d371e99d1c	MINOR: uri_normalizer: Add a `merge-slashes` normalizer to http-request normalize-uri This normalizer merges adjacent slashes into a single slash, thus removing empty path segments. See GitHub Issue #714.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	d2bedcc4ab	MINOR: uri_normalizer: Add `http-request normalize-uri` This patch adds the `http-request normalize-uri` action that was requested in GitHub issue #714. Normalizers will be added in the next patches.	2021-04-19 09:05:57 +02:00
Tim Duesterhus	dbd25c34de	MINOR: uri_normalizer: Add uri_normalizer module This is in preparation for future patches.	2021-04-19 09:05:57 +02:00
Christopher Faulet	1d26f22e05	BUG/MINOR: logs: Report the true number of retries if there was no connection When the session is aborted before any connection attempt to any server, the number of connection retries reported in the logs is wrong. It happens because when the retries counter is not strictly positive, we consider the max number of retries was reached and the backend retries value is used. It is obviously wrong when no connectioh was performed. In fact, at this stage, the retries counter is initialized to 0. But the backend stream-interface is in the INI state. Once it is set to SI_ST_REQ, the counter is set to the backend value. And it is the only possible state transition from INI state. Thus it is safe to rely on it to fix the bug. This patch must be backported to all stable versions.	2021-04-19 08:52:17 +02:00
Christopher Faulet	a7d6cf24fb	BUG/MINOR: http_htx: Remove BUG_ON() from http_get_stline() function The http_get_stline() was designed to be called from HTTP analyzers. Thus before any data forwarding. To prevent any invalid usage, two BUG_ON() statements were added. However, it is not a good idea because it is pretty hard to be sure no HTTP sample fetch will never be called outside the analyzers context. Especially because there is at least one possible area where it may happens. An HTTP sample fetch may be used inside the unique-id format string. On the normal case, it is generated in AN_REQ_HTTP_INNER analyzer. But if an error is reported too early, the id is generated when the log is emitted. So, it is safer to remove the BUG_ON() statements and consider the normal behavior is to return NULL if the first block is not a start-line. Of course, this means all calling functions must test the return value or be sure the start-line is really there. This patch must be backported as far as 2.0.	2021-04-19 08:51:22 +02:00
Christopher Faulet	003df1cff9	MINOR: tcp_samples: Be able to call bc_src/bc_dst from the health-checks The new L4 sample fetches used to get source and destination info of the backend connection may now be called from an health-check.	2021-04-19 08:31:05 +02:00
Christopher Faulet	7d081f02a4	MINOR: tcp_samples: Add samples to get src/dst info of the backend connection This patch adds 4 new sample fetches to get the source and the destination info (ip address and port) of the backend connection : * bc_dst : Returns the destination address of the backend connection * bc_dst_port : Returns the destination port of the backend connection * bc_src : Returns the source address of the backend connection * bc_src_port : Returns the source port of the backend connection The configuration manual was updated accordingly.	2021-04-19 08:31:05 +02:00
Christopher Faulet	6f97a611c8	BUG/MINOR: http-fetch: Make method smp safe if headers were already forwarded When method sample fetch is called, if an exotic method is found (HTTP_METH_OTHER), when smp_prefetch_htx() is called, we must be sure the start-line is still there. Otherwise, HAproxy may crash because of a NULL pointer dereference, for instance if the method sample fetch is used inside a unique-id format string. Indeed, the unique id may be generated when the log message is emitted. At this stage, the request channel is empty. This patch must be backported as far as 2.0. But the bug exists in all stable versions for the legacy HTTP mode too. Thus it must be adapted to the legacy HTTP mode and backported to all other stable versions.	2021-04-19 08:31:05 +02:00
Christopher Faulet	4bef8d1d46	BUG/MINOR: ssl-samples: Fix ssl_bc_* samples when called from a health-check For all ssl_bc_* sample fetches, the test on the keyword when called from a health-check is inverted. We must be sure the 5th charater is a 'b' to retrieve a connection. This patch must be backported as far as 2.2.	2021-04-19 08:31:05 +02:00
Christopher Faulet	242f8ce060	MINOR: connection: Make bc_http_major compatible with tcp-checks bc_http_major sample fetch now works when it is called from a tcp-check. When it happens, the session origin is a check. The backend connection is retrieved from the conn-stream attached to the check. If required, this path may easily be backported as far as 2.2.	2021-04-19 08:31:05 +02:00
Christopher Faulet	f4dd9ae5c7	BUG/MINOR: connection: Fix fc_http_major and bc_http_major for TCP connections fc_http_major and bc_http_major sample fetches return the major digit of the HTTP version used, respectively, by the frontend and the backend connections, based on the mux. However, in reality, "2" is returned if the H2 mux is detected, otherwise "1" is inconditionally returned, regardless the mux used. Thus, if called for a raw TCP connection, "1" is returned. To fix this bug, we now get the multiplexer flags, if there is one, to be sure MX_FL_HTX is set. I guess it was made this way on purpose when the H2 multiplexer was introduced in the 1.8 and with the legacy HTTP mode there is no other solution at the connection level. Thus this patch should be backported as far as 2.2. For the 2.0, it must be evaluated first because of the legacy HTTP mode.	2021-04-19 08:24:38 +02:00
Christopher Faulet	fd81848c22	MINOR: logs: Add support of checks as session origin to format lf strings When a log-format string is built from an health-check, the session origin is the health-check itself and not a connection. In addition, there is no stream. It means for now some formats are not supported: %s, %sc, %b, %bi, %bp, %si and %sp. Thanks to this patch, the session origin is converted to a check. So it is possible to retrieve the backend and the backend connection. Note this session have no listener, thus %ft format must be guarded. This patch is light and standalone, thus it may be backported as far as 2.2 if required. However, because the error is human, it is probably better to wait a bit to be sure everything is properly protected.	2021-04-19 08:22:15 +02:00
Christopher Faulet	0f1fc23d4e	BUG/MINOR: checks: Set missing id to the dummy checks frontend The dummy frontend used to create the session of the tcp-checks is initialized without identifier. However, it is required because this id may be used without any guard, for instance in log-format string via "%f" or when fe_name sample fetch is called. Thus, an unset id may lead to crashes. This patch must be backported as far as 2.2.	2021-04-17 11:14:58 +02:00
Christopher Faulet	76b44195c9	MINOR: threads: Only consider running threads to end a thread harmeless period When a thread ends its harmeless period, we must only consider running threads when testing threads_want_rdv_mask mask. To do so, we reintroduce all_threads_mask mask in the bitwise operation (It was removed to fix a deadlock). Note that for now it is useless because there is no way to stop threads or to have threads reserved for another task. But it is safer this way to avoid bugs in the future.	2021-04-17 11:14:58 +02:00
Alex	51c8ad45ce	MINOR: sample: converter: Add json_query converter With the json_query can a JSON value be extacted from a header or body of the request and saved to a variable. This converter makes it possible to handle some JSON workload to route requests to different backends.	2021-04-15 17:07:03 +02:00
Alex	41007a6835	MINOR: sample: converter: Add mjson library. This library is required for the subsequent patch which adds the JSON query possibility. It is necessary to change the include statement in "src/mjson.c" because the imported includes in haproxy are in "include/import" orig: #include "mjson.h" new: #include <import/mjson.h>	2021-04-15 17:05:38 +02:00
Moemen MHEDHBI	848216f108	CLEANUP: sample: align samples list in sample.c	2021-04-13 17:28:22 +02:00
Moemen MHEDHBI	92f7d43c5d	MINOR: sample: add ub64dec and ub64enc converters ub64dec and ub64enc are the base64url equivalent of b64dec and base64 converters. base64url encoding is the "URL and Filename Safe Alphabet" variant of base64 encoding. It is also used in in JWT (JSON Web Token) standard. RFC1421 mention in base64.c file is deprecated so it was replaced with RFC4648 to which existing converters, base64/b64dec, still apply. Example: HAProxy: http-request return content-type text/plain lf-string %[req.hdr(Authorization),word(2,.),ub64dec] Client: Token=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJ1c2VyIjoiZm9vIiwia2V5IjoiY2hhZTZBaFhhaTZlIn0.5VsVj7mdxVvo1wP5c0dVHnr-S_khnIdFkThqvwukmdg $ curl -H "Authorization: Bearer ${TOKEN}" http://haproxy.local {"user":"foo","key":"chae6AhXai6e"}	2021-04-13 17:28:13 +02:00
Thayne McCombs	b28430591d	BUG/MEDIUM: sample: Fix adjusting size in field converter Adjust the size of the sample buffer before we change the "area" pointer. The change in size is calculated as the difference between the original pointer and the new start pointer. But since the `smp->data.u.str.area` assignment results in `smp->data.u.str.area` and `start` being the same pointer, we always ended up substracting zero. This changes it to change the size by the actual amount it changed. I'm not entirely sure what the impact of this is, but the previous code seemed wrong. [wt: from what I can see the only harmful case is when the output is converted to a stick-table key, it could result in zeroing past the end of the buffer; other cases do not touch beyond ->data]	2021-04-13 12:12:48 +02:00
Christopher Faulet	b15625a43b	MINOR: cfgparse/proxy: Group alloc error handling during proxy section parsing All allocation errors in cfg_parse_listen() are now handled in a unique place under the "alloc_error" label. This simplify a bit error handling in this function.	2021-04-12 22:04:19 +02:00
Christopher Faulet	b45a7d4b74	BUG/MINOR: cfgparse/proxy: Hande allocation errors during proxy section parsing At several places during the proxy section parsing, memory allocation was performed with no check. Result is now tested and an error is returned if the allocation fails. This patch may be backported to all stable version but it only fixes allocation errors during configuration parsing. Thus, it is not mandatory.	2021-04-12 21:35:12 +02:00
Christopher Faulet	0c6d1dcf7d	BUG/MINOR: listener: Handle allocation error when allocating a new bind_conf Allocation error are now handled in bind_conf_alloc() functions. Thus callers, when not already done, are also updated to catch NULL return value. This patch may be backported (at least partially) to all stable versions. However, it only fix errors durung configuration parsing. Thus it is not mandatory.	2021-04-12 21:33:43 +02:00
Christopher Faulet	2e848a9b75	BUG/MINOR: cfgparse/proxy: Fix some leaks during proxy section parsing Allocated variables are now released when an error occurred during use_backend, use-server, force/ignore-parsing, stick-table, stick and stats directives parsing. For some of these directives, allocation errors have been added. This patch may be backported to all stable version but it only fixes leaks or allocation errors during configuration parsing. Thus, it is not mandatory. It should fix issue #1119.	2021-04-12 21:33:39 +02:00
Christopher Faulet	3a9a12bb2a	BUG/MINOR: hlua: Fix memory leaks on error path when registering a cli keyword When an error occurred in hlua_register_cli(), the allocated lua function and keyword must be released to avoid memory leaks. This patch depends on "MINOR: hlua: Add function to release a lua function". It may be backported in all stable versions.	2021-04-12 19:05:05 +02:00
Christopher Faulet	5c028d7f9d	BUG/MINOR: hlua: Fix memory leaks on error path when registering a service When an error occurred in hlua_register_service(), the allocated lua function and keyword must be released to avoid memory leaks. This patch depends on "MINOR: hlua: Add function to release a lua function". It may be backported in all stable versions.	2021-04-12 19:04:42 +02:00
Christopher Faulet	4fc9da01d2	BUG/MINOR: hlua: Fix memory leaks on error path when registering an action When an error occurred in hlua_register_action(), the allocated lua function and keyword must be released to avoid memory leaks. This patch depends on "MINOR: hlua: Add function to release a lua function". It may be backported in all stable versions.	2021-04-12 19:04:42 +02:00
Christopher Faulet	528526f2cc	BUG/MINOR: hlua: Fix memory leaks on error path when parsing a lua action hen an error occurred in action_register_lua(), the allocated hlua rule and arguments must be released to avoid memory leaks. This patch may be backported in all stable versions.	2021-04-12 19:04:42 +02:00
Christopher Faulet	2567f18382	BUG/MINOR: hlua: Fix memory leaks on error path when registering a fetch When an error occurred in hlua_register_fetches(), the allocated lua function and keyword must be released to avoid memory leaks. This patch depends on "MINOR: hlua: Add function to release a lua function". It may be backported in all stable versions. It should fix #1112.	2021-04-12 19:04:42 +02:00
Christopher Faulet	aa22430bba	BUG/MINOR: hlua: Fix memory leaks on error path when registering a converter When an error occurred in hlua_register_converters(), the allocated lua function and keyword must be released to avoid memory leaks. This patch depends on "MINOR: hlua: Add function to release a lua function". It may be backported in all stable versions.	2021-04-12 19:04:42 +02:00
Christopher Faulet	5294ec0708	BUG/MINOR: hlua: Fix memory leaks on error path when registering a task When an error occurred in hlua_register_task(), the allocated lua context and task must be released to avoid memory leaks. This patch may be backported in all stable versions.	2021-04-12 19:04:42 +02:00
Christopher Faulet	dda44442d5	MINOR: hlua: Add function to release a lua function release_hlua_function() must be used to release a lua function. Some fixes depends on this function.	2021-04-12 15:46:53 +02:00
Christopher Faulet	147b8c919c	MINOIR: checks/trace: Register a new trace source with its events Add the trace support for the checks. Only tcp-check based health-checks are supported, including the agent-check. In traces, the first argument is always a check object. So it is easy to get all info related to the check. The tcp-check ruleset, the conn-stream and the connection, the server state...	2021-04-12 12:09:36 +02:00
Christopher Faulet	6d80b63e3c	MINOR: trace: Add the checks as a possible trace source To be able to add the trace support for the checks, a new kind of source must be added for this purpose.	2021-04-12 12:09:36 +02:00
Willy Tarreau	44982715ba	MEDIUM: time: make the clock offset global and no per-thread Since 1.8 for simplicity the time offset used to compensate for time drift and jumps had been stored per thread. But with a global time, the complexit has significantly increased. What this patch does in order to address this is to get back to the origins of the pre-thread time drift correction, and keep a single offset between the system's date and the current global date. The thread first verifies from the before_poll date if the time jumped backwards or forward, then either fixes it by computing the new most likely date, or applies the current offset to this latest system date. In the first case, if the date is out of range, the old one is reused with the max_wait offset or not depending on the interrupted flag. Then it compares its date to the global date and updates both so that both remain monotonic and that the local date always reflects the latest known global date. In order to support atomic updates to the offset, it's saved as a ullong which contains both the tv_sec and tv_usec parts in its high and low words. Note that a part of the patch comes from the inlining of the equivalent of tv_add applied to the offset to make sure that signed ints are permitted (otherwise it depends on how timeval is defined). This is significantly more reliable than the previous model as the global time should move in a much smoother way, and not according to what thread last updated it, and the thread-local time should always be very close to the global one. Note that (at least for debugging) a cheap way to measure processing lag would consist in measuring the difference between global_now_ms and now_ms, as long as other threads keep it up-to-date.	2021-04-11 23:59:37 +02:00
Willy Tarreau	7e4a557f64	MINOR: time: change the global timeval and the the global tick at once Instead of using two CAS loops, better compute the two units simultaneously and update them at once. There is no guarantee that the update will be synchronous, but we don't care, what matters is that both are monotonically updated and that global_now_ms always follows the last known value of global_now.	2021-04-11 23:47:54 +02:00
Willy Tarreau	70cb3026a8	MINOR: time: remove useless variable copies in tv_update_date() In the global_now loop, we used to set tmp_adj from adjusted, then set update it from tmp_now, then set adjusted back to tmp_adj, and finally set now from adjusted. This is a long and unneeded set of moves resulting from years of code changes. Let's just set now directly in the loop, stop using adjusted and remove tmp_adj.	2021-04-11 23:47:01 +02:00
Willy Tarreau	c4c80fb4ea	MINOR: time: move the time initialization out of tv_update_date() The time initialization was made a bit complex because we rely on a dummy negative argument to reset all fields, leaving no distinction between process-level initialization and thread-level initialization. This patch changes this by introducing two functions, one for the process and the second one for the threads. This removes ambigous test and makes sure that the relevant fields are always initialized exactly once. This also offers a better solution to the bug fixed in commit `b48e7c001` ("BUG/MEDIUM: time: make sure to always initialize the global tick") as there is no more special values for global_now_ms. It's simple enough to be backported if any other time-related issues are encountered in stable versions in the future.	2021-04-11 23:45:48 +02:00
Willy Tarreau	61c72c366e	CLEANUP: time: remove the now unused ms_left_scaled It was only used by freq_ctr and is not used anymore. In addition the local curr_sec_ms was removed, as well as the equivalent extern definitions which did not exist anymore either.	2021-04-11 14:01:53 +02:00
Willy Tarreau	fc6323ad82	MEDIUM: freq_ctr: replace the per-second counters with the generic ones It remains cumbersome to preserve two versions of the freq counters and two different internal clocks just for this. In addition, the savings from using two different mechanisms are not that important as the only saving is a divide that is replaced by a multiply, but now thanks to the freq_ctr_total() unificaiton the code could also be simplified to optimize it in case of constants. This patch turns all non-period freq_ctr functions to static inlines which call the period-based ones with a period of 1 second. A direct benefit is that a single internal clock is now needed for any counter and that they now all rely on ticks. These 1-second counters are essentially used to report request rates and to enforce a connection rate limitation in listeners. It was verified that these continue to work like before.	2021-04-11 11:12:55 +02:00
Willy Tarreau	fa1258f02c	MINOR: freq_ctr: unify freq_ctr and freq_ctr_period into freq_ctr Both structures are identical except the name of the field starting the period and its description. Let's call them all freq_ctr and the period's start "curr_tick" which is generic. This is only a temporary change and fields are expected to remain the same with no code change (verified).	2021-04-11 11:11:27 +02:00
Willy Tarreau	607be24a85	MEDIUM: freq_ctr: reimplement freq_ctr_remain_period() from freq_ctr_total() Now the function becomes an inline one and only contains a divide and a max. The divide will automatically go away with constant periods.	2021-04-11 11:11:03 +02:00
Willy Tarreau	a7a31b2602	MEDIUM: freq_ctr: make read_freq_ctr_period() use freq_ctr_total() This one is the easiest to implement, it just requires a call and a divide of the result. Anti-flapping correction for low-rates was preserved. Now calls using a constant period will be able to use a reciprocal multiply for the period instead of a divide.	2021-04-11 11:11:03 +02:00
Willy Tarreau	f3a9f8dc5a	MINOR: freq_ctr: add a generic function to report the total value Most of the functions designed to read a counter over a period go through the same complex loop and only differ in the way they use the returned values, so it was worth implementing all this into freq_ctr_total() which returns the total number of events over a period so that the caller can finish its operation using a divide or a remaining time calculation. As a special case, read_freq_ctr_period() doesn't take pending events but requires to enable an anti-flapping correction at very low frequencies. Thus the function implements it when pend<0. Thanks to this function it will be possible to reimplement the other ones as inline and merge the per-second ones with the arbitrary period ones without always adding the cost of a 64 bit divide.	2021-04-11 11:10:57 +02:00
Willy Tarreau	6eb3d37bf4	MINOR: trace: make trace sources read_mostly The trace sources are checked at plenty of places in the code and their contents only change when trace status changes, let's mark them read_mostly.	2021-04-10 19:29:26 +02:00
Willy Tarreau	295a89c029	MINOR: pattern: make the pat_lru_seed read_mostly This seed is created once at boot and is used in every LRU hash when caching results. Let's mark it read_mostly.	2021-04-10 19:27:41 +02:00
Willy Tarreau	ad6722ea3a	MINOR: protocol: move __protocol_by_family to read_mostly This one is used for each outgoing connection and never changes after boot, move it to read_mostly.	2021-04-10 19:27:41 +02:00
Willy Tarreau	14015b8880	MINOR: server: move idle_conn_task to read_mostly This pointer is used when adding connections to the idle list and is never changed, let's move it to the read_mostly section.	2021-04-10 19:27:41 +02:00
Willy Tarreau	56c3b8b4e8	MINOR: threads: mark all_threads_mask as read_mostly This variable almost never changes and is read a lot in time-critical sections. threads_want_rdv_mask is read very often as well in thread_harmless_end() and is almost never changed (only when someone uses thread_isolate()). Let's move both to read_mostly.	2021-04-10 19:27:41 +02:00
Willy Tarreau	ff88270ef9	MINOR: pool: move pool declarations to read_mostly All pool heads are accessed via a pointer and should not be shared with highly written variables. Move them to the read_mostly section.	2021-04-10 19:27:41 +02:00
Willy Tarreau	8209c9aa18	MINOR: kqueue: move kqueue_fd to read_mostly This one only contains the list of per-thread kqueue FDs, and is used a lot during updates. Let's mark it read_mostly to avoid false sharing of FDs placed at the extremities.	2021-04-10 19:27:41 +02:00
Willy Tarreau	26d212c744	MINOR: epoll: move epoll_fd to read_mostly This one only contains the list of per-thread epoll FDs, and is used a lot during updates. Let's mark it read_mostly to avoid false sharing of FDs placed at the extremities.	2021-04-10 19:27:41 +02:00
Willy Tarreau	a1090a5b61	MINOR: fd: move a few read-mostly variables to their own section Some pointer to arrays such as fdtab, fdinfo, polled_mask etc are never written to at run time but are used a lot. fdtab accesses appear a lot in perf top because ha_used_fds is in the same cache line and is modified all the time. This patch moves all these read-mostly variables to the read_mostly section when defined. This way their cache lines will be able to remain in shared state in all CPU caches.	2021-04-10 19:27:41 +02:00
Willy Tarreau	f459640ef6	MINOR: global: declare a read_mostly section Some variables are mostly read (mostly pointers) but they tend to be merged with other ones in the same cache line, slowing their access down in multi-thread setups. This patch declares an empty, aligned variable in a section called "read_mostly". This will force a cache-line alignment on this section so that any variable declared in it will be certain to avoid false sharing with other ones. The section will be eliminated at link time if not used. A __read_mostly attribute was added to compiler.h to ease use of this section.	2021-04-10 19:27:41 +02:00
Willy Tarreau	9057a0026e	CLEANUP: pattern: make all pattern tables read-only Interestingly, all arrays used to declare patterns were read-write while only hard-coded. Let's mark them const so that they move from data to rodata and don't risk to experience false sharing.	2021-04-10 17:49:41 +02:00
Christopher Faulet	e2c65ba344	BUG/MINOR: mux-pt: Fix a possible UAF because of traces in mux_pt_io_cb In mux_pt_io_cb(), if a connection error or a shutdown is detected, the mux is destroyed. Thus we must be careful to not use it in a trace message once destroyed. No backport needed. This patch should fix the issue #1220.	2021-04-10 09:02:36 +02:00
Christopher Faulet	c0ae097b95	MINOIR: mux-pt/trace: Register a new trace source with its events As for the other muxes, traces are now supported in the pt mux. All parts of the multiplexer is covered by these traces. Events are splitted by categories (connection, stream, rx and tx). In traces, the first argument is always a connection. So it is easy to get the mux context (conn->ctx). The second argument is always a conn-stream and mau be NUUL. The third one is a buffer and it may also be NULL. Depending on the context it is the request or the response. In all cases it is owned by a channel. Finally, the fourth argument is an integer value. Its meaning depends on the calling context.	2021-04-09 17:46:58 +02:00
Tim Duesterhus	403fd722ac	CLEANUP: Remove useless malloc() casts This is not C++.	2021-04-08 20:11:58 +02:00
Tim Duesterhus	b8ee894b66	CLEANUP: htx: Make http_get_stline take a `const struct` Nothing is being modified there, so this can be `const`.	2021-04-08 19:40:59 +02:00
Emeric Brun	c8f3e45c6a	MEDIUM: resolvers: add support of tcp address on nameserver line. This patch re-works configuration parsing, it removes the "server" lines from "resolvers" sections introduced in commit `56fc5d9eb`: MEDIUM: resolvers: add supports of TCP nameservers in resolvers. It also extends the nameserver lines to support stream server addresses such as: resolvers nameserver localhost tcp@127.0.0.1:53 Doing so, a part of nameserver's init code was factorized in function 'parse_resolvers' and removed from 'post_parse_resolvers'.	2021-04-08 14:20:40 +02:00
Willy Tarreau	4781b1521a	CLEANUP: atomic/tree-wide: replace single increments/decrements with inc/dec This patch replaces roughly all occurrences of an HA_ATOMIC_ADD(&foo, 1) or HA_ATOMIC_SUB(&foo, 1) with the equivalent HA_ATOMIC_INC(&foo) and HA_ATOMIC_DEC(&foo) respectively. These are 507 changes over 45 files.	2021-04-07 18:18:37 +02:00
Willy Tarreau	185157201c	CLEANUP: atomic: add a fetch-and-xxx variant for common operations The fetch_and_xxx variant is often missing for add/sub/and/or. In fact it was only provided for ADD under the name XADD which corresponds to the x86 instruction name. But for destructive operations like AND and OR it's missing even more as it's not possible to know the value before modifying it. This patch explicitly adds HA_ATOMIC_FETCH_{OR,AND,ADD,SUB} which cover these standard operations, and renames XADD to FETCH_ADD (there were only 6 call places). In the future, backport of fixes involving such operations could simply remap FETCH_ADD(x) to XADD(x), FETCH_SUB(x) to XADD(-x), and for the OR/AND if needed, these could possibly be done using BTS/BTR. It's worth noting that xchg could have been renamed to fetch_and_store() but xchg already has well understood semantics and it wasn't needed to go further.	2021-04-07 18:18:37 +02:00
Willy Tarreau	1db427399c	CLEANUP: atomic: add an explicit _FETCH variant for add/sub/and/or Currently our atomic ops return a value but it's never known whether the fetch is done before or after the operation, which causes some confusion each time the value is desired. Let's create an explicit variant of these operations suffixed with _FETCH to explicitly mention that the fetch occurs after the operation, and make use of it at the few call places.	2021-04-07 18:18:37 +02:00
Willy Tarreau	184b21259b	MINOR: cli/show-fd: slightly reorganize the FD status flags Slightly reorder the status flags to better match their order in the "state" field, and also decode the "shut" state which is particularly useful and already part of this field.	2021-04-07 18:18:37 +02:00
Willy Tarreau	1673c4a883	MINOR: fd: implement an exclusive syscall bit to remove the ugly "log" lock There is a function called fd_write_frag_line() that's essentially used by loggers and that is used to write an atomic message line over a file descriptor using writev(). However a lock is required around the writev() call to prevent messages from multiple threads from being interleaved. Till now a SPIN_TRYLOCK was used on a dedicated lock that was common to all FDs. This is quite not pretty as if there are multiple output pipes to collect logs, there will be quite some contention. Now that there are empty flags left in the FD state and that we can finally use atomic ops on them, let's add a flag to indicate the FD is locked for exclusive access by a syscall. At least the locking will now be on an FD basis and not the whole process, so we can remove the log_lock.	2021-04-07 18:18:37 +02:00
Willy Tarreau	9063a660cc	MINOR: fd: move .exported into fdtab[].state No need to keep this flag apart any more, let's merge it into the global state.	2021-04-07 18:10:36 +02:00
Willy Tarreau	5362bc9044	MINOR: fd: move .et_possible into fdtab[].state No need to keep this flag apart any more, let's merge it into the global state.	2021-04-07 18:09:43 +02:00
Willy Tarreau	0cc612818d	MINOR: fd: move .initialized into fdtab[].state No need to keep this flag apart any more, let's merge it into the global state. The bit was not cleared in fd_insert() because the only user is the function used to create and atomically send a log message to a pipe FD, which never registers the fd. Here we clear it nevertheless for the sake of clarity. Note that with an extra cleaning pass we could have a bit number here and simply use a BTS to test and set it.	2021-04-07 18:09:08 +02:00
Willy Tarreau	030dae13a0	MINOR: fd: move .cloned into fdtab[].state No need to keep this flag apart any more, let's merge it into the global state.	2021-04-07 18:08:29 +02:00
Willy Tarreau	b41a6e9101	MINOR: fd: move .linger_risk into fdtab[].state No need to keep this flag apart any more, let's merge it into the global state. The CLI's output state was extended to 6 digits and the linger/cloned flags moved inside the parenthesis.	2021-04-07 18:07:49 +02:00
Willy Tarreau	f509065191	MEDIUM: fd: merge fdtab[].ev and state for FD_EV_* and FD_POLL_* into state For a long time we've had fdtab[].ev and fdtab[].state which contain two arbitrary sets of information, one is mostly the configuration plus some shutdown reports and the other one is the latest polling status report which also contains some sticky error and shutdown reports. These ones used to be stored into distinct chars, complicating certain operations and not even allowing to clearly see concurrent accesses (e.g. fd_delete_orphan() would set the state to zero while fd_insert() would only set the event to zero). This patch creates a single uint with the two sets in it, still delimited at the byte level for better readability. The original FD_EV_* values remained at the lowest bit levels as they are also known by their bit value. The next step will consist in merging the remaining bits into it. The whole bits are now cleared both in fd_insert() and _fd_delete_orphan() because after a complete check, it is certain that in both cases these functions are the only ones touching these areas. Indeed, for _fd_delete_orphan(), the thread_mask has already been zeroed before a poller can call fd_update_event() which would touch the state, so it is certain that _fd_delete_orphan() is alone. Regarding fd_insert(), only one thread will get an FD at any moment, and it as this FD has already been released by _fd_delete_orphan() by definition it is certain that previous users have definitely stopped touching it. Strictly speaking there's no need for clearing the state again in fd_insert() but it's cheap and will remove some doubts during some troubleshooting sessions.	2021-04-07 18:04:39 +02:00
Willy Tarreau	8d27c203ed	MEDIUM: fd: prepare FD_POLL_* to move to bits 8-15 In preparation of merging FD_POLL* and FD_EV, this only changes the value of FD_POLL_ to use bits 8-15 (the second byte). The size of the field has been temporarily extended to 32 bits already, as well as the temporary variables that carry the new composite value inside fd_update_events(). The resulting fdtab entry becomes temporarily unaligned. All places making access to .ev or FD_POLL_* were carefully inspected to make sure they were safe regarding this change. Only one temporary update was needed for the "show fd" code. The code was only slightly inflated at this step.	2021-04-07 15:08:40 +02:00
Emeric Brun	26754901e9	BUG/MEDIUM: log: fix config parse error logging on stdout/stderr or any raw fd The regression was introduced by commit previous commit `94aab06`: MEDIUM: log: support tcp or stream addresses on log lines. This previous patch tries to retrieve the used protocol parsing the address using the str2sa_range function but forgets that the raw file descriptor adresses don't specify a protocol and str2sa_range probes an error. This patch re-work the str2sa_range function to stop probing error if an authorized RAW_FD address is parsed whereas the caller request also a protocol. It also modify the code of parse_logsrv to switch on stream logservers only if a protocol was detected.	2021-04-07 15:01:00 +02:00
Emeric Brun	94aab06e24	MEDIUM: log: support tcp or stream addresses on log lines. An explicit stream address prefix such as "tcp6@" "tcp4@" "stream+ipv6@" "stream+ipv4@" or "stream+unix@" will allocate an implicit ring buffer with a forward server targeting the given address. This is usefull to simply send logs to a log server in tcp and It doesn't need to declare a ring section in configuration.	2021-04-07 09:18:34 +02:00
Emeric Brun	9533a70381	MINOR: log: register config file and line number on log servers. This patch registers the parsed file and the line where a log server is declared to make those information available in configuration post check. Those new informations were added on error messages probed resolving ring names on post configuration check.	2021-04-07 09:18:34 +02:00
Emeric Brun	ce325c4360	MINOR: server/bind: add support of new prefixes for addresses. Since the internal function str2sa_range is used to addresses for different objects ('server', 'bind' but also 'log' or 'nameserver') we notice that some combinations are missing. "ip@" is introduced to authorize the prefix "dgram+ip@" or "stream+ip@" which dectects automatically IP version but specify dgram or stream. "tcp@" was introduced and is an alias for "stream+ip@". "tcp6" and "tcp4" are now aliases for "stream+ipv6@" and "stream+ipv4@". "uxst@" and "uxdg@" are now aliases for "stream+unix@" and "dgram+unix@". This patch also adds a complete section in documentation to describe adresses and their prefixes.	2021-04-07 09:18:32 +02:00
Thayne McCombs	a68380524b	BUG/MINOR: tools: fix parsing "us" unit for timers Commit `c20ad0d8db` (BUG/MINOR: tools: make parse_time_err() more strict on the timer validity) broke parsing the "us" unit in timers. It caused `parse_time_err()` to return the string "s", which indicates an error. Now if the "u" is followed by an "s" we properly continue processing the time instead of immediately failing. This fixes #1209. It must be backported to all stable versions.	2021-04-06 07:31:51 +02:00
Christopher Faulet	eccb31c939	BUG/MINOR: hlua: Detect end of request when reading data for an HTTP applet When a script retrieves request data from an HTTP applet, line per line or not, we must be sure to properly detect the end of the request by checking HTX_FL_EOM flag when everything was consumed. Otherwise, the script may hang. It is pretty easy to reproduce the bug by calling applet:receive() without specifying any length. If the request is not chunked, the function never returns. The bug was introduced when the EOM block was removed. Thus, it is specific to the 2.4. This patch should fix the issue #1207. No backport needed.	2021-04-06 07:31:51 +02:00
Christopher Faulet	8043e831d1	MINOR: acl: Add HTTP_2.0 predefined macro HTTP_2.0 predefined macro returns true for HTTP/2 requests. HTTP/2 doen't convey a version information, so this macro may seem a bit strange. But for compatiblity reasons, internally, the "HTTP/2.0" version is set. Thus, it is handy to rely on it to differenciate HTTP/1 and HTTP/2 requests.	2021-04-06 07:31:51 +02:00
Christopher Faulet	779184e35e	MINOR: No longer rely on deprecated sample fetches for predefined ACLs Some predefined ACLs were still based on deprecated sample fetches, like req_proto_http or req_ver. Now, they use non-deprecated sample fetches. In addition, the usage lines in the configuration manual have been updated to be more explicit.	2021-04-05 17:21:05 +02:00
Willy Tarreau	57610c694e	CONTRIB: move src/wurfl.c and contrib/wurfl to addons/wurfl Both the source file and the dummy library are now at the same place. Maybe the build howto could be moved there as well to make things even cleaner. The Makefile, MAINTAINERS, doc, and vtest matrix were updated.	2021-04-02 17:48:42 +02:00
Willy Tarreau	f8d9ec57f0	CONTRIB: move src/da.c and contrib/deviceatlas to addons/deviceatlas Both the source file and the dummy library are now at the same place. Maybe the build howto could be moved there as well to make things even cleaner. The Makefile, MAINTAINERS, doc, github build matrix, coverity checks and travis CI's build were updated.	2021-04-02 17:48:42 +02:00
Willy Tarreau	977209d1d8	CONTRIB: move 51Degrees to addons/51degrees Now it's much cleaner, both 51d.c and the dummy library live together and are easier to spot and maintain. The build howto probably ought to be moved there as well. Makefile, docs and MAINTAINERS were updated, as well as the github CI's build matrix, travis CI's, and coverity checks.	2021-04-02 17:48:42 +02:00
Willy Tarreau	074ebcde29	CONTRIB: move some dev-specific tools to dev/ The following directories were moved from contrib/ to dev/ to make their use case a bit clearer. In short, only developers are expected to ever go there. The makefile was updated to build and clean from these ones. base64/ flags/ hpack/ plug_qdisc/ poll/ tcploop/ trace/	2021-04-02 17:48:42 +02:00
Amaury Denoyelle	728be0f437	MINOR: config: diag if global section after non-global Detect if a global section is present after another section and reports a diagnostic about it.	2021-04-01 18:03:37 +02:00
Amaury Denoyelle	de2fab55aa	MINOR: diag: diag if servers use the same cookie value Add a diagnostic to check that two servers of the same backend does not use the same cookie value. Ignore backup servers as it is quite common for them to share a cookie value with a primary one.	2021-04-01 18:03:37 +02:00
Amaury Denoyelle	5a6926dcf0	MINOR: diag: create cfgdiag module This module is intended to serve as a placeholder for various diagnostics executed after the configuration file has been fully loaded.	2021-04-01 18:03:37 +02:00
Amaury Denoyelle	da0e7f61e0	MINOR: server: diag for 0 weight server Output a diagnostic report if a server has been configured with a null weight.	2021-04-01 18:03:37 +02:00
Amaury Denoyelle	c4d47d609a	MINOR: cfgparse: diag for multiple nbthread statements Output a diagnostic report if the nbthread statement is defined on several places in the configuration.	2021-04-01 18:03:37 +02:00
Amaury Denoyelle	7b01a8dbdd	MINOR: global: define diagnostic mode of execution Define MODE_DIAG which is used to run haproxy in diagnostic mode. This mode is used to output extra warnings about possible configuration blunder or sub-optimal usage. It can be activated with argument '-dD'. A new output function ha_diag_warning is implemented reserved for diagnostic output. It serves to standardize the format of diagnostic messages. A macro HA_DIAG_WARN_COND is also available to automatically check if diagnostic mode is on before executing the diagnostic check.	2021-04-01 18:03:37 +02:00
Willy Tarreau	374edc70ba	CLEANUP: vars: always pre-initialize smp in vars_parse_cli_get_var() In issue #1200 Coverity believes we may use an uninitialized field smp.sess here while it's not possible because the returned variable necessarily matches SCOPE_PROC hence smp.sess is not used. But it cannot see this and it could be confusing if the code later evolved into something more complex. That's not a critical path so let's first reset the sample.	2021-04-01 17:04:17 +02:00
Christopher Faulet	09f88364b7	BUG/MINOR: http-fetch: Fix test on message state to capture the version A bug was introduced when the legacy HTTP mode was removed. To capture the HTTP version of the request or the response, we rely on the message state to be sure the status line was received. However, the test is inverted. The version can be captured if message headers were received, not the opposite. This patch must be backported as far as 2.2.	2021-04-01 16:45:40 +02:00
Christopher Faulet	021a8e4d7b	MEDIUM: http-rules: Add wait-for-body action on request and response side Historically, an option was added to wait for the request payload (option http-buffer-request). This option has 2 drawbacks. First, it is an ON/OFF option for the whole proxy. It cannot be enabled on demand depending on the message. Then, as its name suggests, it only works on the request side. The only option to wait for the response payload was to write a dedicated filter. While it is an acceptable solution for complex applications, it is a bit overkill to simply match strings in the body. To make everyone happy, this patch adds a dedicated HTTP action to wait for the message payload, for the request or the response depending it is used in an http-request or an http-response ruleset. The time to wait is configurable and, optionally, the minimum payload size to have before stop to wait. Both the http action and the old http analyzer rely on the same internal function.	2021-04-01 16:27:40 +02:00
Christopher Faulet	581db2b829	MINOR: payload/config: Warn if a L6 sample fetch is used from an HTTP proxy L6 sample fetches are now ignored when called from an HTTP proxy. Thus, a warning is emitted during the startup if such usage is detected. It is true for most ACLs and for log-format strings. Unfortunately, it is a bit painful to do so for sample expressions. This patch relies on the commit "MINOR: action: Use a generic function to check validity of an action rule list".	2021-04-01 15:34:22 +02:00
Christopher Faulet	42c6cf9501	MINOR: action: Use a generic function to check validity of an action rule list The check_action_rules() function is now used to check the validity of an action rule list. It is used from check_config_validity() function to check L5/6/7 rulesets.	2021-04-01 15:34:22 +02:00
Christopher Faulet	2e96194d00	MINOR: htx: Make internal.strm.is_htx an internal sample fetch It is not really a context-less sample fetch, but it is internal. And it only fails if no stream is attached to the sample. This way, it is still possible to use it on an HTTP proxy (L6 sample fetches are ignored now for HTTP proxies). If the commit "BUG/MINOR: payload/htx: Ingore L6 sample fetches for HTX streams/checks" is backported, it may be a good idea to backport this one too. But only as far as 2.2.	2021-04-01 15:34:22 +02:00
Christopher Faulet	a434a00864	BUG/MINOR: payload/htx: Ingore L6 sample fetches for HTX streams/checks Use a L6 sample fetch on an HTX streams or a HTX health-check is meaningless because data are not raw but structured. So now, these sample fetches fail when called from an HTTP proxy. In addition, a warning has been added in the configuration manual, at the begining of the L6 sample fetches section. Note that req.len and res.len samples return the HTX data size instead of failing. It is not accurate because it does not reflect the buffer size nor the raw data length. But we keep it for backward compatibility purpose. However it remains a bit strange to use it on an HTTP proxy. This patch may be backported to all versions supporting the HTX, i.e as far as 2.0. But the part about the health-checks is only valid for the 2.2 and upper.	2021-04-01 15:31:55 +02:00
Christopher Faulet	5eef0189c7	MINOR: config/proxy: Warn if a TCP proxy without backend is upgradable to HTTP If a 'switch-mode http' tcp action is configured on a listener with no backend, a warning is displayed to remember HTTP connections cannot be routed to TCP servers. Indeed, backend connection is still established using the proxy mode.	2021-04-01 13:24:34 +02:00
Christopher Faulet	3b6446f4d9	MINOR: config/proxy: Don't warn for HTTP rules in TCP if 'switch-mode http' set Warnings about ignored HTTP directives in a TCP proxy are inhibited if at least one switch-mode tcp action is configured to perform HTTP upgraded.	2021-04-01 13:22:42 +02:00
Christopher Faulet	ae863c62e3	MEDIUM: Add tcp-request switch-mode action to perform HTTP upgrade It is now possible to perform HTTP upgrades on a TCP stream from the frontend side. To do so, a tcp-request content rule must be defined with the switch-mode action, specifying the mode (for now, only http is supported) and optionnaly the proto (h1 or h2). This way it could be possible to set HTTP directives on a TCP frontend which will only be evaluated if an upgrade is performed. This new way to perform HTTP upgrades should replace progressively the old way, consisting to route the request to an HTTP backend. And it should be also a good start to remove all HTTP processing from tcp-request content rules. This action is terminal, it stops the ruleset evaluation. It is only available on proxy with the frontend capability. The configuration manual has been updated accordingly.	2021-04-01 13:17:19 +02:00
Christopher Faulet	6c1fd987f6	MINOR: stream: Handle stream HTTP upgrade in a dedicated function The code responsible to perform an HTTP upgrade from a TCP stream is moved in a dedicated function, stream_set_http_mode(). The stream_set_backend() function is slightly updated, especially to correctly set the request analysers.	2021-04-01 11:06:48 +02:00
Christopher Faulet	75f619ad92	MINOR: http-ana: Simplify creation/destruction of HTTP transactions Now allocation and initialization of HTTP transactions are performed in a unique function. Historically, there were two functions because the same TXN was reset for K/A connections in the legacy HTTP mode. Now, in HTX, K/A connections are handled at the mux level. A new stream, and thus a new TXN, is created for each request. In addition, the function responsible to end the TXN is now also reponsible to release it. So, now, http_create_txn() and http_destroy_txn() must be used to create and destroy an HTTP transaction.	2021-04-01 11:06:48 +02:00
Christopher Faulet	c2ac5e4f27	MINOR: filters/http-ana: Decide to filter HTTP headers in HTTP analysers It is just a small cleanup. AN_REQ_FLT_HTTP_HDRS and AN_RES_FLT_HTTP_HDRS analysers are now set in HTTP analysers at the same place AN_REQ_HTTP_XFER_BODY and AN_RES_HTTP_XFER_BODY are set.	2021-04-01 11:06:48 +02:00
Christopher Faulet	1bb6afa35d	MINOR: stream: Use stream type instead of proxy mode when appropriate We now use the stream instead of the proxy to know if we are processing HTTP data or not. If the stream is an HTX stream, it means we are dealing with HTTP data. It is more accurate than the proxy mode because when an HTTP upgrade is performed, the proxy is not changed and only the stream may be used. Note that it was not a problem to rely on the proxy because HTTP upgrades may only happen when an HTTP backend was set. But, we will add the support of HTTP upgrades on the frontend side, after te tcp-request rules evaluation. In this context, we cannot rely on the proxy mode.	2021-04-01 11:06:48 +02:00
Christopher Faulet	28da3f5131	MEDIUM: mux-pt: Expose passthrough in the list of supported mux protocols Add "none" in the list of supported mux protocols. It relies on the passthrough multiplexer and use almost the same mux_ops structure. Only the flags differ because this "new" mux does not support the upgrades. "none" was chosen to explicitly stated there is not processing at the mux level. Thus it is now possible to set "proto none" or "check-proto none" on bind/server lines, depending on the context. However, when set, no upgrade to HTTP is performed. It may be a way to disable HTTP upgrades per bind line.	2021-04-01 11:06:48 +02:00
Christopher Faulet	3f612f7e4d	MEDIUM: mux-h1: Expose h1 in the list of supported mux protocols Add "h1" in the list of supported mux protocols. It relies on the H1 multiplexer and use the almost the same mux_ops structure. Only the flags differ because this "new" mux does not support the upgrades. Thus it is now possible to set "proto h1" or "check-proto h1" on bind/server lines, depending on the context. However, when set, no upgrade to HTTP/2 is performed. It may be a way to disable implicit HTTP/2 upgrades per bind line.	2021-04-01 11:06:47 +02:00
Christopher Faulet	7a9e362b90	MINOR: mux-pt: Don't perform implicit HTTP upgrade if not supported by mux For now this tests is useless, but if the PT muliplexer is flagged to explicitly not support the upgrades to HTTP, an error is returned.	2021-04-01 11:06:47 +02:00
Christopher Faulet	143e9e5888	MINOR: mux-h1: Don't perform implicit HTTP/2 upgrade if not supported by mux For now this tests is useless, but if the H1 muliplexer is flagged to explicitly not support the upgrades to HTTP/2, an error is returned.	2021-04-01 11:06:47 +02:00
Christopher Faulet	a460057f2e	MINOR: muxes: Add a flag to notify a mux does not support any upgrade MX_FL_NO_UPG flag may now be set on a multiplexer to explicitly disable upgrades from this mux. For now, it is set on the FCGI multiplexer because it is not supported and there is no upgrade on backend-only multiplexers. It is also set on the H2 multiplexer because it is clearly not supported.	2021-04-01 11:06:47 +02:00
Christopher Faulet	bb7abede93	BUG/MINOR: config: Add warning for http-after-response rules in TCP mode No warning is emitted if some http-after-response rules are configured on a TCP proxy while such warning messages are emitted for other HTTP ruleset in same condition. It is just an oversight. This patch may be backported as far as 2.2.	2021-04-01 11:06:47 +02:00
Christopher Faulet	97b3a61449	BUG/MINOR: stream: Properly handle TCP>H1>H2 upgrades in http_wait_for_request When a TCP stream is first upgraded to H1 and then to H2, we must be sure to inhibit any connect and to properly handle the TCP stream destruction. When the TCP stream is upgraded to H1, the HTTP analysers are set. Thus http_wait_for_request() is called. In this case, the server connection must be blocked, waiting for the request analysis. Otherwise, a server may be assigned to the stream too early. It is especially a problem if the stream is finally destroyed because of an implicit upgrade to H2. In this case, the stream processing must be properly aborted to not have a stalled stream. Thus, if a shutdown is detected in http_wait_for_request() when an HTTP upgrade is performed, the stream is aborted. It is a 2.4-specific bug. No backport is needed.	2021-04-01 11:06:47 +02:00
Christopher Faulet	57e4a1bf44	MINOR: stream: Be sure to set HTTP analysers when creating an HTX stream Always set frontend HTTP analysers when an HTX stream is created. It is only useful in case a destructive HTTP upgrades (TCP>H2) because the frontend is a TCP proxy. In fact, to be strict, we must only set these analysers when the upgrade is performed before setting the backend (it is not supported yet, but this patch is required to do so), in the frontend part. If the upgrade happens when the backend is set, it means the HTTP processing is just the backend buisness. But there is no way to make the difference when a stream is created, at least for now.	2021-04-01 11:06:47 +02:00
Christopher Faulet	e13ee703d2	MINOR: frontend: Create HTTP txn for HTX streams When an HTX stream is created, be sure to always create the HTTP txn object, regardless of the ".http_needed" value of the frontend. That happens when a destructive HTTP upgrades is performed (TCP>H2). The frontend is a TCP proxy. If there is no dependency on the HTTP part, the HTTP transaction is not created at this stage but only when the backend is set. For now, it is not a problem. But an HTTP txn will be mandatory to fully support TCP to HTTP upgrades after frontend tcp-request rules evaluation.	2021-04-01 11:06:47 +02:00
Christopher Faulet	f0d7eb2f4f	MINOR: stream: Don't trigger errors on destructive HTTP upgrades When a TCP stream is upgraded to H2 stream, a destructive upgrade is performed. It means the TCP stream is silently released while a new one is created. It is of course more complicated but it is what we observe from the stream point of view. That was performed by returning an error when the backend was set. It is neither really elegant nor accurate. So now, instead of returning an error from stream_set_backend() in case of destructive HTTP upgrades, the TCP stream processing is aborted and no error is reported. However, the result is more or less the same.	2021-04-01 11:06:39 +02:00
Christopher Faulet	ceab1ed86c	BUG/MINOR: mux-h2: Don't emit log twice if an error occurred on the preface sess_log() was called twice if an error occurred on the preface parsing, in h2c_frt_recv_preface() and in h2_process_demux(). This patch must be backported as far as 2.0.	2021-04-01 08:56:07 +02:00
Willy Tarreau	645dc08533	BUG/MINOR: http_fetch: make hdr_ip() resistant to empty fields The fix in commit `7b0e00d94` ("BUG/MINOR: http_fetch: make hdr_ip() reject trailing characters") made hdr_ip() more sensitive to empty fields, for example if a trusted proxy incorrectly sends the header with an empty value, we could return 0.0.0.0 which is not correct. Let's make sure we only assign an IPv4 type here when a non-empty address was found. This should be backported to all branches where the fix above was backported.	2021-03-31 11:45:42 +02:00
Willy Tarreau	4bfc6630ba	CLEANUP: socket: replace SOL_IP/IPV6/TCP with IPPROTO_IP/IPV6/TCP Historically we've used SOL_IP/SOL_IPV6/SOL_TCP everywhere as the socket level value in getsockopt() and setsockopt() but as we've seen over time it regularly broke the build and required to have them defined to their IPPROTO_* equivalent. The Linux ip(7) man page says: Using the SOL_IP socket options level isn't portable; BSD-based stacks use the IPPROTO_IP level. And it indeed looks like a pure linuxism inherited from old examples and documentation. strace also reports SOL_* instead of IPPROTO_, which does not help... A check to linux/in.h shows they have the same values. Only SOL_SOCKET and other non-IP values make sense since there is no IPPROTO equivalent. Let's get rid of this annoying confusion by removing all redefinitions of SOL_IP/IPV6/TCP and using IPPROTO_ instead, just like any other operating system. This also removes duplicated tests for the same value. Note that this should not result in exposing syscalls to other OSes as the only ones that were still conditionned to SOL_IPV6 were for IPV6_UNICAST_HOPS which already had an IPPROTO_IPV6 equivalent, and IPV6_TRANSPARENT which is Linux-specific.	2021-03-31 08:59:34 +02:00
Willy Tarreau	da23195785	BUILD: tcp: use IPPROTO_IPV6 instead of SOL_IPV6 on FreeBSD/MacOS Lukas reported in issue #1203 that the previous fix for silent-drop in commit `ab79ee8b1` ("BUG/MINOR: tcp: fix silent-drop workaround for IPv6") breaks the build on FreeBSD/MacOS due to SOL_IPV6 not being defined. On these platforms, IPPROTO_IPV6 must be used instead, so this should fix it. This needs to be backported to whatever version the fix above is backported to.	2021-03-31 08:29:27 +02:00
Willy Tarreau	ab79ee8b11	BUG/MINOR: tcp: fix silent-drop workaround for IPv6 As reported in github issue #1203 the TTL-based workaround that is used when permissions are insufficient for the TCP_REPAIR trick does not work for IPv6 because we're using only SOL_IP with IP_TTL. In IPv6 we have to use SOL_IPV6 and IPV6_UNICAST_HOPS. Let's pick the right one based on the source address's family. This may be backported to all versions.	2021-03-30 19:00:49 +02:00
Willy Tarreau	b48e7c0016	BUG/MEDIUM: time: make sure to always initialize the global tick The issue with non-rotating freq counters was addressed in commit `8cc586c73` ("BUG/MEDIUM: freq_ctr/threads: use the global_now_ms variable") using the global date. But an issue remained with the comparison of the most recent time. Since the initial time in the structure is zero, the tick_is_lt() works on half of the periods depending on the first date an entry is touched. And the wrapping happened last night: $ date --date=@$(((($(date +%s) * 1000) & -0x8000000) / 1000)) Mon Mar 29 23:59:46 CEST 2021 So users of the last fix (backported to 2.3.8) may experience again an always increasing rate for the next 24 days if they restart their process. Let's always update the time if the latest date was not updated yet. It will likely be simplified once the function is reorganized but this will do the job for now. Note that since this timer is only used by freq counters, no other sub-system is affected. The bug can easily be tested with this config during the right time period (i.e. today to today+24 days + N*49.7 days): global stats socket /tmp/sock1 frontend web bind :8080 mode http http-request track-sc0 src stick-table type ip size 1m expire 1h store http_req_rate(2s) Issuing 'socat - /tmp/sock1 <<< "show table web"' should show a stable rate after 2 seconds. The fix must be backported to 2.3 and any other version the fix above goes into. Thanks to Thomas SIMON and Sander Klein for quickly reporting this issue with a working reproducer.	2021-03-30 18:28:25 +02:00
Florian Apolloner	39272c28bf	BUG/MINOR: stats: Apply proper styles in HTML status page. When a backend is in status DOWN and going UP it is currently displayed as yellow ("active UP, going down") instead of orange ("active DOWN, going UP"). This patches restyles the table rows to actually match the legend. This may be backported to any version, the issue appeared in 1.7-dev2 with commit `0c378efe8` ("MEDIUM: stats: compute the color code only in the HTML form").	2021-03-30 16:57:22 +02:00
Christopher Faulet	50623029f8	BUG/MINOR: payload: Wait for more data if buffer is empty in payload/payload_lv In payload() and payload_lv() sample fetches, if the buffer is empty, we must wait for more data by setting SMP_F_MAY_CHANGE flag on the sample. Otherwise, when it happens in an ACL, nothing is returned (because the buffer is empty) and the ACL is considered as finished (success or failure depending on the test). As a workaround, the buffer length may be tested first. For instance : tcp-request inspect-delay 1s tcp-request content reject unless { req.len gt 0 } { req.payload(0,0),fix_is_valid } instead of : tcp-request inspect-delay 1s tcp-request content reject if ! { req.payload(0,0),fix_is_valid } This patch must be backported as far as 2.2.	2021-03-29 11:47:53 +02:00
Willy Tarreau	9b9f8477f8	MEDIUM: backend: use a trylock to grab a connection on high FD counts as well Commit `b1adf03df` ("MEDIUM: backend: use a trylock when trying to grab an idle connection") solved a contention issue on the backend under normal condition, but there is another one further, which only happens when the number of FDs in use is considered too high, and which obviously causes random crashes with just 16 threads once the number of FDs is about to be exhausted. Like the aforementioned patch, this one should be backported to 2.3.	2021-03-27 09:39:23 +01:00
Ilya Shipitsin	2c481d0105	BUILD: ssl: use EVP_CIPH_GCM_MODE macro instead of HA_OPENSSL_VERSION EVP_CIPH_GCM_MODE was introduced in `bdaa54155c` together with EVP support for AES-GCM.	2021-03-26 23:16:25 +01:00
Willy Tarreau	b8bd1ee893	MEDIUM: cli: add a new experimental "set var" command set var <name> <expression> Allows to set or overwrite the process-wide variable 'name' with the result of expression <expression>. Only process-wide variables may be used, so the name must begin with 'proc.' otherwise no variable will be set. The <expression> may only involve "internal" sample fetch keywords and converters even though the most likely useful ones will be str('something') or int(). Note that the command line parser doesn't know about quotes, so any space in the expression must be preceeded by a backslash. This command requires levels "operator" or "admin". This command is only supported on a CLI connection running in experimental mode (see "experimental-mode on"). Just like for "set-var" in the global section, the command uses a temporary dummy proxy to create a temporary "set-var(name)" rule to assign the value. The reg test was updated to verify that an updated global variable is properly reflected in subsequent HTTP responses.	2021-03-26 16:57:43 +01:00
Willy Tarreau	c35eb38f1d	MINOR: vars/cli: add a "get var" CLI command to retrieve global variables Process-wide variables can now be displayed from the CLI using "get var" followed by the variable name. They must all start with "proc." otherwise they will not be found. The output is very similar to the one of the debug converter, with a type and value being reported for the embedded sample. This command is limited to clients with the level "operator" or higher, since it can possibly expose traffic-related data.	2021-03-26 16:52:13 +01:00
Willy Tarreau	2f836de100	MINOR: action: add a new ACT_F_CLI_PARSER origin designation In order to process samples from the command line interface we'll need rules as well, and these rules will have to be marked as coming from the CLI parser. This new origin is used for this.	2021-03-26 16:34:53 +01:00
Willy Tarreau	db5e0dbea9	MINOR: sample: add a new CLI_PARSER context for samples In order to prepare for supporting calling sample expressions from the CLI, let's create a new CLI_PARSER parsing context. This one supports constants and internal samples only.	2021-03-26 16:34:53 +01:00
Willy Tarreau	13d2ba2a82	MEDIUM: vars: add support for a "set-var" global directive While we do support process-wide variables ("proc.<name>"), there was no way to preset them from the configuration. This was particularly limiting their usefulness since configs involving them always had to first check if the variable was set prior to performing an operation. This patch adds a new "set-var" directive in the global section that supports setting the proc.<name> variables from an expression, like other set-var actions do. The syntax however follows what is already being done for setenv, which consists in having one argument for the variable name and another one for the expression. Only "constant" expressions are allowed here, such as "int", "str" etc, combined with arithmetic or string converters, and variable lookups. A few extra sample fetch keywords like "date", "rand" and "uuid" are also part of the constant expressions and may make sense to allow to create a random key or differentiate processes. The way it was done consists in parsing a dummy rule an executing the expression in the CFG_PARSE context, then releasing the expression. This is safe because the sample that variables store does not hold a back pointer to expression that created them.	2021-03-26 16:34:53 +01:00
Willy Tarreau	01d580ae86	MINOR: action: add a new ACT_F_CFG_PARSER origin designation In order to process samples from the config file we'll need rules as well, and these rules will have to be marked as coming from the config parser. This new origin is used for this.	2021-03-26 16:23:45 +01:00
Willy Tarreau	f9a7a8fd8e	MINOR: sample: add a new CFG_PARSER context for samples We'd sometimes like to be able to process samples while parsing the configuration based on purely internal thing but that's not possible right now. Let's add a new CFG_PARSER context for samples which only permits constant samples (i.e. those which do not change in the process' life and which are stable during config parsing).	2021-03-26 16:23:45 +01:00
Willy Tarreau	0209c97038	MINOR: sample: mark the truly constant sample fetch keywords as such A number of keywords are really constant and safe to use at config time. This is the case for str(), int() etc but also env(), hostname(), nbproc() etc. By extension a few other ones which can be useful to preset values in a configuration were enabled as well, like data(), rand() or uuid(). At the moment this doesn't change anything as they are still only usable from runtime rules. The "var()" keyword was also marked as const as it can definitely return stable stuff at boot time.	2021-03-26 16:23:45 +01:00
Willy Tarreau	be2159b946	MINOR: sample: add a new SMP_SRC_CONST sample capability This level indicates that everything it constant in the expression during the whole process' life and that it may safely be used at config parsing time.	2021-03-26 16:23:45 +01:00
Willy Tarreau	77e6a4ef0f	MINOR: sample: make smp_resolve_args() return an allocate error message For now smp_resolve_args() complains on stderr via ha_alert(), but if we want to make it a bit more dynamic, we need it to return errors in an allocated message. Let's pass it an error pointer and have it fill it. On return we indent the output if it contains more than one line.	2021-03-26 16:23:45 +01:00
Willy Tarreau	e26cd0b46c	CLEANUP: sample: remove duplicate "stopping" sample fetch keyword The "stopping" sample fetch keyword was accidently duplicated in 1.9 by commit `70fe94419` ("MINOR: sample: add cpu_calls, cpu_ns_avg, cpu_ns_tot, lat_ns_avg, lat_ns_tot"). This has no effect so no backport is needed.	2021-03-26 16:23:45 +01:00
Willy Tarreau	f26db14dfb	MINOR: vars: make the var() sample fetch keyword depend on nothing This sample fetch doesn't require any L4 client session in practice, as get_var() now checks for the session. This is important to remove this dependency in order to support accessing variables in scope "proc" from anywhere.	2021-03-26 16:23:45 +01:00
Willy Tarreau	a07d61be4c	MINOR: vars: make get_vars() allow the session to be null In order to support manipulating variables from outside a session, let's make get_vars() not assume that the session is always set.	2021-03-26 16:23:45 +01:00
Amaury Denoyelle	704ba1d63e	MINOR: lua: properly allocate the lua Socket servers Instantiate both lua Socket servers tcp/ssl using standard function new_server. There is currently no need to tune their settings except to activate the ssl mode with noverify for the second one. Both servers are freed with the free_server function.	2021-03-26 15:28:33 +01:00
Amaury Denoyelle	239fdbf548	MINOR: lua: properly allocate the lua Socket proxy Replace static initialization of the lua Socket proxy with the standard function alloc_new_proxy. The settings proxy are properly applied thanks to PR_CAP_LUA. The proxy is freed with the free_proxy function.	2021-03-26 15:28:33 +01:00
Amaury Denoyelle	6f26faecd8	MINOR: proxy: define cap PR_CAP_LUA Define a new cap PR_CAP_LUA. It can be used to allocate the internal proxy for lua Socket class. This cap overrides default settings for preferable values in the lua context.	2021-03-26 15:28:33 +01:00
Amaury Denoyelle	27fefa1967	MINOR: proxy: implement a free_proxy function Move all liberation code related to a proxy in a dedicated function free_proxy in proxy.c. For now, this function is only called in haproxy.c. In the future, it will be used to free the lua proxy. This helps to clean up haproxy.c.	2021-03-26 15:28:33 +01:00
Amaury Denoyelle	476b9ad97a	REORG: split proxy allocation functions Create a new function parse_new_proxy specifically designed to allocate a new proxy from the configuration file and copy settings from the default proxy. The function alloc_new_proxy is reduced to a minimal allocation. It is used for default proxy allocation and could also be used for internal proxies such as the lua Socket proxy.	2021-03-26 15:28:33 +01:00
Amaury Denoyelle	68fd7e43d3	REORG: global: move free acl/action in their related source files Move deinit_acl_cond and deinit_act_rules from haproxy.c respectively in acl.c and action.c. The name of the functions has been slightly altered, replacing the prefix deinit_* by free_* to reflect their purpose more clearly. This change has been made in preparation to the implementation of a free proxy function. As a side-effect, it helps to clean up haproxy.c.	2021-03-26 15:28:33 +01:00
Amaury Denoyelle	ce44482fe5	REORG: global: move initcall register code in a dedicated file Create a new module init which contains code related to REGISTER_* macros for initcalls. init.h is included in api.h to make init code available to all modules. It's a step to clean up a bit haproxy.c/global.h.	2021-03-26 15:28:33 +01:00
Ilya Shipitsin	df627943a4	BUILD: ssl: introduce fine guard for ssl random extraction functions SSL_get_{client,server}_random are supported in OpenSSL-1.1.0, BoringSSL, LibreSSL-2.7.0 let us introduce HAVE_SSL_EXTRACT_RANDOM for that purpose	2021-03-26 15:19:07 +01:00
Remi Tricot-Le Breton	bc2c386992	BUG/MINOR: ssl: Prevent removal of crt-list line if the instance is a default one If the first active line of a crt-list file is also the first mentioned certificate of a frontend that does not have the strict-sni option enabled, then its certificate will be used as the default one. We then do not want this instance to be removable since it would make a frontend lose its default certificate. Considering that a crt-list file can be used by multiple frontends, and that its first mentioned certificate can be used as default certificate for only a subset of those frontends, we do not want the line to be removable for some frontends and not the others. So if any of the ckch instances corresponding to a crt-list line is a default instance, the removal of the crt-list line will be forbidden. It can be backported as far as 2.2.	2021-03-26 13:06:39 +01:00
Remi Tricot-Le Breton	8218aed90e	BUG/MINOR: ssl: Fix update of default certificate The default SSL_CTX used by a specific frontend is the one of the first ckch instance created for this frontend. If this instance has SNIs, then the SSL context is linked to the instance through the list of SNIs contained in it. If the instance does not have any SNIs though, then the SSL_CTX is only referenced by the bind_conf structure and the instance itself has no link to it. When trying to update a certificate used by the default instance through a cli command, a new version of the default instance was rebuilt but the default SSL context referenced in the bind_conf structure would not be changed, resulting in a buggy behavior in which depending on the SNI used by the client, he could either use the new version of the updated certificate or the original one. This patch adds a reference to the default SSL context in the default ckch instances so that it can be hot swapped during a certificate update. This should fix GitHub issue #1143. It can be backported as far as 2.2.	2021-03-26 13:06:29 +01:00
Willy Tarreau	62592ad967	BUG/MEDIUM: mux-h1: make h1_shutw_conn() idempotent In issue #1197, St�phane Graber reported a rare case of crash that results from an attempt to close an already closed H1 connection. It indeed looks like under some circumstances it should be possible to call the h1_shutw_conn() function more than once, though these conditions are not very clear. Without going through a deep analysis of all possibilities, one potential case seems to be a detach() called with pending output data, causing H1C_F_ST_SHUTDOWN to be set on the connection, then h1_process() being immediately called on I/O, causing h1_send() to flush these data and call h1_shutw_conn(), and finally the upper stream calling cs_shutw() hence h1_shutw(), which itself will call h1_shutw_conn() again while the transport and control layers have already been released. But the whole sequence is not certain as it's not very clear in which case it's possible to leave h1_send() without the connection anymore (at least the obuf is empty). However what is certain is that a shutdown function must be idempotent, so let's fix h1_shutw_conn() regarding this point. St�phane reported the issue as far back as 2.0, so this patch should be backported this far.	2021-03-26 09:29:38 +01:00
Willy Tarreau	7b0e00d943	BUG/MINOR: http_fetch: make hdr_ip() reject trailing characters The hdr_ip() sample fetch function will try to extract IP addresses from a header field. These IP addresses are parsed using url2ipv4() and if it fails it will fall back to inet_pton(AF_INET6), otherwise will fail. There is a small problem there which is that if a field starts with an IP address and is immediately followed by some garbage, the IP address part is still returned. This is a problem with fields such as x-forwarded-for because it prevents detection of accidental corruption or bug along the chain. For example, the following string: x-forwarded-for: 1.2.3.4; 5.6.7.8 or this one: x-forwarded-for: 1.2.3.4O ( the last one being the letter 'O') would still return "1.2.3.4" despite the trailing characters. This is bad because it will silently cover broken code running on intermediary proxies and may even in some cases allow haproxy to pass improperly formatted headers after they were apparently validated, for example, if someone extracts the address from this field to place it into another one. This issue would only affect the IPv4 parser, because the IPv6 parser already uses inet_pton() which fails at the first invalid character and rejects trailing port numbers. In strict compliance with RFC7239, let's make sure that if there are any characters left in the string, the parsing fails and makes hdr_ip() return nothing. However, a special case has to be handled to support IPv4 addresses followed by a colon and a valid port number, because till now the parser used to implicitly accept them and it appears that this practice, though rare, does exist at least in Azure: https://docs.microsoft.com/en-us/azure/application-gateway/how-application-gateway-works This issue has always been there so the fix may be backported to all versions. It will need the following commit in order to work as expected: MINOR: tools: make url2ipv4 return the exact number of bytes parsed Many thanks to https://twitter.com/melardev and the BitMEX Security Team for their detailed report.	2021-03-25 15:30:06 +01:00
Willy Tarreau	12e1027aa6	MINOR: tools: make url2ipv4 return the exact number of bytes parsed The function's return value is currently used as a boolean but we'll need it to return the number of bytes parsed. Right now it returns it minus one, unless the last char doesn't match what is permitted. Let's update this to make it more usable.	2021-03-25 15:18:47 +01:00
Christopher Faulet	a9a9e9aac9	BUG/MEDIUM: thread: Fix a deadlock if an isolated thread is marked as harmless If an isolated thread is marked as harmless, it will loop forever in thread_harmless_till_end() waiting no threads are isolated anymore. It never happens because the current thread is isolated. To fix the bug, we exclude the current thread for the test. We now wait for all other threads to leave the rendez-vous point. This bug only seems to occurr if HAProxy is compiled with DEBUG_UAF, when pool_gc() is called. pool_gc() isolates the current thread, while pool_free_area() set the thread as harmless when munmap is called. This patch must be backported as far as 2.0.	2021-03-25 14:31:50 +01:00
Amaury Denoyelle	65bf600cc3	BUG/MEDIUM: release lock on idle conn killing on reached pool high count Release the lock before calling mux destroy in connect_server when trying to kill an idle connection because the pool high count has been reached. The lock must be released because the mux destroy will call srv_release_conn which also takes the lock to remove the connection from the tree. As the connection was already deleted from the tree at this stage, it is safe to release the lock, and the removal in srv_release_conn will be a noop. It does not need to be backported because it is only present in the current release. It has been introduced by `5c7086f6b0` MEDIUM: connection: protect idle conn lists with locks	2021-03-25 11:55:35 +01:00
Olivier Houchard	c23b33764e	BUG/MEDIUM: fd: Take the fd_mig_lock when closing if no DWCAS is available. In fd_delete(), if we're running with no double-width cas, take the fd_mig_lock before setting thread_mask to 0 to make sure that another thread calling fd_set_running() won't miss the new value of thread_mask and set its bit in running_mask after we checked it. This should be backported to 2.2 as part of the series fixing fd_delete().	2021-03-25 07:34:35 +01:00
Willy Tarreau	2d4232901c	CLEANUP: fd: slightly simplify up _fd_delete_orphan() Let's release the port range earlier so that all zeroes are grouped together and that the compiler can slightly simplify the code.	2021-03-24 17:17:21 +01:00
Willy Tarreau	2c3f9818e8	BUG/MEDIUM: fd: do not wait on FD removal in fd_delete() Christopher discovered an issue mostly affecting 2.2 and to a less extent 2.3 and above, which is that it's possible to deadlock a soft-stop when several threads are using a same listener: thread1 thread2 unbind_listener() fd_set_running() lock(listener) listener_accept() fd_delete() lock(listener) while (running_mask); -----> deadlock unlock(listener) This simple case disappeared from 2.3 due to the removal of some locked operations at the end of listener_accept() on the regular path, but the architectural problem is still here and caused by a lock inversion built around the loop on running_mask in fd_clr_running_excl(), because there are situations where the caller of fd_delete() may hold a lock that is preventing other threads from dropping their bit in running_mask. The real need here is to make sure the last user deletes the FD. We have all we need to know the last one, it's the one calling fd_clr_running() last, or entering fd_delete() last, both of which can be summed up as the last one calling fd_clr_running() if fd_delete() calls fd_clr_running() at the end. And we can prevent new threads from appearing in running_mask by removing their bits in thread_mask. So what this patch does is that it sets the running_mask for the thread in fd_delete(), clears the thread_mask, thus marking the FD as orphaned, then clears the running mask again, and completes the deletion if it was the last one. If it was not, another thread will pass through fd_clr_running and will complete the deletion of the FD. The bug is easily reproducible in 2.2 under high connection rates during soft close. When the old process stops its listener, occasionally two threads will deadlock and the old process will then be killed by the watchdog. It's strongly believed that similar situations do exist in 2.3 and 2.4 (e.g. if the removal attempt happens during resume_listener() called from listener_accept()) but if so, they should be much harder to trigger. This should be backported to 2.2 as the issue appeared with the FD migration. It requires previous patches "fd: make fd_clr_running() return the remaining running mask" and "MINOR: fd: remove the unneeded running bit from fd_insert()". Notes for backport: in 2.2, the fd_dodelete() function requires an extra argument "do_close" indicating whether we want to remove and close the FD (fd_delete) or just delete it (fd_remove). While this information is not conveyed along the chain, we know that late calls always imply do_close=1 become do_close=0 exclusively results from fd_remove() which is only used by the config parser and the master, both of which are single-threaded, hence are always the last ones in the running_mask. Thus it is safe to assume that a postponed FD deletion always implies do_close=1. Thanks to Olivier for his help in designing this optimal solution.	2021-03-24 17:17:21 +01:00
Christopher Faulet	1e8433f594	BUG/MEDIUM: lua: Always init the lua stack before referencing the context When a lua context is allocated, its stack must be initialized to NULL before attaching it to its owner (task, stream or applet). Otherwise, if the watchdog is fired before the stack is really created, that may lead to a segfault because we try to dump the traceback of an uninitialized lua stack. It is easy to trigger this bug if a lua script do a blocking call while another thread try to initialize a new lua context. Because of the global lua lock, the init is blocked before the stack creation. Of course, it only happens if the script is executed in the shared global context. This patch must be backported as far as 2.0.	2021-03-24 16:36:36 +01:00
Christopher Faulet	cc2c4f8f4c	BUG/MEDIUM: debug/lua: Use internal hlua function to dump the lua traceback The commit reverts following commits: * `83926a04` BUG/MEDIUM: debug/lua: Don't dump the lua stack if not dumpable * `a61789a1` MEDIUM: lua: Use a per-thread counter to track some non-reentrant parts of lua Instead of relying on a Lua function to print the lua traceback into the debugger, we are now using our own internal function (hlua_traceback()). This one does not allocate memory and use a chunk instead. This avoids any issue with a possible deadlock in the memory allocator because the thread processing was interrupted during a memory allocation. This patch relies on the commit "BUG/MEDIUM: debug/lua: Use internal hlua function to dump the lua traceback". Both must be backported wherever the patches above are backported, thus as far as 2.0	2021-03-24 16:35:23 +01:00
Christopher Faulet	d09cc519bd	MINOR: lua: Slightly improve function dumping the lua traceback The separator string is now configurable, passing it as parameter when the function is called. In addition, the message have been slightly changed to be a bit more readable.	2021-03-24 16:33:26 +01:00
Ilya Shipitsin	a0fd35b054	BUILD: ssl: guard ecdh functions with SSL_CTX_set_tmp_ecdh macro let us use feature macro SSL_CTX_set_tmp_ecdh instead of comparing openssl version	2021-03-24 09:52:37 +01:00
Remi Tricot-Le Breton	fb00f31af4	BUG/MINOR: ssl: Prevent disk access when using "add ssl crt-list" If an unknown CA file was first mentioned in an "add ssl crt-list" CLI command, it would result in a call to X509_STORE_load_locations which performs a disk access which is forbidden during runtime. The same would happen if a "ca-verify-file" or "crl-file" was specified. This was due to the fact that the crt-list file parsing and the crt-list related CLI commands parsing use the same functions. The patch simply adds a new parameter to all the ssl_bind parsing functions so that they know if the call is made during init or by the CLI, and the ssl_store_load_locations function can then reject any new cafile_entry creation coming from a CLI call. It can be backported as far as 2.2.	2021-03-23 19:29:46 +01:00
Willy Tarreau	f23b1bc534	BUILD: tools: fix build error with new PA_O_DEFAULT_DGRAM Previous commit `69ba35146` ("MINOR: tools: introduce new option PA_O_DEFAULT_DGRAM on str2sa_range.") managed to introduce a parenthesis imbalance that broke the build. No backport is needed.	2021-03-23 18:38:13 +01:00
Emeric Brun	69ba35146f	MINOR: tools: introduce new option PA_O_DEFAULT_DGRAM on str2sa_range. str2sa_range function options PA_O_DGRAM and PA_O_STREAM are used to define the supported address types but also to set the default type if it is not explicit. If the used address support both STREAM and DGRAM, the default was always set to STREAM. This patch introduce a new option PA_O_DEFAULT_DGRAM to force the default to DGRAM type if it is not explicit in the address field and both STREAM and DGRAM are supported. If only DGRAM or only STREAM is supported, it continues to be considered as the default.	2021-03-23 15:32:22 +01:00
Willy Tarreau	8cc586c73f	BUG/MEDIUM: freq_ctr/threads: use the global_now_ms variable In commit `a1ecbca0a` ("BUG/MINOR: freq_ctr/threads: make use of the last updated global time"), for period-based counters, the millisecond part of the global_now variable was used as the date for the new period. But it's wrong, it only works with sub-second periods as it wraps every second, and for other periods the counters never rotate anymore. Let's make use of the newly introduced global_now_ms variable instead, which contains the global monotonic time expressed in milliseconds. This patch needs to be backported wherever the patch above is backported. It depends on previous commit "MINOR: time: also provide a global, monotonic global_now_ms timer".	2021-03-23 09:03:37 +01:00
Willy Tarreau	6064b34be0	MINOR: time: also provide a global, monotonic global_now_ms timer The period-based freq counters need the global date in milliseconds, so better calculate it and expose it rather than letting all call places incorrectly retrieve it. Here what we do is that we maintain a new globally monotonic timer, global_now_ms, which ought to be very close to the global_now one, but maintains the monotonic approach of now_ms between all threads in that global_now_ms is always ahead of any now_ms. This patch is made simple to ease backporting (it will be needed for a subsequent fix), but it also opens the way to some simplifications on the time handling: instead of computing the local time and trying to force it to the global one, we should soon be able to proceed in the opposite way, that is computing the new global time an making the local one just the latest snapshot of it. This will bring the benefit of making sure that the global time is always ahead of the local one.	2021-03-23 09:01:37 +01:00
Willy Tarreau	e44989369d	CLEANUP: quic: use pool_zalloc() instead of pool_alloc+memset Two places used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:20:21 +01:00
Willy Tarreau	6922e550eb	CLEANUP: tcpcheck: use pool_zalloc() instead of pool_alloc+memset Two places used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:20:03 +01:00
Willy Tarreau	f208ac0616	CLEANUP: ssl: use pool_zalloc() in ssl_init_keylog() This one used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:19:48 +01:00
Willy Tarreau	70490ebb12	CLEANUP: resolvers: use pool_zalloc() in resolv_link_resolution() This one used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:19:28 +01:00
Willy Tarreau	3ab0a0bc88	CLEANUP: mailers: use pool_zalloc() in enqueue_one_email_alert() This one used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:19:13 +01:00
Willy Tarreau	ec4cfc3835	CLEANUP: frontend: use pool_zalloc() in frontend_accept() The capture buffers were allocated then zeroed, let's have the allocator do it.	2021-03-22 23:18:54 +01:00
Willy Tarreau	c9ef9bc9a5	CLEANUP: spoe: use pool_zalloc() instead of pool_alloc+memset Two places used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:18:26 +01:00
Willy Tarreau	1bbec3883a	CLEANUP: filters: use pool_zalloc() in flt_stream_add_filter() This one used to alloc then zero the area, let's have the allocator do it.	2021-03-22 23:17:56 +01:00
Willy Tarreau	d68d4f1002	MEDIUM: dynbuf: remove last usages of b_alloc_margin() The function's purpose used to be to fail a buffer allocation if that allocation wouldn't result in leaving some buffers available. Thus, some allocations could succeed and others fail for the sole purpose of trying to provide 2 buffers at once to process_stream(). But things have changed a lot with 1.7 breaking the promise that process_stream() would always succeed with only two buffers, and later the thread-local pool caches that keep certain buffers available that are not accounted for in the global pool so that local allocators cannot guess anything from the number of currently available pools. Let's just replace all last uses of b_alloc_margin() with b_alloc() once for all.	2021-03-22 16:27:59 +01:00
Willy Tarreau	f499f50c8f	CLEANUP: l7-retries: do not test the buffer before calling b_alloc() The return value is enough now to know if the allocation succeeded or failed.	2021-03-22 16:17:37 +01:00
Willy Tarreau	862ad82f22	CLEANUP: compression: do not test for buffer before calling b_alloc() Now we know the function is idempotent, we don't need to run the preliminary test anymore.	2021-03-22 16:16:22 +01:00
Willy Tarreau	b454e908e5	MINOR: ssl: use pool_alloc(), not pool_alloc_dirty() pool_alloc_dirty() is the version below pool_alloc() that never performs the memory poisonning. It should only be called directly for very large unstructured areas for which enabling memory poisonning would not bring anything but could significantly hurt performance (e.g. buffers). Using this function here will not provide any benefit and will hurt the ability to debug. It would be desirable to backport this, although it does not cause any user-visible bug, it just complicates debugging.	2021-03-22 15:35:53 +01:00
Willy Tarreau	acc5b011e5	MINOR: cache: use pool_alloc(), not pool_alloc_dirty() pool_alloc_dirty() is the version below pool_alloc() that never performs the memory poisonning. It should only be called directly for very large unstructured areas for which enabling memory poisonning would not bring anything but could significantly hurt performance (e.g. buffers). Using this function here will not provide any benefit and will hurt the ability to debug. It would be desirable to backport this, although it does not cause any user-visible bug, it just complicates debugging.	2021-03-22 15:35:53 +01:00
Willy Tarreau	18f43d85a0	MINOR: fcgi-app: use pool_alloc(), not pool_alloc_dirty() pool_alloc_dirty() is the version below pool_alloc() that never performs the memory poisonning. It should only be called directly for very large unstructured areas for which enabling memory poisonning would not bring anything but could significantly hurt performance (e.g. buffers). Using this function here will not provide any benefit and will hurt the ability to debug. It would be desirable to backport this, although it does not cause any user-visible bug, it just complicates debugging.	2021-03-22 15:35:53 +01:00
Willy Tarreau	f1a91292dc	MINOR: spoe: use pool_alloc(), not pool_alloc_dirty() pool_alloc_dirty() is the version below pool_alloc() that never performs the memory poisonning. It should only be called directly for very large unstructured areas for which enabling memory poisonning would not bring anything but could significantly hurt performance (e.g. buffers). Using this function here will not provide any real benefit, it only avoids the area being poisonned before being zeroed. Ideally a pool_calloc() function should be provided for this.	2021-03-22 15:35:53 +01:00
Willy Tarreau	5bfeb2139b	MINOR: compression: use pool_alloc(), not pool_alloc_dirty() pool_alloc_dirty() is the version below pool_alloc() that never performs the memory poisonning. It should only be called directly for very large unstructured areas for which enabling memory poisonning would not bring anything but could significantly hurt performance (e.g. buffers). Using this function here will not provide any benefit and will hurt the ability to debug. It would be desirable to backport this, although it does not cause any user-visible bug, it just complicates debugging.	2021-03-22 15:35:53 +01:00
Amaury Denoyelle	3b1c9a39fd	CLEANUP: mark defproxy as const on parse tune.fail-alloc This fixes a gcc warning about a missing const on defproxy for mem_parse_global_fail_alloc. This is needed since the commit : `018251667e` CLEANUP: config: make the cfg_keyword parsers take a const for the defproxy	2021-03-22 11:50:31 +01:00
Ilya Shipitsin	ba13f16aa2	CLEANUP: assorted typo fixes in the code and comments This is 21st iteration of typo fixes	2021-03-20 09:28:58 +01:00
Olivier Houchard	26c51097d8	MEDIUM: quic: Fix build. Put the ) at the right place. This should fix github issue #1190.	2021-03-19 20:09:22 +01:00
Olivier Houchard	7ab6d8bdf3	MEDIUM: quic: Fix build. Spell conn_xprt_start() correctly. This should fix github issue #1189.	2021-03-19 19:48:53 +01:00
Christopher Faulet	83926a04fe	BUG/MEDIUM: debug/lua: Don't dump the lua stack if not dumpable When we try to dump the stack of a lua context, if it is not dumpable, nothing is performed and a message is emitted instead. This happens when a lua execution was interrupted inside a non-reentrant part. This patch depends on following commit : * MEDIUM: lua: Use a per-thread counter to track some non-reentrant parts of lua Thanks to this patch, we avoid a possible deadllock if the lua is interrupted by the watchdog in the lua memory allocator, because realloc() is not async-signal-safe. Both patches must be backported as far as 2.0.	2021-03-19 16:19:59 +01:00
Christopher Faulet	a61789a1d6	MEDIUM: lua: Use a per-thread counter to track some non-reentrant parts of lua Some parts of the Lua are non-reentrant. We must be sure to carefully track these parts to not dump the lua stack when it is interrupted inside such parts. For now, we only identified the custom lua allocator. If the thread is interrupted during the memory allocation, we must not try to print the lua stack wich also allocate memory. Indeed, realloc() is not async-signal-safe. In this patch we introduce a thread-local counter. It is incremented before entering in a non-reentrant part and decremented when exiting. It is only performed in hlua_alloc() for now.	2021-03-19 16:16:23 +01:00
Christopher Faulet	a561ffb978	CLEANUP: tcp-rules: Fix a typo in error messages about expect-netscaler-cip It was misspelled (expect-netscaler-ip instead of expect-netscaler-cip). 2 commits are concerned : * `db67b0ed7` MINOR: tcp-rules: suggest approaching action names on mismatch * `72d012fbd` CLEANUP: tcp-rules: add missing actions in the tcp-request error message The first one will not be backported, but the second one was backported as far as 1.8. Thus this one may also be backported, but only the 2nd part about the list of accepted keywords.	2021-03-19 15:41:16 +01:00
Olivier Houchard	dae6975498	MINOR: muxes: garbage collect the reset() method. Now that connections aren't being reused when they failed, remove the reset() method. It was unimplemented anywhere, except for H1 where it did nothing, anyway.	2021-03-19 15:33:04 +01:00
Olivier Houchard	bc5ce9201a	MEDIUM: connections: Implement a start() method in ssl_sock. Add a start() method to ssl_sock. It is responsible with initiating the SSL handshake, currently by just scheduling the tasklet, instead of doing it in the init() method, when all the XPRT may not have been initialized.	2021-03-19 15:33:04 +01:00
Olivier Houchard	d54ede7d08	MEDIUM: connections: Implement a start() method for xprt_handshake. Add a start_method to xprt_handshake. It schedules the tasklet that does the handshake. This used to be done in xprt_handshake_add_xprt(), but that's a much better place.	2021-03-19 15:33:04 +01:00
Olivier Houchard	1b3c931bff	MEDIUM: connections: Introduce a new XPRT method, start(). Introduce a new XPRT method, start(). The init() method will now only initialize whatever is needed for the XPRT to run, but any action the XPRT has to do before being ready, such as handshakes, will be done in the new start() method. That way, we will be sure the full stack of xprt will be initialized before attempting to do anything. The init() call is also moved to conn_prepare(). There's no longer any reason to wait for the ctrl to be ready, any action will be deferred until start(), anyway. This means conn_xprt_init() is no longer needed.	2021-03-19 15:33:04 +01:00
Olivier Houchard	ca1a57f022	MINOR: raw_sock: Add a close method. Add a close() method, that explicitely cancels any subscription on the connection, in preparation for future evolutions.	2021-03-19 15:33:04 +01:00
Emeric Brun	8af3bb0abf	BUG/MINOR: protocol: add missing support of dgram unix socket. The proto "uxdg" (UNIX DGRAM) was not declared, causing an error trying to put a socket unix on "dgram-bind" into a log-forward section. This patch introduces the missing "uxdg" protocol by adding proto_uxdg.c which was fully created based on the code available for the other protocols. This patch should be backported to version 2.3 and above.	2021-03-18 18:30:29 +01:00
Amaury Denoyelle	304672320e	MINOR: server: support keyword proto in 'add server' cli Allow to specify the mux proto for a dynamic server. It must be compatible with the backend mode to be accepted. The reg-tests has been extended for this error case.	2021-03-18 16:22:10 +01:00
Amaury Denoyelle	fc465a54fd	MINOR: server: enable standard options for dynamic servers Enable a subset of server options to be used as keywords on the CLI command 'add server'. These options are safe and can be applied flawlessly for a dynamic server.	2021-03-18 16:22:10 +01:00
Amaury Denoyelle	f99f77a500	MEDIUM: server: implement 'add server' cli command Add a new cli command 'add server'. This command is used to create a new server at runtime attached on an existing backend. The syntax is the following one : $ add server <be_name>/<sv_name> [<kws>...] This command is only available through experimental mode for the moment. Currently, no server keywords are supported. They will be activated individually when deemed properly functional and safe. Another limitation is put on the backend load-balancing algorithm. The algorithm must use consistent hashing to guarantee a minimal reallocation of existing connections on the new server insertion.	2021-03-18 15:52:07 +01:00
Amaury Denoyelle	216a1ce3b9	MINOR: stats: export function to allocate extra proxy counters Remove static qualifier on stats_allocate_proxy_counters_internal. This function will be used to allocate extra counters at runtime for dynamic servers.	2021-03-18 15:52:07 +01:00
Amaury Denoyelle	76e10e78bb	MINOR: server: prepare parsing for dynamic servers Prepare the server parsing API to support dynamic servers. - define a new parsing flag to be used for dynamic servers - each keyword contains a new field dynamic_ok to indicate if it can be used for a dynamic server. For now, no keyword are supported. - do not copy settings from the default server for a new dynamic server. - a dynamic server is created in a maintenance mode and requires an explicit 'enable server' command. - a new server flag named SRV_F_DYNAMIC is created. This flag is set for all servers created at runtime. It might be useful later, for example to know if a server can be purged.	2021-03-18 15:51:12 +01:00
Amaury Denoyelle	30c0537f5a	REORG: server: use flags for parse_server Modify the API of parse_server function. Use flags to describe the type of the parsed server instead of discrete arguments. These flags can be used to specify if a server/default-server/server-template is parsed. Additional parameters are also specified (parsing of the address required, resolve of a name must be done immediately). It is now unneeded to use strcmp on args[0] in parse_server. Also, the calls to parse_server are more explicit thanks to the flags.	2021-03-18 15:37:05 +01:00
Amaury Denoyelle	cf58dd79e3	REORG: server: attach servers in parse_server Move server linked into proxy backend list outside of _srv_parse_init to parse_server. This is groundwork for dynamic servers support. There will be two differences in case of a dynamic server : - the server will be attached to the proxy list only at the very end of the operations when everything is ok - the server will be directly attached to the end of the server proxy list	2021-03-18 15:37:05 +01:00
Amaury Denoyelle	7d27efef23	REORG: server: rename internal functions from parse_server Use a standard convention for the functions used through parse_server. Use the prefix _srv_parse and specify their private scope in a comment.	2021-03-18 15:37:05 +01:00
Amaury Denoyelle	9394a9444e	REORG: server: move alert traces in parse_server Move every ha_alert calls in parsing functions into parse_server. Parsing functions now support a pointer-to-string argument which will be allocated with an error message if needed via memprintf. parse_server has then the responsibility to display errors with ha_alert. This is groundwork for dynamic server. No traces should be printed on stderr as a response to a cli command. cli_err will replace ha_alert in this case.	2021-03-18 15:37:05 +01:00
Amaury Denoyelle	a8f442e078	REORG: server: split parse_server The huge parse_server function is splitted into two smaller ones. * _srv_parse_init allocates a new server instance and parses the address parameter * _srv_parse_kw parse the current server keyword This simplify a bit the parse_server function. Besides, it will be useful for dynamic server creation.	2021-03-18 15:37:05 +01:00
Amaury Denoyelle	3b89c11d4d	MINOR: server: remove fastinter from mistyped kw list This keyword is already present in server kw list from checks.c.	2021-03-18 15:37:05 +01:00
Amaury Denoyelle	587b71e402	REORG: server: move keywords in srv_kws Move server-keyword hardcoded in parse_server into the srv_kws list of server.c. Now every server keywords is checked through srv_find_kw. This has the effect to reduce the size of parse_server. As a side-effect, common kw list can be reduced. This change has been made to be able to quickly discard these keywords in case of a dynamic server.	2021-03-18 15:37:05 +01:00
Amaury Denoyelle	3efee6572f	MINOR: cfgparse: always alloc idle conns task The idle conn task is is a global task used to cleanup backend connections marked for deletion. Previously, it was only only allocated if at least one server in the configuration has idle connections. This assumption won't be valid anymore when new servers can be created at runtime with idle connections. Always allocate the global idle conn task.	2021-03-18 15:37:05 +01:00
Amaury Denoyelle	828adf0121	REORG: server: add a free server function Create a new server function named free_server. It can be used to deallocate a server and its member.	2021-03-18 15:37:05 +01:00
Amaury Denoyelle	18487fb532	MINOR: cli: implement experimental-mode Experimental mode is similar to expert-mode. It can be used to access to features still in development.	2021-03-18 15:37:05 +01:00
Eric Salama	5ba8335186	MINOR: mworker/cli: alert the user if we enabled a master CLI but not the master-worker mode Declaring a master CLI socket without activating the master-worker mode is likely a user error, so we issue a warning. This patch can be backported as far as 1.8.	2021-03-18 09:08:33 +01:00
Eric Salama	1b8dacc858	MINOR/BUG: mworker/cli: do not use the unix_bind prefix for the master CLI socket If the configuration file contains a 'unix-bind prefix' directive, and if we use the -S option and specify a UNIX socket path, the path of the socket will be prepended with the value of the unix-bind prefix. For instance, if we have 'unix-bind prefix /tmp/sockets/' and we use '-S /tmp/master-socket' on the command line, we will get this error: Starting proxy MASTER: cannot bind UNIX socket (No such file or directory) [/tmp/sockets/tmp/master-socket] So this patch adds an exception, and will ignore the unix-bind prefix for the master CLI socket. This patch can be backported as far as 1.9.	2021-03-18 09:08:19 +01:00
Willy Tarreau	a1ecbca0a5	BUG/MINOR: freq_ctr/threads: make use of the last updated global time The freq counters were using the thread's own time as the start of the current period. The problem is that in case of contention, it was occasionally possible to perform non-monotonic updates on the edge of the next second, because if the upfront thread updates a counter first, it causes a rotation, then the second thread loses the race from its older time, and tries again, and detects a different time again, but in the past so it only updates the counter, then a third thread on the new date would detect a change again, thus provoking a rotation again. The effect was triple: - rare loss of stored values during certain transitions from one period to the next one, causing counters to report 0 - half of the threads forced to go through the slow path every second - difficult convergence when using many threads where the CAS can fail a lot and we can observe N(N-1) attempts for N threads to complete This patch fixes this issue in two ways: - first, it now makes use og the monotonic global_now value which also happens to be volatile and to carry the latest known time; this way time will never jump backwards anymore and only the first thread updates it on transition, the other ones do not need to. - second, re-read the time in the loop after each failure, because if the date changed in the counter, it means that one thread knows a more recent one and we need to update. In this case if it matches the new current second, the fast path is usable. This patch relies on previous patch "MINOR: time: export the global_now variable" and must be backported as far as 1.8.	2021-03-17 19:36:15 +01:00
Willy Tarreau	650f374f24	MINOR: time: export the global_now variable This is the process-wide monotonic time that is used to update each thread's own time. It may be required at a few places where a strictly monotonic clock is required such as freq_ctr. It will be have to be backported as a dependency of a forthcoming fix.	2021-03-17 19:25:47 +01:00
Christopher Faulet	59b2925733	BUG/MINOR: resolvers: Add missing case-insensitive comparisons of DNS hostnames DNS hostname comparisons were fixed to be case-insensitive (see `b17b88487` "BUG/MEDIUM: dns: Consider the fact that dns answers are case-insensitive"). However 2 comparisons are still case-sensitive. This patch must be backported as far as 1.8.	2021-03-16 11:25:04 +01:00
Willy Tarreau	31a3cea84f	MINOR: cfgparse/proxy: also support spelling fixes on options Some are not always easy to spot with "chk" vs "check" or hyphens at some places and not at others. Now entering "option http-close" properly suggests "httpclose" and "option tcp-chk" suggests "tcp-check". There's no need to consider the proxy's capabilities, what matters is to figure what related word the user tried to spell, and there are not that many options anyway.	2021-03-15 11:14:57 +01:00
Willy Tarreau	ec197e83cd	MINOR: cli: sort the suggestions by order of relevance Now the suggested keywords are sorted with the most relevant ones first instead of scanning them all in registration order and only dumping the proposed ones: - "tra" trace <module> [cmd [args...]] : manage live tracing operator : lower the level of the current CLI session to operator user : lower the level of the current CLI session to user show trace [<module>] : show live tracing state - "pool" show pools : report information about the memory pools usage add acl : add acl entry del map : delete map entry user : lower the level of the current CLI session to user del acl : delete acl entry - "sh ta" show stat : report counters for each proxy and server [desc\|json\|no-maint\|typed\|up]* show tasks : show running tasks set table [id] : update or create a table entry's data show table [id]: report table usage stats or dump this table's contents trace <module> [cmd [args...]] : manage live tracing - "sh state" show stat : report counters for each proxy and server [desc\|json\|no-maint\|typed\|up]* set table [id] : update or create a table entry's data show table [id]: report table usage stats or dump this table's contents show servers state [id]: dump volatile server information (for backend <id>) show sess [id] : report the list of current sessions or dump this session	2021-03-15 10:39:45 +01:00
Willy Tarreau	a9aa628703	MINOR: cli: improve fuzzy matching to work on all remaining words at once Till now the fuzzy matching would only work on the same number of words, but this doesn't account for commands like "show servers conn" which involve 3 words and were not proposed when entering only "show conn". Let's improve the situation by building the two fingerprints separately for the correct keyword sequence and the entered one, then compare them. This can result in slightly larger variations due to the different string lengths but is easily compensated for. Thanks to this, we can now see "show servers conn" when entering "show conn", and the following choices are relevant to correct typos: - "show foo" show sess [id] : report the list of current sessions or dump this session show info : report information about the running process [desc\|json\|typed]* show env [var] : dump environment variables known to the process show fd [num] : dump list of file descriptors in use show pools : report information about the memory pools usage - "show stuff" show sess [id] : report the list of current sessions or dump this session show info : report information about the running process [desc\|json\|typed]* show stat : report counters for each proxy and server [desc\|json\|no-maint\|typed\|up]* show fd [num] : dump list of file descriptors in use show tasks : show running tasks - "show stafe" show sess [id] : report the list of current sessions or dump this session show stat : report counters for each proxy and server [desc\|json\|no-maint\|typed\|up]* show fd [num] : dump list of file descriptors in use show table [id]: report table usage stats or dump this table's contents show tasks : show running tasks - "show state" show stat : report counters for each proxy and server [desc\|json\|no-maint\|typed\|up]* show servers state [id]: dump volatile server information (for backend <id>) It's still visible that the shorter ones continue to easily match, such as "show sess" not having much in common with "show foo" but what matters is that the best candidates are definitely relevant. Probably that listing them in match order would further help.	2021-03-15 10:33:45 +01:00
Willy Tarreau	714c4c14d1	MINOR: tools: do not sum squares of differences for word fingerprints While sums of squares usually give excellent results in fixed-sise patterns, they don't work well to compare different sized ones such as when some sub-words are missing, because a word such as "server" contains "er" twice, which will rsult in an extra distance of at least 4 for just this e->r transition compared to another one missing it. This is one of the main reasons why "show conn" only proposes "show info" on the CLI. Maybe an improved approach consisting in using squares only for exact same lengths would work, but it would still make it difficult to spot reversed characters.	2021-03-15 09:44:53 +01:00
Willy Tarreau	9294e8822f	MINOR: tools: improve word fingerprinting by counting presence The distance between two words can be high due to a sub-word being missing and in this case it happens that other totally unrealted words are proposed because their average score looks lower thanks to being shorter. Here we're introducing the notion of presence of each character so that word sequences that contain existing sub-words are favored against the shorter ones having nothing in common. In addition we do not distinguish being/end from a regular delimitor anymore. That made it harder to spot inverted words.	2021-03-15 09:38:42 +01:00
Willy Tarreau	101df31503	BUG/MINOR: cfgparse: use the GLOBAL not LISTEN keywords list for spell checking In commit `a0e8eb8ca` ("MINOR: cfgparse: suggest correct spelling for unknown words in global section") we got the ability to locate a better matching word in case of error. But it mistakenly used the CFG_LISTEN class of words instead of CFG_GLOBAL, resulting in proposing unsuitable matches in addition to the long hard-coded list. Now, "tune.dh-param" correctly proposes "tune.ssl.default-dh-param". No backport is needed.	2021-03-15 09:15:18 +01:00
Willy Tarreau	9c18747823	BUG/MEDIUM: cli: fix "help" crashing since recent spelling fixes I somehow managed to re-break the "help" command in `b736458bf` ("MEDIUM: cli: apply spelling fixes for known commands before listing them") after fixing it once. A null-deref happens when checking the args early in the processing. No backport is needed as this was introduced in 2.4-dev12.	2021-03-13 12:25:43 +01:00
Willy Tarreau	7416314145	CLEANUP: task: make sure tasklet handlers always indicate their statuses When tasklets were derived from tasks, there was no immediate need for the scheduler to know their status after execution, and in a spirit of simplicity they just started to always return NULL. The problem is that it simply prevents the scheduler from 1) accounting their execution time, and 2) keeping track of their current execution status. Indeed, a remote wake-up could very well end up manipulating a tasklet that's currently being executed. And this is the reason why those handlers have to take the idle lock before checking their context. In 2.5 we'll take care of making tasklets and tasks work more similarly, but trouble is to be expected if we continue to propagate the trend of returning NULL everywhere, especially if some fixes relying on a stricter model later need to be backported. For this reason this patch updates all known tasklet handlers to make them return NULL only when the tasklet was freed. It has no effect for now and isn't even guaranteed to always be 100% safe but it puts the code into the right direction for this.	2021-03-13 11:30:19 +01:00
Willy Tarreau	4975d1482f	CLEANUP: cli: rename the last few "stats_" to "cli_" There were still a very small list of functions, variables and fields called "stats_" while they were really purely CLI-centric. There's the frontend called "stats_fe" in the global section, which instantiates a "cli_applet" called "<CLI>" so it was renamed "cli_fe". The "alloc_stats_fe" function cas renamed to "cli_alloc_fe" which also better matches the naming convention of all cli-specific functions. Finally the "stats_permission_denied_msg" used to return an error on the CLI was renamed "cli_permission_denied_msg". Now there's no more "stats_something" that designates the CLI.	2021-03-13 11:04:35 +01:00
Willy Tarreau	f14c7570d6	CLEANUP: cli: rename MAX_STATS_ARGS to MAX_CLI_ARGS This is the number of args accepted on a command received on the CLI, is has long been totally independent of stats and should not carry this misleading "stats" name anymore.	2021-03-13 10:59:23 +01:00
Willy Tarreau	c57dcfe787	MINOR: cli: apply the fuzzy matching on the whole command instead of words Now instead of comparing words at an exact position, we build a fingerprint made of all of them, so that we can check for them in any position. For example, "show conn serv" finds "show servers conn" and that "set servers maxconn" proposes both "set server" and "set maxconn servers".	2021-03-12 19:09:19 +01:00
Willy Tarreau	e33c4b3c11	MINOR: tools: add the ability to update a word fingerprint Instead of making a new one from scratch, let's support not wiping the existing fingerprint and updating it, and to do the same char by char. The word-by-word one will still result in multiple beginnings and ends, but that will accurately translate word boundaries. The char-based one has more flexibility and requires that the caller maintains the previous char to indicate the transition, which also allows to insert delimiters for example.	2021-03-12 19:09:19 +01:00
Willy Tarreau	b736458bfa	MEDIUM: cli: apply spelling fixes for known commands before listing them Entering "show tls" would still emit 35 entries. By measuring the distance between all unknown words and the candidates, we can sort them and pick the 10 most likely candidates. This works reasonably well, as now "show tls" only proposes "show tls-keys", "show threads", "show pools" and "show tasks". If the distance is still too high or if a word is missing, the whole prefix list continues to be dumped, thus "show" alone will still report the entire list of commands beginning with "show". It's still impossible to skip a word, for example "show conn" will not propose "show servers conn" because the distance is calculated for each word individually. Some changes to the distance calculation to support updating an existing map could easily address this. But this is already a great improvement.	2021-03-12 19:09:19 +01:00
Willy Tarreau	b96a74cbfd	MINOR: cli: filter the list of commands to the matching part The error message on the CLI has become unreadable due to the long list and it's not even sorted, making it even harder to figure the right command. This patch starts by looking if some of the words match something known, and if so, will limit the listing only to those commands that start like the current one. The "help", "prompt" and "quit" commands are always shown to help the user try something else. Now thanks to this, typing "add" or "del" will only list "add acl", "add map" and not 50 lines anymore. As a small bonus, we won't print "Unknown command" anymore in response to the "help" command.	2021-03-12 19:09:19 +01:00
Willy Tarreau	f3697dde2b	MINOR: cli: print the error message in the parser function itself By doing so we can report more accurate information about what's wrong. As a first step, we already distinguish the case of expert-only commands from other ones.	2021-03-12 19:09:19 +01:00
Willy Tarreau	91bc359571	MINOR: cli: test the appctx level for master access instead of comparing pointers Now that the appctx contains the master level, it greatly simplifies all the tests, as we can simply verify that keyword levels match the effective level without having to cheat with applet pointers. This also allows to fold the expert test in them.	2021-03-12 19:09:19 +01:00
Willy Tarreau	e283ee6265	MINOR: cli: set the ACCESS_MASTER* bits on the master bind_conf Right now the code is a bit hackish, it tests for the keyword's level flags but checks the applet's origin to compare the bits. Let's start by properly setting the ACCESS_MASTER_ONLY and ACCESS_MASTER flags on the master CLI's bind_conf so that they are automatically present all the time.	2021-03-12 19:09:19 +01:00
Willy Tarreau	0609c9bde9	BUG/MINOR: cli: make sure "help", "prompt", "quit" are enabled at master level These 3 commands are functionally valid both in master and worker CLIs. However, while they do have a valid handler, they are not permitted by the code and work partially by chance in the master: - "prompt" and "quit" are intercepted by the request analyser - "help" triggers an error, which results in displaying the error message Let's make sure they are permitted so that we don't count errors there and that we can report appropriate help. This bug has always been there but it doesn't have any functional effect at the moment since "help" can only show the error message. As such, there is no need to backport it.	2021-03-12 19:09:19 +01:00
Christopher Faulet	db31b4486c	CLEANUP: resolvers: Perform unsafe loop on requester list when possible When answer list of a response is checked, it is useless to perform a safe loop on the requester list.	2021-03-12 17:42:47 +01:00
Christopher Faulet	c392d461d6	CLEANUP: resolvers: Use ha_free() in srvrq_resolution_error_cb() Two occurrences to "free(A);A=NULL;" may be replaced by a call to ha_free() in the srvrq_resolution_error_cb() function.	2021-03-12 17:42:47 +01:00
Christopher Faulet	e8674c7184	MINOR: resolvers: Don't try to match immediatly renewed ADD items The loop looking for existing ADD items to renew their last_seen must ignore the items already renewed in the same loop. To do so, we rely on the last_seen time. because it is now based on now_ms, it is safe. Doing so avoid to match several time the same ADD item when the same IP address is found in several ADD item. This reduces the number of extra DNS resolutions. This patch depends on "MINOR: resolvers: Use milliseconds for cached items in resolver responses". Both may be backported as far as 2.2 if necessary.	2021-03-12 17:42:45 +01:00
Christopher Faulet	55c1c4053f	MINOR: resolvers: Use milliseconds for cached items in resolver responses The last time when an item was seen in a resolver responses is now stored in milliseconds instead of seconds. This avoid some corner-cases at the edges. This also simplifies time comparisons.	2021-03-12 17:41:28 +01:00
Christopher Faulet	d83a6df5cd	BUG/MEDIUM: resolvers: Skip DNS resolution at startup if SRV resolution is set At startup, if a SRV resolution is set for a server, no DNS resolution is created. We must wait the first SRV resolution to know if it must be triggered. It is important to do so for two reasons. First, during a "classical" startup, a server based on a SRV resolution has no hostname. Thus the created DNS resolution is useless. Best waiting the first SRV resolution. It is not really a bug at this stage, it is just useless. Second, in the same situation, if the server state is loaded from a file, its hosname will be set a bit later. Thus, if there is no additionnal record for this server, because there is already a DNS resolution, it inhibits any new DNS resolution. But there is no hostname attached to the existing DNS resolution. So no resolution is performed at all for this server. To avoid any problem, it is fairly easier to handle this special case during startup. But this means we must be prepared to have no "resolv_requester" field for a server at runtime. This patch must be backported as far as 2.2.	2021-03-12 17:41:28 +01:00
Christopher Faulet	0efc0993ec	BUG/MEDIUM: resolvers: Don't release resolution from a requester callbacks Another way to say it: "Safely unlink requester from a requester callbacks". Requester callbacks must never try to unlink a requester from a resolution, for the current requester or another one. First, these callback functions are called in a loop on a request list, not necessarily safe. Thus unlink resolution at this place, may be unsafe. And it is useless to try to make these loops safe because, all this stuff is placed in a loop on a resolution list. Unlink a requester may lead to release a resolution if it is the last requester. However, the unkink is necessary because we cannot reset the server state (hostname and IP) with some pending DNS resolution on it. So, to workaround this issue, we introduce the "safe" unlink. It is only performed from a requester callback. In this case, the unlink function never releases the resolution, it only reset it if necessary. And when a resolution is found with an empty requester list, it is released. This patch depends on the following commits : * MINOR: resolvers: Purge answer items when a SRV resolution triggers an error * MINOR: resolvers: Use a function to remove answers attached to a resolution * MINOR: resolvers: Directly call srvrq_update_srv_state() when possible * MINOR: resolvers: Add function to change the srv status based on SRV resolution All the series must be backported as far as 2.2. It fixes a regression introduced by the commit `b4badf720` ("BUG/MINOR: resolvers: new callback to properly handle SRV record errors"). don't release resolution from requester cb	2021-03-12 17:41:28 +01:00
Christopher Faulet	6b117aed49	MINOR: resolvers: Directly call srvrq_update_srv_state() when possible When the server status must be updated from the result of a SRV resolution, we can directly call srvrq_update_srv_state(). It is simpler and this avoid a test on the server DNS resolution. This patch is mandatory for the next commit. It also rely on "MINOR: resolvers: Directly call srvrq_update_srv_state() when possible".	2021-03-12 17:41:28 +01:00
Christopher Faulet	5efdef24c1	MINOR: resolvers: Add function to change the srv status based on SRV resolution srvrq_update_srv_status() update the server status based on result of SRV resolution. For now, it is only used from snr_update_srv_status() when appropriate.	2021-03-12 17:41:28 +01:00
Christopher Faulet	51d5e3bda7	MINOR: resolvers: Purge answer items when a SRV resolution triggers an error When a SRV request trigger an error, if we decide to handle the error because last_valid duration is expired, the answer list may be purged. All items are considered as obsolete.	2021-03-12 17:41:28 +01:00
Christopher Faulet	1dec5c7934	MINOR: resolvers: Use a function to remove answers attached to a resolution resolv_purge_resolution_answer_records() must be used to removed all answers attached to a resolution. For now, it is only used when a resolution is released.	2021-03-12 17:41:28 +01:00
Christopher Faulet	3e0600fbbf	BUG/MEDIUM: resolvers: Trigger a DNS resolution if an ADD item is obsolete When a ADD item attached to a SRV item is removed because it is obsolete, we must trigger a DNS resolution to be sure the hostname still resolves or not. There is no other way to be the entry is still valid. And we cannot set the server in RMAINT immediatly, because a DNS server may be inconsitent and may stop to add some additionnal records. The opposite is also true. If a valid ADD item is still attached to a SRV item, any DNS resolution must be stopped. There is no reason to perform extra resolution in this case. This patch must be backported as far as 2.2.	2021-03-12 17:41:28 +01:00
Christopher Faulet	49531e8471	BUG/MINOR; resolvers: Ignore DNS resolution for expired SRV item If no ADD item is found for a SRV item in a SRV response, a DNS resolution is triggered. When it succeeds, we must be sure the SRV item is still alive. Otherwise the DNS resolution must be ignored. This patch depends on the commit "MINOR: resolvers: Move last_seen time of an ADD into its corresponding SRV item". Both must be backported as far as 2.2.	2021-03-12 17:41:28 +01:00
Baptiste Assmann	6a8d11dc80	MINOR: resolvers: new function find_srvrq_answer_record() This function search for a SRV answer item associated to a requester whose type is server. This is mainly useful to "link" a server to its SRV record when no additional record were found to configure the IP address. This patch is required by a bug fix.	2021-03-12 17:41:28 +01:00
Christopher Faulet	77f860699c	BUG/MEDIUM: resolvers: Fix the loop looking for an existing ADD item For each ADD item found in a SRV response, we try to find a corresponding ADD item already attached to an existing SRV item. If found, the ADD last_seen time is updated, otherwise we try to find a SRV item with no ADD to attached the new one. However, the loop is buggy. Instead of comparing 2 ADD items, it compares the new ADD item with the SRV item. Because of this bug, we are unable to renew last_seen time of existing ADD. This patch must be backported as far as 2.2.	2021-03-12 17:41:24 +01:00
Christopher Faulet	ab177ac1f3	BUG/MEDIUM: resolvers: Don't set an address-less server as UP when a server status is updated based on a SRV item, it is always set to UP, regardless it has an IP address defined or not. For instance, if only a SRV item is received, with no additional record, only the server hostname is defined. We must wait to have an IP address to set the server as UP. This patch must be backported as far as 2.2.	2021-03-12 16:43:37 +01:00
Christopher Faulet	bca680ba90	BUG/MINOR: resolvers: Unlink DNS resolution to set RMAINT on SRV resolution When a server is set in RMAINT becaues of a SRV resolution failure, the server DNS resolution, if any, must be unlink first. It is mandatory to handle the change in the context of a SRV resolution. This patch must be backported as far as 2.2.	2021-03-12 16:43:37 +01:00
Christopher Faulet	5130c21fbb	BUG/MINOR: resolvers: Reset server address on DNS error only on status change When a DNS resolution error is detected, in snr_resolution_error_cb(), the server address must be reset only if the server status has changed. It this case, it means the server is set to RMAINT. Thus the server address may by reset. This patch fixes a bug introduced by commit `d127ffa9f` ("BUG/MEDIUM: resolvers: Reset address for unresolved servers"). It must be backported as far as 2.0.	2021-03-12 16:43:37 +01:00
Christopher Faulet	bd0227c109	BUG/MINOR: resolvers: Consider server to have no IP on DNS resolution error When an error is received for a DNS resolution, for instance a NXDOMAIN error, the server must be considered to have no address when its status is updated, not the opposite. Concretly, because this parameter is not used on error path in snr_update_srv_status(), there is no impact. This patch must be backported as far as 1.8.	2021-03-12 16:43:37 +01:00
Christopher Faulet	5037c06d91	Revert "BUG/MINOR: resolvers: Only renew TTL for SRV records with an additional record" This reverts commit `a331a1e8eb`. This commit fixes a real bug, but it also reveals some hidden bugs, mostly because of some design issues. Thus, in itself, it create more problem than it solves. So revert it for now. All known bugs will be addressed in next commits. This patch should be backported as far as 2.2.	2021-03-12 16:43:37 +01:00
Willy Tarreau	736adef511	BUG/MINOR: cfgparse/server: increment the extra keyword counter one at a time This was introduced in previous commit `49c2b45c1` ("MINOR: cfgparse/server: try to fix spelling mistakes on server lines"), the loop was changed but the increment left. No backport is needed.	2021-03-12 14:47:10 +01:00
Willy Tarreau	db67b0ed79	MINOR: tcp-rules: suggest approaching action names on mismatch This adds support for action_suggest() in tcp-request and tcp-response rules so as to propose the closest match in case of misspelling.	2021-03-12 14:13:21 +01:00
Willy Tarreau	49bf7beb14	MINOR: http-rules: suggest approaching action names on mismatch This adds support for action_suggest() in http-request, http-response and http-after-response rulesets. For example: parsing [/dev/stdin:2]: 'http-request' expects (...), but got 'del-hdr'. Did you mean 'del-header' maybe ?	2021-03-12 14:13:21 +01:00
Willy Tarreau	99eb2cc1cc	MINOR: actions: add a function to suggest an action ressembling a given word action_suggest() will return a pointer to an action whose keyword more or less ressembles the passed argument. It also accepts to be more tolerant against prefixes (since actions taking arguments are handled as prefixes). This will be used to suggest approaching words.	2021-03-12 14:13:21 +01:00
Willy Tarreau	433b05fa64	MINOR: cfgparse/bind: suggest correct spelling for unknown bind keywords Just like with the server keywords, now's the turn of "bind" keywords. The difference is that 100% of the bind keywords are registered, thus we do not need the list of extra keywords. There are multiple bind line parsers today, all were updated: - peers - log - dgram-bind - cli $ printf "listen f\nbind :8000 tcut\n" \| ./haproxy -c -f /dev/stdin [NOTICE] 070/101358 (25146) : haproxy version is 2.4-dev11-7b8787-26 [NOTICE] 070/101358 (25146) : path to executable is ./haproxy [ALERT] 070/101358 (25146) : parsing [/dev/stdin:2] : 'bind :8000' unknown keyword 'tcut'; did you mean 'tcp-ut' maybe ? [ALERT] 070/101358 (25146) : Error(s) found in configuration file : /dev/stdin [ALERT] 070/101358 (25146) : Fatal errors found in configuration.	2021-03-12 14:13:21 +01:00
Willy Tarreau	49c2b45c1d	MINOR: cfgparse/server: try to fix spelling mistakes on server lines Let's apply the fuzzy match to server keywords so that we can avoid dumping the huge list of supported keywords each time there is a spelling mistake, and suggest proper spelling instead: $ printf "listen f\nserver s 0 sendpx-v2\n" \| ./haproxy -c -f /dev/stdin [NOTICE] 070/095718 (24152) : haproxy version is 2.4-dev11-caa6e3-25 [NOTICE] 070/095718 (24152) : path to executable is ./haproxy [ALERT] 070/095718 (24152) : parsing [/dev/stdin:2] : 'server s' unknown keyword 'sendpx-v2'; did you mean 'send-proxy-v2' maybe ? [ALERT] 070/095718 (24152) : Error(s) found in configuration file : /dev/stdin [ALERT] 070/095718 (24152) : Fatal errors found in configuration.	2021-03-12 14:13:21 +01:00
Willy Tarreau	a0e8eb8caa	MINOR: cfgparse: suggest correct spelling for unknown words in global section The global section also knows a large number of keywords that are not referenced in any list, so this needed them to be specifically listed. It becomes particularly handy now because some tunables are never easy to remember, but now it works remarkably well: $ printf "global\nsched.queue_depth\n" \| ./haproxy -c -f /dev/stdin [NOTICE] 070/093007 (23457) : haproxy version is 2.4-dev11-dd8ee5-24 [NOTICE] 070/093007 (23457) : path to executable is ./haproxy [ALERT] 070/093007 (23457) : parsing [/dev/stdin:2] : unknown keyword 'sched.queue_depth' in 'global' section; did you mean 'tune.runqueue-depth' maybe ? [ALERT] 070/093007 (23457) : Error(s) found in configuration file : /dev/stdin [ALERT] 070/093007 (23457) : Fatal errors found in configuration.	2021-03-12 14:13:21 +01:00
Willy Tarreau	c0ff679481	MINOR: cfgparse: suggest correct spelling for unknown words in proxy sections Let's start by the largest keyword list, the listeners. Many keywords were still not part of a list, so a common_kw_list array was added to list the not enumerated ones. Now for example, typing "tmout" properly suggests "timeout": $ printf "frontend f\ntmout client 10s\n" \| ./haproxy -c -f /dev/stdin [NOTICE] 070/091355 (22545) : haproxy version is 2.4-dev11-3b728a-21 [NOTICE] 070/091355 (22545) : path to executable is ./haproxy [ALERT] 070/091355 (22545) : parsing [/dev/stdin:2] : unknown keyword 'tmout' in 'frontend' section; did you mean 'timeout' maybe ? [ALERT] 070/091355 (22545) : Error(s) found in configuration file : /dev/stdin [ALERT] 070/091355 (22545) : Fatal errors found in configuration.	2021-03-12 14:13:21 +01:00
Willy Tarreau	e2afcc4509	MINOR: cfgparse: add cfg_find_best_match() to suggest an existing word Instead of just reporting "unknown keyword", let's provide a function which will look through a list of registered keywords for a similar-looking word to the one that wasn't matched. This will help callers suggest correct spelling. Also, given that a large part of the config parser still relies on a long chain of strcmp(), we'll need to be able to pass extra candidates. Thus the function supports an optional extra list for this purpose.	2021-03-12 14:13:21 +01:00
Willy Tarreau	ba2c4459a5	MINOR: tools: add simple word fingerprinting to find similar-looking words This introduces two functions, one which creates a fingerprint of a word, and one which computes a distance between two words fingerprints. The fingerprint is made by counting the transitions between one character and another one. Here we consider the 26 alphabetic letters regardless of their case, then any digit as a digit, and anything else as "other". We also consider the first and last locations as transitions from begin to first char, and last char to end. The distance is simply the sum of the squares of the differences between two fingerprints. This way, doubling/ missing a letter has the same cost, however some repeated transitions such as "e"->"r" like in "server" are very unlikely to match against situations where they do not exist. This is a naive approach but it seems to work sufficiently well for now. It may be refined in the future if needed.	2021-03-12 14:13:21 +01:00
Willy Tarreau	25809999fe	CLEANUP: http-rules: remove the unexpected comma before the list of action keywords The error message for http-request and http-response starts with a comma that very likely is a leftover from a previous list construct. Let's remove it: "'http-request' expects , 'wait-for-handshake', 'use-service' ...".	2021-03-12 14:13:20 +01:00
Willy Tarreau	3d1d178933	CLEANUP: vars: make the error message clearer on missing arguments for set-var The error message after "http-response set-var" isn't very clear: [ALERT] 070/115043 (30526) : parsing [/dev/stdin:2] : error detected in proxy 'f' while parsing 'http-response set-var' rule : invalid variable 'set-var'. Expects 'set-var(<var-name>)' or 'unset-var(<var-name>)'. Let's change it to this instead: [ALERT] 070/115608 (30799) : parsing [/dev/stdin:2] : error detected in proxy 'f' while parsing 'http-response set-var' rule : invalid or incomplete action 'set-var'. Expects 'set-var(<var-name>)' or 'unset-var(<var-name>)'. With a wrong action name, it also works better (it's handled as a prefix due to the opening parenthesis): [ALERT] 070/115608 (30799) : parsing [/dev/stdin:2] : error detected in proxy 'f' while parsing 'http-response set-varxxx' rule : invalid or incomplete action 'set-varxxx'. Expects 'set-var(<var-name>)' or 'unset-var(<var-name>)'.	2021-03-12 14:13:20 +01:00
Willy Tarreau	72d012fbd9	CLEANUP: tcp-rules: add missing actions in the tcp-request error message The tcp-request error message only mentions "accept", "reject" and track-sc*, but there are a few other ones that were missing, so let's add them. This could be backported, though it's not likely that it will help anyone with an existing config.	2021-03-12 14:13:20 +01:00
Willy Tarreau	47a30c456c	BUG/MINOR: server-state: use the argument, not the global state The refactoring in commit `131b07be3` ("MEDIUM: server: Refactor apply_server_state() to make it more readable") also had a copy-paste error resulting in using global.server_state_file instead of the function's argument, which easily crashes with a conf having a state file in a backend and no global state file. In addition, let's simplify the code and get rid of strcpy() which almost certainly will break the build on OpenBSD. This was introduced in 2.4-dev10, no backport is needed.	2021-03-12 14:13:07 +01:00
Willy Tarreau	6d4173e622	BUG/MINOR: server-state: properly handle the case where the base is not set The refactoring in commit `131b07be3` ("MEDIUM: server: Refactor apply_server_state() to make it more readable") made the global server_state_base be dereferenced before being checked, resulting in a crash on certain files. This happened in 2.4-dev10, no backport is needed.	2021-03-12 13:57:19 +01:00
Christopher Faulet	cd03be73d5	BUG/MINOR: tcpcheck: Fix double free on error path when parsing tcp/http-check When a "tcp-check" or a "http-check" rule is parsed, we try to get the previous rule in the ruleset to get its index. We must take care to reset the pointer on this rule in case an error is triggered later on the parsing. Otherwise, the same rule may be released twice. For instance, it happens with such line : http-check meth GET uri / ## note there is no "send" parameter This patch must be backported as far as 2.2.	2021-03-12 13:17:46 +01:00
Christopher Faulet	24ec943427	BUG/MINOR: tcpcheck: Update .health threshold of agent inside an agent-check If an agent-check is configured for a server, When the response is parsed, the .health threshold of the agent must be updated on up/down/stopped/fail command and not the threshold of the health-check. Otherwise, the agent-check will compete with the health-check and may mark a DOWN server as UP. This patch should fix the issue #1176. It must be backported as far as 2.2.	2021-03-12 09:25:45 +01:00
Christopher Faulet	5647fbacdf	BUG/MEDIUM: filters: Set CF_FL_ANALYZE on channels when filters are attached CF_FL_ANALYZE flag is used to know a channel is filtered. It is important to synchronize request and response channels when the filtering ends. However, it is possible to call all request analyzers before starting the filtering on the response channel. This means flt_end_analyze() may be called for the request channel before flt_start_analyze() on the response channel. Thus because CF_FL_ANALYZE flag is not set on the response channel, we consider the filtering is finished on both sides. The consequence is that flt_end_analyze() is not called for the response and backend filters are unregistered before their execution on the response channel. It is possible to encounter this bug on TCP frontend or CONNECT request on HTTP frontend if the client shutdown is reveiced with the first read. To fix this bug, CF_FL_ANALYZE is set when filters are attached to the stream. It means, on the request channel when the stream is created, in flt_stream_start(). And on both channels when the backend is set, in flt_set_stream_backend(). This patch must be backported as far as 1.7.	2021-03-12 09:25:45 +01:00
Emeric Brun	362d25e507	BUG/MEDIUM: stick-tables: fix ref counter in table entry using multiple http tracksc. Setting multiple http-request track-scX rules generates entries which never expires. If there was already an entry registered by a previous http rule 'stream_track_stkctr(&s->stkctr[rule->action], t, ts)' didn't register the new 'ts' into the stkctr. And function is left with no reference on 'ts' whereas refcount had been increased by the '_get_entry' The patch applies the same policy as the one showed on tcp track rules and if there is successive rules the track counter keep the first entry registered in the counter and nothing more is computed. After validation this should be backported in all versions.	2021-03-11 14:14:44 +01:00
Willy Tarreau	060a761248	OPTIM: task: automatically adjust the default runqueue-depth to the threads The recent default runqueue size reduction appeared to have significantly lowered performance on low-thread count configs. Testing various values runqueue values on different workloads under thread counts ranging from 1 to 64, it appeared that lower values are more optimal for high thread counts and conversely. It could even be drawn that the optimal value for various workloads sits around 280/sqrt(nbthread), and probably has to do with both the L3 cache usage and how to optimally interlace the threads' activity to minimize contention. This is much easier to optimally configure, so let's do this by default now.	2021-03-10 11:15:34 +01:00
Willy Tarreau	1691ba3693	MINOR: task: give the scheduler a bit more flexibility in the runqueue size Instead of setting a hard-limit on runqueue-depth and keeping it short to maintain fairness, let's allow the scheduler to automatically cut the existing one in two equal halves if its size is between the configured size and its double. This will allow to increase the default value while keeping a low latency.	2021-03-10 11:15:34 +01:00
Willy Tarreau	4c48edba4f	BUG/MEDIUM: ssl: properly remove the TASK_HEAVY flag at end of handshake Emeric found that SSL+keepalive traffic had dropped quite a bit in the recent changes, which could be bisected to recent commit `9205ab31d` ("MINOR: ssl: mark the SSL handshake tasklet as heavy"). Indeed, a first incarnation of this commit made use of the TASK_SELF_WAKING flag but the last version directly used TASK_HEAVY, but it would still continue to remove the already absent TASK_SELF_WAKING one instead of TASK_HEAVY. As such, the SSL traffic remained processed with low granularity. No backport is needed as this is only 2.4.	2021-03-09 17:58:02 +01:00
Willy Tarreau	5a1c7280a9	CLEANUP: config: also address the cfg_keyword API change in the compression code The tests were made on slz and the zlib parsers for memlevel and windowsize managed to escape the change made by commit `018251667` ("CLEANUP: config: make the cfg_keyword parsers take a const for the defproxy"). This is now fixed.	2021-03-09 16:57:08 +01:00
Willy Tarreau	e89fae3a4e	CLEANUP: stream: rename a few remaining occurrences of "stream *sess" These are some leftovers from the ancient code where they were still called sessions, but these areas in the code remain confusing due to this naming. They were now called "strm" which will not even affect indenting nor alignment.	2021-03-09 15:44:33 +01:00
William Lallemand	36119de182	BUG/MEDIUM: session: NULL dereference possible when accessing the listener When implementing a client applet, a NULL dereference was encountered on the error path which increment the counters. Indeed, the counters incremented are the one in the listener which does not exist in the case of client applets, so in sess->listener->counters, listener is NULL. This patch fixes the access to the listener structure when accessing from a sesssion, most of the access are the counters in error paths. Must be backported as far as 1.8.	2021-03-09 12:51:42 +01:00
Willy Tarreau	018251667e	CLEANUP: config: make the cfg_keyword parsers take a const for the defproxy The default proxy was passed as a variable to all parsers instead of a const, which is not without risk, especially when some timeout parsers used to make some int pointers point to the default values for comparisons. We want to be certain that none of these parsers will modify the defaults sections by accident, so it's important to mark this proxy as const. This patch touches all occurrences found (89).	2021-03-09 10:09:43 +01:00
Willy Tarreau	b7e0c633e8	BUILD: task: fix build at -O0 with threads disabled grq_total was incremented when picking tasks from the global run queue, but this variable was not defined with threads disabled, and the code was optimized away at -O2. No backport is needed.	2021-03-09 10:01:01 +01:00
Tim Duesterhus	56c176a780	CLEANUP: connection: Consistently use `struct ist` to process all TLV types Instead of directly poking around within the `struct tlv tlv_packet` the actual value will be consumed using a `struct ist`.	2021-03-09 09:24:32 +01:00
Tim Duesterhus	615f81eb5a	MINOR: connection: Use a `struct ist` to store proxy_authority This makes the code cleaner, because proxy_authority can be handled like proxy_unique_id.	2021-03-09 09:24:32 +01:00
Tim Duesterhus	002bd77a6e	CLEANUP: connection: Use istptr / istlen for proxy_unique_id Don't access the ist's fields directly, use the helper functions instead.	2021-03-09 09:24:32 +01:00
Ilya Shipitsin	d7a988c14a	CLEANUP: assorted typo fixes in the code and comments This is 19th iteration of typo fixes	2021-03-05 21:22:47 +01:00
Amaury Denoyelle	249f0562cf	BUG/MINOR: backend: fix condition for reuse on mode HTTP This commit is a fix/complement to the following one : `08d87b3f49` BUG/MEDIUM: backend: never reuse a connection for tcp mode It fixes the check for the early insertion of backend connections in the reuse lists if the backend mode is HTTP. The impact of this bug seems limited because : - in tcp mode, no insertion is done in the avail list as mux_pt does not support multiple streams. - in http mode, muxes are also responsible to insert backend connections in lists in their detach functions. Prior to this fix the reuse rate could be slightly inferior. It can be backported to 2.3.	2021-03-05 15:44:51 +01:00
Amaury Denoyelle	d7faa3d6e9	MINOR: backend: add a BUG_ON if conn mux NULL in connect_server Currently, there seems to be no way to have the transport layer ready but not the mux in the function connect_server. Add a BUG_ON to report if this implicit condition is not true anymore. This should fix coverity report from github issue #1120.	2021-03-05 15:27:41 +01:00
Willy Tarreau	d4e78d873c	MINOR: server: move actconns to the per-thread structure The actconns list creates massive contention on low server counts because it's in fact a list of streams using a server, all threads compete on the list's head and it's still possible to see some watchdog panics on 48 threads under extreme contention with 47 threads trying to add and one thread trying to delete. Moving this list per thread is trivial because it's only used by srv_shutdown_streams(), which simply required to iterate over the list. The field was renamed to "streams" as it's really a list of streams rather than a list of connections.	2021-03-05 15:00:24 +01:00
Willy Tarreau	430bf4a483	MINOR: server: allocate a per-thread struct for the per-thread connections stuff There are multiple per-thread lists in the listeners, which isn't the most efficient in terms of cache, and doesn't easily allow to store all the per-thread stuff. Now we introduce an srv_per_thread structure which the servers will have an array of, and place the idle/safe/avail conns tree heads into. Overall this was a fairly mechanical change, and the array is now always initialized for all servers since we'll put more stuff there. It's worth noting that the Lua code still has to deal with its own deinit by itself despite being in a global list, because its server is not dynamically allocated.	2021-03-05 15:00:24 +01:00
Willy Tarreau	4cdac166e0	MINOR: cfgparse: finish to set up servers outside of the proxy setup loop Till now servers were only initialized as part of the proxy setup loop, which doesn't cover peers, tcp log, dns, lua etc. Let's move this part out of this loop and instead iterate over all registered servers. This way we're certain to visit them all. The patch looks big but it's just a move of a large block with the corresponding reindent (as can be checked with diff -b). It relies on the two previous ones ("MINOR: server: add a global list of all known servers and" and "CLEANUP: lua: set a dummy file name and line number on the dummy servers").	2021-03-05 15:00:24 +01:00
Willy Tarreau	198e92a8e5	MINOR: server: add a global list of all known servers It's a real pain not to have access to the list of all registered servers, because whenever there is a need to late adjust their configuration, only those attached to regular proxies are seen, but not the peers, lua, logs nor DNS. What this patch does is that new_server() will automatically add the newly created server to a global list, and it does so as well for the 1 or 2 statically allocated servers created for Lua. This way it will be possible to iterate over all of them.	2021-03-05 15:00:24 +01:00
Willy Tarreau	0f143afe1b	CLEANUP: lua: set a dummy file name and line number on the dummy servers The "socket_tcp" and "socket_ssl" servers had no config file name nor line number, but this is sometimes annoying during debugging or later in error messages, while all other places using new_server() or parse_server() make sure to have a valid file:line set. Let's set something to address this.	2021-03-05 15:00:24 +01:00
Willy Tarreau	5b5974104f	CLEANUP: sockpair: silence a coverity check about fcntl() This is about coverity complaining that we didn't check the fcntl call which can't fail, let's consume it. This is issue #1158.	2021-03-05 14:33:13 +01:00
Willy Tarreau	4149168255	MEDIUM: ssl: implement xprt_set_used and xprt_set_idle to relax context checks Currently the SSL layer checks the validity of its tasklet's context just in case it would have been stolen, had the connection been idle. Now it will be able to be notified by the mux when this situation happens so as not to have to grab the idle connection lock on each pass. This reuses the TASK_F_USR1 flag just as the muxes do.	2021-03-05 08:30:08 +01:00
Willy Tarreau	4f8cd4397f	MINOR: xprt: add new xprt_set_idle and xprt_set_used methods These functions are used on the mux layer to indicate that the connection is becoming idle and that the xprt ought to be careful before checking the context or that it's not idle anymore and that the context is safe. The purpose is to allow a mux which is going to release a connection to tell the xprt to be careful when touching it. At the moment, the xprt are always careful and that's costly so we want to have the ability to relax this a bit. No xprt layer uses this yet.	2021-03-05 08:30:08 +01:00
Willy Tarreau	e388f2fbca	MEDIUM: muxes: mark idle conns tasklets with TASK_F_USR1 The muxes are touching the idle_conns_lock all the time now because they need to be careful that no other thread has stolen their tasklet's context. This patch changes this a little bit by setting the TASK_F_USR1 flag on the tasklet before marking a connection idle, and removing it once it's not idle anymore. Thanks to this we have the guarantee that a tasklet without this flag cannot be present in an idle list and does not need to go through this costly lock. This is especially true for front connections.	2021-03-05 08:30:08 +01:00
Willy Tarreau	6fa8bcdc78	MINOR: task: add an application specific flag to the state: TASK_F_USR1 This flag will be usable by any application. It will be preserved across wakeups so the application can use it to do various stuff. Some I/O handlers will soon benefit from this.	2021-03-05 08:30:08 +01:00
Willy Tarreau	144f84a09d	MEDIUM: task: extend the state field to 32 bits It's been too short for quite a while now and is now full. It's still time to extend it to 32-bits since we have room for this without wasting any space, so we now gained 16 new bits for future flags. The values were not reassigned just in case there would be a few hidden u16 or short somewhere in which these flags are placed (as it used to be the case with stream->pending_events). The patch is tagged MEDIUM because this required to update the task's process() prototype to use an int instead of a short, that's quite a bunch of places.	2021-03-05 08:30:08 +01:00
Willy Tarreau	db4e238938	MINOR: task: stop abusing the nice field to detect a tasklet It's cleaner to use a flag from the task's state to detect a tasklet and it's even cheaper. One of the best benefits is that this will allow to get the nice field out of the common part since the tasklet doesn't need it anymore. This commit uses the last task bit available but that's temporary as the purpose of the change is to extend this.	2021-03-05 08:30:08 +01:00
Ubuntu	1adaddb494	OPTIM: lb-random: use a cheaper PRNG to pick a server The PRNG used by the "random" LB algorithm was the central one which tries hard to produce "correct" (i.e. hardly predictable) values suitable for use in UUIDs or cookies. It's much too expensive for pure load balancing where a cheaper thread-local PRNG is sufficient, and the current PRNG is part of the hot places when running with many threads. Let's switch to the stastistical PRNG instead, it's thread-local, very fast, and with a period of (2^32)-1 which is more than enough to decide on a server.	2021-03-05 08:30:08 +01:00
Willy Tarreau	06e69b556c	REORG: tools: promote the debug PRNG to more general use as a statistical one We frequently need to access a simple and fast PRNG for statistical purposes. The debug_prng() function did exactly this using a xorshift generator but its use was limited to debug only. Let's move this to tools.h and tools.c to make it accessible everywhere. Since it needs to be fast, its state is thread-local. An initialization function starts a different initial value for each thread for better distribution.	2021-03-05 08:30:08 +01:00
Ubuntu	b1adf03df9	MEDIUM: backend: use a trylock when trying to grab an idle connection In conn_backend_get() we can cause some extreme contention due to the idle_conns_lock. Indeed, even though it's per-thread, it still causes high contention when running with many threads. The reason is that all threads which do not have any idle connections are quickly skipped, till the point where there are still some, so the first reaching that point will grab the lock and the other ones wait behind. From this point, all threads are synchronized waiting on the same lock, and will follow the leader in small jumps, all hindering each other. Here instead of doing this we're using a trylock. This way when a thread is already checking a list, other ones will continue to next thread. In the worst case, a high contention will lead to a few new connections to be set up, but this may actually be what is required to avoid contention in the first place. With this change, the contention has mostly disappeared on this lock (it's still present in muxes and transport layers due to the takeover). Surprisingly, checking for emptiness of the tree root before taking the lock didn't address any contention. A few improvements are still possible and desirable here. The first one would be to avoid seeing all threads jump to the next one. We could have each thread use a different prime number as the increment so as to spread them across the entire table instead of keeping them synchronized. The second one is that the lock in the muck layers shouldn't be needed to check for the tasklet's context availability.	2021-03-05 08:30:08 +01:00
Willy Tarreau	2f67e54dca	MINOR: stream: use ABORT_NOW() and not abort() in stream_dump_and_crash() Using abort() occasionally results in unexploitable core due to issues rewinding the stack. Let's use ABORT_NOW() which in addition to crashing much closer to the call point also has the benefit of showing the call trace.	2021-03-05 08:30:08 +01:00
Willy Tarreau	0bae075928	MEDIUM: pools: add CONFIG_HAP_NO_GLOBAL_POOLS and CONFIG_HAP_GLOBAL_POOLS We've reached a point where the global pools represent a significant bottleneck with threads. On a 64-core machine, the performance was divided by 8 between 32 and 64 H2 connections only because there were not enough entries in the local caches to avoid picking from the global pools, and the contention on the list there was very high. It becomes obvious that we need to have an array of lists, but that will require more changes. In parallel, standard memory allocators have improved, with tcmalloc and jemalloc finding their ways through mainstream systems, and glibc having upgraded to a thread-aware ptmalloc variant, keeping this level of contention here isn't justified anymore when we have both the local per-thread pool caches and a fast process-wide allocator. For these reasons, this patch introduces a new compile time setting CONFIG_HAP_NO_GLOBAL_POOLS which is set by default when threads are enabled with thread local pool caches, and we know we have a fast thread-aware memory allocator (currently set for glibc>=2.26). In this case we entirely bypass the global pool and directly use the standard memory allocator when missing objects from the local pools. It is also possible to force it at compile time when a good allocator is used with another setup. It is still possible to re-enable the global pools using CONFIG_HAP_GLOBAL_POOLS, if a corner case is discovered regarding the operating system's default allocator, or when building with a recent libc but a different allocator which provides other benefits but does not scale well with threads.	2021-03-05 08:30:08 +01:00
Willy Tarreau	566cebc1fc	BUG/MINOR: ssl: don't truncate the file descriptor to 16 bits in debug mode Errors reported by ssl_sock_dump_errors() to stderr would only report the 16 lower bits of the file descriptor because it used to be casted to ushort. This can be backported to all versions but has really no importance in practice since this is never seen.	2021-03-05 08:30:08 +01:00
Tim Duesterhus	1568355afd	CLEANUP: Replace for loop with only a condition by while Refactoring performed with the following Coccinelle patch: @@ expression e; statement S; @@ - for (;e;) + while (e) S	2021-03-05 08:28:53 +01:00
Tim Duesterhus	dcf753aabe	CLEANUP: Use the ist() macro whenever possible Refactoring performed with the following Coccinelle patch: @@ char *s; @@ ( - ist2(s, strlen(s)) + ist(s) \| - ist2(strdup(s), strlen(s)) + ist(strdup(s)) ) Note that this replacement is safe even in the strdup() case, because `ist()` will not call `strlen()` on a `NULL` pointer. Instead is inserts a length of `0`, effectively resulting in `IST_NULL`.	2021-03-05 08:28:53 +01:00
Christopher Faulet	1e711beb51	CLEANUP: dns: Remove useless test on ns->dgram in dns_connect_nameserver() When dns_connect_nameserver() is called, the nameserver has always a dgram field properly defined. The caller, dns_send_nameserver(), already performed the appropriate verification.	2021-03-04 16:58:36 +01:00
Christopher Faulet	1a1b674c2c	CLEANUP: dns: Use DISGUISE() on a never-failing ring_attach() call When a DNS session is created, the call to ring_attach() never fails. The ring is freshly initialized and there is other watcher on it. Thus, the call always succeeds. Instead of catching an error that must never happen, we use the DISGUISE() macro to make static analyzers happy.	2021-03-04 16:53:28 +01:00
Christopher Faulet	6f69110191	BUG/MINOR: server-state: Don't load server-state file for disabled backends Recent changes on the server-state file loading have introduced a regression. HAproxy crashes if a backend with no server-state file is disabled in the configuration. Indeed, configuration of such backends is not finalized. Thus many fields are not defined. To fix the bug, disabled backends must be ignored. In addition a BUG_ON() has been added to verify the proxy mode regarding the server-state file. It must be specified (none, global or local) for enabled backends. No backport needed.	2021-03-04 16:49:10 +01:00
Christopher Faulet	2ec4e3c1ac	BUG/MINOR: hlua: Don't strip last non-LWS char in hlua_pushstrippedstring() hlua_pushstrippedstring() function strips leading and trailing LWS characters. But the result length it too short by 1 byte. Thus the last non-LWS character is stripped. Note that a string containing only LWS characters resulting to a stipped string with an invalid length (-1). This leads to a lua runtime error. This bug was reported in the issue #1155. It must be backported as far as 1.7.	2021-03-03 19:48:12 +01:00
Amaury Denoyelle	8ede3db080	MINOR: backend: handle reuse for conns with no server as target If dispatch mode or transparent backend is used, the backend connection target is a proxy instead of a server. In these cases, the reuse of backend connections is not consistent. With the default behavior, no reuse is done and every new request uses a new connection. However, if http-reuse is set to never, the connection are stored by the mux in the session and can be reused for future requests in the same session. As no server is used for these connections, no reuse can be made outside of the session, similarly to http-reuse never mode. A different http-reuse config value should not have an impact. To achieve this, mark these connections as private to have a defined behavior. For this feature to properly work, the connection hash has been slightly adjusted. The server pointer as an input as been replaced by a generic target pointer to refer to the server or proxy instance. The hash is always calculated on connect_server even if the connection target is not a server. This also requires to allocate the connection hash node for every backend connections, not just the one with a server target.	2021-03-03 11:31:19 +01:00
Amaury Denoyelle	68967e595b	BUG/MINOR: backend: free allocated bind_addr if reuse conn Fix a leak in connect_server which happens when a connection is reused and a bind_addr was allocated because transparent mode is active. The connection has already an allocated bind_addr so free the newly allocated one. No backport needed.	2021-03-03 11:28:02 +01:00
Amaury Denoyelle	603657835f	CLEANUP: backend: fix a wrong comment missing 'not' when skipping reuse if proxy mode not HTTP	2021-03-03 11:28:02 +01:00
Tim Duesterhus	7b5777d9b4	CLEANUP: Use isttest(const struct ist) whenever possible Refactoring performed with the following Coccinelle patch: @@ struct ist i; @@ - i.ptr != NULL + isttest(i)	2021-03-03 05:07:10 +01:00
Tim Duesterhus	154374cbc8	CLEANUP: Use istadv(const struct ist, const size_t) whenever possible Refactoring performed with the following Coccinelle patch: @@ struct ist i; expression e; @@ - i.ptr += e; - i.len -= e; + i = istadv(i, e);	2021-03-03 05:07:10 +01:00
Tim Duesterhus	9f75ed114f	CLEANUP: Reapply the ist2() replacement patch One location was not matched due to a typo. Reapply the patch for consistency. see `92c696e663` see `a3298023b0`	2021-03-03 05:07:10 +01:00
Tim Duesterhus	a3298023b0	BUG/MINOR: mux-h2: Fix typo in scheme adjustment That comma should've been a semicolon. Fortunately, as it is now there is no impact thanks to operators precedence, and all expressions are properly evaluated. But this is troubling and the risk is high to turn it into an effective bug with a minor change. Introduced in `b8ce8905cf` which first appeared in 2.1-dev3. This fix must be backported to 2.1+.	2021-03-02 14:13:57 +01:00
Frédéric Lécaille	f57c64fc06	BUILD: proxy: Missing header inclusion for quic_transport_params_init() Since this commit: `144289b45` ("REORG: move init_default_instance() to proxy.c and pass it the defproxy pointer") as quic_transport_params_init() has been moved from cfgparse.c to proxy.c this latter source file must include xprt_quic.h header. Should fix #1153 issue.	2021-03-02 09:45:49 +01:00
Tim Duesterhus	68a088d851	CLEANUP: Use IST_NULL whenever possible Refactoring performed with the following Coccinelle patch: @@ @@ - ist2(NULL, 0) + IST_NULL	2021-03-01 15:44:28 +01:00
Tim Duesterhus	92c696e663	CLEANUP: Use ist2(const void*, size_t) whenever possible Refactoring performed with the following Coccinelle patch: @@ struct ist i; expression p, l; @@ - i.ptr = p; - i.len = l; + i = ist2(p, l);	2021-03-01 15:44:20 +01:00
Christopher Faulet	9e647e5af7	BUG/MEDIUM: spoe: Kill applets if there are pending connections and nbthread > 1 When the processing stage is finished for a SPOE applet, before returning it into the idle list, we check if the assigned server appears as full or if there are some pending connections on the backend or the assigned server. If yes, it means we reach a maxconn and we close the applet to free a slot. Otherwise, the applet can be reused. This test is only performed if there are more than one thread. It is important to close SPOE applets when there are pending connections for multithreaded instances because connections with the SPOE agents are persistent and local to a thread (applets are local to a thread). If a maxconn is configured, some threads may take all available slots for a while, leaving remaining threads without any free slot to process SPOE messages. It is especially true if the maxconn is low. This patch should fix the issue #705. It must be backported as far as 1.8. However, the code in 1.8 is quite different, a test must be performed to be sure it works well.	2021-03-01 15:10:19 +01:00
Christopher Faulet	ae3056157c	BUG/MINOR: connection: Use the client's dst family for adressless servers When the selected server has no address, the destination address of the client is used. However, for now, only the address is set, not the family. Thus depending on how the server is configured and the client's destination address, the server address family may be wrong. For instance, with such server : server srv 0.0.0.0:0 The server address family is AF_INET. The server connection will fail if a client is asking for an IPv6 destination. To fix the bug, we take care to set the rigth family, the family of the client destination address. This patch should fix the issue #202. It must be backported to all stable versions.	2021-03-01 11:34:00 +01:00
Christopher Faulet	e01ca0fbc9	BUG/MINOR: tcp-act: Don't forget to set the original port for IPv4 set-dst rule If an IPv4 is set via a TCP/HTTP set-dst rule, the original port must be preserved or set to 0 if the previous family was neither AF_INET nor AF_INET6. The first case is not an issue because the port remains the same. But if the previous family was, for instance, AF_UNIX, the port is not set to 0 and have an undefined value. This patch must be backported as far as 1.7.	2021-03-01 11:28:54 +01:00
Ilya Shipitsin	0de36adb5c	CLEANUP: assorted typo fixes in the code and comments This is 18th iteration of typo fixes	2021-02-27 09:01:43 +01:00
Willy Tarreau	3bda3f422e	CLEANUP: ssl: use realloc() instead of free()+malloc() There was a free(ptr) followed by ptr=malloc(ptr, len), which is the equivalent of ptr = realloc(ptr, len) but slower and less clean. Let's replace this.	2021-02-26 21:27:33 +01:00
Willy Tarreau	e709e82173	CLEANUP: ssl: make ssl_sock_free_srv_ctx() zero the pointers after free In ssl_sock_free_srv_ctx() there are some calls to free() which are not followed by a zeroing of the pointers. For now this function is only used during deinit but it could be used at run time in the near future, so better secure this.	2021-02-26 21:23:06 +01:00
Willy Tarreau	01acf563a7	CLEANUP: ssl: remove a useless "if" before freeing an error message Just an old "if (err) free(err)" that managed to escape cleanups.	2021-02-26 21:22:20 +01:00
Willy Tarreau	5b52b00393	CLEANUP: vars: always zero the pointers after a free() In sample_store(), depending on the new sample types, the area pointer was not always zeroed after being freed. Let's make sure it's always the case to avoid the risk of dangling pointers being misused.	2021-02-26 21:21:21 +01:00
Willy Tarreau	35cd734356	CLEANUP: config: replace a few free() with ha_free() A few occurrences of calls to free() to free a section name, peers name or server name were using casts and didn't include the trailing free, let's switch them to ha_free().	2021-02-26 21:21:21 +01:00
Willy Tarreau	61cfdf4fd8	CLEANUP: tree-wide: replace free(x);x=NULL with ha_free(&x) This makes the code more readable and less prone to copy-paste errors. In addition, it allows to place some __builtin_constant_p() predicates to trigger a link-time error in case the compiler knows that the freed area is constant. It will also produce compile-time error if trying to free something that is not a regular pointer (e.g. a function). The DEBUG_MEM_STATS macro now also defines an instance for ha_free() so that all these calls can be checked. 178 occurrences were converted. The vast majority of them were handled by the following Coccinelle script, some slightly refined to better deal with "&*x" or with long lines: @ rule @ expression E; @@ - free(E); - E = NULL; + ha_free(&E); It was verified that the resulting code is the same, more or less a handful of cases where the compiler optimized slightly differently the temporary variable that holds the copy of the pointer. A non-negligible amount of {free(str);str=NULL;str_len=0;} are still present in the config part (mostly header names in proxies). These ones should also be cleaned for the same reasons, and probably be turned into ist strings.	2021-02-26 21:21:09 +01:00
Christopher Faulet	29e9326f2f	CLEANUP: hlua: Use net_addr structure internally to parse and compare addresses hlua_addr structure may be replaced by net_addr structure to parse and compare addresses. Both structures are similar.	2021-02-26 13:53:26 +01:00
Christopher Faulet	5d1def623a	MEDIUM: http-ana: Add IPv6 support for forwardfor and orignialto options A network may be specified to avoid header addition for "forwardfor" and "orignialto" option via the "except" parameter. However, only IPv4 networks/addresses are supported. This patch adds the support of IPv6. To do so, the net_addr structure is used to store the parameter value in the proxy structure. And ipcmp2net() function is used to perform the comparison. This patch should fix the issue #1145. It depends on the following commit: * c6ce0ab MINOR: tools: Add function to compare an address to a network address * 5587287 MINOR: tools: Add net_addr structure describing a network addess	2021-02-26 13:52:48 +01:00
Christopher Faulet	9553de7fec	MINOR: tools: Add function to compare an address to a network address ipcmp2net() function may be used to compare an addres (struct sockaddr_storage) to a network address (struct net_addr). Among other things, this function will be used to add support of IPv6 for "except" parameter of "forwardfor" and "originalto" options.	2021-02-26 13:52:06 +01:00
Christopher Faulet	cccded98c7	BUG/MINOR: http-ana: Only consider dst address to process originalto option When an except parameter is used for originalto option, only the destination address must be evaluated. Especially, the address family of the destination must be tested and not the source one. This patch must be backported to all stable versions. However be careful, depending the versions the code may be slightly different.	2021-02-26 13:32:14 +01:00
Willy Tarreau	76390dac06	MINOR: task: only limit TL_HEAVY tasks but not others The preliminary approach to dealing with heavy tasks forced us to quit the poller after meeting one. Now instead we process at most one per poll loop and ignore the next ones, so that we get more bandwidth to process all other classes. Doing so further reduced the induced HTTP request latency at 100k req/s under the stress of 1000 concurrent SSL handshakes in the following proportions: \| default \| low-latency ---------+------------+-------------- before \| 2.75 ms \| 2.0 ms after \| 1.38 ms \| 0.98 ms In both cases, the latency is roughly halved. It's worth noting that both values are now exactly 10 times better than in 2.4-dev9. Even the percentiles have much improved. For 16 HTTP connections (1 per thread) competing with 1000 SSL handshakes, we're seeing these long-tail latencies (in milliseconds) : \| 99.5% \| 99.9% \| 100% -----------+---------+---------+-------- 2.4-dev9 \| 48.4 \| 58.1 \| 78.5 previous \| 6.2 \| 11.4 \| 67.8 this patch \| 2.8 \| 2.9 \| 6.1 The task latency profiling report now shows this in default mode: $ socat - /tmp/sock1 <<< "show profiling" Per-task CPU profiling : on # set profiling tasks {on\|auto\|off} Tasks activity: function calls cpu_tot cpu_avg lat_tot lat_avg si_cs_io_cb 3061966 2.224s 726.0ns 42.03s 13.72us h1_io_cb 3061960 6.418s 2.096us 18.76m 367.6us process_stream 3059982 9.137s 2.985us 15.52m 304.3us ssl_sock_io_cb 602657 4.265m 424.7us 4.736h 28.29ms h1_timeout_task 202973 - - 6.254s 30.81us accept_queue_process 135547 1.179s 8.699us 16.29s 120.1us srv_cleanup_toremove_conns 81 15.64ms 193.1us 30.87ms 381.1us task_run_applet 10 758.7us 75.87us 51.77us 5.176us srv_cleanup_idle_conns 4 375.3us 93.83us 54.52us 13.63us And this in low-latency mode, showing that both si_cs_io_cb() and process_stream() have significantly benefitted from the improvement, with values 50 to 200 times smaller than 2.4-dev9: $ socat - /tmp/sock1 <<< "show profiling" Per-task CPU profiling : on # set profiling tasks {on\|auto\|off} Tasks activity: function calls cpu_tot cpu_avg lat_tot lat_avg h1_io_cb 6407006 11.86s 1.851us 31.14m 291.6us process_stream 6403890 18.40s 2.873us 2.134m 20.00us si_cs_io_cb 6403866 4.139s 646.0ns 1.773m 16.61us ssl_sock_io_cb 894326 6.407m 429.9us 7.326h 29.49ms h1_timeout_task 301189 - - 8.440s 28.02us accept_queue_process 211989 1.691s 7.977us 21.48s 101.3us srv_cleanup_toremove_conns 220 23.46ms 106.7us 65.61ms 298.2us task_run_applet 16 1.219ms 76.17us 181.7us 11.36us srv_cleanup_idle_conns 12 713.3us 59.44us 168.4us 14.03us The changes are slightly more invasive than previous ones and depend on recent patches so they are not likely well suited for backporting.	2021-02-26 12:00:53 +01:00
Willy Tarreau	826fa87246	MINOR: task: place the heavy elements in TL_HEAVY Instead of placing heavy tasklets into the TL_BULK queue, we now place them into the TL_HEAVY one, which is assigned a default weight of ~1% load at once. This way heavy tasks will not block TL_BULK anymore.	2021-02-26 12:00:53 +01:00
Willy Tarreau	401135cee6	MINOR: task: add one extra tasklet class: TL_HEAVY This class will be used exclusively for heavy processing tasklets. It will be cleaner than mixing them with the bulk ones. For now it's allocated ~1% of the CPU bandwidth. The largest part of the patch consists in re-arranging the fields in the task_per_thread structure to preserve a clean alignment with one more list head. Since we're now forced to increase the struct past a second cache line, it now uses 4 cache lines (for easy multiplying) with the first two ones being exclusively used by local operations and the third one mostly by atomic operations. Interestingly, this better arrangement causes less stress and reduced the response time by 8 microseconds at 1 million requests per second.	2021-02-26 12:00:53 +01:00
Eric Salama	6ac61e39c4	BUG/MINOR: ssl: potential null pointer dereference in ckchs_dup() A potential null pointer dereference was reported with an old gcc version (6.5) src/ssl_ckch.c: In function 'cli_parse_set_cert': src/ssl_ckch.c:844:7: error: potential null pointer dereference [-Werror=null-dereference] if (!ssl_sock_copy_cert_key_and_chain(src->ckch, dst->ckch)) ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ src/ssl_ckch.c:844:7: error: potential null pointer dereference [-Werror=null-dereference] src/ssl_ckch.c: In function 'ckchs_dup': src/ssl_ckch.c:844:7: error: potential null pointer dereference [-Werror=null-dereference] if (!ssl_sock_copy_cert_key_and_chain(src->ckch, dst->ckch)) ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ src/ssl_ckch.c:844:7: error: potential null pointer dereference [-Werror=null-dereference] This could happen if ckch_store_new() fails to allocate memory and returns NULL. This patch must be backported with 8f71298 since it was wrongly fixed and the bug could happen. Must be backported as far as 2.2.	2021-02-26 09:49:35 +01:00
Willy Tarreau	d8aa21a611	CLEANUP: server: rename srv_cleanup_{idle,toremove}_connections() These function names are unbearably long, they don't even fit into the screen in "show profiling", let's trim the "_connections" to "_conns", which happens to match the name of the lists there.	2021-02-26 00:30:22 +01:00
Willy Tarreau	9205ab31d2	MINOR: ssl: mark the SSL handshake tasklet as heavy There's a fairness issue between SSL and clear text. A full end-to-end cleartext connection can require up to ~7.7 wakeups on average, plus 3.3 for the SSL tasklet, one of which is particularly expensive. So if we accept to process many handshakes taking 1ms each, we significantly increase the processing time of regular tasks just by adding an extra delay between their calls. Ideally in order to be fair we should have a 1:18 call ratio, but this requires a bit more accounting. With very little effort we can mark the SSL handshake tasklet as TASK_HEAVY until the handshake completes, and remove it once done. Doing so reduces from 14 to 3.0 ms the total response time experienced by HTTP clients running in parallel to 1000 SSL clients doing full handshakes in loops. Better, when tune.sched.low-latency is set to "on", the latency further drops to 1.8 ms. The tasks latency distribution explain pretty well what is happening: Without the patch: $ socat - /tmp/sock1 <<< "show profiling" Per-task CPU profiling : on # set profiling tasks {on\|auto\|off} Tasks activity: function calls cpu_tot cpu_avg lat_tot lat_avg ssl_sock_io_cb 2785375 19.35m 416.9us 5.401h 6.980ms h1_io_cb 1868949 9.853s 5.271us 4.829h 9.302ms process_stream 1864066 7.582s 4.067us 2.058h 3.974ms si_cs_io_cb 1733808 1.932s 1.114us 26.83m 928.5us h1_timeout_task 935760 - - 1.033h 3.975ms accept_queue_process 303606 4.627s 15.24us 16.65m 3.291ms srv_cleanup_toremove_connections452 64.31ms 142.3us 2.447s 5.415ms task_run_applet 47 5.149ms 109.6us 57.09ms 1.215ms srv_cleanup_idle_connections 34 2.210ms 65.00us 87.49ms 2.573ms With the patch: $ socat - /tmp/sock1 <<< "show profiling" Per-task CPU profiling : on # set profiling tasks {on\|auto\|off} Tasks activity: function calls cpu_tot cpu_avg lat_tot lat_avg ssl_sock_io_cb 3000365 21.08m 421.6us 20.30h 24.36ms h1_io_cb 2031932 9.278s 4.565us 46.70m 1.379ms process_stream 2010682 7.391s 3.675us 22.83m 681.2us si_cs_io_cb 1702070 1.571s 922.0ns 8.732m 307.8us h1_timeout_task 1009594 - - 17.63m 1.048ms accept_queue_process 339595 4.792s 14.11us 3.714m 656.2us srv_cleanup_toremove_connections779 75.42ms 96.81us 438.3ms 562.6us srv_cleanup_idle_connections 48 2.498ms 52.05us 178.1us 3.709us task_run_applet 17 1.738ms 102.3us 11.29ms 663.9us other 1 947.8us 947.8us 202.6us 202.6us => h1_io_cb() and process_stream() are divided by 6 while ssl_sock_io_cb() is multipled by 4 And with low-latency on: $ socat - /tmp/sock1 <<< "show profiling" Per-task CPU profiling : on # set profiling tasks {on\|auto\|off} Tasks activity: function calls cpu_tot cpu_avg lat_tot lat_avg ssl_sock_io_cb 3000565 20.96m 419.1us 20.74h 24.89ms h1_io_cb 2019702 9.294s 4.601us 49.22m 1.462ms process_stream 2009755 6.570s 3.269us 1.493m 44.57us si_cs_io_cb 1997820 1.566s 783.0ns 2.985m 89.66us h1_timeout_task 1009742 - - 1.647m 97.86us accept_queue_process 494509 4.697s 9.498us 1.240m 150.4us srv_cleanup_toremove_connections1120 92.32ms 82.43us 463.0ms 413.4us srv_cleanup_idle_connections 70 2.703ms 38.61us 204.5us 2.921us task_run_applet 13 1.303ms 100.3us 85.12us 6.548us => process_stream() is divided by 100 while ssl_sock_io_cb() is multipled by 4 Interestingly, the total HTTPS response time doesn't increase and even very slightly decreases, with an overall ~1% higher request rate. The net effect here is a redistribution of the CPU resources between internal tasks, and in the case of SSL, handshakes wait bit more but everything after completes faster. This was made simple enough to be backportable if it helps some users suffering from high latencies in mixed traffic.	2021-02-26 00:26:03 +01:00
Willy Tarreau	74dea8caea	MINOR: task: limit the number of subsequent heavy tasks with flag TASK_HEAVY While the scheduler is priority-aware and class-aware, and consistently tries to maintain fairness between all classes, it doesn't make use of a fine execution budget to compensate for high-latency tasks such as TLS handshakes. This can result in many subsequent calls adding multiple milliseconds of latency between the various steps of other tasklets that don't even depend on this. An ideal solution would be to add a 4th queue, have all tasks announce their estimated cost upfront and let the scheduler maintain an auto- refilling budget to pick from the most suitable queue. But it turns out that a very simplified version of this already provides impressive gains with very tiny changes and could easily be backported. The principle is to reserve a new task flag "TASK_HEAVY" that indicates that a task is expected to take a lot of time without yielding (e.g. an SSL handshake typically takes 700 microseconds of crypto computation). When the scheduler sees this flag when queuing a tasklet, it will place it into the bulk queue. And during dequeuing, we accept only one of these in a full round. This means that the first one will be accepted, will not prevent other lower priority tasks from running, but if a new one arrives, then the queue stops here and goes back to the polling. This will allow to collect more important updates for other tasks that will be batched before the next call of a heavy task. Preliminary tests consisting in placing this flag on the SSL handshake tasklet show that response times under SSL stress fell from 14 ms before the patch to 3.0 ms with the patch, and even 1.8 ms if tune.sched.low-latency is set to "on".	2021-02-26 00:25:51 +01:00
Amaury Denoyelle	91e55ea3f3	BUG/MINOR: stats: fix compare of no-maint url suffix Only the first 3 characters are compared for ';no-maint' suffix in http_handle_stats. Fix it by doing a full match over the entire suffix. As a side effect, the ';norefresh' suffix matched the inaccurate comparison, so the maintenance servers were always hidden on the stats page in this case. no-maint suffix is present since commit `3e32036701` MINOR: stats: also support a "no-maint" show stat modifier It should be backported up to 2.3. This fixes github issue #1147.	2021-02-25 14:59:17 +01:00
Christopher Faulet	6c93c4ef08	CLEANUP: muxes: Remove useless if condition in show_fd function In H1, H2 and FCGI muxes, in the show_fd function, there is duplicated test on the stream's subs field. This patch fixes the issue #1142. It may be backported as far as 2.2.	2021-02-25 10:07:24 +01:00
Christopher Faulet	456f45f301	MINOR: server-state: Don't load server-state file for serverless proxies Just a minor improvement. Proxies with no server are now ignored early. It may happens for listeners for instance.	2021-02-25 10:02:39 +01:00
Christopher Faulet	3e3d3be708	REORG: server-state: Move functions to deal with server-state in its own file All functions dealing with the server-state files are moved to server_state.c. srv_update_state() function was renammed to srv_state_srv_update().	2021-02-25 10:02:39 +01:00
Christopher Faulet	69beaa91d5	REORG: server: Export and rename some functions updating server info Some static functions are now exported and renamed to follow the same pattern of other exported functions. Here is the list : * update_server_fqdn: Renamed to srv_update_fqdn and exported * update_server_check_addr_port: renamed to srv_update_check_addr_port and exported * update_server_agent_addr_port: renamed to srv_update_agent_addr_port and exported * update_server_addr: renamed to srv_update_addr * update_server_addr_potr: renamed to srv_update_addr_port * srv_prepare_for_resolution: exported This change is mandatory to move all functions dealing with the server-state files in a separate file.	2021-02-25 10:02:39 +01:00
Christopher Faulet	a67c6bf333	MEDIUM: server: Don't load server-state file if a line is corrupted This change is not huge but may have a visible impact for users. Now, if a line of a server-state file is corrupted, the whole file is ignored. A warning is emitted with the corrupted line number. In fact, there is no way to recover from a corrupted line. A line is considered as corrupted if it is too long (truncated line) or if it contains the wrong number of arguments. In both cases, it means the file was forged (or at least manually edited). It is safer to ignore it. Note for now, memory allocation errors are not reported and the corresponding line is silently ignored.	2021-02-25 10:02:39 +01:00
Christopher Faulet	d0a5e84c8d	MINOR: server: Parse and store server-state lines in a dedicated function Now, srv_state_parse_and_store_line() function is used to parse and store a line in a tree. It is used for global and local server-state files. This significatly simplies the apply_server_state() function.	2021-02-25 10:02:39 +01:00
Christopher Faulet	5c37985149	MEDIUM: server: Use a tree to store local server-state lines Just like for the global server-state file, the line of a local server-state file are now stored in a tree. This way, the file is fully parsed before loading the servers state. And with this change, global and local server-state files are now handled the same way. This will be the opportunity to factorize the code. It is also a good way to validate the file before loading any server state.	2021-02-25 10:02:39 +01:00
Christopher Faulet	2c1db104fb	MINOR: server: Move loading state of servers in a dedicated function The loop on the servers of a proxy to load the server states was moved in the function srv_state_px_update(). This simplify a bit the apply_server_state() function. It is aslo mandatory to simplify the loading of local server-state file.	2021-02-25 10:02:39 +01:00
Christopher Faulet	f4d1da90c2	MINOR: server: Remove cached line from global server-state tree when found When a server for a given backend is found in the tree containing all lines of the global server-state file, the node is removed from the tree. It is useless to keep it longer. It is a small improvement, but it may also be usefull to track the orphan lines (not used for now).	2021-02-25 10:02:39 +01:00
Christopher Faulet	ecfb9b9109	MEDIUM: server: Store parsed params of a server-state line in the tree Parsed parameters are now stored in the tree of server-state lines. This way, a line from the global server-state file is only parsed once. Before, it was parsed a first time to store it in the tree and one more time to load the server state. To do so, the server-state line object must be allocated before parsing a line. This means its size must no longer depend on the length of first parsed parameters (backend and server names). Thus the node type was changed to use a hashed key instead of a string.	2021-02-25 10:02:39 +01:00
Christopher Faulet	8a14b73ecf	MINOR: server: Be more strict when reading the version of a server-state file Now, we read a full line and expects to found an integer only on it. And if the line is empty or truncated, an error is returned. If the version is not valid, an error is also returned. This way, the first line is no longer partially read.	2021-02-25 10:02:39 +01:00
Christopher Faulet	8b4b6a0d63	CLEANUP: server: Use a local eb-tree to store lines of the global server-state file There is no reason to use a global variable to store the lines of the global server-state file. This tree is only used during the file parsing, as a line cache. Now the eb-tree is declared as a local variable in the apply_server_state() function.	2021-02-25 10:02:39 +01:00
Christopher Faulet	6d87c58fb4	CLEANUP: server: Rename state_line structure into server_state_line The structure used to store a server-state line in an eb-tree has a too generic name. Instead of state_line, the structure is renamed as server_state_line.	2021-02-25 10:02:39 +01:00
Christopher Faulet	fcb53fbb58	CLEANUP: server: Rename state_line node to node instead of name_name <state_line.name_name> field is a node in an eb-tree. Thus, instead of "name_name", we now use "node" to name this field. If is a more explicit name and not too strange.	2021-02-25 10:02:39 +01:00
Christopher Faulet	131b07be3c	MEDIUM: server: Refactor apply_server_state() to make it more readable The apply_server_state() function is really hard to read. Thus it was refactored to be more maintainable. First, an helper function is used to get the server-state file path. Some useless variables were removed and most of other variables were renamed to be more readable. The error messages are now prefixed to know the context (global vs per-proxy). Finally, the loop on the proxies list was simplified. This patch may seem a bit huge, but the changes are not so important.	2021-02-25 10:02:39 +01:00
Christopher Faulet	2a031ecd96	MINOR: server: Only fill one array when parsing a server-state line There is no reason to fill two parameter arrays in srv_state_parse_line() function. Now, only one array is used. The 4th first entries are just skipped when srv_update_state() is called.	2021-02-25 10:02:39 +01:00
Christopher Faulet	0bf268e184	MINOR: server: Be more strict on the server-state line parsing The srv_state_parse_line() function was rewritten to be more strict. First of all, it is possible to make the difference between an ignored line and an malformed one. Then, only blank characters (spaces and tabs) are now allowed as field separator. An error is reported for truncated lines or for lines with an unexpected number of arguments regarding the provided version. However, for now, errors are ignored by the caller, invalid lines are just skipped.	2021-02-25 10:02:39 +01:00
Willy Tarreau	2a54ffbf43	MINOR: task: make tasklet wakeup latency measurements more accurate First, we don't want to measure wakeup times if the call date had not been set before profiling was enabled at run time. And second, we may only collect the value before clearing the TASK_IN_LIST bit, otherwise another wakeup might happen on another thread and replace the call date we're about to use, hence artificially lower the wakeup times.	2021-02-25 09:44:16 +01:00
Willy Tarreau	b2285de049	MINOR: tasks: also compute the tasklet latency when DEBUG_TASK is set It is extremely useful to be able to observe the wakeup latency of some important I/O operations, so let's accept to inflate the tasklet struct by 8 extra bytes when DEBUG_TASK is set. With just this we have enough to get live reports like this: $ socat - /tmp/sock1 <<< "show profiling" Per-task CPU profiling : on # set profiling tasks {on\|auto\|off} Tasks activity: function calls cpu_tot cpu_avg lat_tot lat_avg si_cs_io_cb 8099492 4.833s 596.0ns 8.974m 66.48us h1_io_cb 7460365 11.55s 1.548us 2.477m 19.92us process_stream 7383828 22.79s 3.086us 18.39m 149.5us h1_timeout_task 4157 - - 348.4ms 83.81us srv_cleanup_toremove_connections751 39.70ms 52.86us 10.54ms 14.04us srv_cleanup_idle_connections 21 1.405ms 66.89us 30.82us 1.467us task_run_applet 16 1.058ms 66.13us 446.2us 27.89us accept_queue_process 7 34.53us 4.933us 333.1us 47.58us	2021-02-25 09:44:16 +01:00
Willy Tarreau	45499c56d3	MINOR: task: make grq_total atomic to move it outside of the grq_lock Instead of decrementing grq_total once per task picked from the global run queue, let's do it at once after the loop like we do for other counters. This simplifies the code everywhere. It is not expected to bring noticeable improvements however, since global tasks tend to be less common nowadays.	2021-02-25 09:44:16 +01:00
Willy Tarreau	c9afbb10f5	MINOR: task: don't decrement then increment the local run queue Now we don't need to decrement rq_total when we pick a tack in the tree to immediately increment it again after installing it into the local list. Instead, we simply add to the local queue count the number of globally picked tasks. Avoiding this shows ~0.5% performance gains at 1Mreq/s (2M task switches/s).	2021-02-25 09:44:16 +01:00
Willy Tarreau	2b363ac092	MINOR: task: do not use __task_unlink_rq() from process_runnable_tasks() As indicated in previous commit, this function tries to guess which tree the task is in to figure what counters to update, while we already have that info in the caller. Let's just pick the relevant parts to place them in the caller.	2021-02-25 09:44:16 +01:00
Willy Tarreau	e7923c1d22	MINOR: task: split the counts of local and global tasks picked In process_runnable_tasks() we're still calling __task_unlink_rq() to pick a task, and this function tries to guess where to pick the task from and which counter to update while the caller's context already has everything. Worse, the number of local tasks is decremented then recredited, doubling the operations. In order to avoid this we first need to keep separate counters for local and global tasks that were picked. This is what this patch does.	2021-02-25 09:44:16 +01:00
Christopher Faulet	e071f0e6a4	MINOR: htx: Add function to reserve the max possible size for an HTX DATA block The function htx_reserve_max_data() should be used to get an HTX DATA block with the max possible size. A current block may be extended or a new one created, depending on the HTX message state. But the idea is to let the caller to copy a bunch of data without requesting many new blocks. It is its responsibility to resize the block at the end, to set the final block size. This function will be used to parse messages with small chunks. Indeed, we can have more than 2700 1-byte chunks in a 16Kb of input data. So it is easy to understand how this function may help to improve the parsing of chunk messages.	2021-02-24 22:10:01 +01:00
Christopher Faulet	d127ffa9f4	BUG/MEDIUM: resolvers: Reset address for unresolved servers If the DNS resolution failed for a server, its ip address must be removed. Otherwise, the server is stopped but keeps its ip. This may be confusing when the servers state are retrieved on the CLI and it may lead to undefined behavior if HAproxy is configured to load its servers state from a file. This patch should be backported as far as 2.0.	2021-02-24 21:58:46 +01:00
Christopher Faulet	52d4d30109	BUG/MEDIUM: resolvers: Reset server address and port for obselete SRV records When a SRV record expires, the ip/port assigned to the associated server are now removed. Otherwise, the server is stopped but keeps its ip/port while the server hostname is removed. It is confusing when the servers state are retrieve on the CLI and may be a problem if saved in a server-state file. Because the reload may fail because of this inconsistency. Here is an example: * Declare a server template in a backend, using the resolver <dns> server-template test 2 _http._tcp.example.com resolvers dns check * 2 SRV records are announced with the corresponding additional records. Thus, 2 servers are filled. Here is the "show servers state" output : 2 frt 1 test1 192.168.1.1 2 64 0 1 2 15 3 4 6 0 0 0 http1.example.com 8001 _http._tcp.example.com 0 0 - - 0 2 frt 2 test2 192.168.1.2 2 64 0 1 1 15 3 4 6 0 0 0 http2.example.com 8002 _http._tcp.example.com 0 0 - - 0 * Then, one additional record is removed (or a SRV record is removed, the result is the same). Here is the new "show servers state" output : 2 frt 1 test1 192.168.1.1 2 64 0 1 38 15 3 4 6 0 0 0 http1.example.com 8001 _http._tcp.example.com 0 0 - - 0 2 frt 2 test2 192.168.1.2 0 96 0 1 19 15 3 0 14 0 0 0 - 8002 _http._tcp.example.com 0 0 - - 0 On reload, if a server-state file is used, this leads to undefined behaviors depending on the configuration. This patch should be backported as far as 2.0.	2021-02-24 21:58:45 +01:00
Baptiste Assmann	b4badf720c	BUG/MINOR: resolvers: new callback to properly handle SRV record errors When a SRV record was created, it used to register the regular server name resolution callbacks. That said, SRV records and regular server name resolution don't work the same way, furthermore on error management. This patch introduces a new call back to manage DNS errors related to the SRV queries. this fixes github issue #50. Backport status: 2.3, 2.2, 2.1, 2.0	2021-02-24 21:58:45 +01:00
Christopher Faulet	a331a1e8eb	BUG/MINOR: resolvers: Only renew TTL for SRV records with an additional record If no additional record is associated to a SRV record, its TTL must not be renewed. Otherwise the entry never expires. Thus once announced a first time, the entry remains blocked on the same IP/port except if a new announce replaces the old one. Now, the TTL is updated if a SRV record is received while a matching existing one is found with an additional record or when an new additional record is assigned to an existing SRV record. This patch should be backported as far as 2.2.	2021-02-24 21:58:45 +01:00
Christopher Faulet	9c246a4b6c	BUG/MINOR: resolvers: Fix condition to release received ARs if not assigned At the end of resolv_validate_dns_response(), if a received additionnal record is not assigned to an existing server record, it is released. But the condition to do so is buggy. If "answer_record" (the received AR) is not assigned, "tmp_record" is not a valid record object. It is just a dummy record "representing" the head of the record list. Now, the condition is far cleaner. This patch must be backported as far as 2.2.	2021-02-24 21:58:45 +01:00
Willy Tarreau	9c6dbf0eea	CLEANUP: task: split the large tasklet_wakeup_on() function in two This function has become large with the multi-queue scheduler. We need to keep the fast path and the debugging parts inlined, but the rest now moves to task.c just like was done for task_wakeup(). This has reduced the code size by 6kB due to less inlining of large parts that are always context-dependent, and as a side effect, has increased the overall performance by 1%.	2021-02-24 17:55:58 +01:00
Willy Tarreau	955a11ebfa	MINOR: task: move the allocated tasks counter to the per-thread struct The nb_tasks counter was still global and gets incremented and decremented for each task_new()/task_free(), and was read in process_runnable_tasks(). But it's only used for stats reporting, so doing this this often is pointless and expensive. Let's move it to the task_per_thread struct and have the stats sum it when needed.	2021-02-24 17:42:04 +01:00
Willy Tarreau	eeffb3df41	MINOR: task: limit the remote thread wakeup to the global runqueue only The test in __task_wakeup() to figure if the remote threads are sleeping doesn't make sense outside of the global runqueue test, since there are only two possibilities here: local runqueue or global runqueue, hence a sleeping thread is another one and can only happen when sending to the global run queue. Let's move the test inside the "if" block.	2021-02-24 17:42:04 +01:00
Willy Tarreau	018564eaa2	CLEANUP: task: move the tree root detection from __task_wakeup() to task_wakeup() Historically we used to call __task_wakeup() with a known tree root but this is not the case and the code has remained needlessly complicated with the root calculation in task_wakeup() passed in argument to __task_wakeup() which compares it again. Let's get rid of this and just move the detection code there. This eliminates some ifdefs and allows to simplify the test conditions quite a bit.	2021-02-24 17:42:04 +01:00
Willy Tarreau	1f3b1417b8	CLEANUP: tasks: use a less confusing name for task_list_size This one is systematically misunderstood due to its unclear name. It is in fact the number of tasks in the local tasklet list. Let's call it "tasks_in_list" to remove some of the confusion.	2021-02-24 17:42:04 +01:00
Willy Tarreau	2c41d77ebc	MINOR: tasks: do not maintain the rqueue_size counter anymore This one is exclusively used as a boolean nowadays and is non-zero only when the thread-local run queue is not empty. Better check the root tree's pointer and avoid updating this counter all the time.	2021-02-24 17:42:04 +01:00
Willy Tarreau	9c7b8085f4	MEDIUM: task: remove the tasks_run_queue counter and have one per thread This counter is solely used for reporting in the stats and is the hottest thread contention point to date. Moving it to the scheduler and having a separate one for the global run queue dramatically improves the performance, showing a 12% boost on the request rate on 16 threads! In addition, the thread debugging output which used to rely on rqueue_size was not totally accurate as it would only report task counts. Now we can return the exact thread's run queue length. It is also interesting to note that there are still a few other task/tasklet counters in the scheduler that are not efficiently updated because some cover a single area and others cover multiple areas. It looks like having a distinct counter for each of the following entries would help and would keep the code a bit cleaner: - global run queue (tree) - per-thread run queue (tree) - per-thread shared tasklets list - per-thread local lists Maybe even splitting the shared tasklets lists between pure tasklets and tasks instead of having the whole and tasks would simplify the code because there remain a number of places where several counters have to be updated.	2021-02-24 17:42:04 +01:00
Willy Tarreau	e3e648c92f	BUILD: dns: avoid a build warning when threads are disabled (dss unused) dns_session_release() only uses its struct dns_stream_server to access the lock, so a warning is emitted when threads are disabled. Let's mark it __maybe_unused.	2021-02-24 17:42:04 +01:00
Willy Tarreau	49de68520e	MEDIUM: streams: do not use the streams lock anymore The lock was still used exclusively to deal with the concurrency between the "show sess" release handler and a stream_new() or stream_free() on another thread. All other accesses made by "show sess" are already done under thread isolation. The release handler only requires to unlink its node when stopping in the middle of a dump (error, timeout etc). Let's just isolate the thread to deal with this case so that it's compatible with the dump conditions, and remove all remaining locking on the streams. This effectively kills the streams lock. The measured gain here is around 1.6% with 4 threads (374krps -> 380k).	2021-02-24 13:54:50 +01:00
Willy Tarreau	a698eb6739	MINOR: streams: use one list per stream instead of a global one The global streams list is exclusively used for "show sess", to look up a stream to shut down, and for the hard-stop. Having all of them in a single list is extremely expensive in terms of locking when using threads, with performance losses as high as 7% having been observed just due to this. This patch makes the list per-thread, since there's no need to have a global one in this situation. All call places just iterate over all threads. The most "invasive" changes was in "show sess" where the end of list needs to go back to the beginning of next thread's list until the last thread is seen. For now the lock was maintained to keep the code auditable but a next commit should get rid of it. The observed performance gain here with only 4 threads is already 7% (350krps -> 374krps).	2021-02-24 13:53:20 +01:00
Willy Tarreau	5d533e2bad	MINOR: cli/streams: make "show sess" dump all streams till the new epoch Instead of placing the current stream at the end of the stream list when issuing a "show sess" on the CLI as was done in 2.2 with commit `c6e7a1b8e` ("MINOR: cli: make "show sess" stop at the last known session"), now we compare the listed stream's epoch with the dumping stream's and stop on more recent ones. This way we're certain to always only dump known streams at the moment we issue the dump command without having to modify the list. In theory we could miss some streams if more than 2^31 "show sess" requests are issued while an old stream remains present, but that's 68 years at 1 "show sess" per second and it's unlikely we'll keep a process, let alone a stream, that long. It could be verified that the count of dumped streams still matches the one before this change.	2021-02-24 12:12:51 +01:00
Willy Tarreau	b981318c11	MINOR: stream: add an "epoch" to figure which streams appeared when The "show sess" CLI command currently lists all streams and needs to stop at a given position to avoid dumping forever. Since 2.2 with commit `c6e7a1b8e` ("MINOR: cli: make "show sess" stop at the last known session"), a hack consists in unlinking the stream running the applet and linking it again at the current end of the list, in order to serve as a delimiter. But this forces the stream list to be global, which affects scalability. This patch introduces an epoch, which is a global 32-bit counter that is incremented by the "show sess" command, and which is copied by newly created streams. This way any stream can know whether any other one is newer or older than itself. For now it's only stored and not exploited.	2021-02-24 12:12:51 +01:00
Willy Tarreau	0d03825b93	BUG/MINOR: proxy: wake up all threads when sending the hard-stop signal The hard-stop event didn't wake threads up. In the past it wasn't an issue as the poll timeout was limited to 1 second, but since commit `4f59d3861` ("MINOR: time: increase the minimum wakeup interval to 60s") it has become a problem because old processes can remain live for up to one minute after the hard-stop-after delay. Let's just wake them up. This may be backported to older releases, though before 2.4 the extra delay was only one second.	2021-02-24 12:12:46 +01:00

... 6 7 8 9 10 ...

11649 commits