icinga2

mirror of https://github.com/Icinga/icinga2.git synced 2026-06-09 00:32:12 -04:00

Author	SHA1	Message	Date
Richard Mortimer	63926c6e0d	Process: Clean up process table entry even when `kill(2)` fails with `ESRCH` (#10375 ) * Icinga daemon leaves zombie processes on very busy system On a very heavily loaded system the process group kill can be delayed until after the regular TERM signal has caused the process to exit. In this situation the waitpid call is valid and reaps the zombie process that would otherwise be left behind. * Update AUTHORS file	2025-03-18 11:29:00 +01:00
Alexander A. Klimov	a9e9e14fce	Remove unused Registry#Clear()	2025-03-18 11:22:56 +01:00
Alexander A. Klimov	4d7361527c	Remove unused Registry#RegisterIfNew()	2025-03-18 11:22:56 +01:00
Alexander A. Klimov	07b274ec45	Remove unused Registry#Unregister()	2025-03-18 11:22:56 +01:00
Alexander A. Klimov	402a6bbf40	Remove unused EventQueue::Unregister()	2025-03-18 11:22:56 +01:00
Alexander A. Klimov	d19c0637ee	Remove unused EventQueue::UnregisterIfUnused()	2025-03-18 11:22:56 +01:00
Alexander A. Klimov	41f61ccba4	Remove unused ApiFunction::Unregister()	2025-03-18 11:22:56 +01:00
Alexander A. Klimov	cce03c5903	Remove unused ApiAction::Unregister()	2025-03-18 11:22:56 +01:00
Yonas Habteab	5e902fe4a7	Merge pull request #10380 from Icinga/sync-notified-problem-users-correctly ClusterEvents: Sync & process notification `notified_problem_users`	2025-03-18 10:27:28 +01:00
Yonas Habteab	66cc6a4d8a	ClusterEvents: Sync & process notification `notified_problem_users`	2025-03-14 14:13:55 +01:00
Yonas Habteab	3d761c0296	ApiActions: Remove child downtimes recursively Services downtimes scheduled via the `all_services` flag get already removed automatically when removing their parent downtimes (introduced with #8913). Now, this commit makes it possible to perform the same actions for all child downtimes, i.e. not only for those of service objects, but for all child objects represented in the dependency tree.	2025-03-13 12:13:45 +01:00
Yonas Habteab	fa63fda75b	ApiListener: Simplify deferred SSL shutdown in `NewClientHandlerInternal()`	2025-03-13 12:12:28 +01:00
Yonas Habteab	4bfaefadfa	IcingaDB: Bump expected redis version to `6`	2025-03-12 16:32:01 +01:00
Yonas Habteab	d094581b4b	Checkable: Use redundancy groups state in `IsReachable`	2025-03-12 16:32:01 +01:00
Yonas Habteab	27f11a0955	Checkable: Introduce `HasAnyDependencies()` method	2025-03-12 16:32:01 +01:00
Yonas Habteab	ff0dabe287	Checkable: Store dependencies grouped by their redundancy group	2025-03-12 16:31:59 +01:00
Yonas Habteab	1820955993	Add `DependencyGroup::GetState()` helper method	2025-03-12 16:31:14 +01:00
Yonas Habteab	d7c9e6687e	Introduce `DependencyGroup` helper class	2025-03-12 16:31:12 +01:00
Yonas Habteab	93d9fad565	Checkable: Drop unused `failedDependency` argument from `IsReachable()`	2025-03-12 16:19:22 +01:00
Julian Brost	67664ad7b7	Checkable::GetAllChildrenInternal: remove redundant emplace call `checkable` is already added to the set by the insert call above, so calling emplace for the same checkable doesn't do anything useful and can be removed.	2025-03-12 16:19:22 +01:00
Yonas Habteab	c465f45200	Rewrite `Checkable::GetAllChildrenInternal()` method The previous wasn't per-se wrong, but it was way too inefficient. With this commit each and every Checkable is going to be visited only once, and we won't traverse the same Checkable's children multiple times somewhere in the dependency chain.	2025-03-12 16:19:22 +01:00
Yonas Habteab	e0ce0ccff6	Activate `Dependency` objects before their parent objects	2025-03-12 16:19:22 +01:00
Yonas Habteab	c02b9d74a9	IcingaDB: Send reachablity state updates for all children	2025-03-12 16:19:22 +01:00
Yonas Habteab	772420a438	Checkable: Don't always trigger reachablity changed signal But only when the current check result being processed affects the child Checkables in any way.	2025-03-12 16:19:22 +01:00
Yonas Habteab	c64ae1af0f	Dependency: Don't allow to change `redundancy_group` at runtime Otherwise, it would require too much code changes to properly handle redundancy group runtime modification in Icinga DB for no real benefit.	2025-03-12 16:19:22 +01:00
Yonas Habteab	6321606671	IcingaDB: Sync `affects_children` as part of runtime state updates	2025-03-12 16:19:22 +01:00
Yonas Habteab	297b62d841	IcingaDB: Add `affected_children` to `Host/Service` Redis updates	2025-03-12 16:19:22 +01:00
Yonas Habteab	d6b289e1cd	Checkable: Introduce `GetAllChildrenCount()` method The previous limit (32) doesn't seem to make sense, and appears to be some random number. So, this limit is set to 256 to match the limit in IsReachable().	2025-03-12 16:19:22 +01:00
Alvar Penning	ef93f945a2	IcingaDB: Start keeping track of Host/Service to Dependency relationship This does not work in this state! Trying to refresh Dependency if a Host or Service being member of this Dependency has a state change.	2025-03-12 16:19:22 +01:00
Julian Brost	e6ad2199fc	Merge pull request #10360 from Icinga/dependency-cycle-detection Rework dependency cycle detection	2025-03-12 15:58:44 +01:00
Julian Brost	8e7e687b96	Unify depependency cycle check code. This commit removes a distinction in how dependency objects are checked for cycles in the resulting graph depending on whether they are part of the initially loaded configuration during process startup or as part of a runtime update. The DependencyCycleChecker helper class is extended with a mechanism that allows additional dependencies to be considered during the cycle search. This allows using it to check for cycles before actually registering the dependencies with the checkables. The aforementioned case-distinction for initial/runtime-update config is removed by making use of the newly added BeforeOnAllConfigLoaded signal to perform the cycle check at once for each batch of dependencies inside ConfigItem::CommitNewItems() for both cases now. During the initial config loading, there can be multiple batches of dependencies as objects from apply rules are created separately, so parts of the dependency graph might be visited multiple times now, however that is limited to a minimum as only parts of the graph that are reachable from the newly added dependencies are searched.	2025-03-12 11:53:30 +01:00
Julian Brost	c1b270f39f	Rework dependency cycle check This commit groups a bunch of structs and static functions inside dependency.cpp into a new DependencyCycleChecker helper class. In the process, the implementation was changed a bit, the behavior should be unchanged except for a more user-friendly error message in the exception.	2025-03-12 11:53:30 +01:00
Julian Brost	500ad70b8c	Implement std::hash<boost::intrusive_ptr<T>> for old Boost versions Boost only implements it iself starting from version 1.74, but a specialization of std::hash<> can be added trivially to allow the use of std::unordered_set<boost::intrusive_ptr<T>> and std::unordered_map<boost::intrusive_ptr<K>, V>. Being unable to use such types already came up a few types in the past, often resulting in the use of raw pointer instead which always involves an additional "is this safe?"/"could the object go out of scope?" discussion. This commit simply solves this for the future by simply allowing the use of intrusive_ptr in unordered containers.	2025-03-12 11:53:30 +01:00
Julian Brost	4b18f62a11	Add ConfigType::BeforeOnAllConfigLoaded signal Allows to hook into the config loading process just before OnAllConfigLoaded() is called on a bunch of individual config objects. Allows doing some operations more efficiently at once for all objects. Intended use: when adding a number of dependencies, it has to be checked whether this uses any cycles. This can be done more efficiently if all dependencies are checked at once. So far, this is with a case-distinction for initially loaded files in DaemonUtility::LoadConfigFiles() and for dependencies created by runtime updates in Dependency::OnAllConfigLoaded(). The mechanism added by this commit allows to unify the handling of both cases (done in a following commit).	2025-03-12 11:53:30 +01:00
Yonas Habteab	206d7cda1b	Merge pull request #10359 from Icinga/do-not-publish-useless-stats IcingaDB: Don't publish useless data to Redis	2025-03-07 12:51:10 +01:00
Yonas Habteab	3e9292a349	Value: Add a specialized rvalue reference of `Get()` The move `String(Value&&)` constructor tries to partially move `String` values from a `Value` type. However, since there was no an appropriate `Value::Get<T>()` implementation that binds to the requested move operation, the compiler will actually not move the value but copy it instead as the only available implementation of `Value::Get<T>()` returns a const reference `const T&`. This commit adds a new overload that returns a non-const reference and allows to optionally move the string value of a Value type.	2025-03-07 10:16:31 +01:00
Yonas Habteab	6a888e1494	String: Mark move constructor & assignment op as `noexcept` The Icinga DB code performs intensive operations on certain STL containers, primarily on `std::vector<String>`. Specifically, it inserts 2-3 new elements at the beginning of a vector containing thousands of elements. Without this commit, all the existing elements would be unnecessarily copied just to accommodate the new elements at the front. By making this change, the compiler is able to optimize STL operations like `push_back`, `emplace_back`, and `insert`, enabling it to prefer the move constructor over copy operations, provided it is guaranteed that no exceptions will be thrown.	2025-03-06 13:02:40 +01:00
Yonas Habteab	6ca0611f3d	IcingaDB: Don't publish useless data to Redis The Icinga DB daemon processes the data from the `IcingaApplication` type only and Icinga DB Web also uses only those stats. However, before this commit, Icinga DB published all kinds of useless stats to Redis each second, like the number of (un)reachable hosts, services, and so on, which is waste of CPU and some other resources. This commit reduces the published data drastically to only those simple stats coming from the `IcingaApplication` type.	2025-03-04 17:34:38 +01:00
Julian Brost	21c9ad5323	Merge pull request #10332 from Icinga/do-not-close-connection-in-request-cert-handler Don't abruptly close anonymous connections	2025-02-04 10:58:17 +01:00
Alexander Aleksandrovič Klimov	065dfe4c40	Merge pull request #9928 from Icinga/no-data-received-on-new-api-connection API: also log error behind "No data received on new API connection"	2025-02-03 15:39:26 +01:00
Yonas Habteab	25bbac1677	Don't abruptly close anonymous connections This was mistakenly introduced with PR #7686 due to too many open connections (#7680). This was wrong in the sense that closing the connection is simply out of place here and should have been handled differently. After we revised the RPC connection disconnect procedure with `v2.14.4`, it becomes clear why it is wrong, because the connection is closed abruptly before the corresponding response (`result`) has even been written. Now if you remove the disconnect here, shouldn't the issue #7680 occur again, you ask? The answer is no, because we now also have a maximum timeout of `10s` for anonymous connections, after which they are automatically closed. Thanks to the introduction of this timeout by @julianbrost in #8479, this `Disconnect()` call has become superfluous.	2025-01-30 17:45:27 +01:00
Julian Brost	51c6a58657	Merge pull request #9943 from Icinga/renegotiation-openbsd Disable TLS renegotiation and fix compile error on OpenBSD	2025-01-30 15:50:07 +01:00
Alexander A. Klimov	e1a4390b9c	Fix compile error on OpenBSD which has no SSL_OP_NO_RENEGOTIATION	2025-01-29 17:42:10 +01:00
Alexander A. Klimov	411c57aac5	API: also log error behind "No data received on new API connection"	2025-01-24 11:28:16 +01:00
Julian Brost	78883669d3	Merge pull request #8169 from Icinga/bugfix/object-query-all-attrs-8167 GET /v1/objects/*: handle "attrs":[] as expected	2025-01-24 09:14:17 +01:00
Sebastian Grund	7d12c1a524	Add tags functionality to `ElasticSearchWriter`	2025-01-24 08:51:53 +01:00
Alexander A. Klimov	e18c923abb	GET /v1/objects/*: handle "attrs":[] as expected ... i.e. yield no attrs and not all. refs #8167	2025-01-21 11:36:55 +01:00
Alexander Aleksandrovič Klimov	866db3ba3c	Merge pull request #10137 from Icinga/win-progfiles-icinga2-var On Windows, don't create C:\Program Files\Icinga2\var during MSI build	2025-01-16 12:02:33 +01:00
Julian Brost	4ffe88e263	Merge pull request #9732 from Icinga/silence-compiler-warnings-in-code-we-don-t-maintain Silence compiler warnings in code we don't maintain	2025-01-15 16:33:24 +01:00
Alexander A. Klimov	6195a457a7	Silence compiler warnings in code we don't maintain	2025-01-14 11:48:33 +01:00
Julian Brost	1f047ebbf5	Merge pull request #10058 from Icinga/error-timestamp-out-of-range-53323 Ido*sqlConnection#FieldToEscapedString(): don't write out of range time	2025-01-14 09:43:37 +01:00
Julian Brost	55829c4f55	Merge pull request #10077 from RincewindsHat/reject_invalid_perfdata Reject infinite performance data values	2025-01-13 12:00:12 +01:00
Julian Brost	fb50e4b1f1	Merge pull request #10188 from Icinga/icingadb-heartbeat-both-responsible IcingaDB Check: Multiple Responsible Instances	2025-01-13 11:56:19 +01:00
Lorenz Kästle	e7381193c8	Reject infinite performance data values Some fault monitoring plugins may return "inf" or "-inf" as values due to a failure to initialize or other errors. This patch introduces a check on whether the parse value is infinite (or negative infinite) and rejects the data point if that is the case. The reasoning here is: There is no possible way a value of "inf" is ever a true measuring or even useful. Furthermore, when passed to the performance data writers, it may be rejected by the backend and lead to further complications.	2025-01-09 11:46:34 +01:00
Yonas Habteab	1425641931	Don't endlessly wait on writer coroutine on disconnect	2025-01-08 16:30:36 +01:00
Yonas Habteab	41373ad0e5	Log before & after an RPC client is disconnected	2025-01-08 16:30:36 +01:00
Yonas Habteab	3af7cfe2ec	JsonRpcConnection: Don't drop client from cache prematurely PR #7445 incorrectly assumed that a peer that had already disconnected and never reconnected was due to the endpoint client being dropped after a successful socket shutdown. However, the issue at that time was that there was not a single timeout guards that could cancel the `async_shutdown` call, petentially blocking indefinetely. Although removing the client from cache early might have allowed the endpoint to reconnect, it did not resolve the underlying problem. Now that we have a proper cancellation timeout, we can wait until the currently used socket is fully closed before dropping the client from our cache. When our socket termination works reliably, the `ApiListener` reconnect timer should attempt to reconnect this endpoint after the next tick. Additionally, we now have logs both for before and after socket termination, which may help identify if it is hanging somewhere in between.	2025-01-08 16:30:36 +01:00
Alexander A. Klimov	8f72891228	Document Timeout	2025-01-07 18:20:54 +01:00
Alexander A. Klimov	3ca7ff7bf4	Timeout: explicitly delete #Timeout(const Timeout&), #Timeout(Timeout&&), #operator=(const Timeout&), #operator=(Timeout&&)	2025-01-07 18:20:52 +01:00
Alexander A. Klimov	27e0e236cb	Move Timeout instances from heap to stack	2025-01-07 18:20:50 +01:00
Alexander A. Klimov	d77d7506f1	Don't call Timeout#Cancel() where Timeout#~Timeout() is called	2025-01-07 18:20:14 +01:00
Alexander A. Klimov	959b162913	Timeout#~Timeout(), #Cancel(): support boost::asio::io_context running on multiple threads	2025-01-07 18:19:42 +01:00
Alexander A. Klimov	cb51649363	Timeout#Timeout(): drop unnecessary template parameters	2025-01-07 18:19:39 +01:00
Alexander A. Klimov	d2285bcf0e	While using Timeout, don't unnecessarily keep the strand alive via smart pointer	2025-01-07 18:18:46 +01:00
Alexander A. Klimov	faaeb4eb2e	Timeout: use a plain callback, not an unnecessary coroutine	2025-01-07 18:18:24 +01:00
Alexander A. Klimov	92ab913226	Timeout#Timeout(): don't pass yield_context to callback It's not used. Also, the callback shall run completely at once. This ensures that it won't (continue to) run once another coroutine on the strand calls Timeout#Cancel().	2025-01-07 18:18:18 +01:00
Julian Brost	880632b93a	Merge pull request #9861 from ymartin-ovh/issue-9752 icinga2: address comment loading where host reference is not found	2025-01-07 14:12:03 +01:00
Julian Brost	cf125dd8d5	Simplify `DependencyGraph:RemoveDependency()` method	2025-01-07 11:07:46 +01:00
Yonas Habteab	ff0e12e6ac	ApiListener: Sync runtime configs in order	2025-01-07 11:07:46 +01:00
Yonas Habteab	015374e69d	DependencyGraph: Allow lookups by parent & child dependencies	2025-01-07 11:07:46 +01:00
Alexander Aleksandrovič Klimov	383773eb2b	Merge pull request #10264 from Icinga/DependencyGraph-ConfigObject DependencyGraph: use ConfigObject, not Object	2024-12-18 13:36:56 +01:00
Alexander A. Klimov	3a09cf72d6	DependencyGraph: use ConfigObject, not Object This saves dynamic_cast<ConfigObject*> + if() on every item of GetChildren().	2024-12-17 18:33:05 +01:00
Julian Brost	452386cdb6	Merge pull request #10005 from Icinga/graceful-tls-disconnect Add a dedicated method for disconnecting TLS connections	2024-12-12 16:20:14 +01:00
Julian Brost	3642ca3369	Merge pull request #10263 from Icinga/DependencyGraph-parent-child DependencyGraph: switch "parent" and "child" terminology	2024-12-12 15:13:08 +01:00
Julian Brost	a506d562ae	Add comment for remaining uses of async_shutdown() why it's safe The reason for introducing AsioTlsStream::GracefulDisconnect() was to handle the TLS shutdown properly with a timeout since it involves a timeout. However, the implementation of this timeout involves spwaning coroutines which are redundant in some cases. This commit adds comments to the remaining calls of async_shutdown() stating why calling it is safe in these places.	2024-12-12 12:10:59 +01:00
Julian Brost	e6d103d0dd	HttpServerConnection: use AsioTlsStream::GracefulDisconnect() This new helper function has proper timeout handling which was missing here.	2024-12-12 12:10:59 +01:00
Julian Brost	007e3fbe7e	JsonRpcConnection: use AsioTlsStream::GracefulDisconnect() This new helper functions allows deduplicating the timeout handling for `async_shutdown()`.	2024-12-12 12:10:59 +01:00
Julian Brost	56d5811283	AsioTlsStream: add GracefulDisconnect() and ForceDisconnect() Calling `AsioTlsStream::async_shutdown()` performs a TLS shutdown which exchanges messages (that's why it takes a `yield_context`) and thus has the potential to block the coroutine. Therefore, it should be protected with a timeout. As `async_shutdown()` doesn't simply take a timeout, this has to be implemented using a timer. So far, these timers are scattered throughout the codebase with some places missing them entirely. This commit adds helper functions to properly shutdown a TLS connection with a single function call.	2024-12-12 12:10:59 +01:00
Alexander A. Klimov	188ba53b74	DependencyGraph: switch "parent" and "child" terminology The .ti files call `DependencyGraph::AddDependency(this, service.get())`. Obviously, `service.get()` is the parent and `this` (Downtime, Notification, ...) is the child. The DependencyGraph terminology should reflect this not to confuse its future users.	2024-12-04 10:57:30 +01:00
Alexander Aleksandrovič Klimov	8f51f54f19	Merge pull request #10221 from Icinga/Al2Klimov-patch-7 JsonRpcConnection: don't write new messages on shutdown	2024-11-29 09:24:10 +01:00
Yonas Habteab	4564c068fe	JsonRpcConnection: Log message processing time stats Co-Authored-By: Julian Brost <julian.brost@icinga.com>	2024-11-27 09:57:38 +01:00
Yonas Habteab	e0b053cbe1	HttpServerConnection: Log noticable CPU semaphore wait time	2024-11-27 09:57:38 +01:00
Yonas Habteab	3218908595	Merge pull request #10214 from Icinga/useless-http-coroutines HttpServerConnection: Don't spawn useless coroutines	2024-11-19 15:53:54 +01:00
Yonas Habteab	2931aea9bb	Merge pull request #7818 from Icinga/bugfix/no_more_notifications-7758 Don't set Notification#no_more_notifications on custom notifications	2024-11-15 14:43:12 +01:00
Alexander A. Klimov	35a705752f	Don't set Notification#no_more_notifications on custom notifications	2024-11-15 13:03:22 +01:00
Alvar Penning	0bbe7a9b2f	IcingaDB Check: Multiple Responsible Instances By design, only one Icinga 2 instance should be responsible in the HA context. If this promise is broken, the Icinga 2 IcingaDB check should report it. The code did not check for invalid data in icingadb:telemetry:heartbeat. With this change, it will go CRITICAL with a descriptive message and report the actual number of icingadb_responsible_instances in the performance data.	2024-11-15 12:56:45 +01:00
Yonas Habteab	5c0f9bfdaa	HttpServerConnection: Don't spawn useless coroutines Currently, for each `Disconnect()` call, we spawn a coroutine, but every one of them is just usesless, except the first one. However, since all `Disconnect()` usages share the same asio strand and cannot interfere with each other, spawning another coroutine within `Disconnect()` isn't even necessary. When a coroutine calls `Disconnect()` now, it will immediately initiate an async shutdown of the socket, potentially causing the coroutine to yield and allowing the others to resume. Therefore, the `m_ShuttingDown` flag is still required by the coroutines to be checked regularly.	2024-11-14 16:47:01 +01:00
Yonas Habteab	d68ee3fcf8	Merge pull request #10224 from Icinga/Empty-constant Make icinga::Empty constant to prevent accidental changes	2024-11-14 10:35:36 +01:00
Julian Brost	67175c43c0	Merge pull request #10102 from Icinga/icingadb-redis-username Icinga DB: Config no_user_modify and Support Redis username authentication	2024-11-12 17:04:20 +01:00
Julian Brost	5817e7666b	Merge commit from fork Security: fix TLS certificate validation bypass	2024-11-12 15:01:57 +01:00
Alexander A. Klimov	09160ea9eb	Make icinga::Empty constant to prevent accidental changes	2024-11-11 16:31:04 +01:00
Alexander Aleksandrovič Klimov	aa7f159a0f	JsonRpcConnection: don't write new messages on shutdown In fact, this is already done for the outer loop (for each bulk), just not yet for the inner one (for each message of a bulk). So once the remote signals EOF, don't try to process the remaining queue until write error (which can't be associated with a particular message anyway, due to buffering), but just let the peer go. Flush already half-written messages, though, if possible.	2024-11-07 17:32:12 +01:00
Alexander Aleksandrovič Klimov	9a8620d923	Merge pull request #10213 from Icinga/do-not-read-data-on-disconnect JsonRpcConnection: Don't read any data on shutdown	2024-11-07 12:32:02 +01:00
Alexander Aleksandrovič Klimov	fb64c4f057	Atomic#Atomic(): remove superfluous atomic write	2024-11-06 11:37:02 +01:00
Alexander Aleksandrovič Klimov	a77259adc1	Atomic<T>#Atomic(T): fix C++ compliance by not calling `std::atomic<T>::atomic(void)`. After the latter the instance "does not contain a T object, and its only valid uses are destruction and initialization by std::atomic_init" which we don't call. So the only safe option is `std::atomic<T>::atomic(T)`. https://en.cppreference.com/w/cpp/atomic/atomic/atomic	2024-11-05 13:15:22 +01:00
Yonas Habteab	1c34610a78	JsonRpcConnection: Don't read any data on shutdown When the `Desconnect()` method is called, clients are not disconnected immediately. Instead, a new coroutine is spawned using the same strand as the other coroutines. This coroutine calls `async_shutdown` on the TCP socket, which might be blocking. However, in order not to block indefintely, the `Timeout` class cancels all operations on the socket after `10` seconds. Though, the timeout does not trigger the handler immediately; it creates spawns another coroutine using the same strand as in the `JsonRpcConnection` class. This can cause unexpected delays if e.g. `HandleIncomingMessages` gets resumed before the coroutine from the timeout class. Apart from that, the coroutine for writing messages uses the same condition, making the two symmetrical.	2024-10-31 17:09:13 +01:00
Yonas Habteab	d894792c36	Merge pull request #10209 from Icinga/log-error-context-only-once ApiListener: Log error context only once	2024-10-31 13:14:42 +01:00
Alexander Aleksandrovič Klimov	5f487aff1b	Merge pull request #10201 from Icinga/Validation-failed Remove redundant "Validation failed" prefix from ValidationError exceptions	2024-10-31 12:30:39 +01:00
Yonas Habteab	8574357443	ApiListener: Log error context only once When logging at the warning level, the logger will automatically look up for registered context and append them to the log entry accordingly.	2024-10-30 16:55:13 +01:00
Yonas Habteab	e8b7baa298	JsonRpcConnection: Drop unused `m_NextHeartbeat` variable	2024-10-30 14:31:48 +01:00
Yonas Habteab	10775f4481	Merge pull request #10207 from Icinga/log-connected-endpoint-connection-attempts ApiListener: Log connection attempts from an already connected client prominently	2024-10-30 13:31:44 +01:00
Yonas Habteab	9d4625e1ec	ApiListener: Log connection attempts from an already connected client Something is definitely going wrong if a client tries to reconnect to this endpoint while it still has an active connection to that client. So we shouldn't hide this, but at least log it at info level. Apart from that, I've added some additional information about the currently active client, such as when the last message was sent and received.	2024-10-30 11:26:21 +01:00
Alexander Aleksandrovič Klimov	4ca68e444e	Merge pull request #10204 from Icinga/an-HA doc/: fix "a HA" -> "an HA"	2024-10-24 11:30:24 +02:00
Alexander Aleksandrovič Klimov	fb8badfd2e	Merge pull request #10187 from Icinga/state-before-suppression Fix lost recovery notifications after recovery outside of notification time period	2024-10-24 10:07:59 +02:00
Alexander Aleksandrovič Klimov	7df6baf146	Merge pull request #10176 from Icinga/ICINGA2_UNITY_BUILD=OFF-ICINGA2_WITH_LIVESTATUS=ON Fix build on Mac with -DICINGA2_UNITY_BUILD=OFF -DICINGA2_WITH_LIVESTATUS=ON	2024-10-24 10:03:57 +02:00
Alexander A. Klimov	095e5982f4	doc/: fix "a HA" -> "an HA"	2024-10-24 09:44:36 +02:00
Alvar Penning	98f60fd78e	Icinga DB: Support Redis username authentication The Redis ACL system was introduced with Redis 6.0. It introduced users with precisely granular permissions. This change allows Icinga 2 to use the Icinga DB feature against a Redis with an ACL user. This was reflected in the documentation, next to the already implemented, but undocumented Redis database. Closes #9536.	2024-10-24 09:18:19 +02:00
Alvar Penning	57fab7f39e	Icinga DB: Config no_user_modify Each configuration field of an IcingaDB Object was marked with no_user_modify as modifications via the API would not result in an actual change. While the Object would be updated, the internal Redis connection would not be restarted, resulting in an unexpected behavior. The missing db_index was added to the documentation.	2024-10-24 09:18:09 +02:00
Alexander A. Klimov	7a4ba59961	Remove redundant "Validation failed" prefix from ValidationError exceptions ValidationError#ValidationError() already prefixes #m_What, which #what() returns, with "Validation failed for object".	2024-10-23 13:06:12 +02:00
Julian Brost	869a7d6f0f	Security: fix TLS certificate validation bypass The previous validation in set_verify_callback() could be bypassed, tricking Icinga 2 into treating invalid certificates as valid. To fix this, the validation checks were moved into the IsVerifyOK() function. This is tracked as CVE-2024-49369, more details will be published at a later time.	2024-10-22 10:36:58 +02:00
Yonas Habteab	f4e61ef9bd	Merge pull request #10177 from Icinga/log-noop-fix Log: fix some parts of messages not being discarded early	2024-10-21 09:31:19 +02:00
Julian Brost	7d0a43f926	Use `Checkable::GetStateBeforeSuppression()` only where relevant This fixes an issue where recovery notifications get lost if they happen outside of a notification time period. Not all calls to `Checkable::NotificationReasonApplies()` need `GetStateBeforeSuppression()` to be checked. In fact, for one caller, `FireSuppressedNotifications()` in `lib/notification/notificationcomponent.cpp`, the state before suppression may not even be initialized properly, so that the default value of OK is used which can lead to incorrect return values. Note the difference between suppressions happening on the level of the `Checkable` object level and the `Notification` object level. Only the first sets the state before suppression in the `Checkable` object, but so far, also the latter used that value incorrectly. This commit moves the check of `GetStateBeforeSuppression()` from `Checkable::NotificationReasonApplies()` to the one place where it's actually relevant: `Checkable::FireSuppressedNotifications()`. This made the existing call to `NotificationReasonApplies()` unneccessary as it would always return true: the `type` argument is computed based on the current check result, so there's no need to check it against the current check result.	2024-10-11 13:21:10 +02:00
Alexander A. Klimov	c6f9de5933	Ido*sqlConnection#FieldToEscapedString(): don't write out of range time MySQL's FROM_UNIXTIME() NULLs ts <1970, errors for >2038. Postgres' TO_TIMESTAMP() errors for all ts not between 4713BC - 294276AD.	2024-10-02 11:52:25 +02:00
Julian Brost	5e9e0bbcdf	Merge pull request #10059 from Icinga/IcingaDB-TimestampToMilliseconds-limit IcingaDB::TimestampToMilliseconds(): limit output to four year digits	2024-10-02 09:19:03 +02:00
Alexander A. Klimov	ad6fcda6df	Ido*sqlConnection#FieldToEscapedString(): don't overflow timestamps > long	2024-10-01 17:38:52 +02:00
Alexander A. Klimov	dc4869c3aa	IcingaDB::TimestampToMilliseconds(): limit output to four year digits Too high timestamps may overflow uint64_t (and the YYYY format) and negative ones don't fit into uint64_t. Those may crash our Go daemon.	2024-09-30 16:54:40 +02:00
Julian Brost	f0e084d530	Log: fix some parts of messages not being discarded early `m_IsNoOp` was introduced to avoid building up log messages that will later be discarded, like debug messages if no debug logging is configured. However, it looks like the template operator<< implemented in the header file was forgotten when adding this feature, all other places writing into `m_Buffer` already have an if guard like added by this commit.	2024-09-27 14:23:05 +02:00
Alexander A. Klimov	2bbeaec916	Fix build on Mac with -DICINGA2_UNITY_BUILD=OFF -DICINGA2_WITH_LIVESTATUS=ON error: no matching function for call to 'intrusive_ptr_release' ... candidate function not viable: cannot convert argument of incomplete type 'icinga::Notification ' to 'Object ' for 1st argument void intrusive_ptr_release(Object *object);	2024-09-27 12:41:11 +02:00
Julian Brost	b6b1506bda	Merge pull request #10140 from Icinga/drop-cpu-bound-work-usage-from-ifwapi Don't use thread-local var in coroutine & drop superfluous `CpuBoundWork` usage	2024-09-27 11:31:58 +02:00
Yonas Habteab	92df9ef8c3	Merge pull request #10148 from Icinga/enhanced-sort-types-by-load-dependencies Sort config types by their load dependencies once	2024-09-26 15:27:41 +02:00
Sebastian Grund	8c68c6e9d8	Add closing quotationmarks in Validator for influxdb writer config	2024-09-25 13:03:00 +02:00
Yonas Habteab	467e8b18e7	Type: Simplify sort by load dependencies algorithm	2024-09-20 16:18:12 +02:00
Alexander A. Klimov	31f3acaa13	ConfigItem::CommitNewItems(): pre-sort types by their load dependencies once to avoid complicated nested loops, iterating over the same types and checking dependencies over and over, skipping already completed ones.	2024-09-20 16:18:12 +02:00
Alexander A. Klimov	b848934d57	Introduce Type::GetConfigTypesSortedByLoadDependencies()	2024-09-20 16:18:12 +02:00
Yonas Habteab	26f43b0b48	IcingaDB: Don't sync partially initialised objects	2024-09-11 14:08:27 +02:00
Yonas Habteab	74009f0fcb	Don't use thread-local variable in coroutine & process final `cr` in global thread pool	2024-09-05 17:36:03 +02:00
Yonas Habteab	c9159494c0	HttpServerConnection: Drop yet another superfluous `CpuBoundWork` usage	2024-09-05 15:10:14 +02:00
Alexander Aleksandrovič Klimov	79e3cb2a95	Utility::ReleaseHelper(): remove detection of EOL distros We only support /etc/os-release owners.	2024-09-04 10:26:50 +02:00
Alexander Aleksandrovič Klimov	0951230ce1	Merge pull request #9991 from Icinga/JsonRpcConnection-9985 JsonRpcConnection#Send*(): discard messages ASAP once shutting down	2024-09-03 15:13:30 +02:00
Julian Brost	4c6b93d617	Merge pull request #10011 from Icinga/next-check-cluster-sync-issue Checkable: Don't recalculate `next_check` for remotely generated `cr`	2024-08-30 13:37:41 +02:00
Yonas Habteab	9f84c1516e	ApiListener: Reorder logging in `ApiTimerHandler()`	2024-08-28 16:53:53 +02:00
Yonas Habteab	e062ceb901	ApiListener: Catch & supress clients runtime errors	2024-08-28 16:53:53 +02:00
Julian Brost	88e79ea41a	Merge pull request #10111 from Icinga/unregister-invalid-objects-properly Unregister invalid config objects properly	2024-08-27 14:30:38 +02:00
Yonas Habteab	932a53449d	JsonRpcConnection: Raise an exception when trying to send to disconnected clients	2024-08-27 14:23:41 +02:00
Julian Brost	9222a63ff7	Make sure log file is reopened when `ApiListener::ReplayLog()` returns	2024-08-27 14:23:41 +02:00
Yonas Habteab	a5a83e311a	Defer: Allow empty initialization & add `SetFunc()` method	2024-08-27 14:23:41 +02:00
Yonas Habteab	73db30c08b	Use `Defer` class for cleanup in `ApiListener::ReplayLog()`	2024-08-27 14:23:41 +02:00
Alexander A. Klimov	f074e24d2a	ApiListener#ReplayLog(): stop reading files ASAP on send error	2024-08-27 14:23:41 +02:00
Alexander A. Klimov	b538ad2528	JsonRpcConnection#Send*(): discard messages ASAP once shutting down Especially ApiListener#ReplayLog() enqueued lots of messages into JsonRpcConnection#{m_IoStrand,m_OutgoingMessagesQueue} (RAM) even if the connection was shut(ting) down. Now #Disconnect() takes effect ASAP.	2024-08-27 14:23:41 +02:00
Alexander A. Klimov	33f8ea6dcc	JsonRpcConnection#Disconnect(): spawn coroutine only if necessary by checking the now atomic #m_ShuttingDown outside of it.	2024-08-27 14:23:41 +02:00
Alexander A. Klimov	f96e7c67ee	On Windows, don't create C:\Program Files\Icinga2\var during MSI build	2024-08-23 12:49:09 +02:00
Julian Brost	39ae2e8ca4	Utility::FormatDateTime(): provide an overload for tm* This allows the function to be used both with a double timestamp or a pointer to a tm struct. With this, a similar implementation inside the tests can simply use our regular function.	2024-08-23 12:48:50 +02:00
Julian Brost	d5b3ffaa6d	Utility::FormatDateTime(): handle invalid format strings on Windows On Windows, the strftime() function family invokes an invalid parameter handler when the format string is invalid (see the "Remarks" section in their documentation). std::put_time() shows the same behavior as it uses _wcsftime_l() internally. The default invalid parameter handler may terminate the process, which can be a problem given that the format string can be specified by the user from the Icinga DSL. Thus, temporarily set a thread-local no-op handler to disable the default one allowing the program to continue. This then simply results in the function returning an error which then results in an exception as we ask the stream to throw one. See also: https://learn.microsoft.com/en-us/cpp/c-runtime-library/reference/strftime-wcsftime-strftime-l-wcsftime-l?view=msvc-170 https://learn.microsoft.com/en-us/cpp/c-runtime-library/parameter-validation?view=msvc-170 https://learn.microsoft.com/en-us/cpp/c-runtime-library/reference/set-invalid-parameter-handler-set-thread-local-invalid-parameter-handler?view=msvc-170	2024-08-23 12:48:50 +02:00
Julian Brost	0285028689	Utility::FormatDateTime(): handle errors from strftime() So far, the return value of strftime() was simply ignored and the output buffer passed to the icinga::String constructor. However, there are error conditions where strftime() returns 0 to signal an error, like if the buffer was too small for the output. In that case, there's no guarantee on the buffer contents and reading it can result in undefined behavior. Unfortunately, returning 0 can also indicate success and strftime() doesn't set errno, so there's no reliable way to distinguish both situations. Thus, the implementation now returns the empty string in both cases. I attempted to use std::put_time() at first as that allows for better error handling, however, there were problems with the implementation on Windows (see inline comment), so I put that plan on hold at left strftime() there for the time being.	2024-08-23 12:42:54 +02:00
Julian Brost	c2c66908f6	Utility::FormatDateTime(): use localtime_s() on Windows localtime() is not thread-safe as it returns a pointer to a shared tm struct. Everywhere except on Windows, localtime_r() is used already which avoids the problem by using a struct allocated by the caller for the output. Windows actually has a similar function called localtime_s() which has the same properties, just with a different name and order of arguments.	2024-08-23 12:42:32 +02:00
Julian Brost	704acdc698	Utility::FormatDateTime(): use boost::numeric_cast<>() The previous implementation actually had undefined behavior when called with a double that can't be represented as time_t. With boost::numeric_cast, there's a convenient cast available that avoids this and throws an exceptions on overflow. It's undefined behavior ([0], where the implicit conversion rule comes into play because the C-style cast uses static_cast [1] which in turn uses the imlicit conversion as per rule 5 of [2]): > A prvalue of floating-point type can be converted to a prvalue of any integer > type. The fractional part is truncated, that is, the fractional part is > discarded. > > * If the truncated value cannot fit into the destination type, the behavior > is undefined (even when the destination type is unsigned, modulo arithmetic > does not apply). Note that on Linux amd64, the undefined behavior typically manifests itself in the result being the minimal value of time_t which then results in localtime_r failing with EOVERFLOW. [0]: https://en.cppreference.com/w/cpp/language/implicit_conversion#Floating.E2.80.93integral_conversions [1]: https://en.cppreference.com/w/cpp/language/explicit_cast [2]: https://en.cppreference.com/w/cpp/language/static_cast	2024-08-23 12:42:30 +02:00
Julian Brost	4c83d793a6	Merge pull request #9983 from Icinga/broken-timeperiod Fix broken `TimePeriod/ScheduledDowntime`s	2024-08-20 10:05:59 +02:00
Yonas Habteab	ca7cc54438	Checkable: Don't recalculate `next_check` while processing remotely genrated check Currently, when processing a `CheckResult`, it will first trigger an `OnNextCheckChanged` event, which is sent to all connected endpoints. Then, when `Checkable::ProcessCheckResult()` returns, an `OnCheckResult` event is fired, which is of course also sent to all connected endpoints. Next, the other endpoints receive the `event::SetNextCheck` cluster event followed by `event::CheckResult`and invoke `checkable#SetNextCheck()` and `Checkable#CheckResult()` with the newly received check. So they also try to recalculate the next check themselves and invalidate the previously received next check timestamp from the source endpoint. Since each endpoint randomly initialises its own scheduling offset, the recalculated next check will always differ by a split second/millisecond on each of them. As a consequence, two Icinga DB HA instances will generate two different checksums for the same state and causes the state histories to be fully resynchronised after a takeover/Icinga 2 reload.	2024-08-16 16:15:56 +02:00
Alexander Aleksandrovič Klimov	02ba5e4101	Merge pull request #10015 from Icinga/malloc_info /v1/debug/malloc_info: call malloc_info(3) if available	2024-08-12 14:41:09 +02:00
Alexander A. Klimov	f3c7ac11e9	/v1/debug/malloc_info: call malloc_info(3) if available The GNU libc function malloc_info(3) provides memory allocation and usage statistics of Icinga 2 itself.	2024-08-09 12:59:25 +02:00
Julian Brost	2bfa1f1649	Merge pull request #10107 from Icinga/timeperiod-nth-day-of-month-off-by-one Timeperiods: fix off by one when calculating n-th last weekday of the month	2024-08-08 14:40:18 +02:00
Julian Brost	c45829b59f	Timeperiods: fix off by one when calculating n-th last weekday of the month A day specification like "monday -1" refers to the last Monday of the month. However, there was an off by one if the first day of the next month is the same day of the week, i.e. a Monday in this example. LegacyTimePeriod::FindNthWeekday() picks a day to start the search for the day in question. When given a negative n to search for the n-th last day, it wrongly used the first day of the following month as the start and counted it as if it was within the current month. This resulted in a 1/7 chance that the result was one week too late. This is fixed by using the last day of the current month instead.	2024-08-07 12:06:05 +02:00
Yonas Habteab	c4edecc1fb	Unregister invalid config objects properly	2024-08-06 16:59:30 +02:00
Julian Brost	07d253009a	Merge pull request #10013 from Icinga/broken-runtime-config-sync Fix broken runtime config sync	2024-08-06 11:57:24 +02:00
Yonas Habteab	86347013a6	Check segemnt start date inclusively in `TimePeriod::IsInside()`	2024-08-01 16:16:48 +02:00
Yonas Habteab	4daa03dc02	Fix broken timeperiods/scheduleddowntimes	2024-08-01 15:14:34 +02:00
Yonas Habteab	546dea95a2	Don't allow to modify/create/delete an object concurrently	2024-06-13 11:26:19 +02:00
Yonas Habteab	099f664ce6	`ConfigObjectUtility#CreateObject()`: Use `Defer` for config path cleanup	2024-06-13 11:26:19 +02:00
Yonas Habteab	433e2de13a	ApiListener: Process cluster config updates sequentially	2024-06-13 11:26:19 +02:00
Yonas Habteab	1a55b68541	Introduce RAII style `ObjectNameLock` class	2024-06-13 11:26:19 +02:00
Yonas Habteab	2218ebd6b0	`ConfigObjectUtility`: Use `AtomicFile` to store object config files	2024-06-13 11:26:19 +02:00
Alexander Aleksandrovič Klimov	f1be9b73ab	Merge pull request #10060 from Icinga/IcingaDB-SerializeState-execution_time-latency IcingaDB#SerializeState(): limit execution_time and latency to 2^32-1	2024-06-13 09:55:45 +02:00
Yonas Habteab	81a94a0759	Don't fail to remove obsolete downtimes	2024-05-23 10:09:41 +02:00
Yonas Habteab	4eeccce36c	Don't loose args in recursive `Downtime::RemoveDowntime()` call	2024-05-23 10:09:41 +02:00
Yonas Habteab	e0fd0d3df4	Introduce & use enum `DowntimeRemovalReason`	2024-05-23 09:34:15 +02:00
Alexander Aleksandrovič Klimov	cc3965c3ce	Merge pull request #10065 from Icinga/heavy-update-missing-table-relations Update `object#config_hash` after all relations queries	2024-05-22 15:38:31 +02:00
Yonas Habteab	1019398d55	Update object#config_hash after all relations queries	2024-05-22 13:39:30 +02:00
Yonas Habteab	3d64240ee3	Merge pull request #10066 from Icinga/Checkable-RemoveAllDowntimes Remove unused Checkable#RemoveAllDowntimes()	2024-05-21 17:13:16 +02:00
Alexander A. Klimov	e2bdb8a2f1	Remove unused Checkable#RemoveAllDowntimes()	2024-05-21 14:28:39 +02:00
Alexander A. Klimov	f9adf18111	IcingaDB#SerializeState(): limit execution_time and latency to 2^32-1 not to write higher values into Redis than the Icinga DB schema can hold. This fixes yet another potential Go daemon crash.	2024-05-15 12:55:41 +02:00
Alexander Aleksandrovič Klimov	8c2eb3c1ed	Merge pull request #10049 from Icinga/AddDowntime-trigger_name Downtime::AddDowntime(): NULL-check pointer before deref not to crash	2024-05-06 10:26:26 +02:00
Alexander Aleksandrovič Klimov	d8f8d64f1a	Merge pull request #10027 from macdems/master Fix missing values in PerfData normalization	2024-04-25 19:38:21 +02:00
Maciej Dems	2bb5cc62e2	Fix missing values in PerfData normalization	2024-04-25 17:41:12 +02:00
Alexander A. Klimov	5f80ac17aa	l_LegacyDowntimesCache: delete removed objects not to leak memory	2024-04-25 12:13:52 +02:00
Alexander A. Klimov	c0f87dd4c9	/v1/actions/schedule-downtime: reject request on invalid trigger_name For this purpose lookup the specified Downtime. Also pass Downtime objects, not just names, to Downtime::AddDowntime() not to lookup it twice.	2024-04-25 12:13:52 +02:00
Alexander A. Klimov	f0b5239a15	[Refactor] Downtime::GetDowntimeIDFromLegacyID(): return the Downtime itself not just its name.	2024-04-25 12:13:52 +02:00
Alexander A. Klimov	28b0f7a48c	[Refactor] l_LegacyDowntimesCache: store Downtime objects, not just their names to avoid names of vanished objects.	2024-04-24 12:33:56 +02:00
Alexander A. Klimov	bb13e98ca5	PluginCheckTask::ProcessFinishedHandler(): warn about exit codes outside 0..3 in the plugin output as well, in addition to the warning log.	2024-04-23 17:45:31 +02:00
Alexander A. Klimov	e33befabfb	Make ProcessResult#ExitStatus and CheckResult#exit_status 64-bit ints so that they can hold Windows exit codes like 3221225477 (>2147483647).	2024-04-23 17:45:31 +02:00
Alexander A. Klimov	5c17465a19	OpenTsdbWriter#CheckResultHandler(): skip custom tags with empty values refs #7724	2024-04-18 11:36:21 +02:00
Lorenz Kästle	7afda4dc0d	Add cli option to disable the default global zones When setting up Icinga 2 agents, in most cases, the default global zones are not needed, but have to be removed manually or automatically whith tools outside of Icinga 2 from the configuration. This seems like unnecessary work, since the node setup command does everything else. This commit introduces a new option for the node setup command ("--no-default-global-zones") to exclude the default global zones.	2024-04-03 08:10:45 +02:00
Yannick Martin	5e92450877	icinga2: address comment loading where host reference is not found address #9752: check if host reference is valid	2024-03-11 12:42:23 +01:00
Julian Brost	31be43ff6c	Merge pull request #10018 from Icinga/revert-9980-config-sync-conflicts Revert "Process `config::update/delete` cluster events gracefully"	2024-03-08 16:58:28 +01:00
Julian Brost	af97431bfb	Merge pull request #10006 from Icinga/http-error-handling HttpServerConnection: use exceptions for error handling	2024-03-08 15:06:51 +01:00
Yonas Habteab	a924a49cd8	Revert "Process `config::update/delete` cluster events gracefully"	2024-03-07 17:17:17 +01:00
Julian Brost	097ba00a9c	Merge pull request #10008 from Icinga/Al2Klimov-patch-12 Don't unnecessarily shuffle items before config validation	2024-03-07 16:44:38 +01:00
Alexander Aleksandrovič Klimov	629038344b	OpenTsdbWriter#CheckResultHandler(): clarify log messages Clarify which "host or service" an "Unable to resolve macro" debug log message refers to.	2024-02-22 10:34:35 +01:00
Julian Brost	abea2f270c	Merge pull request #9997 from Icinga/ListenerCoroutineProc-remote_endpoint ApiListener#ListenerCoroutineProc(): get remote endpoint ASAP for logging	2024-02-20 13:46:02 +01:00
Alexander Aleksandrovič Klimov	51cdd593da	Don't unnecessarily shuffle items before config validation Before `ae693cb7e1` (#9577) we've repeatedly looped over all items in parallel like this: while not types.done: for t in types: if not t.done and t.dependencies.done: with parallel(all_items, CONCURRENCY) as some_items: for i in some_items: if i.type is t: i.commit() I.e. all items got distributed over CONCURRENCY threads, but not always equally. E.g. it was the hosts' turn, but only two threads got hosts and did all the work. The others didn't do actual work (due to the lack of hosts in their queue) which reduced the performance. `c721c302cd` (#6581) fixed it by shuffling all_items first. `ae693cb7e1` (#9577) made the latter unnecessary by replacing the above algorithm with this: while not types.done: for t in types: if not t.done and t.dependencies.done: with parallel(all_items[t], CONCURRENCY) as some_items: for i in some_items: if i.type is t: i.commit() I.e. parallel() gets only items of type t, so all threads get e.g. hosts.	2024-02-19 14:26:06 +01:00
Julian Brost	700c5a13d7	HttpServerConnection: use exceptions for error handling When a HTTP connection dies prematurely while the response is sent, `http::async_write()` sets the error code to something like broken pipe for example. When calling `async_flush()` afterwards, it sometimes happens that this never returns. This results in a resource leak as the coroutine isn't cleaned up. This commit makes the individual functions throw exceptions instead of silently ignoring the errors, resulting in the function terminating early and also resulting in an error being logged as well.	2024-02-19 14:12:41 +01:00
Julian Brost	04ef105caa	Merge pull request #9980 from Icinga/config-sync-conflicts Process `config::update/delete` cluster events gracefully	2024-02-19 13:49:41 +01:00
Julian Brost	7d1c887a32	Merge pull request #9999 from Icinga/reset-log-message-count-correctly ApiListener: Reset `m_LogMessageCount` when rotating	2024-02-15 17:06:16 +01:00
Alexander Aleksandrovič Klimov	9db1c4aca3	Merge pull request #8011 from Icinga/bugfix/reset-sigpipe-6912 Reset all signal handlers of child processes	2024-02-15 12:22:36 +01:00
Yonas Habteab	456144c1dc	ApiListener: Process cluster config updates sequentially	2024-02-14 14:25:53 +01:00
Yonas Habteab	40011b0584	Introduce `ObjectNamesMutex` helper class	2024-02-14 14:25:53 +01:00
Alexander Aleksandrovič Klimov	1a8ce5a90e	Merge pull request #9575 from Icinga/WorkQueue-ParallelFor WorkQueue#ParallelFor(): allocate lambda once per thread, not once per item	2024-02-14 12:59:50 +01:00
Julian Brost	2be08aa2e0	Merge pull request #9992 from Icinga/remove-redundat-cpu-bound-work Drop redundant `CpuBoundWork` usage in `JsonRpcConnection::Disconnect()`	2024-02-13 15:51:34 +01:00
Julian Brost	fc6a106345	Merge pull request #9994 from Icinga/redundant-cpu-bound-work-usages Drop redundant `CpuBoundWork` usages in `lib/remote`	2024-02-13 14:53:59 +01:00
Alexander Aleksandrovič Klimov	48eb563ca0	Merge pull request #9736 from Icinga/stream-read-allow_partial Stream#Read(): remove de facto unused param allow_partial	2024-02-13 13:04:15 +01:00
Yonas Habteab	008fcd1744	Preserve runtime objects in a tmp file for the entire validation process Given that the internal `config::Update` cluster events are using this as well to create received runtime objects, we don't want to persist first the conf file and the load and validate it with `CompileFile`. Otherwise, we are forced to remove the newly created file whenever we can't validate, commit or activate it. This also would also have the downside that two cluster events for the same object arriving at the same moment from two different endpoints would result in two different threads simultaneously creating and loading the same config file - whereby only one of the surpasses the validation, while the other is facing an object `re-definition` error and tries to remove that config file it mistakenly thinks it has created. As a consequence, an object successfully created by the former is implicitly deleted by the latter thread, causing the objects to mysteriously disappear.	2024-02-12 15:18:32 +01:00
Yonas Habteab	6e66cd9aff	ApiListener: Reset `m_LogMessageCount` when rotating Closing and re-opening that very same log file shouldn't reset the counter, otherwise some log files may exceed the max limit per file as their offset indicator is reset each time they are re-opened.	2024-02-09 18:04:20 +01:00
Yonas Habteab	eb813cfb99	HttpServerConnection: Drop superfluous `CpuBoundWork` usage	2024-02-09 15:17:26 +01:00
Alexander A. Klimov	62e1d7650d	ApiListener#ListenerCoroutineProc(): get remote endpoint ASAP for logging On incoming connection timeout we log the remote endpoint which isn't available if it was already disconnected - an exception is thrown. Get it as long as we're still connected not to lose it, nor to get an exception.	2024-02-09 12:27:25 +01:00
Yonas Habteab	32531fe909	EventsHandler: Drop superfluous `CpuBoundWork` usage	2024-02-09 12:00:50 +01:00
Eric Lippmann	c7293de91d	IoEngine: Always log coroutine exception diagnostics While analyzing a possible memory leak, we encountered several coroutine exception messages, which unfortunately do not provide any information about what exactly went wrong, as exception diagnostics were previously only logged at the notice level.	2024-02-08 12:09:06 +01:00
Yonas Habteab	72266434df	Drop redundant `CpuBoundWork` usages in `lib/remote`	2024-02-08 11:30:23 +01:00
Yonas Habteab	e2793f1d88	Drop redundant `CpuBoundWork` usage in `JsonRpcConnection::Disconnect()` Although there is locking involved here, it shoudln't take too long for the thread to actually acquire it, since there aren't that many threads dealing with endpoint clients concurrently. It's just wasting pointless time trying to obtain a CPU slot.	2024-02-08 11:24:55 +01:00
Alexander Aleksandrovič Klimov	e9fcbf400f	Merge pull request #9966 from Icinga/Al2Klimov-patch-3 HttpServerConnection: remove duplicate ")" from a log message	2024-01-18 10:46:51 +01:00
Alexander A. Klimov	d48b369554	Reset all signal handlers of child processes ... not to disturb check plugins. refs #6912	2024-01-17 12:25:59 +01:00
Alexander Aleksandrovič Klimov	966b46e808	Merge pull request #9965 from Icinga/http-request-time HttpServerConnection: log request processing time as well	2024-01-17 11:30:33 +01:00
Julian Brost	b1fe15f694	Merge pull request #9962 from Icinga/influx-disk-9948 Influx DB: truncate timestamps to whole seconds to save disk space	2024-01-17 08:50:16 +01:00
Alexander A. Klimov	b6874cc8d4	HttpServerConnection: log request processing time as well	2024-01-16 17:52:07 +01:00
Alexander Aleksandrovič Klimov	6a4cb5c12c	HttpServerConnection: remove duplicate ")" from a log message The commit `5c32a5a7dc`, which introduced it, clearly shows that the other ")" already existed legitimately.	2024-01-16 16:31:00 +01:00
Alexander A. Klimov	cc9db3756f	Revert "Influx DB: don't unneccessarily truncate timestamps to whole seconds" This reverts commit `eaa3cd83ad`.	2024-01-16 12:19:48 +01:00
Alexander A. Klimov	fc5b1178c6	Revert "Remove no-op InfluxDB URL param" This reverts commit `21f548d3c0`.	2024-01-16 12:19:47 +01:00
Alexander Aleksandrovič Klimov	28b2db8446	Merge pull request #9851 from Icinga/Al2Klimov-patch-3 Make ObjectImpl<Logger>#GetSeverity() non-virtual	2023-12-22 12:44:51 +01:00
Alexander Aleksandrovič Klimov	6c03598678	Merge pull request #9896 from Icinga/provide-cancel_time-where-has_been_cancelled-may-be-1 Disallow triggering a cancelled downtime, but provide cancel_time in Icinga DB downtime history where has_been_cancelled may be 1	2023-12-20 10:03:09 +01:00
Alexander Aleksandrovič Klimov	949d983a76	Merge pull request #9895 from Icinga/targeted-api-filter FilterUtility::GetFilterTargets(): don't run filter for specific object(s) for all objects	2023-12-19 15:18:41 +01:00
Alexander Aleksandrovič Klimov	8b2e28a869	Merge pull request #9891 from Icinga/renew-the-ca-9890 ApiListener#Start(): auto-renew CA on its owner	2023-12-19 14:57:47 +01:00
Alexander Aleksandrovič Klimov	96cfc4abe8	Merge pull request #9887 from Icinga/argument-list-too-long-9340 PluginNotificationTask::ScriptFunc(): on Linux truncate output and comment	2023-12-19 14:36:57 +01:00
Alexander A. Klimov	175153ce6a	PluginNotificationTask::ScriptFunc(): on Linux truncate output and comment not to run into an exec(3) error E2BIG due to a too long argument. This sends a notification with truncated output instead of not sending.	2023-12-19 12:21:03 +01:00
Alexander A. Klimov	966216f4ba	RequestCertificateHandler(): also renew if CA needs a renewal and a newer one is available.	2023-12-18 15:28:11 +01:00
Alexander A. Klimov	551c3afa60	CertificateToString(): allow raw pointer input	2023-12-18 15:28:11 +01:00
Alexander A. Klimov	bc778116e9	ApiListener#Start(): auto-renew CA on its owner otherwise it would expire.	2023-12-18 15:28:11 +01:00
Alexander A. Klimov	36a08b0497	ApiListener#RenewCert(): enable optional CA creation	2023-12-18 15:28:11 +01:00
Alexander A. Klimov	7b55df6f11	CreateCertIcingaCA(EVP_PKEY, X509_NAME): enable optional CA creation	2023-12-18 15:28:11 +01:00
Alexander Aleksandrovič Klimov	953eeba061	Merge pull request #9893 from Icinga/do-not-re-notify-if-filtered-states-don-t-change-4503 Discard likely duplicate problem notifications via Notification#last_notified_state_per_user	2023-12-13 16:13:28 +01:00
Alexander A. Klimov	ecfc9033b0	FilterUtility::GetFilterTargets(): don't run filter for specific object(s) for all objects	2023-12-13 16:02:50 +01:00
Alexander A. Klimov	15191bcd74	ApplyRule::GetTarget*s(): support constant strings from variables in addition to literal strings. This is for sandboxed filters with some variables pre-set by the caller. They're "constant" in that scope, too.	2023-12-13 16:02:50 +01:00
Alexander A. Klimov	a04cef1890	Introduce DictExpression#GetExpressions()	2023-12-13 16:02:50 +01:00
Alexander A. Klimov	8bcae97ecc	Introduce Dictionary#GetRef()	2023-12-13 16:02:50 +01:00
Alexander A. Klimov	97cd05db7a	Notification#BeginExecuteNotification(): on recovery clear last_notified_state_per_user	2023-12-13 13:21:22 +01:00
Alexander A. Klimov	44e9c6f40d	Notification#BeginExecuteNotification(): discard likely duplicate problem notifications	2023-12-13 13:21:19 +01:00
Alexander A. Klimov	74f52c6fcd	Introduce IsCaUptodate() by splitting IsCertUptodate()	2023-12-13 12:08:34 +01:00
Julian Brost	871fa67b52	Merge pull request #9885 from Icinga/renegotiation	2023-12-12 17:38:09 +01:00
Alexander A. Klimov	2cff763295	Cluster-sync Notification#last_notified_state_per_user	2023-12-12 15:29:50 +01:00
Alexander A. Klimov	b25ba7a316	Notification#BeginExecuteNotification(): track state change notifications	2023-12-07 12:43:30 +01:00
Julian Brost	d2a7117007	Merge pull request #9899 from Icinga/icinga2-crashes-silently-9897 IcingaDB#SendConfigDelete(): fix missing nullptr check before deref	2023-11-21 11:03:28 +01:00
Alexander Aleksandrovič Klimov	7fc7d054af	Merge pull request #9841 from WuerthPhoenix/fix-9840-lock-console-api-during-reload	2023-11-21 10:36:26 +01:00
Alexander A. Klimov	7174dc864d	IcingaDB#SendConfigDelete(): fix missing nullptr check before deref	2023-11-10 17:43:33 +01:00
Alexander A. Klimov	9aaa9901bd	Icinga DB downtime history: provide cancel_time where has_been_cancelled may be 1 The table sla_history_downtime requires a downtime_end. The Go daemon takes the cancel_time if has_been_cancelled is 1. So we must supply a cancel_time whereever has_been_cancelled is 1. Otherwise the Go daemon can't process some entries.	2023-11-08 15:22:39 +01:00
Alexander A. Klimov	7ce9457a4a	Disable TLS renegotiation The API doesn't need it and a customer's security scanner is afraid of a potential DoS attack vector.	2023-11-06 18:46:37 +01:00
Theo Buehler	1f06589f7a	Remove dead code in GetSignatureAlgorithm() This code was added in commit `548eb93` and never did anything useful. Using X509_get_signature_nid() or its expanded version in the pre-1.1 branch is the correct way of retrieving the signature algorithm of a certificate. CLA: trivial	2023-10-20 18:55:44 +02:00
Julian Brost	bba6a76f4a	Merge pull request #9853 from Icinga/GelfWriter-m_StreamMutex GelfWriter: protect m_Stream via m_WorkQueue, not ObjectLock(this)	2023-09-07 11:46:38 +02:00
Alexander Aleksandrovič Klimov	e5d988a2fe	Merge pull request #7799 from Icinga/bugfix/file-end Fix file endings	2023-08-25 11:06:19 +02:00
Alexander A. Klimov	4ee10a6c20	GelfWriter: protect m_Stream via m_WorkQueue, not ObjectLock(this) On shutdown or HA re-connect ConfigObject#SetAuthority(false) is called which does ObjectLock(this) and ConfigObject#Pause(). GelfWriter#Pause(), with the above ObjectLock, calls m_WorkQueue.Join(). But items inside that also doing ObjectLock(this) cause a deadlock.	2023-08-24 17:48:09 +02:00
Alexander Aleksandrovič Klimov	993c9b742d	Make ObjectImpl<Logger>#GetSeverity() non-virtual After all it's not overridden.	2023-08-15 13:03:31 +02:00
Mattia Codato	41e21cb8cf	Prevent calls to command API while the configuration is reloading. Fixes #9840	2023-08-09 08:45:04 +02:00
Alexander A. Klimov	1308ad62af	Stream#Read(): remove de facto unused param allow_partial The only caller passes true, so no one forbids partial reads (even implicitly). All usages in the implementation just assert it being true (allowed).	2023-07-13 16:55:48 +02:00
Alexander Aleksandrovič Klimov	1af5109ad3	Merge pull request #9734 from Icinga/remove-unused-stream-peek- Remove unused Stream#Peek()	2023-07-13 16:52:29 +02:00
Alexander A. Klimov	8f8a6ee2a0	Application::m_LastReloadFailed: if double isn't always lock free, use uint32_t which will overflow in 2106, not 2038. This fixes a compile failure on 32-bit Raspbian.	2023-07-10 10:51:02 +02:00
Alexander Aleksandrovič Klimov	000a776dfb	Built-in check command: ifw-api (#9062 )	2023-07-06 14:18:21 +02:00
Julian Brost	26a75f8a6f	Merge pull request #9812 from Icinga/support-elasticsearch-8-0-9251 ElasticsearchWriter: switch to v7+ URL schema to support v8	2023-07-05 10:15:10 +02:00
Julian Brost	fe13b96226	Merge pull request #9809 from Icinga/reevaluate-and-update-default-tls-cipher-list-9808 Copy and paste global default TLS cipher set from ssl-config.mozilla.org	2023-07-03 19:13:10 +02:00
Alexander A. Klimov	617dda61fb	Re-order global default TLS cipher list to prefer AES256 over AES128	2023-07-03 15:36:11 +02:00
Alexander A. Klimov	4c2e59a690	ElasticsearchWriter: switch to v7+ URL schema to support v8 and OpenSearch 2. This breaks the EOL v5 and v6.	2023-07-03 14:43:45 +02:00
Julian Brost	70d6b6e424	Merge pull request #9810 from Icinga/Al2Klimov-patch-8 ElasticsearchWriter#Pause(): call Flush() only once	2023-06-30 17:21:16 +02:00
Alexander Aleksandrovič Klimov	076eb59443	ElasticsearchWriter#Pause(): lock m_DataBufferMutex during Flush() just to be sure regarding race conditions.	2023-06-30 14:57:18 +02:00
Julian Brost	a2e05f89e8	Enable built-in OpenSSL DH parameters to allow DHE TLS ciphers Non-ECC DHE ciphers in the `cipher_list` attribute of `ApiListener` (the default value includes these) had no effect as no DH parameters were available and therefore the server wouldn't offer these ciphers. OpenSSL provides built-in DH parameters starting from version 1.1.0, however, these have to be enables explicitly using the `SSL_CTX_set_dh_auto()` function. This commit does so and thereby makes it possible to establish a connection to an Icinga 2 server using a DHE cipher.	2023-06-29 12:06:26 +02:00
Alexander Aleksandrovič Klimov	d5e6ecec8a	ElasticsearchWriter#Pause(): call Flush() only once The first Flush() is redundant and may access m_DataBuffer at the same time as some Flush() in m_WorkQueue (race condition) which isn't joined, yet.	2023-06-29 10:42:12 +02:00
Alexander A. Klimov	2e053b0e06	Copy and paste global default TLS cipher set from ssl-config.mozilla.org which got more secure by now, but still overlaps with v2.13.x' set.	2023-06-28 14:49:08 +02:00
Julian Brost	a2926b8604	Merge pull request #9794 from Icinga/round-notification-times-begin-end-not-to-crash-go-daemon IcingaDB::PrepareObject(): round Notification#times.{begin,end} not to crash Go daemon	2023-06-27 17:08:41 +02:00
Alexander A. Klimov	dccb678882	IcingaDB::PrepareObject(): cut off (null) negative Notification#times.{begin,end} not to crash Go daemon At least our PostgreSQL schema enforces positive values.	2023-06-27 12:58:08 +02:00
Alexander A. Klimov	415b810abf	IcingaDB::PrepareObject(): round Notification#times.{begin,end} not to crash Go daemon The latter expects ints, not floats - not to mention strings. Luckily Icinga already enforces numeric strings so that we can cast it to number.	2023-06-27 12:53:08 +02:00
Julian Brost	9cf519316e	Merge pull request #9805 from Icinga/checkcommand-timeout-0-crashes-icinga-db-daemon-9804 IcingaDB::PrepareObject(): cut off (0) negative Command#timeout for Redis	2023-06-27 10:45:02 +02:00
Julian Brost	c08d3beeb1	Merge pull request #9785 from Icinga/Al2Klimov-patch-8 Icinga DB: also write ConfigObject#original_attributes into Redis	2023-06-27 10:24:41 +02:00
Julian Brost	bd11bc2eb4	Merge pull request #9793 from Icinga/unmarshal-number-42-5-into-go-struct-field-notification-notification_interval IcingaDB::PrepareObject(): round Notification#interval and limit it to >=0	2023-06-27 10:12:13 +02:00
Alexander A. Klimov	d641a3c799	IcingaDB::PrepareObject(): cut off (0) negative Command#timeout for Redis not to crash the Go daemon which expects positive values there.	2023-06-26 15:36:47 +02:00
Julian Brost	5350aa3c72	Merge pull request #9792 from Icinga/icingadb-conversion-of-strings-to-number-types-to-avoid-crashes-9791 IcingaDB::PrepareObject(): convert non-null Checkable#check_timeout to number	2023-06-26 15:03:21 +02:00
Alexander A. Klimov	273aa6f997	IcingaDB::PrepareObject(): round Notification#interval and limit it to >=0 otherwise, e.g. with -42.5, the Go daemon crashes. It expects uints there.	2023-06-19 12:46:40 +02:00
Alexander A. Klimov	9f08bad395	IcingaDB::PrepareObject(): convert non-null Checkable#check_timeout to number and, in case of null, fall back to Checkable#check_command.timeout, just like IcingaDB#SerializeState(). Otherwise the Go daemon crashes. It expects a number.	2023-06-15 12:29:42 +02:00
Alexander A. Klimov	1587431945	POST /v1/objects: allow array of attrs to undo modifications of	2023-06-13 16:40:33 +02:00
Alexander A. Klimov	385fe2fd76	Icinga DB: also write ConfigObject#original_attributes into Redis for the case the Go daemon decides to sync them into DB.	2023-06-12 12:53:25 +02:00
Julian Brost	7c381ae12f	Merge pull request #9779 from Icinga/macroprocessor-resolvemacro-quasi-cv-object-icingaapplication MacroProcessor::ResolveMacro(): treat quasi-CV-object IcingaApplication as real CV-object	2023-05-31 20:41:31 +02:00
Alexander A. Klimov	a9c80ffb2e	MacroProcessor::ResolveMacro(): treat quasi-CV-object IcingaApplication as real CV-object As MacroProcessor checked just for CustomVarObject base class, but IcingaApplication provided the vars attribute by itself, it had to also resolve CV macros by itself. That logic diverged from MacroProcessor so that macros inside IcingaApplication CVs weren't resolved. Until now.	2023-05-31 16:35:09 +02:00
Julian Brost	8a42c3bf18	Merge pull request #9775 from Icinga/icingadb-service-crashes-on-negative-downtime-duration-or-end-before-start-9774 Icinga DB: don't write negative Downtime durations into Redis	2023-05-31 11:37:42 +02:00
Alexander A. Klimov	75eaa81c06	Icinga DB: don't write negative Downtime durations into Redis via `std::max(0, x)` not to crash the Go daemon which can't handle such.	2023-05-30 17:56:03 +02:00
Julian Brost	b0899d9ab4	Merge pull request #8429 from Icinga/bugfix/last-reload-attempt-failed-8428 Share "Last reload attempt failed" time across Icinga process tree on *nix	2023-05-30 11:42:21 +02:00
Julian Brost	d871c5c837	Merge pull request #9772 from Icinga/icinga-db-feature-should-normalize-command-arguments-required-skip_key-repeat_key-to-boolean-9576 Icinga DB feature: normalize Command.arguments[].{required,skip_key…	2023-05-25 11:54:01 +02:00
Alexander A. Klimov	ad618e9716	Icinga DB feature: normalize Command.arguments[].{required,skip_key,repeat_key} to boolean At the moment, the Icinga DB feature will use that value as-is and serialize it to JSON, resulting in a crash in Icinga DB down the road because it expects a boolean.	2023-05-24 16:04:14 +02:00
Julian Brost	2470e930eb	Merge pull request #9643 from Icinga/hardware_concurrency Always use Configuration#Concurrency, not `std:🧵:hardware_concurrency()`	2023-05-23 19:23:14 +02:00
Alexander A. Klimov	3fae41ef22	Restart thread pool after freezing Configuration The user (-D) or we could have changed Configuration.Concurrency, so correct the thread pool's thread amount.	2023-05-23 14:41:35 +02:00
Julian Brost	0e25644151	Merge pull request #8969 from Icinga/bugfix/perfdata-dont-get-parsed-correctly-8912 PluginUtility: Fix PerfData parsing for values separated with multiple spaces	2023-05-22 17:16:31 +02:00
Alexander A. Klimov	9376a311ea	Fix file endings git ls-files -z \ \|grep -zEe '^lib/' \ \|grep -zEe '\.[ch]pp$' \ \|xargs -0 perl -p0i -e 's/\n*(?!(?:.\|\n))/\n/'	2023-05-17 18:05:13 +02:00
Alexander A. Klimov	32eb1680f7	Configuration.Concurrency: default to 1 until Configuration freeze not to start many threads before the user could override their amount (-D).	2023-05-11 16:59:47 +02:00
Alexander A. Klimov	8fb5d53118	Track Configuration.Concurrency modifications	2023-05-11 15:41:35 +02:00
Alexander A. Klimov	5c330e9d4f	Share "Last reload attempt failed" time across Icinga process tree on *nix ... as only the umbrella process knows that time, but the icinga check running in the main process also needs to know it. refs #8428	2023-05-08 14:42:21 +02:00
Julian Brost	eca8890d49	Merge pull request #9718 from Icinga/acknowledgement-sync-between-masters-are-not-working-9652 Checkable#ProcessCheckResult(): only clean up ack comments older than check result	2023-05-05 15:29:38 +02:00
Julian Brost	af9d67b262	Merge pull request #9726 from Icinga/43624b Remove -and notify- expired downtimes immediately, not every 60s II	2023-05-02 11:25:03 +02:00
Alexander A. Klimov	58b788cd51	Downtime#Start(): trigger flexible downtimes not earlier than fixed ones the last state change could be a long time ago. If it's longer than the new downtime's duration, the downtime expires immediately. trigger time + duration < now	2023-04-18 16:55:32 +02:00
Julian Brost	8238ec0d96	Merge pull request #9725 from Icinga/operation_aborted-shutDownIfNeeded.Cancel ApiListener#NewClientHandlerInternal(): on basic_socket#cancel() (due to timeout) don't ssl::stream#async_shutdown()	2023-04-17 12:21:21 +02:00
Alexander A. Klimov	0ac1cd1ecb	Rename Downtime::DowntimesExpireTimerHandler() to actually reflect its purpose.	2023-04-14 14:52:05 +02:00
Alexander A. Klimov	6adf2d19e4	Remove -and notify- expired downtimes immediately, not every 60s Don't look for expired downtimes in a timer fired every 60s, but fire one timer per downtime once at expire time.	2023-04-14 14:52:05 +02:00
Alexander A. Klimov	ba7102cae3	Explicitly stop started timers and wait for them before permitting their parent objects' destruction. For the cases where the handlers have raw pointers to these objects.	2023-04-14 14:52:04 +02:00
Julian Brost	8228fae740	Merge pull request #8627 from WuerthPhoenix/bug/agent-cannot-update-executions-8616 Fix update execution message discarded. refs #8616	2023-04-13 19:29:49 +02:00
Julian Brost	f505325ff9	Merge pull request #9445 from Icinga/9365 Disallow config modifications via API during reload	2023-04-13 17:11:58 +02:00
Mattia Codato	c5c17928a6	Allow to exec command on endpoint where the checkable is not present but checkable has command_endpoint specified	2023-04-13 14:44:07 +02:00
Alexander A. Klimov	2ee776b5ab	Disallow config modifications via API during reload Once the new main process has read the config, it misses subsequent modifications from the old process otherwise.	2023-04-12 14:45:40 +02:00
Alexander A. Klimov	64e000df56	Introduce ConfigObjects*Lock	2023-04-12 13:36:48 +02:00
Julian Brost	50018c1d2b	Merge pull request #8218 from efuss/redundancy_group Introduce redundancy groups for Dependency Objects	2023-04-05 18:49:58 +02:00
Yonas Habteab	24d95e1178	PluginUtility: Fix PerfData don't get parsed correctly The problem was that some PerfData labels contained several whitespace characters, not just one, and therefore it was parsed incorrectly in `SplitPerfdata()`. I.e. the condition in line 144 checks whether the first and last character is a normal quote, but since the label can contain spaces at the beginning and at the end respectively, this caused the problems. This PR fixes the problem by removing all occurring whitespace from the beginning and end, before starting to parse the actual label.	2023-04-05 15:37:54 +02:00
Alexander A. Klimov	a66ace7245	Introduce SharedMemory	2023-04-04 13:40:27 +02:00
Alexander A. Klimov	c41e5fd05d	Support multiple redundant Timer#Start() calls so that only the first one changes l_AliveTimers (as in Timer#Stop()).	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	298f3b1973	Timer: actually support non-periodic timers	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	3933502739	Timer#Start(): don't unnecessarily unlock/lock l_TimerMutex via new Timer#InternalRescheduleUnlocked()	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	13b9cfda41	Timer::TimerThreadProc(): don't unnecessarily unlock and lock l_TimerMutex	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	1badbab002	Timer::TimerThreadProc(): keep a Timer alive while it's running to prevent the case: Timer callback destroys parent object -> destroys Timer -> ~Timer() -> Stop(true) -> waits for the Timer callback to finish -> deadlock.	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	9b00c1c4dd	Timer: drop unnecessary base class	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	24681b30f6	Make Timer::Ptr a std::shared_ptr	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	9ee4d08722	Make Timer#Timer() private to enforce Timer::Create() usage	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	21b68455ce	Use Timer::Create() instead of new Timer() git ls-files -z \|xargs -0 perl -pi -e 's/\bnew Timer\b/Timer::Create/g' ex. in Timer::Create() itself.	2023-04-04 10:35:20 +02:00
Alexander A. Klimov	bb1f574b69	Introduce factory method Timer::Create()	2023-04-04 10:35:10 +02:00
Alexander A. Klimov	35248b1b63	Code style	2023-04-03 13:39:08 +02:00
Alexander A. Klimov	cc872dac1f	Remove CheckResultReader which has been deprecated for 5 major versions	2023-04-03 11:39:21 +02:00
Julian Brost	7a7902cea7	Merge pull request #9715 from Icinga/StatusDataWriter Remove StatusDataWriter which has been deprecated for 5 major versions	2023-03-31 12:32:43 +02:00
Julian Brost	e87e1ea73f	Freeze globals namespace during config load This allows for a faster config load due to less locking required. The change is slightly backwards-incompatible. Before, you could manipulate the globals namespace at a later stage, but disallowing this feels reasonable for the performance benefit alone (which especially shows on many-core machines). Apart from that, it's doubtful if doing so is even useful at all as the DSL provides no mechanism for you to synchronize your operations that may run in parallel. The data structures itself are protected from race conditions, but anything implemented on top of this may still be subject to race conditions. And even if some user has a good reason for doing this, there's a feasible workaround by creating your own namespace like globals.mutable and using that instead.	2023-03-30 18:07:51 +02:00
Alexander A. Klimov	335688909b	Document why Timer::TimerThreadProc() can use Timer members during Timer#~Timer() call	2023-03-29 18:04:19 +02:00
Alexander A. Klimov	78b4dc6509	Remove unused Stream#Peek()	2023-03-24 18:18:13 +01:00
Alexander A. Klimov	4c154f93dc	ApiListener#NewClientHandlerInternal(): on basic_socket#cancel() (due to timeout) don't ssl::stream#async_shutdown() If a connection hangs for too long in ApiListener#NewClientHandler(), ApiListener#AddConnection()'s Timeout calls boost::asio::basic_socket#cancel() on that connection to trigger an exception which unwinds ApiListener#NewClientHandler(). Previously that unwind could trigger a Defer which called boost::asio::ssl::stream#async_shutdown() which extended the hang.	2023-03-21 10:57:40 +01:00
Julian Brost	66b039df9c	Merge pull request #9497 from Icinga/9249 Application::Exit(): don't exit(), but _exit(), even in debug build mode	2023-03-10 16:04:54 +01:00
Alexander A. Klimov	6414fd19f5	Checkable#ProcessCheckResult(): only clean up ack comments older than check result Normally if for some reason an ack comment still exists on a checkable not acked anymore, still clean it up. But while replaying log config objects incl. ack comments come before check results and acks. I.e. 1) ack comment, 2) DOWN check result and 3) ack. Not 1) DOWN check result, 2) ack and 3) ack comment. So the checkable is temporarily not acked, but already has the ack comment. In this case the DOWN check result which is older than the ack comment shall not clean up the latter.	2023-03-03 15:48:34 +01:00
Alexander A. Klimov	4662d4477b	Checkable#RemoveAckComments(): add optional comment entry time filter	2023-03-03 15:48:11 +01:00
Alexander A. Klimov	dceb29c742	Checkable#RemoveCommentsByType(): remove redundant parameter	2023-03-03 11:53:02 +01:00
Mattia Codato	912fdb9700	Fix update execution message discarded refs Icinga#8616	2023-03-02 17:50:39 +01:00
Alexander Aleksandrovič Klimov	55930c8042	ProcessSpawnImpl(): remove redundant _exit(128); Now this if doesn’t _exit(128) by itself, but "return" to the outer if which immediately _exit(128)s.	2023-03-02 12:45:15 +01:00
Alexander A. Klimov	bbf2e80002	Remove StatusDataWriter which has been deprecated for 5 major versions	2023-03-01 17:16:28 +01:00
Julian Brost	cf517050bc	Merge pull request #9711 from Icinga/connect-cancel Connect(): don't try next DNS record if operation is canceled	2023-03-01 15:49:53 +01:00
Alexander A. Klimov	79f1e0666a	Connect(): don't try next DNS record if operation is canceled Instead return immediately to meet the caller's expectations.	2023-02-28 10:57:54 +01:00
Edgar Fuß	20d7e1b5e6	Fix use of std::unordered_map::insert() as pointed out by Nathaniel Wesley Filardo in GitHup Pull Request #8999	2023-02-21 16:23:40 +01:00
Edgar Fuß	5bba609e60	Add missing #include	2023-02-21 16:23:40 +01:00
Edgar Fuß	cfef9fdadc	Introduce redundancy groups for Dependency Objects Traditional behaviour was to regard all dependecies as cumulative (e.g., the parent considered unreachable if any one dependency is violated), commit `ed58922389` made all dependencies regarded redundant (e.g., the parent considered unreachable only if all dependency are violated). This may lead to unrelated services (or even hosts vs. services) inadvertantly regarded to be redundant to each other. Most importantly, applying the explicit "disable-host-service-checks" dependency described in the "Monitoring Basics" chapter will defeat all other dependencies. This commit introduces a new "redundancy_group" attribute for dependencies. Specifying a redundancy_group causes a dependency to be regarded as redundant only inside that redundancy group. Dependencies lacking a redundancy_group attribute are regarded as essential for the parent. This allows for both cumulative and redundant dependencies and even a combination (cumulation of redundancies, like SSH depeding on both LDAP and DNS to function, while operating redundant LDAP servers as well as redundant DNS resolvers). This commit lacks changes to the tests.	2023-02-21 16:23:36 +01:00
Julian Brost	bda8be343b	Merge pull request #9662 from Icinga/Repair#9627 Repair DSL Namespace values being constant broken in #9627	2023-02-20 16:35:36 +01:00
Julian Brost	d9767cff3f	Merge pull request #9675 from Icinga/third-party/nlohmann_json Update third-party/nlohmann_json to v3.9.1	2023-02-20 15:31:32 +01:00
Julian Brost	a84a0a3cee	Merge pull request #8302 from Icinga/bugfix/windows-systemroot-aliases-6259 Macros: support $env.ENV_VAR_NAME$	2023-02-20 13:09:15 +01:00
Alexander A. Klimov	f2974c07cf	Centralise default icinga.* and env.* macros	2023-02-17 15:33:36 +01:00
Julian Brost	3023009804	Merge pull request #9653 from Icinga/9631 Setup all signal handlers with SA_RESTART flag	2023-02-14 17:55:09 +01:00
Alexander A. Klimov	34d0b942b9	Update third-party/nlohmann_json to v3.9.1 the latest version w/o Apache 2.0 licensed code which conflicts with GPL 2.	2023-02-14 16:19:44 +01:00
Alexander Aleksandrovič Klimov	fd5350d588	Fix typo	2023-02-13 13:00:28 +01:00
Julian Brost	e074e892ce	Merge pull request #9658 from Icinga/unfreeze Dictionary#*(): remove bool overrideFrozen if unused	2023-02-10 19:42:00 +01:00
Julian Brost	213f3f9444	Merge pull request #8389 from Icinga/feature/forbid-dep-cycles Forbid dependency cycles	2023-02-10 17:26:04 +01:00
Alexander A. Klimov	b2b49caf61	Macros: support $env.ENV_VAR_NAME$ refs #6259	2023-02-10 17:21:29 +01:00
Alexander A. Klimov	f3f2c943c7	ScriptGlobal::Set(): don't explicitly give Namespace#Set() its default values	2023-02-10 15:55:10 +01:00
Alexander A. Klimov	e61b380808	Call Namespace#Set(), not #SetFieldByName() Namespace#SetFieldByName() calls #Set() anyway.	2023-02-10 15:53:30 +01:00
Alexander A. Klimov	683095a165	Make globals.Internal values non-const by default That namespace is internal anyway. Previous commit, icinga2 console: Error: Constants must not be removed. This commit fixes it.	2023-02-10 15:47:25 +01:00
Alexander A. Klimov	02df94a46a	Repair DSL Namespace values being constant broken in #9627 master before #9627 (`a0286e9c6`): <1> => namespace n { x = 42; x = 42 } ^^^^^^ Constant must not be modified. <2> => HEAD of #9627 (`24b57f0d3`): <1> => namespace n { x = 42; x = 42 } null <2> =>	2023-02-10 15:43:01 +01:00
Julian Brost	0dd35bb960	Merge pull request #9657 from Icinga/shared_mutex-Dictionary Use a shared_mutex for read `Dictionary` operations	2023-02-10 15:15:52 +01:00
Alexander A. Klimov	e9846f1827	ScriptGlobal::Set(): remove unused bool overrideFrozen	2023-02-10 11:33:46 +01:00
Alexander A. Klimov	cd78da13d3	Dictionary#Clear(): remove unused bool overrideFrozen	2023-02-10 11:33:46 +01:00
Alexander A. Klimov	270c6392d4	Dictionary#Remove(): remove unused bool overrideFrozen	2023-02-10 11:33:46 +01:00
Alexander A. Klimov	ca547d0292	Use a shared_mutex for read `Dictionary` operations This allows multiple parallel read operations resulting in a overall speedup on systems with many cores.	2023-02-10 11:31:51 +01:00
Alexander A. Klimov	a309b4a415	ResolverSpec: add option not to resolve "$name$" but only "$host.name$".	2023-02-06 16:39:17 +01:00
Alexander A. Klimov	5b63407d15	Forbid dependency cycles	2023-02-06 12:33:48 +01:00
Alexander A. Klimov	91901eafd8	Introduce EnvResolver refs #6259	2023-02-06 11:25:25 +01:00
Alexander A. Klimov	a9341eb4a0	Setup all signal handlers with SA_RESTART flag so interrupted syscalls get auto-restarted and callers don't get or have to handle the EINTR error.	2023-02-03 14:46:45 +01:00
Julian Brost	c51037725a	Merge pull request #9466 from Icinga/flush-temp-files Deduplicate and stabilize fragile filesystem transactions	2023-02-02 16:29:11 +01:00
Julian Brost	3eb85797ce	Merge pull request #9622 from Icinga/9563 Main process: ignore SIGHUP	2023-02-02 11:36:13 +01:00
Julian Brost	a0239e44f7	Merge pull request #9598 from Icinga/9596 CheckerComponent#CheckThreadProc(): also propagate next check update …	2023-02-01 20:09:06 +01:00
Alexander Aleksandrovič Klimov	4e021e0105	Merge pull request #9648 from Icinga/frozen-namespace-config-validation Fix config sync after freezing namespaces	2023-02-01 17:07:57 +01:00
Alexander A. Klimov	e9b8c67975	CheckerComponent#CheckThreadProc(): also propagate next check update to Icinga DB if caused by dependency or check period. Now as long as any of the above causes check skips next check and next update will be up-to-date in Icinga DB, so the checkable won't slide into false positive overdue.	2023-02-01 16:25:56 +01:00
Julian Brost	2b43354080	Merge pull request #8744 from Icinga/bugfix/unnecessary-chown-8743 NodeUtility::WriteNodeConfigObjects(): avoid unneccessary Utility::SetFileOwnership()	2023-02-01 14:27:46 +01:00
Julian Brost	fd1aa73d25	Fix config sync after freezing namespaces This was accidentally broken by #9627 because during config sync, a config validation happens that uses `--define System.ZonesStageVarDir=...` which fails on the now frozen namespace. This commit changes this to use `Internal.ZonesStageVarDir` instead. After all, this is used for internal functionality, users should not directly interact with this flag. Additionally, it no longer freezes the `Internal` namespace which actually allows using `Internal.ZonesStageVarDir` in the first place. This also fixes `--define Internal.Debug*` which was also broken by said PR. Freezing of the `Internal` namespace is not necessary for performance reasons as it's not searched implicitly (for example when accessing `globals.x`) and should users actually interact with it, they should know by that name that they are on their own.	2023-02-01 12:29:47 +01:00
Alexander A. Klimov	c953ba1206	Remove redundant ThreadPool#m_Threads	2023-01-27 16:34:11 +01:00
Alexander A. Klimov	288ad68649	ThreadPool#ThreadPool(): remove unused parameter	2023-01-27 16:32:29 +01:00
Alexander A. Klimov	fd93feaec7	Include Utility::SetFileOwnership() inside FS transactions to make them even more atomic.	2023-01-27 12:03:59 +01:00
Alexander A. Klimov	d22fdf2a7a	Introduce AtomicFile#GetTempFilename()	2023-01-27 12:03:59 +01:00
Alexander A. Klimov	0367c9e099	Remove unused Utility::CreateTempFile()	2023-01-27 12:03:59 +01:00
Alexander A. Klimov	b92fe23469	Deduplicate and stabilize fragile filesystem transactions by using AtomicFile so they ensure all or nothing of a file gets replaced.	2023-01-27 12:03:56 +01:00
Alexander A. Klimov	a3e205b990	Introduce AtomicFile::Write()	2023-01-27 11:36:09 +01:00
Julian Brost	2d860a0f5e	Merge pull request #8118 from Icinga/feature/speed-object-registry-8112 Speed up config object lookup	2023-01-26 19:03:40 +01:00
Alexander Aleksandrovič Klimov	421ac1735c	Merge pull request #9608 from Icinga/move-types-namespace Move Types namespace into type.cpp and simplify Type::GetByName()	2023-01-26 18:32:41 +01:00
Julian Brost	ad8868cab7	Merge pull request #9599 from Icinga/influx-ns Influx DB: don't unneccessarily truncate timestamps to whole seconds	2023-01-26 17:44:50 +01:00
Alexander A. Klimov	b2fc49569c	Make ConfigType#m_Mutex a std::shared_timed_mutex refs #8112	2023-01-26 15:04:02 +01:00
Alexander A. Klimov	21759f015d	ConfigType: store config objects in a hash map refs #8112	2023-01-26 15:03:54 +01:00
Julian Brost	3dab46623b	Move Types namespace into type.cpp and simplify Type::GetByName() This commit moves the initialization of the globals.Types namespace to type.cpp in order to keep a pointer to the Namespace object in Type::m_Namespace and simplify Type::GetByName() using it. The dynamic type check is moved into an assertion after freezing the namespace.	2023-01-26 14:26:41 +01:00
Yonas Habteab	5a67ddea76	Don't post-increment stl iterators	2023-01-26 09:10:49 +01:00
Yonas Habteab	8bb0b857d8	ApiListener: Fix memory leak & group `a \|\| b && c` correctly	2023-01-26 09:10:49 +01:00
Yonas Habteab	95cec9cba2	Don't mark a method as `virtual` in a `final` class	2023-01-26 09:10:38 +01:00
Yonas Habteab	7b91b200f5	Use simplified if conditions where applicable	2023-01-26 09:06:20 +01:00
Yonas Habteab	38313434d2	Avoid calling `GetDeferredInitializers()` repeatedly	2023-01-26 09:05:19 +01:00
Alexander Aleksandrovič Klimov	bb99106926	Merge pull request #7863 from Icinga/bugfix/disallow-receiving-ticket-salt-via-api Disallow fetching the ticket salt via REST API	2023-01-25 16:39:30 +01:00
Julian Brost	5fea15e090	Merge pull request #7958 from Icinga/bugfix/api-500-404-7956 /v1/actions/*: return 404 if no objects found	2023-01-24 15:08:17 +01:00
Michael Friedrich	4d57de2a1a	Hide TicketSalt in /v1/variables	2023-01-20 12:38:18 +01:00
Julian Brost	24b57f0d3a	Namespace: don't acquire shared locks on frozen namespaces This makes freezing a namespace an irrevocable operation but in return allows omitting further lock operations. This results in a performance improvement as reading an atomic bool is faster than acquiring and releasing a shared lock. ObjectLocks on namespaces remain untouched as these mostly affect write operations which there should be none of after freezing (if there are some, they will throw exceptions anyways).	2023-01-19 17:56:44 +01:00
Julian Brost	cc0e2ec181	Use a shared_mutex for read `Namespace` operations This allows multiple parallel read operations resulting in a overall speedup on systems with many cores.	2023-01-19 17:55:29 +01:00
Julian Brost	1c066fc02e	Simplify NamespaceValue class hierarchy to one struct without member functions This commit removes EmbeddedNamespaceValue and ConstEmbeddedNamespaceValue and reduces NamespaceValue down to a simple struct without inheritance or member functions. The code from these clases is inlined into the Namespace class. The class hierarchy determining whether a value is const is moved to an attribute of NamespaceValue. This is done in preparation for changes to the locking in the Namespace class. Currently, it relies on a recursive mutex. In the future, a shared mutex (read/write lock) should be used instead, which cannot allow recursive locking (without failing or risk deadlocking on lock upgrades). With this change, all operations requiring a lock for one operation are within one function, no recursive locking is not needed any more.	2023-01-19 17:55:11 +01:00
Julian Brost	0503ca1379	Initialize namespaces without using `overrideFrozen` This commit adds a new initialization priority `FreezeNamespaces` that is run last and moves all calls to `Namespace::Freeze()` there. This allows all other initialization functions to still update namespaces without the use of the `overrideFrozen` flag. It also moves the initialization of `System.Platform` and `System.Build` to an initialize function so that these can also be set without setting `overrideFrozen`. This is preparation for a following commit that will make the frozen flag in namespaces finial, no longer allowing it to be overriden (freezing the namespace will disable locking, so performing further updates would be unsafe).	2023-01-19 09:53:36 +01:00
Julian Brost	6229f4d9bf	InitializePriority: don't explicitly specify values Now that all values are in one place, there is no reason for this numbering with gaps anymore. If you need to insert a new value in between, you can just do so in the enum. This reverses the sort order of the enum, thereby requiring a change to the sort order of the std::priority_queue containing the elements.	2023-01-18 15:57:32 +01:00
Julian Brost	99bb687350	INITIALIZE_ONCE_WITH_PRIORITY: use enum for priority values Change the type of the priority values from int to a new enum. By replacing the magic int values throughout the code base with named values, there is now a single place where all priority values are defined and you get an overview over the initialization order.	2023-01-18 15:57:27 +01:00
Julian Brost	61285adcae	InitializeOnceHelper: use std::function instead of C function pointer InitializeOnceHelper calls Loader::AddDeferredInitializer which takes a std::function, so it's eventually converted to that anyways. This commit just does this a bit earlier, and by saving the step of the intermediate C function pointer, this would now also work for capturing lambdas (which there are none of at the moment).	2023-01-18 15:52:42 +01:00
Julian Brost	c019f8c04a	Merge pull request #9603 from Icinga/remove-namespace-behavior Namespace: replace behavior classes with a bool	2023-01-18 15:48:34 +01:00
Julian Brost	a259650bea	Merge pull request #8595 from Icinga/bugfix/cluster-zone-own-zone-8570 cluster-zone: consider own zone connected if there's only one endpoint	2023-01-17 17:26:14 +01:00
Alexander A. Klimov	21f548d3c0	Remove no-op InfluxDB URL param precision=ns is the default.	2023-01-16 12:03:08 +01:00
Julian Brost	9590c176e3	Merge pull request #9491 from Icinga/9488 Fix compile error on Solaris 11.4	2023-01-12 14:22:52 +01:00
Julian Brost	0294c174a4	Merge pull request #9594 from Icinga/8834 ConfigObjectUtility::GetObjectConfigPath(): just return paths of existing objects	2023-01-09 13:49:58 +01:00
Alexander A. Klimov	e1bb085b0f	ConfigObjectUtility::DeleteObjectHelper(): only delete _api files to restore the behavior before the previous commit. Otherwise we'd delete all API object's child objects' files including applied child object rules in /etc.	2023-01-05 18:05:31 +01:00
Julian Brost	dd51997c73	Merge pull request #9624 from Icinga/9618 Make compilable on Boost v1.81	2023-01-05 15:32:22 +01:00
Alexander A. Klimov	99c2d69dc8	Handle boost::beast::http::basic_fields#operator[]() signature change (v1.81) Use always working std::string(x), not broken x.to_string(). (x is a return value.)	2023-01-05 11:18:20 +01:00
Alexander A. Klimov	5bcbc96e22	Handle boost::beast::http::basic_fields#set() signature change (v1.81) Make String convertible to boost::beast::string_view (always working), not boost::string_view (broken).	2023-01-05 11:18:20 +01:00
Alexander A. Klimov	d059885d9b	Main process: ignore SIGHUP On OpenBSD rcctl reload icinga2 SIGHUPs all "icinga2" processes, not just our umbrella. We must handle this.	2023-01-03 18:29:31 +01:00
Julian Brost	fbb68dbcd0	Namespace: replace behavior classes with a bool In essence, namespace behaviors acted as hooks for update operations on namespaces. Two behaviors were implemented: - `NamespaceBehavior`: allows the update operation unless it acts on a value that itself was explicitly marked as constant. - `ConstNamespaceBehavior`: initially allows insert operations but marks the individual values as const. Additionally provides a `Freeze()` member function. After this was called, updates are rejected unless a special `overrideFrozen` flag is set explicitly. This marvel of object-oriented programming can be replaced with a simple bool. This commit basically replaces `Namespace::m_Behavior` with `Namespace::m_ConstValues` and inlines the behavior functions where they were called. While doing so, the code was slightly simplified by assuming that `m_ConstValues` is true if `m_Frozen` is true. This is similar to what the API allowed in the old code as you could only freeze a `ConstNamespaceBehavior`. However, this PR moves the `Freeze()` member function and the related `m_Freeze` member variable to the `Namespace` class. So now the API allows any namespace to be frozen. The new code also makes sense with the previously mentioned simplification: a `Namespace` with `m_ConstValues = false` can be modified without restrictions until `Freeze()` is called. When this is done, it becomes read-only. The changes outside of `namespace.*` just adapt the code to the slightly changed API.	2022-12-09 09:25:46 +01:00

... 6 7 8 9 10 ...

6874 commits