icinga2

mirror of https://github.com/Icinga/icinga2.git synced 2026-06-09 08:42:59 -04:00

Author	SHA1	Message	Date
Yonas Habteab	10775f4481	Merge pull request #10207 from Icinga/log-connected-endpoint-connection-attempts ApiListener: Log connection attempts from an already connected client prominently	2024-10-30 13:31:44 +01:00
Yonas Habteab	9d4625e1ec	ApiListener: Log connection attempts from an already connected client Something is definitely going wrong if a client tries to reconnect to this endpoint while it still has an active connection to that client. So we shouldn't hide this, but at least log it at info level. Apart from that, I've added some additional information about the currently active client, such as when the last message was sent and received.	2024-10-30 11:26:21 +01:00
Alexander Aleksandrovič Klimov	4ca68e444e	Merge pull request #10204 from Icinga/an-HA doc/: fix "a HA" -> "an HA"	2024-10-24 11:30:24 +02:00
Alexander Aleksandrovič Klimov	fb8badfd2e	Merge pull request #10187 from Icinga/state-before-suppression Fix lost recovery notifications after recovery outside of notification time period	2024-10-24 10:07:59 +02:00
Alexander Aleksandrovič Klimov	7df6baf146	Merge pull request #10176 from Icinga/ICINGA2_UNITY_BUILD=OFF-ICINGA2_WITH_LIVESTATUS=ON Fix build on Mac with -DICINGA2_UNITY_BUILD=OFF -DICINGA2_WITH_LIVESTATUS=ON	2024-10-24 10:03:57 +02:00
Alexander A. Klimov	095e5982f4	doc/: fix "a HA" -> "an HA"	2024-10-24 09:44:36 +02:00
Alvar Penning	98f60fd78e	Icinga DB: Support Redis username authentication The Redis ACL system was introduced with Redis 6.0. It introduced users with precisely granular permissions. This change allows Icinga 2 to use the Icinga DB feature against a Redis with an ACL user. This was reflected in the documentation, next to the already implemented, but undocumented Redis database. Closes #9536.	2024-10-24 09:18:19 +02:00
Alvar Penning	57fab7f39e	Icinga DB: Config no_user_modify Each configuration field of an IcingaDB Object was marked with no_user_modify as modifications via the API would not result in an actual change. While the Object would be updated, the internal Redis connection would not be restarted, resulting in an unexpected behavior. The missing db_index was added to the documentation.	2024-10-24 09:18:09 +02:00
Alexander A. Klimov	7a4ba59961	Remove redundant "Validation failed" prefix from ValidationError exceptions ValidationError#ValidationError() already prefixes #m_What, which #what() returns, with "Validation failed for object".	2024-10-23 13:06:12 +02:00
Julian Brost	869a7d6f0f	Security: fix TLS certificate validation bypass The previous validation in set_verify_callback() could be bypassed, tricking Icinga 2 into treating invalid certificates as valid. To fix this, the validation checks were moved into the IsVerifyOK() function. This is tracked as CVE-2024-49369, more details will be published at a later time.	2024-10-22 10:36:58 +02:00
Yonas Habteab	f4e61ef9bd	Merge pull request #10177 from Icinga/log-noop-fix Log: fix some parts of messages not being discarded early	2024-10-21 09:31:19 +02:00
Julian Brost	7d0a43f926	Use `Checkable::GetStateBeforeSuppression()` only where relevant This fixes an issue where recovery notifications get lost if they happen outside of a notification time period. Not all calls to `Checkable::NotificationReasonApplies()` need `GetStateBeforeSuppression()` to be checked. In fact, for one caller, `FireSuppressedNotifications()` in `lib/notification/notificationcomponent.cpp`, the state before suppression may not even be initialized properly, so that the default value of OK is used which can lead to incorrect return values. Note the difference between suppressions happening on the level of the `Checkable` object level and the `Notification` object level. Only the first sets the state before suppression in the `Checkable` object, but so far, also the latter used that value incorrectly. This commit moves the check of `GetStateBeforeSuppression()` from `Checkable::NotificationReasonApplies()` to the one place where it's actually relevant: `Checkable::FireSuppressedNotifications()`. This made the existing call to `NotificationReasonApplies()` unneccessary as it would always return true: the `type` argument is computed based on the current check result, so there's no need to check it against the current check result.	2024-10-11 13:21:10 +02:00
Alexander A. Klimov	c6f9de5933	Ido*sqlConnection#FieldToEscapedString(): don't write out of range time MySQL's FROM_UNIXTIME() NULLs ts <1970, errors for >2038. Postgres' TO_TIMESTAMP() errors for all ts not between 4713BC - 294276AD.	2024-10-02 11:52:25 +02:00
Julian Brost	5e9e0bbcdf	Merge pull request #10059 from Icinga/IcingaDB-TimestampToMilliseconds-limit IcingaDB::TimestampToMilliseconds(): limit output to four year digits	2024-10-02 09:19:03 +02:00
Alexander A. Klimov	ad6fcda6df	Ido*sqlConnection#FieldToEscapedString(): don't overflow timestamps > long	2024-10-01 17:38:52 +02:00
Alexander A. Klimov	dc4869c3aa	IcingaDB::TimestampToMilliseconds(): limit output to four year digits Too high timestamps may overflow uint64_t (and the YYYY format) and negative ones don't fit into uint64_t. Those may crash our Go daemon.	2024-09-30 16:54:40 +02:00
Julian Brost	f0e084d530	Log: fix some parts of messages not being discarded early `m_IsNoOp` was introduced to avoid building up log messages that will later be discarded, like debug messages if no debug logging is configured. However, it looks like the template operator<< implemented in the header file was forgotten when adding this feature, all other places writing into `m_Buffer` already have an if guard like added by this commit.	2024-09-27 14:23:05 +02:00
Alexander A. Klimov	2bbeaec916	Fix build on Mac with -DICINGA2_UNITY_BUILD=OFF -DICINGA2_WITH_LIVESTATUS=ON error: no matching function for call to 'intrusive_ptr_release' ... candidate function not viable: cannot convert argument of incomplete type 'icinga::Notification ' to 'Object ' for 1st argument void intrusive_ptr_release(Object *object);	2024-09-27 12:41:11 +02:00
Julian Brost	b6b1506bda	Merge pull request #10140 from Icinga/drop-cpu-bound-work-usage-from-ifwapi Don't use thread-local var in coroutine & drop superfluous `CpuBoundWork` usage	2024-09-27 11:31:58 +02:00
Yonas Habteab	92df9ef8c3	Merge pull request #10148 from Icinga/enhanced-sort-types-by-load-dependencies Sort config types by their load dependencies once	2024-09-26 15:27:41 +02:00
Sebastian Grund	8c68c6e9d8	Add closing quotationmarks in Validator for influxdb writer config	2024-09-25 13:03:00 +02:00
Yonas Habteab	467e8b18e7	Type: Simplify sort by load dependencies algorithm	2024-09-20 16:18:12 +02:00
Alexander A. Klimov	31f3acaa13	ConfigItem::CommitNewItems(): pre-sort types by their load dependencies once to avoid complicated nested loops, iterating over the same types and checking dependencies over and over, skipping already completed ones.	2024-09-20 16:18:12 +02:00
Alexander A. Klimov	b848934d57	Introduce Type::GetConfigTypesSortedByLoadDependencies()	2024-09-20 16:18:12 +02:00
Yonas Habteab	26f43b0b48	IcingaDB: Don't sync partially initialised objects	2024-09-11 14:08:27 +02:00
Yonas Habteab	74009f0fcb	Don't use thread-local variable in coroutine & process final `cr` in global thread pool	2024-09-05 17:36:03 +02:00
Yonas Habteab	c9159494c0	HttpServerConnection: Drop yet another superfluous `CpuBoundWork` usage	2024-09-05 15:10:14 +02:00
Alexander Aleksandrovič Klimov	79e3cb2a95	Utility::ReleaseHelper(): remove detection of EOL distros We only support /etc/os-release owners.	2024-09-04 10:26:50 +02:00
Alexander Aleksandrovič Klimov	0951230ce1	Merge pull request #9991 from Icinga/JsonRpcConnection-9985 JsonRpcConnection#Send*(): discard messages ASAP once shutting down	2024-09-03 15:13:30 +02:00
Julian Brost	4c6b93d617	Merge pull request #10011 from Icinga/next-check-cluster-sync-issue Checkable: Don't recalculate `next_check` for remotely generated `cr`	2024-08-30 13:37:41 +02:00
Yonas Habteab	9f84c1516e	ApiListener: Reorder logging in `ApiTimerHandler()`	2024-08-28 16:53:53 +02:00
Yonas Habteab	e062ceb901	ApiListener: Catch & supress clients runtime errors	2024-08-28 16:53:53 +02:00
Julian Brost	88e79ea41a	Merge pull request #10111 from Icinga/unregister-invalid-objects-properly Unregister invalid config objects properly	2024-08-27 14:30:38 +02:00
Yonas Habteab	932a53449d	JsonRpcConnection: Raise an exception when trying to send to disconnected clients	2024-08-27 14:23:41 +02:00
Julian Brost	9222a63ff7	Make sure log file is reopened when `ApiListener::ReplayLog()` returns	2024-08-27 14:23:41 +02:00
Yonas Habteab	a5a83e311a	Defer: Allow empty initialization & add `SetFunc()` method	2024-08-27 14:23:41 +02:00
Yonas Habteab	73db30c08b	Use `Defer` class for cleanup in `ApiListener::ReplayLog()`	2024-08-27 14:23:41 +02:00
Alexander A. Klimov	f074e24d2a	ApiListener#ReplayLog(): stop reading files ASAP on send error	2024-08-27 14:23:41 +02:00
Alexander A. Klimov	b538ad2528	JsonRpcConnection#Send*(): discard messages ASAP once shutting down Especially ApiListener#ReplayLog() enqueued lots of messages into JsonRpcConnection#{m_IoStrand,m_OutgoingMessagesQueue} (RAM) even if the connection was shut(ting) down. Now #Disconnect() takes effect ASAP.	2024-08-27 14:23:41 +02:00
Alexander A. Klimov	33f8ea6dcc	JsonRpcConnection#Disconnect(): spawn coroutine only if necessary by checking the now atomic #m_ShuttingDown outside of it.	2024-08-27 14:23:41 +02:00
Alexander A. Klimov	f96e7c67ee	On Windows, don't create C:\Program Files\Icinga2\var during MSI build	2024-08-23 12:49:09 +02:00
Julian Brost	39ae2e8ca4	Utility::FormatDateTime(): provide an overload for tm* This allows the function to be used both with a double timestamp or a pointer to a tm struct. With this, a similar implementation inside the tests can simply use our regular function.	2024-08-23 12:48:50 +02:00
Julian Brost	d5b3ffaa6d	Utility::FormatDateTime(): handle invalid format strings on Windows On Windows, the strftime() function family invokes an invalid parameter handler when the format string is invalid (see the "Remarks" section in their documentation). std::put_time() shows the same behavior as it uses _wcsftime_l() internally. The default invalid parameter handler may terminate the process, which can be a problem given that the format string can be specified by the user from the Icinga DSL. Thus, temporarily set a thread-local no-op handler to disable the default one allowing the program to continue. This then simply results in the function returning an error which then results in an exception as we ask the stream to throw one. See also: https://learn.microsoft.com/en-us/cpp/c-runtime-library/reference/strftime-wcsftime-strftime-l-wcsftime-l?view=msvc-170 https://learn.microsoft.com/en-us/cpp/c-runtime-library/parameter-validation?view=msvc-170 https://learn.microsoft.com/en-us/cpp/c-runtime-library/reference/set-invalid-parameter-handler-set-thread-local-invalid-parameter-handler?view=msvc-170	2024-08-23 12:48:50 +02:00
Julian Brost	0285028689	Utility::FormatDateTime(): handle errors from strftime() So far, the return value of strftime() was simply ignored and the output buffer passed to the icinga::String constructor. However, there are error conditions where strftime() returns 0 to signal an error, like if the buffer was too small for the output. In that case, there's no guarantee on the buffer contents and reading it can result in undefined behavior. Unfortunately, returning 0 can also indicate success and strftime() doesn't set errno, so there's no reliable way to distinguish both situations. Thus, the implementation now returns the empty string in both cases. I attempted to use std::put_time() at first as that allows for better error handling, however, there were problems with the implementation on Windows (see inline comment), so I put that plan on hold at left strftime() there for the time being.	2024-08-23 12:42:54 +02:00
Julian Brost	c2c66908f6	Utility::FormatDateTime(): use localtime_s() on Windows localtime() is not thread-safe as it returns a pointer to a shared tm struct. Everywhere except on Windows, localtime_r() is used already which avoids the problem by using a struct allocated by the caller for the output. Windows actually has a similar function called localtime_s() which has the same properties, just with a different name and order of arguments.	2024-08-23 12:42:32 +02:00
Julian Brost	704acdc698	Utility::FormatDateTime(): use boost::numeric_cast<>() The previous implementation actually had undefined behavior when called with a double that can't be represented as time_t. With boost::numeric_cast, there's a convenient cast available that avoids this and throws an exceptions on overflow. It's undefined behavior ([0], where the implicit conversion rule comes into play because the C-style cast uses static_cast [1] which in turn uses the imlicit conversion as per rule 5 of [2]): > A prvalue of floating-point type can be converted to a prvalue of any integer > type. The fractional part is truncated, that is, the fractional part is > discarded. > > * If the truncated value cannot fit into the destination type, the behavior > is undefined (even when the destination type is unsigned, modulo arithmetic > does not apply). Note that on Linux amd64, the undefined behavior typically manifests itself in the result being the minimal value of time_t which then results in localtime_r failing with EOVERFLOW. [0]: https://en.cppreference.com/w/cpp/language/implicit_conversion#Floating.E2.80.93integral_conversions [1]: https://en.cppreference.com/w/cpp/language/explicit_cast [2]: https://en.cppreference.com/w/cpp/language/static_cast	2024-08-23 12:42:30 +02:00
Julian Brost	4c83d793a6	Merge pull request #9983 from Icinga/broken-timeperiod Fix broken `TimePeriod/ScheduledDowntime`s	2024-08-20 10:05:59 +02:00
Yonas Habteab	ca7cc54438	Checkable: Don't recalculate `next_check` while processing remotely genrated check Currently, when processing a `CheckResult`, it will first trigger an `OnNextCheckChanged` event, which is sent to all connected endpoints. Then, when `Checkable::ProcessCheckResult()` returns, an `OnCheckResult` event is fired, which is of course also sent to all connected endpoints. Next, the other endpoints receive the `event::SetNextCheck` cluster event followed by `event::CheckResult`and invoke `checkable#SetNextCheck()` and `Checkable#CheckResult()` with the newly received check. So they also try to recalculate the next check themselves and invalidate the previously received next check timestamp from the source endpoint. Since each endpoint randomly initialises its own scheduling offset, the recalculated next check will always differ by a split second/millisecond on each of them. As a consequence, two Icinga DB HA instances will generate two different checksums for the same state and causes the state histories to be fully resynchronised after a takeover/Icinga 2 reload.	2024-08-16 16:15:56 +02:00
Alexander Aleksandrovič Klimov	02ba5e4101	Merge pull request #10015 from Icinga/malloc_info /v1/debug/malloc_info: call malloc_info(3) if available	2024-08-12 14:41:09 +02:00
Alexander A. Klimov	f3c7ac11e9	/v1/debug/malloc_info: call malloc_info(3) if available The GNU libc function malloc_info(3) provides memory allocation and usage statistics of Icinga 2 itself.	2024-08-09 12:59:25 +02:00
Julian Brost	2bfa1f1649	Merge pull request #10107 from Icinga/timeperiod-nth-day-of-month-off-by-one Timeperiods: fix off by one when calculating n-th last weekday of the month	2024-08-08 14:40:18 +02:00
Julian Brost	c45829b59f	Timeperiods: fix off by one when calculating n-th last weekday of the month A day specification like "monday -1" refers to the last Monday of the month. However, there was an off by one if the first day of the next month is the same day of the week, i.e. a Monday in this example. LegacyTimePeriod::FindNthWeekday() picks a day to start the search for the day in question. When given a negative n to search for the n-th last day, it wrongly used the first day of the following month as the start and counted it as if it was within the current month. This resulted in a 1/7 chance that the result was one week too late. This is fixed by using the last day of the current month instead.	2024-08-07 12:06:05 +02:00
Yonas Habteab	c4edecc1fb	Unregister invalid config objects properly	2024-08-06 16:59:30 +02:00
Julian Brost	07d253009a	Merge pull request #10013 from Icinga/broken-runtime-config-sync Fix broken runtime config sync	2024-08-06 11:57:24 +02:00
Yonas Habteab	86347013a6	Check segemnt start date inclusively in `TimePeriod::IsInside()`	2024-08-01 16:16:48 +02:00
Yonas Habteab	4daa03dc02	Fix broken timeperiods/scheduleddowntimes	2024-08-01 15:14:34 +02:00
Yonas Habteab	546dea95a2	Don't allow to modify/create/delete an object concurrently	2024-06-13 11:26:19 +02:00
Yonas Habteab	099f664ce6	`ConfigObjectUtility#CreateObject()`: Use `Defer` for config path cleanup	2024-06-13 11:26:19 +02:00
Yonas Habteab	433e2de13a	ApiListener: Process cluster config updates sequentially	2024-06-13 11:26:19 +02:00
Yonas Habteab	1a55b68541	Introduce RAII style `ObjectNameLock` class	2024-06-13 11:26:19 +02:00
Yonas Habteab	2218ebd6b0	`ConfigObjectUtility`: Use `AtomicFile` to store object config files	2024-06-13 11:26:19 +02:00
Alexander Aleksandrovič Klimov	f1be9b73ab	Merge pull request #10060 from Icinga/IcingaDB-SerializeState-execution_time-latency IcingaDB#SerializeState(): limit execution_time and latency to 2^32-1	2024-06-13 09:55:45 +02:00
Yonas Habteab	81a94a0759	Don't fail to remove obsolete downtimes	2024-05-23 10:09:41 +02:00
Yonas Habteab	4eeccce36c	Don't loose args in recursive `Downtime::RemoveDowntime()` call	2024-05-23 10:09:41 +02:00
Yonas Habteab	e0fd0d3df4	Introduce & use enum `DowntimeRemovalReason`	2024-05-23 09:34:15 +02:00
Alexander Aleksandrovič Klimov	cc3965c3ce	Merge pull request #10065 from Icinga/heavy-update-missing-table-relations Update `object#config_hash` after all relations queries	2024-05-22 15:38:31 +02:00
Yonas Habteab	1019398d55	Update object#config_hash after all relations queries	2024-05-22 13:39:30 +02:00
Yonas Habteab	3d64240ee3	Merge pull request #10066 from Icinga/Checkable-RemoveAllDowntimes Remove unused Checkable#RemoveAllDowntimes()	2024-05-21 17:13:16 +02:00
Alexander A. Klimov	e2bdb8a2f1	Remove unused Checkable#RemoveAllDowntimes()	2024-05-21 14:28:39 +02:00
Alexander A. Klimov	f9adf18111	IcingaDB#SerializeState(): limit execution_time and latency to 2^32-1 not to write higher values into Redis than the Icinga DB schema can hold. This fixes yet another potential Go daemon crash.	2024-05-15 12:55:41 +02:00
Alexander Aleksandrovič Klimov	8c2eb3c1ed	Merge pull request #10049 from Icinga/AddDowntime-trigger_name Downtime::AddDowntime(): NULL-check pointer before deref not to crash	2024-05-06 10:26:26 +02:00
Alexander Aleksandrovič Klimov	d8f8d64f1a	Merge pull request #10027 from macdems/master Fix missing values in PerfData normalization	2024-04-25 19:38:21 +02:00
Maciej Dems	2bb5cc62e2	Fix missing values in PerfData normalization	2024-04-25 17:41:12 +02:00
Alexander A. Klimov	5f80ac17aa	l_LegacyDowntimesCache: delete removed objects not to leak memory	2024-04-25 12:13:52 +02:00
Alexander A. Klimov	c0f87dd4c9	/v1/actions/schedule-downtime: reject request on invalid trigger_name For this purpose lookup the specified Downtime. Also pass Downtime objects, not just names, to Downtime::AddDowntime() not to lookup it twice.	2024-04-25 12:13:52 +02:00
Alexander A. Klimov	f0b5239a15	[Refactor] Downtime::GetDowntimeIDFromLegacyID(): return the Downtime itself not just its name.	2024-04-25 12:13:52 +02:00
Alexander A. Klimov	28b0f7a48c	[Refactor] l_LegacyDowntimesCache: store Downtime objects, not just their names to avoid names of vanished objects.	2024-04-24 12:33:56 +02:00
Alexander A. Klimov	bb13e98ca5	PluginCheckTask::ProcessFinishedHandler(): warn about exit codes outside 0..3 in the plugin output as well, in addition to the warning log.	2024-04-23 17:45:31 +02:00
Alexander A. Klimov	e33befabfb	Make ProcessResult#ExitStatus and CheckResult#exit_status 64-bit ints so that they can hold Windows exit codes like 3221225477 (>2147483647).	2024-04-23 17:45:31 +02:00
Alexander A. Klimov	5c17465a19	OpenTsdbWriter#CheckResultHandler(): skip custom tags with empty values refs #7724	2024-04-18 11:36:21 +02:00
Lorenz Kästle	7afda4dc0d	Add cli option to disable the default global zones When setting up Icinga 2 agents, in most cases, the default global zones are not needed, but have to be removed manually or automatically whith tools outside of Icinga 2 from the configuration. This seems like unnecessary work, since the node setup command does everything else. This commit introduces a new option for the node setup command ("--no-default-global-zones") to exclude the default global zones.	2024-04-03 08:10:45 +02:00
Yannick Martin	5e92450877	icinga2: address comment loading where host reference is not found address #9752: check if host reference is valid	2024-03-11 12:42:23 +01:00
Julian Brost	31be43ff6c	Merge pull request #10018 from Icinga/revert-9980-config-sync-conflicts Revert "Process `config::update/delete` cluster events gracefully"	2024-03-08 16:58:28 +01:00
Julian Brost	af97431bfb	Merge pull request #10006 from Icinga/http-error-handling HttpServerConnection: use exceptions for error handling	2024-03-08 15:06:51 +01:00
Yonas Habteab	a924a49cd8	Revert "Process `config::update/delete` cluster events gracefully"	2024-03-07 17:17:17 +01:00
Julian Brost	097ba00a9c	Merge pull request #10008 from Icinga/Al2Klimov-patch-12 Don't unnecessarily shuffle items before config validation	2024-03-07 16:44:38 +01:00
Alexander Aleksandrovič Klimov	629038344b	OpenTsdbWriter#CheckResultHandler(): clarify log messages Clarify which "host or service" an "Unable to resolve macro" debug log message refers to.	2024-02-22 10:34:35 +01:00
Julian Brost	abea2f270c	Merge pull request #9997 from Icinga/ListenerCoroutineProc-remote_endpoint ApiListener#ListenerCoroutineProc(): get remote endpoint ASAP for logging	2024-02-20 13:46:02 +01:00
Alexander Aleksandrovič Klimov	51cdd593da	Don't unnecessarily shuffle items before config validation Before `ae693cb7e1` (#9577) we've repeatedly looped over all items in parallel like this: while not types.done: for t in types: if not t.done and t.dependencies.done: with parallel(all_items, CONCURRENCY) as some_items: for i in some_items: if i.type is t: i.commit() I.e. all items got distributed over CONCURRENCY threads, but not always equally. E.g. it was the hosts' turn, but only two threads got hosts and did all the work. The others didn't do actual work (due to the lack of hosts in their queue) which reduced the performance. `c721c302cd` (#6581) fixed it by shuffling all_items first. `ae693cb7e1` (#9577) made the latter unnecessary by replacing the above algorithm with this: while not types.done: for t in types: if not t.done and t.dependencies.done: with parallel(all_items[t], CONCURRENCY) as some_items: for i in some_items: if i.type is t: i.commit() I.e. parallel() gets only items of type t, so all threads get e.g. hosts.	2024-02-19 14:26:06 +01:00
Julian Brost	700c5a13d7	HttpServerConnection: use exceptions for error handling When a HTTP connection dies prematurely while the response is sent, `http::async_write()` sets the error code to something like broken pipe for example. When calling `async_flush()` afterwards, it sometimes happens that this never returns. This results in a resource leak as the coroutine isn't cleaned up. This commit makes the individual functions throw exceptions instead of silently ignoring the errors, resulting in the function terminating early and also resulting in an error being logged as well.	2024-02-19 14:12:41 +01:00
Julian Brost	04ef105caa	Merge pull request #9980 from Icinga/config-sync-conflicts Process `config::update/delete` cluster events gracefully	2024-02-19 13:49:41 +01:00
Julian Brost	7d1c887a32	Merge pull request #9999 from Icinga/reset-log-message-count-correctly ApiListener: Reset `m_LogMessageCount` when rotating	2024-02-15 17:06:16 +01:00
Alexander Aleksandrovič Klimov	9db1c4aca3	Merge pull request #8011 from Icinga/bugfix/reset-sigpipe-6912 Reset all signal handlers of child processes	2024-02-15 12:22:36 +01:00
Yonas Habteab	456144c1dc	ApiListener: Process cluster config updates sequentially	2024-02-14 14:25:53 +01:00
Yonas Habteab	40011b0584	Introduce `ObjectNamesMutex` helper class	2024-02-14 14:25:53 +01:00
Alexander Aleksandrovič Klimov	1a8ce5a90e	Merge pull request #9575 from Icinga/WorkQueue-ParallelFor WorkQueue#ParallelFor(): allocate lambda once per thread, not once per item	2024-02-14 12:59:50 +01:00
Julian Brost	2be08aa2e0	Merge pull request #9992 from Icinga/remove-redundat-cpu-bound-work Drop redundant `CpuBoundWork` usage in `JsonRpcConnection::Disconnect()`	2024-02-13 15:51:34 +01:00
Julian Brost	fc6a106345	Merge pull request #9994 from Icinga/redundant-cpu-bound-work-usages Drop redundant `CpuBoundWork` usages in `lib/remote`	2024-02-13 14:53:59 +01:00
Alexander Aleksandrovič Klimov	48eb563ca0	Merge pull request #9736 from Icinga/stream-read-allow_partial Stream#Read(): remove de facto unused param allow_partial	2024-02-13 13:04:15 +01:00
Yonas Habteab	008fcd1744	Preserve runtime objects in a tmp file for the entire validation process Given that the internal `config::Update` cluster events are using this as well to create received runtime objects, we don't want to persist first the conf file and the load and validate it with `CompileFile`. Otherwise, we are forced to remove the newly created file whenever we can't validate, commit or activate it. This also would also have the downside that two cluster events for the same object arriving at the same moment from two different endpoints would result in two different threads simultaneously creating and loading the same config file - whereby only one of the surpasses the validation, while the other is facing an object `re-definition` error and tries to remove that config file it mistakenly thinks it has created. As a consequence, an object successfully created by the former is implicitly deleted by the latter thread, causing the objects to mysteriously disappear.	2024-02-12 15:18:32 +01:00
Yonas Habteab	6e66cd9aff	ApiListener: Reset `m_LogMessageCount` when rotating Closing and re-opening that very same log file shouldn't reset the counter, otherwise some log files may exceed the max limit per file as their offset indicator is reset each time they are re-opened.	2024-02-09 18:04:20 +01:00
Yonas Habteab	eb813cfb99	HttpServerConnection: Drop superfluous `CpuBoundWork` usage	2024-02-09 15:17:26 +01:00
Alexander A. Klimov	62e1d7650d	ApiListener#ListenerCoroutineProc(): get remote endpoint ASAP for logging On incoming connection timeout we log the remote endpoint which isn't available if it was already disconnected - an exception is thrown. Get it as long as we're still connected not to lose it, nor to get an exception.	2024-02-09 12:27:25 +01:00
Yonas Habteab	32531fe909	EventsHandler: Drop superfluous `CpuBoundWork` usage	2024-02-09 12:00:50 +01:00
Eric Lippmann	c7293de91d	IoEngine: Always log coroutine exception diagnostics While analyzing a possible memory leak, we encountered several coroutine exception messages, which unfortunately do not provide any information about what exactly went wrong, as exception diagnostics were previously only logged at the notice level.	2024-02-08 12:09:06 +01:00
Yonas Habteab	72266434df	Drop redundant `CpuBoundWork` usages in `lib/remote`	2024-02-08 11:30:23 +01:00
Yonas Habteab	e2793f1d88	Drop redundant `CpuBoundWork` usage in `JsonRpcConnection::Disconnect()` Although there is locking involved here, it shoudln't take too long for the thread to actually acquire it, since there aren't that many threads dealing with endpoint clients concurrently. It's just wasting pointless time trying to obtain a CPU slot.	2024-02-08 11:24:55 +01:00
Alexander Aleksandrovič Klimov	e9fcbf400f	Merge pull request #9966 from Icinga/Al2Klimov-patch-3 HttpServerConnection: remove duplicate ")" from a log message	2024-01-18 10:46:51 +01:00
Alexander A. Klimov	d48b369554	Reset all signal handlers of child processes ... not to disturb check plugins. refs #6912	2024-01-17 12:25:59 +01:00
Alexander Aleksandrovič Klimov	966b46e808	Merge pull request #9965 from Icinga/http-request-time HttpServerConnection: log request processing time as well	2024-01-17 11:30:33 +01:00
Julian Brost	b1fe15f694	Merge pull request #9962 from Icinga/influx-disk-9948 Influx DB: truncate timestamps to whole seconds to save disk space	2024-01-17 08:50:16 +01:00
Alexander A. Klimov	b6874cc8d4	HttpServerConnection: log request processing time as well	2024-01-16 17:52:07 +01:00
Alexander Aleksandrovič Klimov	6a4cb5c12c	HttpServerConnection: remove duplicate ")" from a log message The commit `5c32a5a7dc`, which introduced it, clearly shows that the other ")" already existed legitimately.	2024-01-16 16:31:00 +01:00
Alexander A. Klimov	cc9db3756f	Revert "Influx DB: don't unneccessarily truncate timestamps to whole seconds" This reverts commit `eaa3cd83ad`.	2024-01-16 12:19:48 +01:00
Alexander A. Klimov	fc5b1178c6	Revert "Remove no-op InfluxDB URL param" This reverts commit `21f548d3c0`.	2024-01-16 12:19:47 +01:00
Alexander Aleksandrovič Klimov	28b2db8446	Merge pull request #9851 from Icinga/Al2Klimov-patch-3 Make ObjectImpl<Logger>#GetSeverity() non-virtual	2023-12-22 12:44:51 +01:00
Alexander Aleksandrovič Klimov	6c03598678	Merge pull request #9896 from Icinga/provide-cancel_time-where-has_been_cancelled-may-be-1 Disallow triggering a cancelled downtime, but provide cancel_time in Icinga DB downtime history where has_been_cancelled may be 1	2023-12-20 10:03:09 +01:00
Alexander Aleksandrovič Klimov	949d983a76	Merge pull request #9895 from Icinga/targeted-api-filter FilterUtility::GetFilterTargets(): don't run filter for specific object(s) for all objects	2023-12-19 15:18:41 +01:00
Alexander Aleksandrovič Klimov	8b2e28a869	Merge pull request #9891 from Icinga/renew-the-ca-9890 ApiListener#Start(): auto-renew CA on its owner	2023-12-19 14:57:47 +01:00
Alexander Aleksandrovič Klimov	96cfc4abe8	Merge pull request #9887 from Icinga/argument-list-too-long-9340 PluginNotificationTask::ScriptFunc(): on Linux truncate output and comment	2023-12-19 14:36:57 +01:00
Alexander A. Klimov	175153ce6a	PluginNotificationTask::ScriptFunc(): on Linux truncate output and comment not to run into an exec(3) error E2BIG due to a too long argument. This sends a notification with truncated output instead of not sending.	2023-12-19 12:21:03 +01:00
Alexander A. Klimov	966216f4ba	RequestCertificateHandler(): also renew if CA needs a renewal and a newer one is available.	2023-12-18 15:28:11 +01:00
Alexander A. Klimov	551c3afa60	CertificateToString(): allow raw pointer input	2023-12-18 15:28:11 +01:00
Alexander A. Klimov	bc778116e9	ApiListener#Start(): auto-renew CA on its owner otherwise it would expire.	2023-12-18 15:28:11 +01:00
Alexander A. Klimov	36a08b0497	ApiListener#RenewCert(): enable optional CA creation	2023-12-18 15:28:11 +01:00
Alexander A. Klimov	7b55df6f11	CreateCertIcingaCA(EVP_PKEY, X509_NAME): enable optional CA creation	2023-12-18 15:28:11 +01:00
Alexander Aleksandrovič Klimov	953eeba061	Merge pull request #9893 from Icinga/do-not-re-notify-if-filtered-states-don-t-change-4503 Discard likely duplicate problem notifications via Notification#last_notified_state_per_user	2023-12-13 16:13:28 +01:00
Alexander A. Klimov	ecfc9033b0	FilterUtility::GetFilterTargets(): don't run filter for specific object(s) for all objects	2023-12-13 16:02:50 +01:00
Alexander A. Klimov	15191bcd74	ApplyRule::GetTarget*s(): support constant strings from variables in addition to literal strings. This is for sandboxed filters with some variables pre-set by the caller. They're "constant" in that scope, too.	2023-12-13 16:02:50 +01:00
Alexander A. Klimov	a04cef1890	Introduce DictExpression#GetExpressions()	2023-12-13 16:02:50 +01:00
Alexander A. Klimov	8bcae97ecc	Introduce Dictionary#GetRef()	2023-12-13 16:02:50 +01:00
Alexander A. Klimov	97cd05db7a	Notification#BeginExecuteNotification(): on recovery clear last_notified_state_per_user	2023-12-13 13:21:22 +01:00
Alexander A. Klimov	44e9c6f40d	Notification#BeginExecuteNotification(): discard likely duplicate problem notifications	2023-12-13 13:21:19 +01:00
Alexander A. Klimov	74f52c6fcd	Introduce IsCaUptodate() by splitting IsCertUptodate()	2023-12-13 12:08:34 +01:00
Julian Brost	871fa67b52	Merge pull request #9885 from Icinga/renegotiation	2023-12-12 17:38:09 +01:00
Alexander A. Klimov	2cff763295	Cluster-sync Notification#last_notified_state_per_user	2023-12-12 15:29:50 +01:00
Alexander A. Klimov	b25ba7a316	Notification#BeginExecuteNotification(): track state change notifications	2023-12-07 12:43:30 +01:00
Julian Brost	d2a7117007	Merge pull request #9899 from Icinga/icinga2-crashes-silently-9897 IcingaDB#SendConfigDelete(): fix missing nullptr check before deref	2023-11-21 11:03:28 +01:00
Alexander Aleksandrovič Klimov	7fc7d054af	Merge pull request #9841 from WuerthPhoenix/fix-9840-lock-console-api-during-reload	2023-11-21 10:36:26 +01:00
Alexander A. Klimov	7174dc864d	IcingaDB#SendConfigDelete(): fix missing nullptr check before deref	2023-11-10 17:43:33 +01:00
Alexander A. Klimov	9aaa9901bd	Icinga DB downtime history: provide cancel_time where has_been_cancelled may be 1 The table sla_history_downtime requires a downtime_end. The Go daemon takes the cancel_time if has_been_cancelled is 1. So we must supply a cancel_time whereever has_been_cancelled is 1. Otherwise the Go daemon can't process some entries.	2023-11-08 15:22:39 +01:00
Alexander A. Klimov	7ce9457a4a	Disable TLS renegotiation The API doesn't need it and a customer's security scanner is afraid of a potential DoS attack vector.	2023-11-06 18:46:37 +01:00
Theo Buehler	1f06589f7a	Remove dead code in GetSignatureAlgorithm() This code was added in commit `548eb93` and never did anything useful. Using X509_get_signature_nid() or its expanded version in the pre-1.1 branch is the correct way of retrieving the signature algorithm of a certificate. CLA: trivial	2023-10-20 18:55:44 +02:00
Julian Brost	bba6a76f4a	Merge pull request #9853 from Icinga/GelfWriter-m_StreamMutex GelfWriter: protect m_Stream via m_WorkQueue, not ObjectLock(this)	2023-09-07 11:46:38 +02:00
Alexander Aleksandrovič Klimov	e5d988a2fe	Merge pull request #7799 from Icinga/bugfix/file-end Fix file endings	2023-08-25 11:06:19 +02:00
Alexander A. Klimov	4ee10a6c20	GelfWriter: protect m_Stream via m_WorkQueue, not ObjectLock(this) On shutdown or HA re-connect ConfigObject#SetAuthority(false) is called which does ObjectLock(this) and ConfigObject#Pause(). GelfWriter#Pause(), with the above ObjectLock, calls m_WorkQueue.Join(). But items inside that also doing ObjectLock(this) cause a deadlock.	2023-08-24 17:48:09 +02:00
Alexander Aleksandrovič Klimov	993c9b742d	Make ObjectImpl<Logger>#GetSeverity() non-virtual After all it's not overridden.	2023-08-15 13:03:31 +02:00
Mattia Codato	41e21cb8cf	Prevent calls to command API while the configuration is reloading. Fixes #9840	2023-08-09 08:45:04 +02:00
Alexander A. Klimov	1308ad62af	Stream#Read(): remove de facto unused param allow_partial The only caller passes true, so no one forbids partial reads (even implicitly). All usages in the implementation just assert it being true (allowed).	2023-07-13 16:55:48 +02:00
Alexander Aleksandrovič Klimov	1af5109ad3	Merge pull request #9734 from Icinga/remove-unused-stream-peek- Remove unused Stream#Peek()	2023-07-13 16:52:29 +02:00
Alexander A. Klimov	8f8a6ee2a0	Application::m_LastReloadFailed: if double isn't always lock free, use uint32_t which will overflow in 2106, not 2038. This fixes a compile failure on 32-bit Raspbian.	2023-07-10 10:51:02 +02:00
Alexander Aleksandrovič Klimov	000a776dfb	Built-in check command: ifw-api (#9062 )	2023-07-06 14:18:21 +02:00
Julian Brost	26a75f8a6f	Merge pull request #9812 from Icinga/support-elasticsearch-8-0-9251 ElasticsearchWriter: switch to v7+ URL schema to support v8	2023-07-05 10:15:10 +02:00
Julian Brost	fe13b96226	Merge pull request #9809 from Icinga/reevaluate-and-update-default-tls-cipher-list-9808 Copy and paste global default TLS cipher set from ssl-config.mozilla.org	2023-07-03 19:13:10 +02:00
Alexander A. Klimov	617dda61fb	Re-order global default TLS cipher list to prefer AES256 over AES128	2023-07-03 15:36:11 +02:00
Alexander A. Klimov	4c2e59a690	ElasticsearchWriter: switch to v7+ URL schema to support v8 and OpenSearch 2. This breaks the EOL v5 and v6.	2023-07-03 14:43:45 +02:00
Julian Brost	70d6b6e424	Merge pull request #9810 from Icinga/Al2Klimov-patch-8 ElasticsearchWriter#Pause(): call Flush() only once	2023-06-30 17:21:16 +02:00
Alexander Aleksandrovič Klimov	076eb59443	ElasticsearchWriter#Pause(): lock m_DataBufferMutex during Flush() just to be sure regarding race conditions.	2023-06-30 14:57:18 +02:00
Julian Brost	a2e05f89e8	Enable built-in OpenSSL DH parameters to allow DHE TLS ciphers Non-ECC DHE ciphers in the `cipher_list` attribute of `ApiListener` (the default value includes these) had no effect as no DH parameters were available and therefore the server wouldn't offer these ciphers. OpenSSL provides built-in DH parameters starting from version 1.1.0, however, these have to be enables explicitly using the `SSL_CTX_set_dh_auto()` function. This commit does so and thereby makes it possible to establish a connection to an Icinga 2 server using a DHE cipher.	2023-06-29 12:06:26 +02:00
Alexander Aleksandrovič Klimov	d5e6ecec8a	ElasticsearchWriter#Pause(): call Flush() only once The first Flush() is redundant and may access m_DataBuffer at the same time as some Flush() in m_WorkQueue (race condition) which isn't joined, yet.	2023-06-29 10:42:12 +02:00
Alexander A. Klimov	2e053b0e06	Copy and paste global default TLS cipher set from ssl-config.mozilla.org which got more secure by now, but still overlaps with v2.13.x' set.	2023-06-28 14:49:08 +02:00
Julian Brost	a2926b8604	Merge pull request #9794 from Icinga/round-notification-times-begin-end-not-to-crash-go-daemon IcingaDB::PrepareObject(): round Notification#times.{begin,end} not to crash Go daemon	2023-06-27 17:08:41 +02:00
Alexander A. Klimov	dccb678882	IcingaDB::PrepareObject(): cut off (null) negative Notification#times.{begin,end} not to crash Go daemon At least our PostgreSQL schema enforces positive values.	2023-06-27 12:58:08 +02:00
Alexander A. Klimov	415b810abf	IcingaDB::PrepareObject(): round Notification#times.{begin,end} not to crash Go daemon The latter expects ints, not floats - not to mention strings. Luckily Icinga already enforces numeric strings so that we can cast it to number.	2023-06-27 12:53:08 +02:00
Julian Brost	9cf519316e	Merge pull request #9805 from Icinga/checkcommand-timeout-0-crashes-icinga-db-daemon-9804 IcingaDB::PrepareObject(): cut off (0) negative Command#timeout for Redis	2023-06-27 10:45:02 +02:00
Julian Brost	c08d3beeb1	Merge pull request #9785 from Icinga/Al2Klimov-patch-8 Icinga DB: also write ConfigObject#original_attributes into Redis	2023-06-27 10:24:41 +02:00
Julian Brost	bd11bc2eb4	Merge pull request #9793 from Icinga/unmarshal-number-42-5-into-go-struct-field-notification-notification_interval IcingaDB::PrepareObject(): round Notification#interval and limit it to >=0	2023-06-27 10:12:13 +02:00
Alexander A. Klimov	d641a3c799	IcingaDB::PrepareObject(): cut off (0) negative Command#timeout for Redis not to crash the Go daemon which expects positive values there.	2023-06-26 15:36:47 +02:00
Julian Brost	5350aa3c72	Merge pull request #9792 from Icinga/icingadb-conversion-of-strings-to-number-types-to-avoid-crashes-9791 IcingaDB::PrepareObject(): convert non-null Checkable#check_timeout to number	2023-06-26 15:03:21 +02:00
Alexander A. Klimov	273aa6f997	IcingaDB::PrepareObject(): round Notification#interval and limit it to >=0 otherwise, e.g. with -42.5, the Go daemon crashes. It expects uints there.	2023-06-19 12:46:40 +02:00
Alexander A. Klimov	9f08bad395	IcingaDB::PrepareObject(): convert non-null Checkable#check_timeout to number and, in case of null, fall back to Checkable#check_command.timeout, just like IcingaDB#SerializeState(). Otherwise the Go daemon crashes. It expects a number.	2023-06-15 12:29:42 +02:00
Alexander A. Klimov	1587431945	POST /v1/objects: allow array of attrs to undo modifications of	2023-06-13 16:40:33 +02:00
Alexander A. Klimov	385fe2fd76	Icinga DB: also write ConfigObject#original_attributes into Redis for the case the Go daemon decides to sync them into DB.	2023-06-12 12:53:25 +02:00
Julian Brost	7c381ae12f	Merge pull request #9779 from Icinga/macroprocessor-resolvemacro-quasi-cv-object-icingaapplication MacroProcessor::ResolveMacro(): treat quasi-CV-object IcingaApplication as real CV-object	2023-05-31 20:41:31 +02:00
Alexander A. Klimov	a9c80ffb2e	MacroProcessor::ResolveMacro(): treat quasi-CV-object IcingaApplication as real CV-object As MacroProcessor checked just for CustomVarObject base class, but IcingaApplication provided the vars attribute by itself, it had to also resolve CV macros by itself. That logic diverged from MacroProcessor so that macros inside IcingaApplication CVs weren't resolved. Until now.	2023-05-31 16:35:09 +02:00
Julian Brost	8a42c3bf18	Merge pull request #9775 from Icinga/icingadb-service-crashes-on-negative-downtime-duration-or-end-before-start-9774 Icinga DB: don't write negative Downtime durations into Redis	2023-05-31 11:37:42 +02:00
Alexander A. Klimov	75eaa81c06	Icinga DB: don't write negative Downtime durations into Redis via `std::max(0, x)` not to crash the Go daemon which can't handle such.	2023-05-30 17:56:03 +02:00
Julian Brost	b0899d9ab4	Merge pull request #8429 from Icinga/bugfix/last-reload-attempt-failed-8428 Share "Last reload attempt failed" time across Icinga process tree on *nix	2023-05-30 11:42:21 +02:00
Julian Brost	d871c5c837	Merge pull request #9772 from Icinga/icinga-db-feature-should-normalize-command-arguments-required-skip_key-repeat_key-to-boolean-9576 Icinga DB feature: normalize Command.arguments[].{required,skip_key…	2023-05-25 11:54:01 +02:00
Alexander A. Klimov	ad618e9716	Icinga DB feature: normalize Command.arguments[].{required,skip_key,repeat_key} to boolean At the moment, the Icinga DB feature will use that value as-is and serialize it to JSON, resulting in a crash in Icinga DB down the road because it expects a boolean.	2023-05-24 16:04:14 +02:00
Julian Brost	2470e930eb	Merge pull request #9643 from Icinga/hardware_concurrency Always use Configuration#Concurrency, not `std:🧵:hardware_concurrency()`	2023-05-23 19:23:14 +02:00
Alexander A. Klimov	3fae41ef22	Restart thread pool after freezing Configuration The user (-D) or we could have changed Configuration.Concurrency, so correct the thread pool's thread amount.	2023-05-23 14:41:35 +02:00
Julian Brost	0e25644151	Merge pull request #8969 from Icinga/bugfix/perfdata-dont-get-parsed-correctly-8912 PluginUtility: Fix PerfData parsing for values separated with multiple spaces	2023-05-22 17:16:31 +02:00
Alexander A. Klimov	9376a311ea	Fix file endings git ls-files -z \ \|grep -zEe '^lib/' \ \|grep -zEe '\.[ch]pp$' \ \|xargs -0 perl -p0i -e 's/\n*(?!(?:.\|\n))/\n/'	2023-05-17 18:05:13 +02:00
Alexander A. Klimov	32eb1680f7	Configuration.Concurrency: default to 1 until Configuration freeze not to start many threads before the user could override their amount (-D).	2023-05-11 16:59:47 +02:00
Alexander A. Klimov	8fb5d53118	Track Configuration.Concurrency modifications	2023-05-11 15:41:35 +02:00
Alexander A. Klimov	5c330e9d4f	Share "Last reload attempt failed" time across Icinga process tree on *nix ... as only the umbrella process knows that time, but the icinga check running in the main process also needs to know it. refs #8428	2023-05-08 14:42:21 +02:00
Julian Brost	eca8890d49	Merge pull request #9718 from Icinga/acknowledgement-sync-between-masters-are-not-working-9652 Checkable#ProcessCheckResult(): only clean up ack comments older than check result	2023-05-05 15:29:38 +02:00
Julian Brost	af9d67b262	Merge pull request #9726 from Icinga/43624b Remove -and notify- expired downtimes immediately, not every 60s II	2023-05-02 11:25:03 +02:00
Alexander A. Klimov	58b788cd51	Downtime#Start(): trigger flexible downtimes not earlier than fixed ones the last state change could be a long time ago. If it's longer than the new downtime's duration, the downtime expires immediately. trigger time + duration < now	2023-04-18 16:55:32 +02:00
Julian Brost	8238ec0d96	Merge pull request #9725 from Icinga/operation_aborted-shutDownIfNeeded.Cancel ApiListener#NewClientHandlerInternal(): on basic_socket#cancel() (due to timeout) don't ssl::stream#async_shutdown()	2023-04-17 12:21:21 +02:00
Alexander A. Klimov	0ac1cd1ecb	Rename Downtime::DowntimesExpireTimerHandler() to actually reflect its purpose.	2023-04-14 14:52:05 +02:00
Alexander A. Klimov	6adf2d19e4	Remove -and notify- expired downtimes immediately, not every 60s Don't look for expired downtimes in a timer fired every 60s, but fire one timer per downtime once at expire time.	2023-04-14 14:52:05 +02:00
Alexander A. Klimov	ba7102cae3	Explicitly stop started timers and wait for them before permitting their parent objects' destruction. For the cases where the handlers have raw pointers to these objects.	2023-04-14 14:52:04 +02:00
Julian Brost	8228fae740	Merge pull request #8627 from WuerthPhoenix/bug/agent-cannot-update-executions-8616 Fix update execution message discarded. refs #8616	2023-04-13 19:29:49 +02:00
Julian Brost	f505325ff9	Merge pull request #9445 from Icinga/9365 Disallow config modifications via API during reload	2023-04-13 17:11:58 +02:00
Mattia Codato	c5c17928a6	Allow to exec command on endpoint where the checkable is not present but checkable has command_endpoint specified	2023-04-13 14:44:07 +02:00
Alexander A. Klimov	2ee776b5ab	Disallow config modifications via API during reload Once the new main process has read the config, it misses subsequent modifications from the old process otherwise.	2023-04-12 14:45:40 +02:00
Alexander A. Klimov	64e000df56	Introduce ConfigObjects*Lock	2023-04-12 13:36:48 +02:00
Julian Brost	50018c1d2b	Merge pull request #8218 from efuss/redundancy_group Introduce redundancy groups for Dependency Objects	2023-04-05 18:49:58 +02:00
Yonas Habteab	24d95e1178	PluginUtility: Fix PerfData don't get parsed correctly The problem was that some PerfData labels contained several whitespace characters, not just one, and therefore it was parsed incorrectly in `SplitPerfdata()`. I.e. the condition in line 144 checks whether the first and last character is a normal quote, but since the label can contain spaces at the beginning and at the end respectively, this caused the problems. This PR fixes the problem by removing all occurring whitespace from the beginning and end, before starting to parse the actual label.	2023-04-05 15:37:54 +02:00
Alexander A. Klimov	a66ace7245	Introduce SharedMemory	2023-04-04 13:40:27 +02:00
Alexander A. Klimov	c41e5fd05d	Support multiple redundant Timer#Start() calls so that only the first one changes l_AliveTimers (as in Timer#Stop()).	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	298f3b1973	Timer: actually support non-periodic timers	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	3933502739	Timer#Start(): don't unnecessarily unlock/lock l_TimerMutex via new Timer#InternalRescheduleUnlocked()	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	13b9cfda41	Timer::TimerThreadProc(): don't unnecessarily unlock and lock l_TimerMutex	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	1badbab002	Timer::TimerThreadProc(): keep a Timer alive while it's running to prevent the case: Timer callback destroys parent object -> destroys Timer -> ~Timer() -> Stop(true) -> waits for the Timer callback to finish -> deadlock.	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	9b00c1c4dd	Timer: drop unnecessary base class	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	24681b30f6	Make Timer::Ptr a std::shared_ptr	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	9ee4d08722	Make Timer#Timer() private to enforce Timer::Create() usage	2023-04-04 10:35:22 +02:00
Alexander A. Klimov	21b68455ce	Use Timer::Create() instead of new Timer() git ls-files -z \|xargs -0 perl -pi -e 's/\bnew Timer\b/Timer::Create/g' ex. in Timer::Create() itself.	2023-04-04 10:35:20 +02:00
Alexander A. Klimov	bb1f574b69	Introduce factory method Timer::Create()	2023-04-04 10:35:10 +02:00
Alexander A. Klimov	35248b1b63	Code style	2023-04-03 13:39:08 +02:00
Alexander A. Klimov	cc872dac1f	Remove CheckResultReader which has been deprecated for 5 major versions	2023-04-03 11:39:21 +02:00
Julian Brost	7a7902cea7	Merge pull request #9715 from Icinga/StatusDataWriter Remove StatusDataWriter which has been deprecated for 5 major versions	2023-03-31 12:32:43 +02:00
Julian Brost	e87e1ea73f	Freeze globals namespace during config load This allows for a faster config load due to less locking required. The change is slightly backwards-incompatible. Before, you could manipulate the globals namespace at a later stage, but disallowing this feels reasonable for the performance benefit alone (which especially shows on many-core machines). Apart from that, it's doubtful if doing so is even useful at all as the DSL provides no mechanism for you to synchronize your operations that may run in parallel. The data structures itself are protected from race conditions, but anything implemented on top of this may still be subject to race conditions. And even if some user has a good reason for doing this, there's a feasible workaround by creating your own namespace like globals.mutable and using that instead.	2023-03-30 18:07:51 +02:00
Alexander A. Klimov	335688909b	Document why Timer::TimerThreadProc() can use Timer members during Timer#~Timer() call	2023-03-29 18:04:19 +02:00
Alexander A. Klimov	78b4dc6509	Remove unused Stream#Peek()	2023-03-24 18:18:13 +01:00
Alexander A. Klimov	4c154f93dc	ApiListener#NewClientHandlerInternal(): on basic_socket#cancel() (due to timeout) don't ssl::stream#async_shutdown() If a connection hangs for too long in ApiListener#NewClientHandler(), ApiListener#AddConnection()'s Timeout calls boost::asio::basic_socket#cancel() on that connection to trigger an exception which unwinds ApiListener#NewClientHandler(). Previously that unwind could trigger a Defer which called boost::asio::ssl::stream#async_shutdown() which extended the hang.	2023-03-21 10:57:40 +01:00
Julian Brost	66b039df9c	Merge pull request #9497 from Icinga/9249 Application::Exit(): don't exit(), but _exit(), even in debug build mode	2023-03-10 16:04:54 +01:00
Alexander A. Klimov	6414fd19f5	Checkable#ProcessCheckResult(): only clean up ack comments older than check result Normally if for some reason an ack comment still exists on a checkable not acked anymore, still clean it up. But while replaying log config objects incl. ack comments come before check results and acks. I.e. 1) ack comment, 2) DOWN check result and 3) ack. Not 1) DOWN check result, 2) ack and 3) ack comment. So the checkable is temporarily not acked, but already has the ack comment. In this case the DOWN check result which is older than the ack comment shall not clean up the latter.	2023-03-03 15:48:34 +01:00
Alexander A. Klimov	4662d4477b	Checkable#RemoveAckComments(): add optional comment entry time filter	2023-03-03 15:48:11 +01:00
Alexander A. Klimov	dceb29c742	Checkable#RemoveCommentsByType(): remove redundant parameter	2023-03-03 11:53:02 +01:00
Mattia Codato	912fdb9700	Fix update execution message discarded refs Icinga#8616	2023-03-02 17:50:39 +01:00
Alexander Aleksandrovič Klimov	55930c8042	ProcessSpawnImpl(): remove redundant _exit(128); Now this if doesn’t _exit(128) by itself, but "return" to the outer if which immediately _exit(128)s.	2023-03-02 12:45:15 +01:00
Alexander A. Klimov	bbf2e80002	Remove StatusDataWriter which has been deprecated for 5 major versions	2023-03-01 17:16:28 +01:00
Julian Brost	cf517050bc	Merge pull request #9711 from Icinga/connect-cancel Connect(): don't try next DNS record if operation is canceled	2023-03-01 15:49:53 +01:00
Alexander A. Klimov	79f1e0666a	Connect(): don't try next DNS record if operation is canceled Instead return immediately to meet the caller's expectations.	2023-02-28 10:57:54 +01:00
Edgar Fuß	20d7e1b5e6	Fix use of std::unordered_map::insert() as pointed out by Nathaniel Wesley Filardo in GitHup Pull Request #8999	2023-02-21 16:23:40 +01:00
Edgar Fuß	5bba609e60	Add missing #include	2023-02-21 16:23:40 +01:00
Edgar Fuß	cfef9fdadc	Introduce redundancy groups for Dependency Objects Traditional behaviour was to regard all dependecies as cumulative (e.g., the parent considered unreachable if any one dependency is violated), commit `ed58922389` made all dependencies regarded redundant (e.g., the parent considered unreachable only if all dependency are violated). This may lead to unrelated services (or even hosts vs. services) inadvertantly regarded to be redundant to each other. Most importantly, applying the explicit "disable-host-service-checks" dependency described in the "Monitoring Basics" chapter will defeat all other dependencies. This commit introduces a new "redundancy_group" attribute for dependencies. Specifying a redundancy_group causes a dependency to be regarded as redundant only inside that redundancy group. Dependencies lacking a redundancy_group attribute are regarded as essential for the parent. This allows for both cumulative and redundant dependencies and even a combination (cumulation of redundancies, like SSH depeding on both LDAP and DNS to function, while operating redundant LDAP servers as well as redundant DNS resolvers). This commit lacks changes to the tests.	2023-02-21 16:23:36 +01:00
Julian Brost	bda8be343b	Merge pull request #9662 from Icinga/Repair#9627 Repair DSL Namespace values being constant broken in #9627	2023-02-20 16:35:36 +01:00
Julian Brost	d9767cff3f	Merge pull request #9675 from Icinga/third-party/nlohmann_json Update third-party/nlohmann_json to v3.9.1	2023-02-20 15:31:32 +01:00
Julian Brost	a84a0a3cee	Merge pull request #8302 from Icinga/bugfix/windows-systemroot-aliases-6259 Macros: support $env.ENV_VAR_NAME$	2023-02-20 13:09:15 +01:00
Alexander A. Klimov	f2974c07cf	Centralise default icinga.* and env.* macros	2023-02-17 15:33:36 +01:00
Julian Brost	3023009804	Merge pull request #9653 from Icinga/9631 Setup all signal handlers with SA_RESTART flag	2023-02-14 17:55:09 +01:00
Alexander A. Klimov	34d0b942b9	Update third-party/nlohmann_json to v3.9.1 the latest version w/o Apache 2.0 licensed code which conflicts with GPL 2.	2023-02-14 16:19:44 +01:00
Alexander Aleksandrovič Klimov	fd5350d588	Fix typo	2023-02-13 13:00:28 +01:00
Julian Brost	e074e892ce	Merge pull request #9658 from Icinga/unfreeze Dictionary#*(): remove bool overrideFrozen if unused	2023-02-10 19:42:00 +01:00
Julian Brost	213f3f9444	Merge pull request #8389 from Icinga/feature/forbid-dep-cycles Forbid dependency cycles	2023-02-10 17:26:04 +01:00
Alexander A. Klimov	b2b49caf61	Macros: support $env.ENV_VAR_NAME$ refs #6259	2023-02-10 17:21:29 +01:00
Alexander A. Klimov	f3f2c943c7	ScriptGlobal::Set(): don't explicitly give Namespace#Set() its default values	2023-02-10 15:55:10 +01:00
Alexander A. Klimov	e61b380808	Call Namespace#Set(), not #SetFieldByName() Namespace#SetFieldByName() calls #Set() anyway.	2023-02-10 15:53:30 +01:00
Alexander A. Klimov	683095a165	Make globals.Internal values non-const by default That namespace is internal anyway. Previous commit, icinga2 console: Error: Constants must not be removed. This commit fixes it.	2023-02-10 15:47:25 +01:00
Alexander A. Klimov	02df94a46a	Repair DSL Namespace values being constant broken in #9627 master before #9627 (`a0286e9c6`): <1> => namespace n { x = 42; x = 42 } ^^^^^^ Constant must not be modified. <2> => HEAD of #9627 (`24b57f0d3`): <1> => namespace n { x = 42; x = 42 } null <2> =>	2023-02-10 15:43:01 +01:00
Julian Brost	0dd35bb960	Merge pull request #9657 from Icinga/shared_mutex-Dictionary Use a shared_mutex for read `Dictionary` operations	2023-02-10 15:15:52 +01:00
Alexander A. Klimov	e9846f1827	ScriptGlobal::Set(): remove unused bool overrideFrozen	2023-02-10 11:33:46 +01:00
Alexander A. Klimov	cd78da13d3	Dictionary#Clear(): remove unused bool overrideFrozen	2023-02-10 11:33:46 +01:00
Alexander A. Klimov	270c6392d4	Dictionary#Remove(): remove unused bool overrideFrozen	2023-02-10 11:33:46 +01:00
Alexander A. Klimov	ca547d0292	Use a shared_mutex for read `Dictionary` operations This allows multiple parallel read operations resulting in a overall speedup on systems with many cores.	2023-02-10 11:31:51 +01:00
Alexander A. Klimov	a309b4a415	ResolverSpec: add option not to resolve "$name$" but only "$host.name$".	2023-02-06 16:39:17 +01:00
Alexander A. Klimov	5b63407d15	Forbid dependency cycles	2023-02-06 12:33:48 +01:00
Alexander A. Klimov	91901eafd8	Introduce EnvResolver refs #6259	2023-02-06 11:25:25 +01:00
Alexander A. Klimov	a9341eb4a0	Setup all signal handlers with SA_RESTART flag so interrupted syscalls get auto-restarted and callers don't get or have to handle the EINTR error.	2023-02-03 14:46:45 +01:00
Julian Brost	c51037725a	Merge pull request #9466 from Icinga/flush-temp-files Deduplicate and stabilize fragile filesystem transactions	2023-02-02 16:29:11 +01:00
Julian Brost	3eb85797ce	Merge pull request #9622 from Icinga/9563 Main process: ignore SIGHUP	2023-02-02 11:36:13 +01:00
Julian Brost	a0239e44f7	Merge pull request #9598 from Icinga/9596 CheckerComponent#CheckThreadProc(): also propagate next check update …	2023-02-01 20:09:06 +01:00
Alexander Aleksandrovič Klimov	4e021e0105	Merge pull request #9648 from Icinga/frozen-namespace-config-validation Fix config sync after freezing namespaces	2023-02-01 17:07:57 +01:00
Alexander A. Klimov	e9b8c67975	CheckerComponent#CheckThreadProc(): also propagate next check update to Icinga DB if caused by dependency or check period. Now as long as any of the above causes check skips next check and next update will be up-to-date in Icinga DB, so the checkable won't slide into false positive overdue.	2023-02-01 16:25:56 +01:00
Julian Brost	2b43354080	Merge pull request #8744 from Icinga/bugfix/unnecessary-chown-8743 NodeUtility::WriteNodeConfigObjects(): avoid unneccessary Utility::SetFileOwnership()	2023-02-01 14:27:46 +01:00
Julian Brost	fd1aa73d25	Fix config sync after freezing namespaces This was accidentally broken by #9627 because during config sync, a config validation happens that uses `--define System.ZonesStageVarDir=...` which fails on the now frozen namespace. This commit changes this to use `Internal.ZonesStageVarDir` instead. After all, this is used for internal functionality, users should not directly interact with this flag. Additionally, it no longer freezes the `Internal` namespace which actually allows using `Internal.ZonesStageVarDir` in the first place. This also fixes `--define Internal.Debug*` which was also broken by said PR. Freezing of the `Internal` namespace is not necessary for performance reasons as it's not searched implicitly (for example when accessing `globals.x`) and should users actually interact with it, they should know by that name that they are on their own.	2023-02-01 12:29:47 +01:00
Alexander A. Klimov	c953ba1206	Remove redundant ThreadPool#m_Threads	2023-01-27 16:34:11 +01:00
Alexander A. Klimov	288ad68649	ThreadPool#ThreadPool(): remove unused parameter	2023-01-27 16:32:29 +01:00
Alexander A. Klimov	fd93feaec7	Include Utility::SetFileOwnership() inside FS transactions to make them even more atomic.	2023-01-27 12:03:59 +01:00
Alexander A. Klimov	d22fdf2a7a	Introduce AtomicFile#GetTempFilename()	2023-01-27 12:03:59 +01:00
Alexander A. Klimov	0367c9e099	Remove unused Utility::CreateTempFile()	2023-01-27 12:03:59 +01:00
Alexander A. Klimov	b92fe23469	Deduplicate and stabilize fragile filesystem transactions by using AtomicFile so they ensure all or nothing of a file gets replaced.	2023-01-27 12:03:56 +01:00
Alexander A. Klimov	a3e205b990	Introduce AtomicFile::Write()	2023-01-27 11:36:09 +01:00
Julian Brost	2d860a0f5e	Merge pull request #8118 from Icinga/feature/speed-object-registry-8112 Speed up config object lookup	2023-01-26 19:03:40 +01:00
Alexander Aleksandrovič Klimov	421ac1735c	Merge pull request #9608 from Icinga/move-types-namespace Move Types namespace into type.cpp and simplify Type::GetByName()	2023-01-26 18:32:41 +01:00
Julian Brost	ad8868cab7	Merge pull request #9599 from Icinga/influx-ns Influx DB: don't unneccessarily truncate timestamps to whole seconds	2023-01-26 17:44:50 +01:00
Alexander A. Klimov	b2fc49569c	Make ConfigType#m_Mutex a std::shared_timed_mutex refs #8112	2023-01-26 15:04:02 +01:00
Alexander A. Klimov	21759f015d	ConfigType: store config objects in a hash map refs #8112	2023-01-26 15:03:54 +01:00
Julian Brost	3dab46623b	Move Types namespace into type.cpp and simplify Type::GetByName() This commit moves the initialization of the globals.Types namespace to type.cpp in order to keep a pointer to the Namespace object in Type::m_Namespace and simplify Type::GetByName() using it. The dynamic type check is moved into an assertion after freezing the namespace.	2023-01-26 14:26:41 +01:00
Yonas Habteab	5a67ddea76	Don't post-increment stl iterators	2023-01-26 09:10:49 +01:00
Yonas Habteab	8bb0b857d8	ApiListener: Fix memory leak & group `a \|\| b && c` correctly	2023-01-26 09:10:49 +01:00
Yonas Habteab	95cec9cba2	Don't mark a method as `virtual` in a `final` class	2023-01-26 09:10:38 +01:00
Yonas Habteab	7b91b200f5	Use simplified if conditions where applicable	2023-01-26 09:06:20 +01:00
Yonas Habteab	38313434d2	Avoid calling `GetDeferredInitializers()` repeatedly	2023-01-26 09:05:19 +01:00
Alexander Aleksandrovič Klimov	bb99106926	Merge pull request #7863 from Icinga/bugfix/disallow-receiving-ticket-salt-via-api Disallow fetching the ticket salt via REST API	2023-01-25 16:39:30 +01:00
Julian Brost	5fea15e090	Merge pull request #7958 from Icinga/bugfix/api-500-404-7956 /v1/actions/*: return 404 if no objects found	2023-01-24 15:08:17 +01:00
Michael Friedrich	4d57de2a1a	Hide TicketSalt in /v1/variables	2023-01-20 12:38:18 +01:00
Julian Brost	24b57f0d3a	Namespace: don't acquire shared locks on frozen namespaces This makes freezing a namespace an irrevocable operation but in return allows omitting further lock operations. This results in a performance improvement as reading an atomic bool is faster than acquiring and releasing a shared lock. ObjectLocks on namespaces remain untouched as these mostly affect write operations which there should be none of after freezing (if there are some, they will throw exceptions anyways).	2023-01-19 17:56:44 +01:00
Julian Brost	cc0e2ec181	Use a shared_mutex for read `Namespace` operations This allows multiple parallel read operations resulting in a overall speedup on systems with many cores.	2023-01-19 17:55:29 +01:00
Julian Brost	1c066fc02e	Simplify NamespaceValue class hierarchy to one struct without member functions This commit removes EmbeddedNamespaceValue and ConstEmbeddedNamespaceValue and reduces NamespaceValue down to a simple struct without inheritance or member functions. The code from these clases is inlined into the Namespace class. The class hierarchy determining whether a value is const is moved to an attribute of NamespaceValue. This is done in preparation for changes to the locking in the Namespace class. Currently, it relies on a recursive mutex. In the future, a shared mutex (read/write lock) should be used instead, which cannot allow recursive locking (without failing or risk deadlocking on lock upgrades). With this change, all operations requiring a lock for one operation are within one function, no recursive locking is not needed any more.	2023-01-19 17:55:11 +01:00
Julian Brost	0503ca1379	Initialize namespaces without using `overrideFrozen` This commit adds a new initialization priority `FreezeNamespaces` that is run last and moves all calls to `Namespace::Freeze()` there. This allows all other initialization functions to still update namespaces without the use of the `overrideFrozen` flag. It also moves the initialization of `System.Platform` and `System.Build` to an initialize function so that these can also be set without setting `overrideFrozen`. This is preparation for a following commit that will make the frozen flag in namespaces finial, no longer allowing it to be overriden (freezing the namespace will disable locking, so performing further updates would be unsafe).	2023-01-19 09:53:36 +01:00
Julian Brost	6229f4d9bf	InitializePriority: don't explicitly specify values Now that all values are in one place, there is no reason for this numbering with gaps anymore. If you need to insert a new value in between, you can just do so in the enum. This reverses the sort order of the enum, thereby requiring a change to the sort order of the std::priority_queue containing the elements.	2023-01-18 15:57:32 +01:00
Julian Brost	99bb687350	INITIALIZE_ONCE_WITH_PRIORITY: use enum for priority values Change the type of the priority values from int to a new enum. By replacing the magic int values throughout the code base with named values, there is now a single place where all priority values are defined and you get an overview over the initialization order.	2023-01-18 15:57:27 +01:00
Julian Brost	61285adcae	InitializeOnceHelper: use std::function instead of C function pointer InitializeOnceHelper calls Loader::AddDeferredInitializer which takes a std::function, so it's eventually converted to that anyways. This commit just does this a bit earlier, and by saving the step of the intermediate C function pointer, this would now also work for capturing lambdas (which there are none of at the moment).	2023-01-18 15:52:42 +01:00
Julian Brost	c019f8c04a	Merge pull request #9603 from Icinga/remove-namespace-behavior Namespace: replace behavior classes with a bool	2023-01-18 15:48:34 +01:00
Julian Brost	a259650bea	Merge pull request #8595 from Icinga/bugfix/cluster-zone-own-zone-8570 cluster-zone: consider own zone connected if there's only one endpoint	2023-01-17 17:26:14 +01:00
Alexander A. Klimov	21f548d3c0	Remove no-op InfluxDB URL param precision=ns is the default.	2023-01-16 12:03:08 +01:00
Julian Brost	9590c176e3	Merge pull request #9491 from Icinga/9488 Fix compile error on Solaris 11.4	2023-01-12 14:22:52 +01:00
Julian Brost	0294c174a4	Merge pull request #9594 from Icinga/8834 ConfigObjectUtility::GetObjectConfigPath(): just return paths of existing objects	2023-01-09 13:49:58 +01:00
Alexander A. Klimov	e1bb085b0f	ConfigObjectUtility::DeleteObjectHelper(): only delete _api files to restore the behavior before the previous commit. Otherwise we'd delete all API object's child objects' files including applied child object rules in /etc.	2023-01-05 18:05:31 +01:00
Julian Brost	dd51997c73	Merge pull request #9624 from Icinga/9618 Make compilable on Boost v1.81	2023-01-05 15:32:22 +01:00
Alexander A. Klimov	99c2d69dc8	Handle boost::beast::http::basic_fields#operator[]() signature change (v1.81) Use always working std::string(x), not broken x.to_string(). (x is a return value.)	2023-01-05 11:18:20 +01:00
Alexander A. Klimov	5bcbc96e22	Handle boost::beast::http::basic_fields#set() signature change (v1.81) Make String convertible to boost::beast::string_view (always working), not boost::string_view (broken).	2023-01-05 11:18:20 +01:00
Alexander A. Klimov	d059885d9b	Main process: ignore SIGHUP On OpenBSD rcctl reload icinga2 SIGHUPs all "icinga2" processes, not just our umbrella. We must handle this.	2023-01-03 18:29:31 +01:00
Julian Brost	fbb68dbcd0	Namespace: replace behavior classes with a bool In essence, namespace behaviors acted as hooks for update operations on namespaces. Two behaviors were implemented: - `NamespaceBehavior`: allows the update operation unless it acts on a value that itself was explicitly marked as constant. - `ConstNamespaceBehavior`: initially allows insert operations but marks the individual values as const. Additionally provides a `Freeze()` member function. After this was called, updates are rejected unless a special `overrideFrozen` flag is set explicitly. This marvel of object-oriented programming can be replaced with a simple bool. This commit basically replaces `Namespace::m_Behavior` with `Namespace::m_ConstValues` and inlines the behavior functions where they were called. While doing so, the code was slightly simplified by assuming that `m_ConstValues` is true if `m_Frozen` is true. This is similar to what the API allowed in the old code as you could only freeze a `ConstNamespaceBehavior`. However, this PR moves the `Freeze()` member function and the related `m_Freeze` member variable to the `Namespace` class. So now the API allows any namespace to be frozen. The new code also makes sense with the previously mentioned simplification: a `Namespace` with `m_ConstValues = false` can be modified without restrictions until `Freeze()` is called. When this is done, it becomes read-only. The changes outside of `namespace.*` just adapt the code to the slightly changed API.	2022-12-09 09:25:46 +01:00
Julian Brost	a8cc5dff89	Prevent ObjectLock from being copied Copying an ObjectLock results in the underlying mutex being unlocked too often. There's also no good reason for copying a scoped locking class (if at all, it should be moved).	2022-12-08 15:48:01 +01:00
Alexander Aleksandrovič Klimov	ca328627cd	Merge pull request #9537 from Icinga/replace-some-raw-pointer-with-intrusive-ptr FilterUtility: Replace some nested raw pointers by `unique_ptr<>*`	2022-12-06 13:07:24 +01:00
Alexander Aleksandrovič Klimov	b585e20a4c	Merge pull request #9591 from Icinga/circular-refs icinga2 daemon: w/o --dump-objects just check for circular refs	2022-11-30 21:41:21 +01:00
Alexander A. Klimov	ba62c665aa	WorkQueue#ParallelFor(): allocate lambda once per thread, not once per item	2022-11-30 11:10:24 +01:00
Alexander A. Klimov	83021f8231	CONTEXT: use << everywhere to unify usages	2022-11-30 11:06:51 +01:00
Alexander A. Klimov	b82814fb29	CONTEXT: lazily evaluate frames to only actually assemble when needed	2022-11-30 11:06:45 +01:00
Alexander A. Klimov	0b46e0aeab	CONTEXT: use l_Frames as stack to reduce modification complexity	2022-11-30 10:56:24 +01:00
Alexander A. Klimov	70df0e298e	CONTEXT: reduce malloc()s by replacing linked list with vector	2022-11-30 10:56:24 +01:00
Alexander A. Klimov	7c481742f4	icinga2 daemon: w/o --dump-objects just check for circular refs and don't malloc() anything.	2022-11-30 10:45:50 +01:00
Alexander A. Klimov	e53ec2a50f	SerializeInternal(): allow to optionally not malloc() anything This effectively just checks for circular refs.	2022-11-30 10:45:50 +01:00
Alexander A. Klimov	145ee890df	Just get paths from existing objects for modification and deletion instead of computing from scratch if they're in the _api package. For now this changes literally nothing as paths of existing objects still follow the scheme of paths of new objects which didn't change. Now Icinga only doesn't expect existing objects at particular paths. However, with the latter in v2.14+ (agent, satellite) we can just change the path scheme of new objects in v2.16+ (master) as we wish. The child nodes will just follow the new scheme of paths of new objects.	2022-11-28 16:39:16 +01:00
Yonas Habteab	c1f73fbc1d	FilterUtility: Replace some nested raw pointers by our `unique_ptr<X>*`	2022-11-28 14:50:54 +01:00
Yonas Habteab	834709543a	ApplyRule: Make `m_HasMatches` atomic This prevents the `m_HasMatches` property from being altered simultaneously. This might seem harmless (since this property can only be set to true by any calling thread), however, from a technical (C++) point of view, this constitutes a data race.	2022-11-28 14:13:58 +01:00
Alexander A. Klimov	eaa3cd83ad	Influx DB: don't unneccessarily truncate timestamps to whole seconds Instead send timestamps with the highest possible precision (ns). Useful for check intervals <1s.	2022-11-28 12:27:01 +01:00
Julian Brost	ae32b3cbbd	Merge pull request #9586 from Icinga/9363 icinga2 daemon: write icinga2.debug only if --dump-objects given	2022-11-24 16:03:42 +01:00
Alexander A. Klimov	f71612d8f3	icinga2 object list: warn on possibly outdated config	2022-11-24 10:50:17 +01:00
Alexander A. Klimov	0767c6ef87	icinga2 daemon -C: write icinga2.debug only if --dump-objects given to save config (re)load time.	2022-11-23 12:54:33 +01:00
Julian Brost	dd99a5ace9	Merge pull request #9577 from Icinga/ConfigItem-CommitNewItems ConfigItem::CommitNewItems(): allow fast search of pending items by type	2022-11-23 12:34:51 +01:00
Alexander A. Klimov	ae693cb7e1	ConfigItem::CommitNewItems(): allow fast search of pending items by type	2022-11-21 15:07:39 +01:00
Alexander A. Klimov	33e609d791	Type#GetLoadDependencies(): avoid malloc() - cache result - return it by const ref - do Type::GetByName() for the callers	2022-11-21 15:07:39 +01:00
Julian Brost	a958a735d7	Merge pull request #9555 from Icinga/ApplyRule-GetDebugInfo ApplyRule#GetDebugInfo(): return by const ref to avoid malloc()	2022-11-16 13:35:04 +01:00
Alexander A. Klimov	e97a5d59e0	ApplyRule#GetFVVar(): return by const ref to avoid malloc().	2022-11-08 12:48:13 +01:00
Alexander A. Klimov	738662338f	ApplyRule#GetFKVar(): return by const ref to avoid malloc().	2022-11-08 12:45:21 +01:00
Julian Brost	98902b2ff0	Merge pull request #9545 from Icinga/targeted-apply-rules Separately handle apply rules targetting only specific parent objects	2022-11-04 14:06:15 +01:00
Yonas Habteab	a8d46e6d47	Use service short name for evaluating targeted service rules	2022-11-04 10:19:26 +01:00
Yonas Habteab	2610fb1285	Avoid evaluating the same filter twice for the same target	2022-11-04 10:15:22 +01:00
Alexander A. Klimov	27a559c5fe	ApplyRule#GetDebugInfo(): return by const ref to avoid malloc()	2022-10-28 15:33:44 +02:00
Alexander A. Klimov	a698b9c3da	ApplyRule::RuleMap: reduce complexity, save unnecessary lookups	2022-10-28 14:27:53 +02:00
Alexander A. Klimov	a907c2ac9a	Targeted apply rules: don't unnecessarily eval filter	2022-10-28 14:27:53 +02:00
Alexander A. Klimov	dacd6a206d	VariableExpression#GetVariable(): return by const ref not to unnecessarily malloc()	2022-10-28 14:27:53 +02:00
Alexander A. Klimov	038a5e8ef6	Unify storages of regular/targeted apply rules: std::vector<ApplyRule::Ptr>	2022-10-28 14:27:53 +02:00
Alexander A. Klimov	a56ad38ad3	Separately handle apply rules targetting only specific parent objects not to unnecessarily run e.g. the filter assign where host.name=="example.com" for all hosts being not example.com.	2022-10-28 14:27:53 +02:00
Alexander A. Klimov	fd7ac4e5ca	Allow hashmaps of String	2022-10-21 10:28:41 +02:00
Alexander A. Klimov	449a3c14cf	Allow intrusive pointers to ApplyRule	2022-10-21 10:28:41 +02:00
Julian Brost	987bb22397	Merge pull request #9543 from Icinga/apply-rules-lookup Lookup apply rules faster by Type*, not String and by map instead of ==/!=	2022-10-21 09:53:35 +02:00
Alexander A. Klimov	c7d656716f	Remove unused ApplyRule#m_TargetType	2022-10-19 13:43:51 +02:00
Alexander A. Klimov	d468d7993c	Lookup apply rules faster by Type, not String and by map instead of ==/!= 1. The lookup of apply rules per source type now implies no String(const char) (no malloc()) and just pointer (uint64) comparisions 2. Apply rules are now also grouped by target type via a nested map, that obsoletes checking the target type while iterating over all rules per source type	2022-10-19 13:43:51 +02:00
Alexander A. Klimov	90fe4e5bea	ApplyRule::GetTargetTypes(): return by const ref not to malloc()	2022-10-19 13:43:51 +02:00
Julian Brost	f2563cc890	Merge pull request #9542 from Icinga/context-evaluating-apply-rules-for-host Construct string once, not unnecessarily N times	2022-10-17 19:57:09 +02:00
Alexander A. Klimov	ce1a122618	Construct string once, not unnecessarily N times	2022-10-17 15:54:02 +02:00
Yonas Habteab	400117e2f6	ConfigItem: Don't add items to the new items vector before committing This also improves the performance a bit, as we longer have to iterate over the items and copy them into the new items vector.	2022-10-12 13:27:41 +02:00
Yonas Habteab	f7298e85d2	ConfigItem: Fix infinite recursion caused by `ignore_on_error` when committing an item When committing an item with `ignore_on_error` flag set fails, the `Commit()` method only returns `nullptr` and the current item is not being dropped from `m_Items`. `CommittNewItems()` also doesn't check the return value of `Commit()` but just continues and tries to commit all items from `m_Items` in recursive call. Since this corrupt item is never removed from `m_Items`, it ends up in an endless recursion till it finally crashes.	2022-10-12 13:15:09 +02:00
Julian Brost	91cbb856fe	Merge pull request #9521 from Icinga/noop-log-msgs Logger: don't render log messages which will be disposed anyway	2022-10-11 19:05:03 +02:00
Alexander Aleksandrovič Klimov	363f4d3fde	Merge pull request #9408 from Icinga/bugfix/match-api-permissions-against-join-relations ObjectQueryHandler: Check user permissions on joined relations	2022-10-11 13:42:27 +02:00
Yonas Habteab	a656444d78	RedisConnection: Don't log queries that are going to be discarded	2022-10-11 13:28:08 +02:00
Alexander A. Klimov	0fbb0332a6	Logger: don't render log messages which will be disposed anyway by caching the total minimum log severity of all loggers in a "global variable" and whether a message's severity is large enough for any of the loggers in a per-message no-op flag.	2022-10-11 13:28:08 +02:00
Julian Brost	9be02e3f04	Merge pull request #9518 from Icinga/9481 StartUnixWorker(): watch forked child via waitpid(), not SIGCHLD handler	2022-10-10 14:36:52 +02:00
Yonas Habteab	72e6894bbb	Evaluate permission filters also on all joined relations	2022-10-10 12:33:33 +02:00
Yonas Habteab	607f7ab5ca	ObjectQueryHandler: Check user permissions on joined relations	2022-10-10 12:33:33 +02:00
Yonas Habteab	1bb2d65a8d	FilterUtility: Outsource permission matching from CheckPermission() to a separate method	2022-10-10 12:33:33 +02:00
Julian Brost	465da17060	Merge pull request #9407 from Icinga/bugfix/don-not-allow-changing-object-relations-at-runtime Don't allow to change object navigation fields at runtime	2022-10-10 12:27:57 +02:00
Alexander A. Klimov	61f7e029cb	Replace two-variants enum with bool	2022-10-07 15:14:33 +02:00
Julian Brost	87a4925997	Merge pull request #9519 from Icinga/utf8cp Utility::ValidateUTF8(): move a string instead of copying a vector	2022-10-07 10:21:37 +02:00
Julian Brost	2a4dc083ae	Merge pull request #9524 from Icinga/introduce-object-idx Introduce object identifier attr	2022-10-07 10:19:59 +02:00
Julian Brost	0ed9c09a1d	Merge pull request #9513 from Icinga/9501 Icinga DB: on every check result update state only 1x, not 3x in a row	2022-10-07 10:18:56 +02:00
Yonas Habteab	85c77bd878	IcingaDB: Cache generated object hash	2022-09-12 17:23:06 +02:00
Yonas Habteab	07e60c1961	ConfigObject: Introduce new `icingadb_identifier` attr	2022-09-12 17:22:57 +02:00
Yonas Habteab	28c29c1fbc	Don't allow to change object parent,host/service_name at runtime	2022-09-09 18:26:28 +02:00
Alexander A. Klimov	a6b36a2d7b	Utility::ValidateUTF8(): move a string instead of copying a vector less malloc() = more speed Especially as JsonEncode() validates every single input string.	2022-09-09 10:50:42 +02:00
Alexander A. Klimov	22bfcf9ac5	icinga2 daemon: remove no-op SIGCHLD handling 1. Don't set a custom handler for SIGCHLD (in the umbrella process) as that handler doesn't actually handle SIGCHLD anymore 2. Don't reset the SIGCHLD handler (in the worker process) as there's nothing to reset anymore due to the above change 3. Don't block SIGCHLD across fork(2) as its handler doesn't change anymore due to the above changes	2022-09-07 12:12:09 +02:00
Alexander A. Klimov	3de714489c	Remove unused UnixWorkerState::Failed	2022-09-07 12:08:33 +02:00
Alexander A. Klimov	df9008bfc4	StartUnixWorker(): watch forked child via waitpid(), not SIGCHLD handler Before: On SIGCHLD from the forked worker the umbrella process sets a failure flag. StartUnixWorker() recognises that and does waitpid(), failure message, etc.. On OpenBSD we can't tell the signal source, so we always set the failure flag. That's not how our IPC shall work, that breaks the IPC sooner or later. After: No SIGCHLD handling and no failure flag setting. Instead StartUnixWorker()'s wait loop uses waitpid(x,y,WNOHANG) to avoid false positives while watching the forked worker.	2022-09-07 11:46:46 +02:00
Yonas Habteab	31785b48fd	Expression: Decrease `frame.Depth` only when calling `IncreaseStackDepth()` succeeds This ensures that `frame.Depth` is only decreased when preceding `frame.IncreaseStackDepth()` callee was successful. This way, `frame.Depth` will have the same depth prior to and after evaluating a frame.	2022-09-07 09:41:16 +02:00
Alexander A. Klimov	5e9f95c007	Icinga DB: on every check result update state only 1x, not 3x in a row Before (time: vertical, stack: horizontal): * Checkable::ExecuteCheck * Checkable::UpdateNextCheck * IcingaDB::NextCheckChangedHandler * HSET icinga:host:state * HSET icinga:checksum:host:state * ZADD icinga:nextupdate:host * RandomCheckTask::ScriptFunc * Checkable::ProcessCheckResult * Checkable::UpdateNextCheck * IcingaDB::NextCheckChangedHandler * HSET icinga:host:state * HSET icinga:checksum:host:state * ZADD icinga:nextupdate:host * IcingaDB::NewCheckResultHandler * HSET icinga:host:state * HSET icinga:checksum:host:state * ZADD icinga:nextupdate:host * IcingaDB::StateChangeHandler * XADD icinga:runtime:state * IcingaDB::ForwardHistoryEntries * XADD icinga:history:stream:state After: * Checkable::ExecuteCheck * Checkable::UpdateNextCheck * RandomCheckTask::ScriptFunc * Checkable::ProcessCheckResult * Checkable::UpdateNextCheck * IcingaDB::NewCheckResultHandler * HSET icinga:host:state * HSET icinga:checksum:host:state * ZADD icinga:nextupdate:host * IcingaDB::StateChangeHandler * XADD icinga:runtime:state * IcingaDB::ForwardHistoryEntries * XADD icinga:history:stream:state The first state + nextupdate (for overdue) update comes from next_check being set to now + interval immediately before doing the actual check (not to trigger it twice). This update is not only not important for the end user, but even inappropriate. The end user SHALL see next_check being e.g. in -4s, not 5m, as the check is running at the moment. The second one is just redundant as IcingaDB::NewCheckResultHandler (the third one) is called anyway and will update state + nextupdate as well.	2022-09-06 10:10:14 +02:00
Alexander A. Klimov	01bc7d4043	Application::Exit(): don't exit(), but _exit(), even in debug build mode Case: 1. icinga2 api setup 2. icinga2 daemon -C -x debug Before: Second commands crashes at exit. After: No crash. As the comment between the removed lines clearly says: Our destructors haven't been built for static data. This is build type independent.	2022-08-23 13:12:21 +02:00
Alexander A. Klimov	790bad9250	Fix compile error on Solaris 11.4 by not using LOG_FTP which is not defined there.	2022-08-16 12:07:05 +02:00
Julian Brost	d3cca0e621	Merge pull request #9409 from Icinga/feature/disallow-empty-object-names Disallow empty object names	2022-08-11 16:53:28 +02:00
Alexander A. Klimov	a2362ebf17	IcingaDB::VersionChangedHandler(): don't handle not synced types not to surprise (and crash) the Icinga DB daemon with unknown types.	2022-08-10 13:24:44 +02:00
Alexander A. Klimov	32871ca40c	IcingaDB::SendCustomVarsChanged(): don't delete custom vars of not synced types not to surprise (and crash) the Icinga DB daemon with unknown types.	2022-08-10 11:40:53 +02:00
Alexander A. Klimov	c9d6eecc7f	Dump state file atomically not to corrupt it by using fsync(2) before close(2) and rename(2).	2022-07-28 18:00:37 +02:00
Alexander A. Klimov	600fb0e3c2	Introduce AtomicFile	2022-07-28 18:00:37 +02:00
Julian Brost	913566cbfa	Windows: output useful error message for syscall errors	2022-07-28 17:00:57 +02:00
Yonas Habteab	148f5b8416	ConfigObject: Mark object names as required This prevents a user from creating an object without a valid name such as `"", null`.	2022-07-15 15:51:58 +02:00
Julian Brost	a927ba39b7	Windows: only include critical messages in early log messages The point of logging to the Windows Event Log was to catch errors that happen before the full logging configuration has been loaded and enabled. Messages like the number of loaded objects per type just cause noise in the log and provide little benefit. Therefore raise the required log level at this stage. Note that this commit removes the (never documented) ability to use the -x flag to change the level. But doing so would require patching the command line of the service in the registry anyways.	2022-07-14 14:07:56 +02:00
Julian Brost	bd2118c4cd	Merge pull request #9420 from Icinga/IcingaDB-soft_state Icinga DB: icinga:*:state: rename state to soft_state	2022-06-29 12:24:52 +02:00
Alexander A. Klimov	ba9a5c614c	Icinga DB: icinga:*:state: rename state to soft_state	2022-06-29 11:49:06 +02:00
Julian Brost	9b24056e05	Merge pull request #9346 from Icinga/icingadb-check Introduce Icinga DB check (like the IDO one)	2022-06-28 18:24:29 +02:00
Julian Brost	3222fab05a	Icinga DB Check: don't check runtime update backlog during full sync	2022-06-28 13:33:00 +02:00
Julian Brost	4f125753bf	Icinga DB Check: ignore suppressed queries in Redis backlog check If some kind of query is not supposed to be processed at the moment, there is little point in checking it. During a full dump, state updates are suppressed (i.e. delayed), so when a dump takes very long, this would have resulted in a false Redis backlog warning.	2022-06-28 13:33:00 +02:00
Julian Brost	5550fb713c	Icinga DB Check: include ongoing dumps in OK message Also use the "current" and "full dump/sync" terminology in the other messages.	2022-06-28 13:33:00 +02:00
Julian Brost	3ded7a9268	Icinga DB Check: rename dump/sync related perfdata values Scope all values using current/last instead of takes/took.	2022-06-28 13:33:00 +02:00
Julian Brost	e36bc92a2c	Icinga DB Check: add unit hints to all rates	2022-06-28 13:33:00 +02:00
Julian Brost	eaae7d5863	Icinga DB Check: update not connected message The check makes no attempt to explicitly connect to Redis, it uses the connection of the IcingaDB feature, so this message better describes the state in this situation.	2022-06-28 13:33:00 +02:00
Julian Brost	2fafffb85f	Icinga DB Check: fix race-condition with IcingaDB::Start() IcingaDB::GetConnection() uses IcingaDB::m_Rcon which is only initialized in IcingaDB::Start(), therefore add a nullptr check to the check command. Additionally, as m_Rcon is potentially accessed concurrently, add a copy of the value that is safe for concurrent use.	2022-06-28 13:33:00 +02:00
Julian Brost	953e113465	Icinga DB Check: remove markdown headings from output icingadb-web shows multiple lines from the check output collapsed into a single line. The lines containing just minuses make this look cluttered and making making it a heading provides little to no benefit. Even when rendering markdown in the check output at some point, having the lists labeled using normal paragraphs would look just fine.	2022-06-28 13:33:00 +02:00
Julian Brost	c59d44cd8b	Icinga DB Check: rename perfdata values - Add icinga2_ and icingadb_ prefixes to make clear which component is responsible for the value. - Rename heartbeat_lag to heartbeat_age, describes it better in my opinion and sound a bit less like something that should be as close to zero as possible. - Rename redis_dump/database_sync into full_dump/full_sync as this is how these operations are refered to in log messages as well. - Rename Redis backlog into Redis query backlog, makes it a bit clearer in my opinion. - Rename runtime_backlog into runtime_update_backlog, as the component in Icinga DB is called that way and this naming is also exposed in log messages. - Rename dump_config/state/history into config/state/history_dump, makes it sound more natural.	2022-06-28 13:33:00 +02:00
Julian Brost	d0382f71ab	Icinga DB Check: rename variables from takes to duration Sounds more natural in my opinion and I doubt that many users would get that due to the difference between takes/took, this refers to ongoing dumps.	2022-06-28 13:33:00 +02:00
Julian Brost	3c29b15214	Icinga DB Check: use more natural names for sync/cleanup metrics	2022-06-28 13:33:00 +02:00
Julian Brost	d70a27b982	Icinga DB Check: report history and runtime update backlog separately Probably makes little difference for an end-user, but for support and development it's great to know which of the two is causing problems.	2022-06-28 13:33:00 +02:00
Julian Brost	2a4605f4b7	Icinga DB Check: clearly state Icinga 2 Redis backlog Should make it easier to understand that this refers to Redis queries issued by Icinga 2.	2022-06-28 13:33:00 +02:00
Julian Brost	5613412b81	Icinga DB Check: replace nested calls to fmax() with std::max() Improves readability, even more so after splitting it into separate lines.	2022-06-28 13:33:00 +02:00
Julian Brost	f3f1373f83	Icinga DB Check: spell out "error" in perfdata	2022-06-28 13:33:00 +02:00
Julian Brost	31c7dfee53	Icinga DB Check: fix error message on Redis query error Not only XREAD queries are performed, so the previous error message was incorrect.	2022-06-28 13:33:00 +02:00
Julian Brost	4f1f70f843	Icinga DB Check: remove unused includes	2022-06-28 13:33:00 +02:00
Julian Brost	2b310718e3	Icinga DB Check: rename keys in heartbeat stream In both C++ and Go, the keys are only used as constant strings, so namespacing them just adds clutter for the `general:*` keys, therefore remove it.	2022-06-28 13:33:00 +02:00
Julian Brost	d74fbbbb82	Icinga DB Check: remove _1sec metrics They add no additional information compared to the _1min values as it's always the same value divided by 60 anyways. Adding the actual value from the last second makes little sense for realistic values of check_interval.	2022-06-28 13:33:00 +02:00
Julian Brost	44cbd04088	Icinga DB Check: read performance data string from Redis Use the already existing format to pass performance data to Icinga 2 rather than some new JSON structure. Has the additional benefit of doing more things in Go than in C++.	2022-06-28 13:33:00 +02:00
Yonas Habteab	0ffef02c1d	IcingaDB: Adjust some column names according to the DB schema	2022-06-23 14:27:34 +02:00
Alexander A. Klimov	e4a36bc217	Introduce Icinga DB check (like the IDO one)	2022-06-23 11:14:31 +02:00
Alexander A. Klimov	88c8d29ee6	Remove Icinga DB perfdata from Icinga check as the Icinga DB check already yields it.	2022-06-22 13:25:29 +02:00
Julian Brost	6b4681ee9e	Icinga DB: make error message more helpful if API isn't set up	2022-06-20 14:57:19 +02:00
Alexander A. Klimov	8eef51afeb	Introduce IcingaDB::AddKvsToMap()	2022-06-20 13:47:39 +02:00
Alexander A. Klimov	2c3d2f8b87	RedisConnection::ReadRESP(): *-1\r\n is null, not [ ]	2022-06-20 13:47:39 +02:00
Alexander Aleksandrovič Klimov	4522522444	Merge pull request #9362 from Icinga/bugfix/remove-redundant-serialization Remove redundant call to Serialize() in ConfigItem::Commit()	2022-06-15 09:34:38 +02:00
Julian Brost	ad218c9a12	Icinga DB: initialize environment ID during config validation IcingaDB may receive callbacks from Boost signals before being fully started. This resulted in situations where m_EnvironmentId was used before it was initialized properly. This is fixed by initializing it earlier (during the config validation stage). However, at this stage, it should not yet write to disk, therefore, persisting the environment ID to disk is delayed until later in the startup process. Initializing at this stage has an extra benefit: if there is an error for some reason (possibly corrupt icingadb.env file), this now shows up as a nice error during config validation. Additionally, this replaces the use of std::call_once with std::mutex due to bug in libstdc++ (see inline comment for reference).	2022-06-10 14:19:58 +02:00
Yonas Habteab	45f536ca06	Bump Redis schema version to 5	2022-06-07 12:55:12 +02:00
Yonas Habteab	92becec37f	IcingaDB: Add `_name` suffix to columns referring to name	2022-05-31 16:41:40 +02:00
Eric Lippmann	18c8b4ad54	Merge pull request #9371 from Icinga/bugfix/icingadb-command-arguments-null IcingaDB: handle null (Empty) for value/set_if/separator in command arguments	2022-05-23 16:01:49 +02:00
Julian Brost	3220fecd4c	Merge pull request #7919 from Icinga/feature/parameter-delimiters-check-execution-6277 Introduce Command#arguments[].separator	2022-05-23 13:23:36 +02:00
Julian Brost	f110e26635	IcingaDB: handle null (Empty) for value/set_if/separator in command arguments Icinga 2 treats null (Empty) as if the corresponding attribute is not specified. However, without this commit, it would serialize the value as "null" (i.e. type string), so that it ends up in the database as this string instead of NULL. This commit adds handling for ValueEmpty so that is serialized as JSON null value and ends up in the database as NULL.	2022-05-23 11:53:41 +02:00
Alexander A. Klimov	069c3968d9	Introduce Command#arguments[].sep ... for letting check commands produce argv like --key=value, not just --key value. refs #6277	2022-05-11 17:50:12 +02:00
Julian Brost	4184dcd62c	Merge pull request #9354 from WuerthPhoenix/feature/return-correct-status-in-process-check-result-api Return correct status codes in process-check-result API	2022-05-05 15:30:09 +02:00
Julian Brost	abe2dfa763	Replace EventuallyAtomic with AtomicOrLocked which falls back to a mutex Apparently there was a reason for making the members of generated classes atomic. However, this was only done for some types, others were still accessed using non-atomic operations. For members of type T::Ptr (i.e. intrusive_ptr<T>), this can result in a double free when multiple threads access the same variable and at least one of them writes to the variable. This commit makes use of std::atomic<T> for more T (it removes the additional constraint sizeof(T) <= sizeof(void*)) and uses a type including a mutex for load and store operations as a fallback.	2022-05-03 12:02:46 +02:00
Julian Brost	2dcdae4470	Remove redundant call to Serialize() in ConfigItem::Commit() The very same object is already serialized a few lines above, the result is even stored in a variable, but that variable was not used before. Simply using this variable results in a noticeable improvement of config validation times.	2022-04-28 17:09:16 +02:00
Damiano Chini	9d9810b44d	Return correct status codes in process-check-result API	2022-04-26 13:33:59 +02:00
Julian Brost	51cd7e7b0b	Take host state into account when sending suppressed notifications Checkable::FireSuppressedNotifications() compares the time of the current checkable with the last recovery time of parents to avoid notification right after a parent recovered and before the current checkable was checked. This commit makes this check also include to host if the checkable is a service. This makes the behavior consistent with the documentation that states there is an implicit dependency on the host (which isn't realized as implicitly generating a Dependency object unfortunately).	2022-04-19 16:13:15 +02:00
Julian Brost	178aaaeca9	Merge pull request #9332 from Icinga/bugfix/compare-cluster-tickets-in-constant-time Compare cluster tickets in constant time	2022-04-11 15:32:32 +02:00
Julian Brost	b24a2fa2a5	Merge pull request #9179 from Icinga/Al2Klimov-patch-3 Let new cluster certificates expire after 397 days, not 15 years	2022-04-11 15:29:05 +02:00
Julian Brost	0e880048ee	Merge pull request #7961 from Icinga/bugfix/startup-log Place startup.log and status in /var/lib/icinga2/api, not /var/lib/icinga2/api/zones-stage	2022-04-11 14:41:07 +02:00
Alexander A. Klimov	b15763bd86	Compare cluster tickets in constant time Just to be sure.	2022-04-11 11:17:05 +02:00
Alexander A. Klimov	08a23f4035	Write also /var/lib/icinga2/api/zones-stage-startup-last-failed.log in addition to /var/lib/icinga2/api/zones-stage-startup.log to prevent the next success to overwrite the last failure.	2022-04-11 11:14:42 +02:00
Alexander A. Klimov	c9e4c016e0	Protect ApiListener#m_SSLContext with a mutex	2022-04-11 11:02:45 +02:00
Alexander A. Klimov	e490883577	Renew certificates also periodically	2022-04-11 11:02:39 +02:00
Alexander Aleksandrovič Klimov	39d642af75	Merge pull request #9321 from Icinga/perfdata-resume-signal Perfdata writers: disconnect handlers from signals in Pause()	2022-04-07 15:51:02 +02:00
Alexander A. Klimov	ce6d1b8961	Place startup.log and status in /var/lib/icinga2/api, not /var/lib/icinga2/api/zones-stage not to loose them.	2022-04-07 11:24:24 +02:00
Alexander Aleksandrovič Klimov	b29b95e882	Merge pull request #9267 from Icinga/bugfix/parallel-api-package-calls-do-not-finish-while-reload Worker process doesn't let parallel API package stage updates to complete when terminated	2022-04-06 13:27:44 +02:00
Alexander A. Klimov	56933b8877	Perfdata writers: disconnect handlers from signals in Pause() as they would be re-connected in Resume() (HA). Before they were still connected during pause and connected X+1 times after X split-brains (the same data was written X+1 times).	2022-04-06 13:09:26 +02:00
Alexander A. Klimov	3753f86c80	ApiListener#Start(): auto-renew own cert if CA owner otherwise that particular cert would expire.	2022-04-04 12:13:31 +02:00
Alexander A. Klimov	6d470a3ca5	Introduce ApiListener#RenewCert()	2022-04-04 12:12:31 +02:00
Alexander Aleksandrovič Klimov	f749c7556e	Merge pull request #9314 from Icinga/latin1 IDO MySQL: reason latin1 charset for actually UTF-8 bytes	2022-04-04 11:05:12 +02:00
Alexander A. Klimov	11b8d0f058	IDO MySQL: reason latin1 charset for actually UTF-8 bytes	2022-03-31 18:10:21 +02:00
Alexander Aleksandrovič Klimov	2fa26961ac	Merge pull request #9311 from Icinga/9308 IDO MySQL: explicitly use latin1	2022-03-31 16:44:11 +02:00
Alexander A. Klimov	245fbad1e5	IDO MySQL: explicitly use latin1 for the case the MySQL client lib is compiled with another default not to turn Unicode chars into ??.	2022-03-31 15:04:45 +02:00
Yonas Habteab	6193a911bf	ConfigStagesHandler: Don't allow concurrent package updates anymore To prevent Icinga2 from being restarted while one or more requests are still in progress and end up as corrupted stages without status file and startup logs.	2022-03-30 09:42:22 +02:00
Yonas Habteab	362adcab1a	ConfigPackageUtility: Don't reset ongoing package updates on config validation success and process is going to be reloaded	2022-03-30 09:42:22 +02:00
Yonas Habteab	575af4c980	Defer: Allow to cancel the callback before going out of scope	2022-03-30 09:42:22 +02:00
Alexander A. Klimov	9be2eb8e5e	Introduce IsCertUptodate()	2022-03-29 16:47:23 +02:00
Alexander A. Klimov	5f2e021390	Request certificate renewal also master2->master1 not only sat->master to prevent master2's certificate from expiring.	2022-03-29 16:47:23 +02:00
Alexander A. Klimov	e06b631f3a	Let new cluster certificates expire after 397 days, not 15 years https://cabforum.org/wp-content/uploads/CA-Browser-Forum-BR-1.7.3.pdf, section 6.3.2: "Subscriber Certificates issued on or after 1 September 2020 SHOULD NOT have a Validity Period greater than 397 days and MUST NOT have a Validity Period greater than 398 days."	2022-03-29 16:47:23 +02:00
Alexander Aleksandrovič Klimov	d171301b9d	Merge pull request #9298 from Icinga/bugfix/icingadb-remove-comment-history Icinga DB: discard comment removals with missing information	2022-03-29 11:25:01 +02:00
Alexander Aleksandrovič Klimov	bbc2b59b0d	Merge pull request #9287 from Icinga/9275 Icinga DB: correct ack comments' is_sticky	2022-03-28 22:42:52 +02:00
Julian Brost	d139bc31c8	Icinga DB: discard comment removals with missing information If comments get removed in unintended ways (i.e. not by expiring or by using the remove-comment API action), the comment object misses information to create a proper history event for Icinga DB. Therefor, discard these events.	2022-03-28 16:58:05 +02:00
Alexander A. Klimov	1220ad8a2f	Icinga DB: correct ack comments' is_sticky On ack Icinga first adds a comment, then acks the checkable so the ack event has the comment ID. But due to the yet missing ack the comment was missing is_sticky. That's corrected now.	2022-03-24 16:42:18 +01:00
Alexander A. Klimov	4399e82d9d	Introduce Comment#sticky Carries whether ack was sticky for ack comments.	2022-03-24 16:42:18 +01:00
Julian Brost	ba154d2a38	Merge pull request #7929 from Icinga/bugfix/override-default-template-apply-rules-7914 Apply rules: import default templates first	2022-03-23 11:30:51 +01:00
Julian Brost	cfa6f1c6a9	Merge pull request #9288 from Icinga/9272 IcingaDB#SendRemovedComment(): ignore ack comments like #SendAddedComment()	2022-03-22 15:06:06 +01:00
Alexander A. Klimov	27966c3c08	IcingaDB#SendRemovedComment(): ignore ack comments like #SendAddedComment() Icinga DB doesn't expect comment history for ack comments. Before: 1. Acked checkable recovers 2. Icinga clears ack comments w/o setting removal time 3. Icinga DB gets neither removal time, nor expire time 4. Icinga DB falls back to NULL and violates NOT NULL constraint	2022-03-21 17:06:35 +01:00
Julian Brost	9630e86997	Add missing array locking in IcingaDB::GetArrayDeletedValues() icinga::Array requires locking by the caller when iterating using Begin() and End(). This is only checked in debug builds but there it makes this function fail.	2022-03-09 14:29:44 +01:00
Julian Brost	bf5b905707	Merge pull request #9250 from Icinga/feature/fix-compiler-warning-do-not-move-local-variables Fix compiler warnings don't move local variables	2022-03-08 11:37:09 +01:00
Julian Brost	90848f602b	Checkable: Add test for state notifications after a suppression ends	2022-03-03 14:25:23 +01:00
Julian Brost	cbc0b21b86	Checkable: sync state_before_suppression in cluster This ensures that in case of a failover in an HA zone, the other can take over properly and has the required state to send the proper notifications.	2022-03-03 14:25:23 +01:00
Julian Brost	39cee3538a	Checkable: improve state notifications after suppression ends This commit changes the Checkable notification suppression logic (notifications are currently suppressed on the Checkable if it is unreachable, in a downtime, or acknowledged) to that after the suppression reason ends, a state notification is sent if and only if the first hard state after is different from the last hard state from before. If the checkable is in a soft state after the suppression ends, the notification is further suppressed until a hard state is reached. To achieve this behavior, a new attribute state_before_suppression is added to Checkable. This attribute is set to the last hard state the first time either a PROBLEM or a RECOVERY notification is suppressed. Compared to from before, neither of these two flags in the suppressed_notification will ever be cleared while the supression is still ongoing but only after the suppression ended and the current state is compared with the old state stored in state_before_suppression.	2022-03-03 14:25:23 +01:00
Alexander A. Klimov	6b5106ffdd	IcingaDB#Stop(): don't block shutdown, timeout instead	2022-03-02 16:39:44 +01:00
Alexander A. Klimov	3a8efcb4ea	IcingaDB#Send*(): don't enqueue any history once stopped	2022-03-02 16:39:44 +01:00
Alexander A. Klimov	cac22fe38b	RedisConnection#Connect(): wait for all promises to be completed by the read loop from the previous connection.	2022-03-02 16:39:44 +01:00
Alexander A. Klimov	9585a63fa0	Introduce IoEngine::YieldCurrentCoroutine()	2022-03-02 16:39:44 +01:00
Alexander A. Klimov	732d5c472d	RedisConnection#ReadLoop(): don't crash (silently) if a promise to be set is already set	2022-03-02 16:39:37 +01:00
Alexander A. Klimov	50fee6aeb9	Icinga DB: include amount of history kept in memory in /v1/status	2022-03-02 16:39:37 +01:00
Alexander A. Klimov	ad0fe764f7	Icinga DB: log amount of history kept in memory every 10s	2022-03-02 16:39:37 +01:00
Alexander A. Klimov	8ea62f7fc7	Icinga DB: keep history in memory until written to Redis by putting the messages into a Bulker and retrying each chunk.	2022-03-02 16:39:37 +01:00
Alexander A. Klimov	9a8d388734	Introduce Bulker	2022-03-02 16:39:37 +01:00
Alexander Aleksandrovič Klimov	3fee562e7a	Merge pull request #9256 from Icinga/bugfix/add-some-missing-locks Add some missing locks to prevent data races	2022-03-01 16:12:50 +01:00
Julian Brost	9d3eba8383	Merge pull request #9259 from Icinga/bugfix/event-handler-spamming-8704 Checkable#ExecuteEventHandler(): don't outsource event command run twice	2022-02-25 16:51:31 +01:00
Yonas Habteab	f00a3c9693	ConfigObject: Initialize local static var at declaration to ensure thread safety	2022-02-25 15:23:49 +01:00
Yonas Habteab	fb21345bfd	ConfigItem: Use atomic variables for notified and commited items count	2022-02-25 15:17:33 +01:00
Alexander A. Klimov	74935dad7b	Checkable#ExecuteEventHandler(): don't outsource event command run twice refs #8704	2022-02-24 14:03:57 +01:00
Yonas Habteab	a0607aceff	Fix compiler warnings don't move local variables	2022-02-22 17:51:43 +01:00
Julian Brost	5383df3c79	Merge pull request #9212 from Icinga/bugfix/multi-ido-notification-id IDO: fix incorrect contacts in notification history with multiple IDO instances on a single node	2022-02-21 11:40:46 +01:00
Julian Brost	8e81faf3e0	Merge pull request #9221 from Icinga/bugfix/processcheckresult-dependency-deadlock Prevent deadlock in ProcessCheckResult	2022-02-18 14:14:46 +01:00
Julian Brost	99008755b5	Merge pull request #9213 from Icinga/feature/icingadb-add-previous_soft_state-to-host_state-and-service_state-9210 IcingaDB: Add previous_soft_state to host_state and service_state	2022-02-18 14:09:35 +01:00
Julian Brost	3bb9cdb8cc	Prevent deadlock in ProcessCheckResult Without this commit, children and parents of a checkable were rescheduled on a state change while holding the lock for the current checkable. If both ends of a dependency are checked at the same time and both change state, they could end up in a deadlock waiting for each other. This commit fixes this problem by changing the code so that other checkables are rescheduled only after releasing the lock for the current checkable.	2022-02-17 16:13:25 +01:00
Alexander A. Klimov	c613e62454	IcingaDB: Add previous_soft_state to host_state and service_state refs #9210	2022-02-14 11:32:46 +01:00
Julian Brost	7c9d0fff01	IDO: use per-instance notification_id in history When there are multiple active IDO instances on the same node, before this commit, all of them would share a single DbValue object for the notification_id column of the icinga_contactnotifications table. This resulted in the issue that one database references the notification_id in another database. This commit fixes this by using a separate DbValue value for each IDO instance. This needs a new signal as the existing OnQuery and OnMultipleQueries signals perform the same queries on all IDO instances, but different queries are needed here per instance (they only differ in the referenced DbValue). Therefore, a new signal OnMakeQueries is added that takes a std::function which is called once per IDO instance and can access callbacks to perform one or multiple queries only on this specific IDO instance.	2022-02-10 16:36:35 +01:00
Julian Brost	1b0ad099f1	Merge pull request #9154 from Icinga/bugfix/icingadb-reachabilitychangehandler-9143 Icinga DB: ensure is_reachable and severity don't miss updates	2022-02-03 14:53:51 +01:00
Alexander A. Klimov	2ef3dd6a38	Checkable#ProcessCheckResult(): call Checkable::OnReachabilityChanged less often Call it only on state changes to reduce no-op Redis/IDO updates a lot. refs #9143	2022-02-03 11:12:53 +01:00
Alexander Aleksandrovič Klimov	ff712f6b23	Service#GetSeverity(): behave as the respective IDO query of Icinga Web which doesn't include host reachability.	2022-01-27 12:21:06 +01:00
Alexander A. Klimov	4c38715ef2	Checkable#ProcessCheckResult(): call Checkable::OnReachabilityChanged last to ensure Checkable#IsReachable() returns correctly for dependency children inside OnReachabilityChanged(). That needs the dependency parent to be already in the correct state. refs #9143	2022-01-25 13:33:46 +01:00
Alexander A. Klimov	84d09876b4	Icinga DB: ensure is_reachable and severity don't miss updates refs #9143	2022-01-25 13:33:46 +01:00
Julian Brost	185fab3761	Merge pull request #9144 from Icinga/bugfix/icingadb-state-history Icinga DB: don't write state history for ack/downtime/host problem changes	2022-01-20 12:00:24 +01:00
Julian Brost	6390911262	Merge pull request #9123 from Icinga/bugfix/icinga2-crashes-when-sending-notifications-8186 Avoid "type" key in dicts being part of object state attrs	2022-01-19 11:48:40 +01:00
Julian Brost	463b159414	Merge pull request #9171 from Icinga/bugfix/icinga-db-notification-history-might-use-incorrect-previous_hard_state-9132 IcingaDB#SendSentNotification(): make stream deterministic via CheckResult#previous_hard_state	2022-01-18 16:54:16 +01:00
Julian Brost	31da6a56e6	Icinga DB: remove obsolete StateChangeHandler overload This version of StateChangeHandler is no longer called anywhere as it was the wrong function for all previous callers anyways.	2022-01-18 12:26:43 +01:00
Julian Brost	cf73c6136b	Icinga DB: make host problem change events update the state tables but not write state history StateChangeHandler() is the function used when the actual hard/soft state changes and thus also writes state history. This is not desired in this case, instead, a runtime update should be generated, therefore call UpdateState() instead. refs #9063	2022-01-18 12:26:43 +01:00
Julian Brost	855e342b63	Icinga DB: make acknowledgement events update the state tables but not write state history StateChangeHandler() is the function used when the actual hard/soft state changes and thus also writes state history. This is not desired in this case, instead, a runtime update should be generated, therefore call UpdateState() instead. refs #9063	2022-01-18 12:26:43 +01:00
Julian Brost	f63268b0dd	Icinga DB: make downtime events update the state tables but not write state history StateChangeHandler() is the function used when the actual hard/soft state changes and thus also writes state history. This is not desired in this case, instead, a runtime update should be generated, therefore call UpdateState() instead. refs #9063	2022-01-18 12:26:43 +01:00
Julian Brost	447884be72	Icinga DB: don't reimplement volatile state update in SendConfigUpdate Sending a volatile state update is already implemented in UpdateState, so just use that function instead of generating the update queries.	2022-01-18 12:26:43 +01:00
Julian Brost	a6d6cb788e	Icinga DB: Merge SendStatusUpdate into UpdateState Previously, both funktions did related operations but had unclear and confusing naming: - UpdateState updated the icinga:{host,service}:state Redis keys. - SendStatusUpdate sent a runtime update for the icinga:{host,service}:state. This commit merges both functions into one with a new mode parameter. The following modes are now supported: - Volatile: Update the icinga:{host,service}:state Redis key. - Full: Perform the volatile state update and in addition send a corresponding runtime update so that this state update gets written through to the persistent database by a running icingadb process. - RuntimeOnly: Special mode for callers that can ensure that a volatile update for the current state was already performed but has to be upgraded to a full update. refs #9063	2022-01-18 12:26:43 +01:00
Alexander A. Klimov	1fee3f1b12	IcingaDB#SendSentNotification(): make stream deterministic via CheckResult#previous_hard_state Now it gets everything from one source, the CheckResult. refs #9132	2022-01-10 19:18:11 +01:00
Julian Brost	3d04b04172	Merge pull request #9138 from Icinga/bugfix/mysql-schema-versions Make MySQL schema version in full schema file and upgrade files consistent	2022-01-10 09:54:38 +01:00
Julian Brost	e518dc2436	Merge pull request #9112 from Icinga/bugfix/sync-missing-history-information Icinga DB: ensure consistent history streams in HA setup	2022-01-07 15:14:06 +01:00
Julian Brost	a99c04030c	Merge pull request #9150 from Icinga/bugfix/icingadb-cmd-arg-order-int Icinga DB: ensure icinga:*command:argument#order is an int	2022-01-05 16:07:30 +01:00
Julian Brost	3e73a262cc	Sync comment and downtime removal info for Icinga DB history When a comment or downtime is removed manually, the name of the requestor and timestamp have to be synced to other nodes in the cluster to allow all of them to generate a consistent Icinga DB history stream. refs #9101	2022-01-05 10:27:13 +01:00
Alexander Aleksandrovič Klimov	1b50d912a0	Merge pull request #9137 from Icinga/bugfix/influxdb-writer-synchronization Fix unsafe concurrent access to m_DataBuffer in InfluxdbCommonWriter	2022-01-04 17:37:28 +01:00
Alexander A. Klimov	e9e555468d	Handle "type" key in dicts being part of object state attrs i.e. the confusion of the state file deserializator with e.g. `"type":32` on startup. That would unexpectedly restore (the now ignored) null (not `{"type":32}`) as there's no type "32". refs #8186	2022-01-04 17:17:20 +01:00
Alexander Aleksandrovič Klimov	80663cf5e6	Merge pull request #9048 from Icinga/bugfix/timeperiod-dst-2.0 LegacyTimePeriod::ScriptFunc: fix DST edge-cases	2022-01-03 18:11:32 +01:00
Alexander A. Klimov	a8c9d19dae	Icinga DB: ensure icinga:command:argument#order is an int The config parser requires Command#arguments#order to be a Number, i.e. 42, 4.2 or even "4.2". That's int-casted where needed, now also for Icinga DB. Before: ``` object CheckCommand "9117" { command = [ "true" ] arguments = { "4.2" = { order = "4.2" } } } ``` 2022-01-03T13:25:07.166+0100 FATAL icingadb json: cannot unmarshal string into Go value of type int64	2022-01-03 13:28:19 +01:00
Julian Brost	33781496da	InfluxdbCommonWriter: use atomic_size_t to data buffer size from stats function m_DataBuffer may be modified concurrently while StatsFunc() is called, thus it's unsafe to call size() on it. As write access to m_DataBuffer is already synchronized by only modifying it from the single work queue thread, instead of adding a mutex, this commit adds a new std::atomic_size_t which is additionally updated when modifying m_DataBuffer and can safely be accessed in StatsFunc().	2022-01-03 12:24:26 +01:00
Julian Brost	e6300aacf9	InfluxdbCommonWriter: only flush from work queue There is no explicit synchronization of access to m_DataBuffer which is fine if it is only accessed from the single-threaded work queue. However, Stop() also called Flush() in another thread, leading to concurrent write access to m_DataBuffer which can result in a crash due to use after free/double free. Changes in this commit: * Flush() is renamed to FlushWQ() to show that it should only be called from the work queue. Additionally, it now asserts that it is running on the work queue. * Visibility of some data members is changed from protected to private. No other classes have to access these at the moment. By this change, accidental concurrent access from derived classes in the future is prevented. * Stop() now flushes by posting FlushWQ() to the work queue and joining it.	2022-01-03 12:24:26 +01:00
Julian Brost	23693248d4	Make MySQL schema version in full schema file and upgrade files consistent In the 2.12.6 release, the full schema file sets the version to 1.14.3, whereas the latest available upgrade file 2.11.0.sql sets it to 1.15.0. Therefore, ship a new upgrade file 2.12.7.sql for all users who imported their schema with version 2.11.0 or later and never performed an upgrade since then. Their databases incorrectly state schema version 1.14.3 and is bumped to the correct version 1.15.0 by the upgrade. In the 2.13.2 release, the full schema file sets the version to 1.15.0, whereas the latest available upgrade file 2.13.0.sql sets it to 1.15.1. Therefore, rename the incorrectly named upgrade file 2.13.1.sql (it was not shipped in this or any other release so far) to 2.13.3.sql for users who imported their schema with version 2.13.0 or later and never performed an upgrade since then. Their databases incorrectly state schema version 1.15.0 and are bumped to the correct version 1.15.1 by the upgrade. The full schema is not touched by this commit as for the current branch, this was already fixed by `815533b334`.	2021-12-16 15:48:12 +01:00
Julian Brost	13ea635188	Don't trigger a fixed downtime like a flexible one When creating a fixed downtime that starts immediately while the checkable is in a non-OK state, previously the code path for flexible downtimes was used to trigger this downtime. This is fixed by this commit which resolves two issued: 1. Missing downtime start notification: notifications work differently for fixed and flexible downtimes. This resulted in missing downtime start notifications under the conditions described above. 2. Incorrect downtime trigger time: this code path would incorrectly assume the timestamp of the last checkable as the trigger time which is incorrect for fixed downtimes.	2021-12-14 11:02:40 +01:00

... 8 9 10 11 12 ...

6874 commits