prometheus

mirror of https://github.com/prometheus/prometheus.git synced 2026-06-03 13:42:14 -04:00

Author	SHA1	Message	Date
Łukasz Mierzwa	d106b3beb7	Wrap db.blocks read in a read lock We don't hold db.mtx lock when trying to read db.blocks here so we need a read lock around this loop. Signed-off-by: Łukasz Mierzwa <l.mierzwa@gmail.com>	2025-01-09 17:06:05 +00:00
Łukasz Mierzwa	92788d313a	Remove TestTombstoneCleanRetentionLimitsRace This test ensures that running db.reloadBlocks() and db.CleanTombstones() at the same time doesn't race. The problem is that CleanTombstones() is a public method while reloadBlocks() is internal. CleanTombstones() sets db.cmtx lock while reloadBlocks() is not protected by any locks at all, it expects the public method through which it was called to do it. So having a race between these two is not unexpected and we shouldn't really be testing this. db.cmtx ensures that no other function can be modifying the list of open blocks and so the scenario tested here cannot happen. If it would happen it would be only because some other method doesn't aquire db.ctmx lock, something this test cannot detect. Signed-off-by: Łukasz Mierzwa <l.mierzwa@gmail.com>	2025-01-09 17:06:03 +00:00
Łukasz Mierzwa	b880cea613	Fix locks in db.reloadBlocks() This partially reverts `ae3d392aa9`. `ae3d392aa9` added a call to db.mtx.Lock() that lasts for the entire duration of db.reloadBlocks(), previous db.mtx would be locked only during critical part of db.reloadBlocks(). The motivation was to protect against races: `9e0351e161 (r555699794)` The 'reloads' being mentioned are (I think) reloadBlocks() calls, rather than db.reload() or other methods. TestTombstoneCleanRetentionLimitsRace was added to catch this but I wasn't able to ever get any error out of it, even after disabling all calls to db.mtx in reloadBlocks() and CleanTombstones(). To make things more complicated CleanupTombstones() itself calls reloadBlocks(), so it seems that the real issue is that we might have concurrent calls to reloadBlocks(). The problem with this change is that db.reloadBlocks() can take a very long time, that's because it might need to load very large blocks from disk, which is slow. While db.mtx is locked a large chunk of the db is locked, including queries, since db.mtx read lock is needed for db.Querier() call. One of the issues this manifests itself as is a gap in all metrics and blocked queries just after a large block compaction happens. When compaction merges multiple day-or-more blocks into a week-or-more block it create a single very big block. After that block is written it needs to be loaded and that seems to be taking many seconds (30-45), during which mtx is held and everything is blocked. Turns out that there is another lock that is more fine grained and aimed at this specific use case: // cmtx ensures that compactions and deletions don't run simultaneously. cmtx sync.Mutex All calls to reloadBlocks() are wrapped inside cmtx lock. The only exception is db.reload() which this change fixes. We can't add cmtx lock inside reloadBlocks() itself because it's called by a number of functions, some of which are already holding cmtx. Looking at the code I think it is sufficient to hold cmtx and skip a reloadBlocks() wide mtx call. Signed-off-by: Łukasz Mierzwa <l.mierzwa@gmail.com>	2025-01-09 17:05:39 +00:00
Arve Knudsen	f030894c2c	Fix issues raised by staticcheck (#15722 ) Fix issues raised by staticcheck We are not enabling staticcheck explicitly, though, because it has too many false positives. --------- Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>	2025-01-09 17:51:26 +01:00
Vandit Singh	6339989e25	web/api: Add a limit parameter to /query and /query_range (#15552 ) add limit param to query and rangeQuery --------- Signed-off-by: Vandit Singh <vanditsinghkv@gmail.com> Signed-off-by: Vandit Singh <107131545+Vandit1604@users.noreply.github.com> Co-authored-by: Björn Rabenstein <github@rabenste.in>	2025-01-09 17:27:39 +01:00
Neeraj Gartia	b3e30d52ce	[BUGFIX] PromQL: Fix `<aggr_over_time>` functions with histograms (#15711 ) fix aggr_over_time with histograms Signed-off-by: Neeraj Gartia <neerajgartia211002@gmail.com> --------- Signed-off-by: Neeraj Gartia <neerajgartia211002@gmail.com>	2025-01-09 16:38:42 +01:00
Matthias Loibl	d173c0b61c	Merge pull request #15334 from janhorstmann/feature/update-monitoring-mixin-dashboard Update mixin dashboard	2025-01-09 14:54:05 +00:00
Fiona Liao	9d6f88cb73	Add additional tests for operators over incompatible nhcb (#15787 ) * Add additional tests for operators over incompatible nhcb Signed-off-by: Fiona Liao <fiona.liao@grafana.com>	2025-01-09 10:29:57 +01:00
Julien Duchesne	a768a3b95e	Rule Concurrency: Test safe abort of rule evaluations (#15797 ) This test was added in the Grafana fork a while ago: https://github.com/grafana/mimir-prometheus/pull/714 and has been helpful to make sure we can safely terminate rule evaluations early The new rule evaluation logic (done here: https://github.com/prometheus/prometheus/pull/15681) does not have the bug, but the test was useful to verify that Signed-off-by: Julien Duchesne <julien.duchesne@grafana.com>	2025-01-08 16:32:48 +00:00
Björn Rabenstein	1ea9b72997	Merge pull request #15795 from prometheus/beorn7/promql promqltest: let eval_ordered ignore annotations and improve documentation	2025-01-08 16:04:14 +01:00
beorn7	d9a80a91e3	docs: Document eval_warn and eval_info This also improves the documentation in the following ways: - Clarifies that `eval` requires no annotations. - Clarifies that `eval_ordered` ignores annotations. - Clarifies that `eval_ordered` does not work with matrix returns (which could very well be created by instant queries). - Clarifies that there are more `eval` commands than just `eval`. - Improves wording for `eval_ordered`. - Replaces `...` by the typographical correct `…`. - Fixes a numerical error in an example. Signed-off-by: beorn7 <beorn@grafana.com>	2025-01-08 13:57:13 +01:00
beorn7	7687661453	promqltest: make eval_ordered ignore annotations Besides making eval_ordered ignore annotations, this does the following: - Adds a test to verify that eval_ordered indeed ignores an info annotations now, while eval complains about it, eval_info recognizes it and, eval_warn flags the missing of the warn annotation. - Refactors the annotation check into its own method. - Moves closing of the query to the appropriate place where it wasn't so far. Signed-off-by: beorn7 <beorn@grafana.com>	2025-01-08 12:55:27 +01:00
Julius Volz	02501e097e	Merge pull request #15789 from prometheus/beorn7/doc2 docs: fix spelling	2025-01-07 19:39:08 +01:00
Ben Ye	919a5b657e	Expose ListPostings Length via Len() method (#15678 ) tsdb: expose remaining ListPostings Length Signed-off-by: Ben Ye <benye@amazon.com> --------- Signed-off-by: Ben Ye <benye@amazon.com>	2025-01-07 17:58:26 +01:00
beorn7	df55e536b8	docs: fix spelling Signed-off-by: beorn7 <beorn@grafana.com>	2025-01-07 17:51:57 +01:00
Julius Volz	18bb8bf996	Merge pull request #15784 from sujalshah-bit/15394_server_name_and_time api: Add two new fields Hostname and ServerTime.	2025-01-07 13:36:53 +01:00
sujal shah	73a3438c1b	api: Add two new fields Node and ServerTime. This commit introduced two field in `/status` endpoint: - The node currently serving the request. - The current server time for debugging time drift issues. fixes #15394. Signed-off-by: sujal shah <sujalshah28092004@gmail.com>	2025-01-07 16:05:50 +05:30
Julien Duchesne	1a27ab29b8	Rules: Store dependencies instead of boolean (#15689 ) * Rules: Store dependencies instead of boolean To improve https://github.com/prometheus/prometheus/pull/15681 further, we'll need to store the dependencies and dependents of each Right now, if a rule has both (at least 1) dependents and dependencies, it is not possible to determine the order to run the rules and they must all run sequentially This PR only changes the dependents and dependencies attributes of rules, it does not implement a new topological sort algorithm Signed-off-by: Julien Duchesne <julien.duchesne@grafana.com> * Store a slice of Rule instead Signed-off-by: Julien Duchesne <julien.duchesne@grafana.com> * Add `BenchmarkRuleDependencyController_AnalyseRules` for future reference Signed-off-by: Julien Duchesne <julien.duchesne@grafana.com> --------- Signed-off-by: Julien Duchesne <julien.duchesne@grafana.com>	2025-01-06 20:48:38 +00:00
Julien Duchesne	8067f27971	`RuleConcurrencyController`: Add `SplitGroupIntoBatches` method (#15681 ) * `RuleConcurrencyController`: Add `SplitGroupIntoBatches` method The concurrency implementation can now return a slice of concurrent rule batches This allows for additional concurrency as opposed to the current interface which is limited by the order in which the rules have been loaded Also, I removed the `concurrencyController` attribute from the group. That information is duplicated in the opts.RuleConcurrencyController` attribute, leading to some confusing behavior, especially in tests. Signed-off-by: Julien Duchesne <julien.duchesne@grafana.com> * Address PR comments Signed-off-by: Julien Duchesne <julien.duchesne@grafana.com> * Apply suggestions from code review Co-authored-by: gotjosh <josue.abreu@gmail.com> Signed-off-by: Julien Duchesne <julienduchesne@live.com> --------- Signed-off-by: Julien Duchesne <julien.duchesne@grafana.com> Signed-off-by: Julien Duchesne <julienduchesne@live.com> Co-authored-by: gotjosh <josue.abreu@gmail.com>	2025-01-06 18:51:19 +00:00
Paulo Dias	c803f7e82f	Merge branch 'openstack-loadbalancer-discovery' of github.com:paulojmdias/prometheus into openstack-loadbalancer-discovery Signed-off-by: Paulo Dias <paulodias.gm@gmail.com>	2025-01-06 15:15:32 +00:00
Paulo Dias	816a5c94b9	fix: fix docs typo Signed-off-by: Paulo Dias <paulodias.gm@gmail.com>	2025-01-06 15:15:17 +00:00
Arthur Silva Sens	5fdec31401	otlp/translator: Use separate function for metric names with UTF8 characters (#15664 ) BuildCompliantName was renamed to BuildCompliantMetricName, and it no longer takes UTF8 support into consideration. It focuses on building a metric name that follows Prometheus conventions. A new function, BuildMetricName, was added to optionally add unit and type suffixes to OTLP metric names without translating any characters to underscores(_).	2025-01-06 11:30:39 -03:00
Hélia Barroso	56094197b5	[Docs] Note that scrape_timeout cannot be greater than scrape_interval (#15786 ) Signed-off-by: Hélia Barroso <helia_barroso@hotmail.com>	2025-01-06 14:13:17 +00:00
Bartlomiej Plotka	a441ad771e	Merge pull request #15467 from prometheus/cedwards/nhcb-wal-wbl feat(nhcb): support custom buckets in native histograms in the WAL/WBL	2025-01-03 22:33:21 +01:00
Arve Knudsen	4f67a38a39	template: Use cases.Title instead of deprecated strings.Title (#15721 ) Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>	2025-01-03 17:58:02 +01:00
Bryan Boreham	a6947a0369	Merge 3.1 into main (#15775 ) Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2025-01-03 14:28:51 +00:00
George Krajcsovits	cfcb00a716	perf(nhcbparse): unroll recursion (#15776 ) https://github.com/prometheus/prometheus/pull/15467#issuecomment-2563585979 Signed-off-by: György Krajcsovits <gyorgy.krajcsovits@grafana.com>	2025-01-02 15:51:52 +01:00
Paulo Dias	36ccf62692	Merge branch 'prometheus:main' into openstack-loadbalancer-discovery	2025-01-02 14:44:19 +00:00
Paulo Dias	d40e99c2ec	Merge branch 'openstack-loadbalancer-discovery' of github.com:paulojmdias/prometheus into openstack-loadbalancer-discovery Signed-off-by: Paulo Dias <paulodias.gm@gmail.com>	2025-01-02 14:43:46 +00:00
Paulo Dias	cb7254158b	feat: rename status to provisioning_status and add operating_status Signed-off-by: Paulo Dias <paulodias.gm@gmail.com>	2025-01-02 14:43:31 +00:00
György Krajcsovits	1e420ef373	Merge branch 'main' into cedwards/nhcb-wal-wbl # Conflicts: # tsdb/tsdbutil/histogram.go	2025-01-02 12:50:19 +01:00
György Krajcsovits	a7ccc8e091	record_test.go: avoid captures, simply return test refs Signed-off-by: György Krajcsovits <gyorgy.krajcsovits@grafana.com>	2025-01-02 12:45:20 +01:00
Bryan Boreham	096e2aa7bd	Merge pull request #14518 from bboreham/faster-listpostings-merge TSDB: Optimization: Merge postings using concrete type	2025-01-02 10:43:45 +00:00
Arve Knudsen	f37b5adfef	OTLP receiver: Optimize by initializing regexps at program start (#15733 ) Signed-off-by: Arve Knudsen <arve.knudsen@gmail.com>	2024-12-31 11:12:21 +01:00
Bryan Boreham	b2fa1c9524	TSDB benchmarks: Commit periodically to speed up init When creating dummy data for benchmarks, call `Commit()` periodically to avoid growing the appender to enormous size. Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2024-12-30 17:42:56 +00:00
Bryan Boreham	2dfb4fdafb	Merge pull request #15723 from machine424/eng-clos fix(main.go): avoid closing the query engine until it is guaranteed to no longer be in use.	2024-12-30 11:16:20 +00:00
TJ Hoplock	4cbd9ffb91	docs: update required go version in readme to 1.22 (#15447 ) It was bumped during 3.0 with the adoption of log/slog and other dep updates. ``` ~/go/src/github.com/prometheus/prometheus (main [ ]) -> grep '^go' go.mod go 1.22.0 ``` Signed-off-by: TJ Hoplock <t.hoplock@gmail.com>	2024-12-30 09:46:17 +01:00
machine424	9823a93c42	fix(main.go): avoid closing the query engine until it is guaranteed to no longer be in use. partially reverts https://github.com/prometheus/prometheus/pull/14064 fixes https://github.com/prometheus/prometheus/issues/15232 supersedes https://github.com/prometheus/prometheus/pull/15533 reusing Engine.Close() outside of tests will require more consideration. Signed-off-by: machine424 <ayoubmrini424@gmail.com>	2024-12-30 05:14:44 +01:00
dependabot[bot]	08c81b721a	chore(deps): bump actions/setup-go from 5.1.0 to 5.2.0 (#15581 ) Bumps [actions/setup-go](https://github.com/actions/setup-go) from 5.1.0 to 5.2.0. - [Release notes](https://github.com/actions/setup-go/releases) - [Commits](`41dfa10bad...3041bf56c9`) --- updated-dependencies: - dependency-name: actions/setup-go dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-12-29 17:08:18 +00:00
pinglanlu	6a61efcfc3	discovery: use a more direct and less error-prone return value (#15347 ) Signed-off-by: pinglanlu <pinglanlu@outlook.com>	2024-12-29 18:03:06 +01:00
johncming	061400e31b	tsdb: export CheckpointPrefix constant (#15636 ) Exported the CheckpointPrefix constant to be used in other packages. Updated references to the constant in db.go and checkpoint.go files. This change improves code readability and maintainability. Signed-off-by: johncming <johncming@yahoo.com> Co-authored-by: johncming <conjohn668@gmail.com>	2024-12-29 17:54:45 +01:00
dependabot[bot]	2c5502c114	chore(deps): bump actions/setup-go from 5.1.0 to 5.2.0 in /scripts (#15580 ) Bumps [actions/setup-go](https://github.com/actions/setup-go) from 5.1.0 to 5.2.0. - [Release notes](https://github.com/actions/setup-go/releases) - [Commits](`41dfa10bad...3041bf56c9`) --- updated-dependencies: - dependency-name: actions/setup-go dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-12-29 17:50:30 +01:00
dependabot[bot]	4d2c1c1d06	chore(deps): bump actions/cache from 4.1.2 to 4.2.0 (#15583 ) Bumps [actions/cache](https://github.com/actions/cache) from 4.1.2 to 4.2.0. - [Release notes](https://github.com/actions/cache/releases) - [Changelog](https://github.com/actions/cache/blob/main/RELEASES.md) - [Commits](`6849a64899...1bd1e32a3b`) --- updated-dependencies: - dependency-name: actions/cache dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-12-29 17:50:09 +01:00
dependabot[bot]	43fd40cae0	chore(deps): bump github/codeql-action from 3.27.5 to 3.27.7 (#15582 ) Bumps [github/codeql-action](https://github.com/github/codeql-action) from 3.27.5 to 3.27.7. - [Release notes](https://github.com/github/codeql-action/releases) - [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md) - [Commits](`f09c1c0a94...babb554ede`) --- updated-dependencies: - dependency-name: github/codeql-action dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-12-29 17:49:51 +01:00
bwplotka	281306765e	scrape: Unified scrape loop benchmark. Signed-off-by: bwplotka <bwplotka@gmail.com>	2024-12-29 15:19:06 +00:00
Bryan Boreham	bc9210e393	[TESTS] Scrape: make caching work in benchmark Returning 0 from Append means 'unknown', so the series is never cached. Return arbitrary numbers instead. Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2024-12-29 15:13:47 +00:00
Bryan Boreham	b4ef38cfc8	Scraping: Add benchmark for protobuf format Extract helper function textToProto(). Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2024-12-29 15:13:38 +00:00
Bryan Boreham	8f4557b0b1	Scraping benchmark: more realistic test Don't repeat type and help text. Signed-off-by: Bryan Boreham <bjboreham@gmail.com>	2024-12-29 15:11:39 +00:00
Carrie Edwards	1508149184	Update benchmark test and comment	2024-12-27 09:09:13 -08:00
Bartlomiej Plotka	30967330ca	Merge pull request #14755 from prometheus/arthursens/appendct-prwv2 Append CT as zero sample from PRWv2	2024-12-27 12:44:54 +01:00

... 41 42 43 44 45 ...

17019 commits