[Alerting] Allow alert tags to be modified in bulk #241883

baileycash-elastic · 2025-11-04T22:03:47Z

Summary

This pull request introduces a new API route and supporting backend logic for bulk updating tags on alerts in the rule registry. The main changes include the addition of a patchTags method to the AlertsClient, a new route for bulk patching alert tags, and reusable scripts for tag updates. These improvements make it possible to add/remove or replace tags on multiple alerts efficiently, either by IDs or by query.

Alerting authorization

The AlertingAuthorization class changed to support bulk authorized multiple rule type IDs and consumers. The ensureAuthorized was renamed to _ensureAuthorized and accepts Array<{ ruleTypeId: string; consumers: string[] }>;. The logic is the same as before, but it constructs all security actions based on the input. This is needed to avoid having to do one authorization call per (ruleTypeId, consumer) pair. Lastly, a bulkEnsureAuthorized is exposed which is a wrapper of the private _ensureAuthorized.

Alerts client

The code in the AlertsClient is pretty outdated. For this reason, I decided not to use the existing functionality and code from scratch. The bulkUpdateTags is introduced, and it bulk updates the tags of multiple alerts either by using alertIds or by using a KQL query. In the first scenario, an aggregation is made to get the rule type ID and the consumer of each alert. Then we bulk authorize. If the user has access, we move forward and update the tags of the alerts. If not, we throw an error. For the query scenario, we apply the authorization filter along with the query to filter out the alerts that the user does not have access to. Lastly, we audit log only once for the whole bulk operation and not for each alert found.

API and Backend Enhancements

Added a new API route POST /internal/rac/alerts/tags for bulk updating alert tags, supporting add, remove, and replace operations, with validation and error handling. [1] [2]
Implemented the bulkUpdateTags method in AlertsClient, enabling tag updates on alerts by IDs or query, using Elasticsearch scripts for efficient bulk operations.

Reusable Update Scripts

Added reusable Painless scripts for adding, removing, and replacing alert tags, exported from alert_client_bulk_update_scripts.ts and integrated into the client logic. [1] [2]

baileycash-elastic · 2025-11-04T22:03:53Z

/ci

elasticmachine · 2025-11-04T23:03:10Z

Pinging @elastic/response-ops (Team:ResponseOps)

elasticmachine · 2025-11-04T23:03:12Z

Pinging @elastic/obs-ux-management-team (Team:obs-ux-management)

x-pack/platform/plugins/shared/rule_registry/server/routes/tags.ts

x-pack/platform/plugins/shared/rule_registry/server/alert_data_client/alerts_client.test.ts

x-pack/platform/plugins/shared/rule_registry/server/alert_data_client/alerts_client.ts

dominiqueclarke · 2025-11-06T17:27:55Z

x-pack/platform/plugins/shared/rule_registry/server/routes/bulk_update_tags.ts

+      validate: {
+        body: buildRouteValidation(
+          t.intersection([
+            t.type({


I don't understand why we need to provide the index. Shouldn't the index be derived by the alert? It feels like our internal system should be able to know and keep track of this.

Also, what if we are trying to bulk update alerts that are spread across multiple indices, for example, add a tag to both a synthetics and apm rule that are for the same service? cc: @cnasikas there may be some context I'm missing here.

The code was developed before ResponseOps, but I assume it is needed because it would be very inefficient to get the indices of the alerts using IDs. You may be able to do a bulk get using an alias, but for the query param, this is not feasible. You would need to do an aggregation (search) over all alerting indices, handle aggregation limits, etc. Now, behind the scenes, the bulk update of the alert client is using bulk update by query, which supports updating multiple documents using an alias. So you can set the index route param to be .alerts-observability-* or even alerts-observability-logs*,alerts-observability-apm*.

I'm attempting this without success so far

PATCH kbn:/internal/rac/alerts/tags { "index": ".alerts-*", "addTags": ["alert_tag"], "alertIds": ["30b3b23c-27f5-4064-9947-1ebe376d4561"] }

Result is

{ "statusCode": 404, "error": "Not Found", "message": "alerts with ids 30b3b23c-27f5-4064-9947-1ebe376d4561 and index .alerts-* not found" }

Attempting to use the UUID

@cnasikas testing I'm not able to use an index pattern it seems in the way you've suggested.

Users will have to provide the full index due to constraints with esClient

So from the UI, we will have to segment bulk requests so that we have one bulk request per index, it would seem.

You are right, mget does not support index patterns. To avoid having the UI make multiple requests, instead of an mget, we can do an aggregation to get the rule type IDs and consumers. Then we can construct the pairs and pass them to the ensureAllAuthorized. The index would still be needed to target the o11y + stack alerts (and not modify the security alerts), but it can be optional and fallback to .alerts-*.

GET .alerts-*/_search { "query": { "ids": { "values": [ "36d6d650-1204-44f5-b105-3af0c5d5acde" ] } }, "aggs": { "ruleTypeId": { "terms": { "field": "kibana.alert.rule.rule_type_id", "size": 100 }, "aggs": { "consumer": { "terms": { "field": "kibana.alert.rule.consumer", "size": 100 } } } } }, "size": 0 }

baileycash-elastic · 2025-11-07T04:10:28Z

@elasticmachine merge upstream

adcoelho · 2025-11-07T12:01:42Z

x-pack/platform/plugins/shared/rule_registry/server/routes/bulk_update_tags.test.ts

+    });
+  });
+
+  describe('failure scenarios', () => {


Could we please also mock the Not Found failure scenario?

adcoelho · 2025-11-07T12:03:20Z

x-pack/platform/test/rule_registry/security_and_spaces/tests/basic/bulk_update_tags.ts

+        noKibanaPrivileges,
+      ];
+
+      addTests({


Nice, I like how these are structured 🔥

adcoelho · 2025-11-07T12:04:56Z

x-pack/platform/test/rule_registry/security_and_spaces/tests/basic/tags.ts

+            .set('kbn-xsrf', 'true')
+            .send({
+              alertIds: [alertId],
+              addTags: ['new-tag'],


I noticed all these only have the addTags param. Could you please add one scenario for tag removal? (No need to duplicate all tests, just one should bulk remove alert tags.)

adcoelho · 2025-11-07T12:39:27Z

x-pack/platform/plugins/shared/rule_registry/server/utils/alert_client_bulk_update_scripts.ts

+    ctx._source['${ALERT_WORKFLOW_TAGS}'] = new ArrayList();
+  }
+  for (item in params.addTags) {
+    if (!ctx._source['${ALERT_WORKFLOW_TAGS}'].contains(item)) {


I think this is not enough, if I use spaces, I am able to create duplicates:

Should we always trim the tags before applying?

dominiqueclarke · 2025-11-18T16:06:04Z

The Sec alert was updated but not the O11y one, which is expected given the Security index used in the request. The response though showed a success result for the O11y alert to

@cnasikas I was able to recreate this too. The difference between the success count and the length of the results array is confusing user experience wise, and the "success" status is misleading.

dominiqueclarke

My user has access to the not_default space. I'm using the super user. There are no alerts in that space, definitely not the alerts with the uuids I provided. However, 403 seems like the wrong response to return here.

dominiqueclarke

When you use the query option, the results array is empty. Maybe it doesn't make sense to include an array item for every individual instance that matches the query, but perhaps returning back a single item with the query with status: success?

dominiqueclarke

Blocking on the bug reported by Umberto and I, where the success status can be incorrect.

cnasikas · 2025-11-19T09:02:32Z

EDIT When I tried to update the tags of all o11y rules I created, only 3 of 4 were actually updated:

@umbopepato Which user did you use and what is the consumer of each ES rule?

The Sec alert was updated but not the O11y one, which is expected given the Security index used in the request. The response though showed a success result for the O11y alert to

@cnasikas I was able to recreate this too. The difference between the success count and the length of the results array is confusing user experience wise, and the "success" status is misleading.

When you use the query option, the results array is empty. Maybe it doesn't make sense to include an array item for every individual instance that matches the query, but perhaps returning back a single item with the query with status: success?

@umbopepato @dominiqueclarke Given all the misleading findings, I would suggest removing the results array from the response entirely and returning a failure array with any failures if they exist. It is not possible to know which alerts were updated and which were not with an updateByQuery, and doing another call to find out is not performant, and it will increase the complexity of the code. Wdyt?

My user has access to the not_default space. I'm using the super user. There are no alerts in that space, definitely not the alerts with the uuids I provided. However, 403 seems like the wrong response to return here.

I believe that the response is correct here. You are not authorized to perform any action on that space, so a 403 indicates that.

umbopepato · 2025-11-19T09:48:47Z

@umbopepato Which user did you use and what is the consumer of each ES rule?

I used the default elastic superuser

dominiqueclarke · 2025-11-19T12:57:25Z

I believe that the response is correct here. You are not authorized to perform any action on that space, so a 403 indicates that.

I'm not sure I fully understand the rationale, but this particular issue is not as critical to me as the other. My user dos technical have access to perform all actions in that space, they are a super user, it's just that there are no alerts in that space. It's more of a 404.

dominiqueclarke · 2025-11-19T13:18:06Z

@umbopepato @dominiqueclarke Given all the misleading findings, I would suggest removing the results array from the response entirely and returning a failure array with any failures if they exist. It is not possible to know which alerts were updated and which were not with an updateByQuery, and doing another call to find out is not performant, and it will increase the complexity of the code. Wdyt?

Fine by me. That'll help us iterate over fails to populate the error in the UI.

cnasikas · 2025-11-19T14:07:28Z

I'm not sure I fully understand the rationale, but this particular issue is not as critical to me as the other. My user dos technical have access to perform all actions in that space, they are a super user, it's just that there are no alerts in that space. It's more of a 404.

Oh, I see what you mean. I thought you did NOT have access to that space. Let me investigate, and I will come back to you.

cnasikas · 2025-11-19T15:53:44Z

@dominiqueclarke I changed the response to return only the failures if they exist. I also return a 404 for the scenario with alerts from a different space. @umbopepato I could not reproduce your bug. Was it a refresh issue with the alerts table?

cnasikas · 2025-11-20T08:21:50Z

@elasticmachine merge upstream

dominiqueclarke · 2025-11-21T02:39:57Z

@dominiqueclarke I changed the response to return only the failures if they exist. I also return a 404 for the scenario with alerts from a different space.

I'm really sorry to report that I'm having difficulties with returning failures if they are any. Here I have two ids. I'm querying against the slo alert indices. One id is from slo alert and the other is from synthetics alert. As you can see it's reporting total 1 updated 1, without reporting any failures.

cnasikas · 2025-11-21T08:36:57Z

I'm really sorry to report that I'm having difficulties with returning failures if they are any. Here I have two ids. I'm querying against the slo alert indices. One id is from slo alert and the other is from synthetics alert. As you can see it's reporting total 1 updated 1, without reporting any failures.

This is the expected response. We are using updateByQuery behind the scenes, which will update any alert that matches the query: { ids: { values: ['alert-id-1', ....] } }. ES will not report the docs that did not match the filter as failures. It will only report failures for docs that matched the filter, but for some reason, ES could not update them, like conflict errors, etc. We cannot throw an error if a doc is missing from the index by using mget (ES bulk get) before doing the query, because it does not support index aliases. Lastly, the reason we are using updateByQuery for IDs is to be able to support index aliases. If the API is public in the future, we can document the behavior.

dominiqueclarke

We discussed and will move forward with merging this PR.

elasticmachine · 2025-11-22T13:42:54Z

💚 Build Succeeded

Buildkite Build
Commit: 79f2b6b

Metrics [docs]

Public APIs missing comments

Total count of every public API that lacks a comment. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats comments for more detailed information.

id	before	after	diff
`alerting`	886	888	+2
`ruleRegistry`	210	212	+2
total			+4

Public APIs missing exports

Total count of every type that is part of your API that should be exported but is not. This will cause broken links in the API documentation system. Target amount is 0. Run node scripts/build_api_docs --plugin [yourplugin] --stats exports for more detailed information.

id	before	after	diff
`alerting`	62	63	+1
`ruleRegistry`	8	10	+2
total			+3

Unknown metric groups

API count

id	before	after	diff
`alerting`	923	925	+2
`ruleRegistry`	248	250	+2
total			+4

ESLint disabled line counts

id	before	after	diff
`@kbn/test-suites-xpack-platform`	160	161	+1

Total ESLint disabled count

id	before	after	diff
`@kbn/test-suites-xpack-platform`	170	171	+1

History

💔 Build #363008 failed 2d018bb
💔 Build #362794 failed 11de026
💚 Build #362175 succeeded 8977d65
💔 Build #362042 failed a8c4ae3
💔 Build #360894 failed 32fca05

cc @cnasikas

github-actions bot added the author:obs-ux-management PRs authored by the obs ux management team label Nov 4, 2025

baileycash-elastic force-pushed the alerting-240356-patch branch 2 times, most recently from 1f4e1b4 to a6c1886 Compare November 4, 2025 23:01

baileycash-elastic marked this pull request as ready for review November 4, 2025 23:03

baileycash-elastic requested review from a team as code owners November 4, 2025 23:03

baileycash-elastic requested review from adcoelho, cnasikas and dominiqueclarke November 4, 2025 23:03

patch api

80f8fdd

baileycash-elastic force-pushed the alerting-240356-patch branch from a6c1886 to 80f8fdd Compare November 5, 2025 01:52

adcoelho reviewed Nov 5, 2025

View reviewed changes

x-pack/platform/plugins/shared/rule_registry/server/routes/tags.ts Outdated Show resolved Hide resolved

adcoelho reviewed Nov 5, 2025

View reviewed changes

x-pack/platform/plugins/shared/rule_registry/server/alert_data_client/alerts_client.test.ts Outdated Show resolved Hide resolved

adcoelho reviewed Nov 5, 2025

View reviewed changes

x-pack/platform/plugins/shared/rule_registry/server/alert_data_client/alerts_client.ts Outdated Show resolved Hide resolved

add tags, update missing script ops message

6d8d7b3

baileycash-elastic requested a review from adcoelho November 5, 2025 16:59

baileycash-elastic added 2 commits November 6, 2025 09:18

remove replace tags functionality

feb3d11

add docs

8f1d7e1

dominiqueclarke reviewed Nov 6, 2025

View reviewed changes

Merge branch 'main' into alerting-240356-patch

cc90ddf

adcoelho reviewed Nov 7, 2025

View reviewed changes

dominiqueclarke self-requested a review November 7, 2025 14:29

dominiqueclarke reviewed Nov 18, 2025

View reviewed changes

dominiqueclarke self-requested a review November 18, 2025 18:14

dominiqueclarke reviewed Nov 18, 2025

View reviewed changes

dominiqueclarke requested changes Nov 18, 2025

View reviewed changes

cnasikas added 3 commits November 19, 2025 16:31

Update docs

ed2a6f7

Merge branch 'main' into alerting-240356-patch

f06f35b

PR feedback

9292864

cnasikas requested a review from dominiqueclarke November 19, 2025 15:53

Changes from node scripts/lint_ts_projects --fix

11de026

elastic-vault-github-plugin-prod bot requested a review from a team as a code owner November 19, 2025 16:02

cnasikas self-assigned this Nov 20, 2025

Merge branch 'main' into alerting-240356-patch

2d018bb

benakansara mentioned this pull request Nov 21, 2025

[Alerts table] Edit tags UI #243792

Open

dominiqueclarke approved these changes Nov 21, 2025

View reviewed changes

cnasikas added 2 commits November 21, 2025 18:14

Merge branch 'main' into alerting-240356-patch

a10842d

Fix integration tests

79f2b6b

cnasikas removed the request for review from a team November 22, 2025 12:03

cnasikas merged commit 67de5ef into elastic:main Nov 22, 2025
12 checks passed

kibanamachine added the v9.3.0 label Nov 22, 2025

[Alerting] Allow alert tags to be modified in bulk #241883

[Alerting] Allow alert tags to be modified in bulk #241883

Uh oh!

Conversation

baileycash-elastic commented Nov 4, 2025 • edited by cnasikas Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Alerting authorization

Alerts client

API and Backend Enhancements

Reusable Update Scripts

Uh oh!

baileycash-elastic commented Nov 4, 2025

Uh oh!

elasticmachine commented Nov 4, 2025

Uh oh!

elasticmachine commented Nov 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cnasikas Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dominiqueclarke Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

baileycash-elastic commented Nov 7, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dominiqueclarke commented Nov 18, 2025

Uh oh!

dominiqueclarke left a comment

Choose a reason for hiding this comment

Uh oh!

dominiqueclarke left a comment

Choose a reason for hiding this comment

Uh oh!

dominiqueclarke left a comment

Choose a reason for hiding this comment

Uh oh!

cnasikas commented Nov 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

umbopepato commented Nov 19, 2025

Uh oh!

dominiqueclarke commented Nov 19, 2025

Uh oh!

dominiqueclarke commented Nov 19, 2025

Uh oh!

cnasikas commented Nov 19, 2025

Uh oh!

cnasikas commented Nov 19, 2025

Uh oh!

cnasikas commented Nov 20, 2025

Uh oh!

dominiqueclarke commented Nov 21, 2025

Uh oh!

cnasikas commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

baileycash-elastic commented Nov 4, 2025 •

edited by cnasikas

Loading

cnasikas Nov 7, 2025 •

edited

Loading

dominiqueclarke Nov 7, 2025 •

edited

Loading

cnasikas commented Nov 19, 2025 •

edited

Loading

cnasikas commented Nov 21, 2025 •

edited

Loading