Agent - Custom blocklist matches (blocked terms)

Agent Blocklist Matches

Query

let window = 1d;
AppDependencies
| where TimeGenerated > ago(window)
| where isnotempty(Properties["microsoft.foundry.content_filter.results"])
| extend
    Agent     = tostring(Properties["gen_ai.agent.name"]),
    Model     = tostring(Properties["gen_ai.request.model"]),
    ConvId    = tostring(Properties["gen_ai.conversation.id"]),
    ProjectId = tostring(Properties["microsoft.foundry.project.id"]),
    Prompt    = tostring(Properties["gen_ai.input.messages"]),
    FilterArr = todynamic(tostring(Properties["microsoft.foundry.content_filter.results"]))
| mv-expand Entry = FilterArr
| extend
    SourceType = tostring(Entry.source_type),
    Cfr        = todynamic(Entry.content_filter_results)
// Custom blocklists surface as named keys inside content_filter_results
// (e.g. "Profanity", or your own blocklist name) alongside the standard
// harm categories. Treat any non-standard key that tripped as a blocklist
// hit so the rule works regardless of the blocklist name.
| mv-expand Key = bag_keys(Cfr)
| extend BlockListName = tostring(Key)
| where BlockListName !in~ ("jailbreak", "prompt_shield", "indirect_attack", "hate", "sexual", "violence", "self_harm", "protected_material", "protected_material_code", "protected_material_text")
| extend Verdict = todynamic(Cfr[BlockListName])
| where tobool(Verdict.filtered) or tobool(Verdict.detected)
| project
    TimeGenerated, SourceType, Agent, Model, ProjectId, ConvId,
    BlockListName, VerdictJson = tostring(Verdict), Prompt
| order by TimeGenerated desc

Explanation

This query is designed to monitor and analyze instances where a custom blocklist, defined by an organization, is triggered within the Azure AI Content Safety system. Here's a simplified breakdown of what the query does:

Time Frame: It looks at data from the past day (1d).
Data Source: It examines telemetry data from the AppDependencies table, specifically focusing on entries where the content filter results are not empty.
Data Extraction: The query extracts various pieces of information from the telemetry data, such as:
- The name of the AI agent involved.
- The model used for the AI request.
- The conversation ID and project ID.
- The prompt or input messages that were processed.
- The results of the content filter, which includes details about any blocklist matches.
Blocklist Analysis:
- It expands the content filter results to identify specific entries related to blocklist matches.
- It checks for any custom blocklist names that are not part of a predefined list of standard harm categories (like "jailbreak", "hate", "violence", etc.).
- It identifies whether any of these custom blocklist terms were flagged as filtered or detected.
Output: The query projects relevant information, such as the time of the event, source type, agent, model, project ID, conversation ID, blocklist name, verdict details, and the original prompt. The results are sorted by the time the event was generated, in descending order.
Purpose: This query helps confirm that custom blocklists are functioning as intended, identifies users who might be testing the boundaries of these blocklists, and provides insights into which terms are most frequently triggering the blocklists.
Security Context: The query is associated with tactics like Execution and Defense Evasion, and techniques such as T1059 (Command and Scripting Interpreter) and T1562 (Impair Defenses). It is tagged with keywords related to Sentinel, custom configurations, Foundry, AI, content safety, and blocklists.

Details

David Alonso

Released: June 8, 2026

Tables

AppDependencies

Keywords

AppDependenciesPropertiesAgentModelConvIdProjectIdPromptFilterArrEntrySourceTypeCfrBlockListNameVerdictTimeGeneratedVerdictJson

Operators

let|whereisnotemptyextendtostringtodynamicmv-expandbag_keys!in~toboolprojectorder bydesc

Tactics

ExecutionDefenseEvasion

MITRE Techniques

T1059 T1562

Actions

GitHub

KQL Search