Access Control for LLM Agent Tools Built with OPA — Designing a Runtime Context-Based Dynamic Policy Engine

Is it okay for this user to call this tool right now?

To answer this question in an MCP (Model Context Protocol) environment where LLM agents freely call external tools, many teams apply OAuth 2.0 scopes as is. MCP is an open standard announced by Anthropic in November 2024 that standardizes how LLMs communicate with tools such as file systems, databases, and external APIs. However, the standard specification itself delegates policy enforcement to the application layer. Due to this structural gap, static OAuth scopes alone cannot meet realistic requirements, such as "blocking financial inquiries between 00:00 and 06:00" or "allowing analysts to access only data with a sensitivity of less than 3." This is because runtime context, such as the user's role, call time, data sensitivity, and agent identifier, is currently missing from the policy decision process.

More serious threats also grow from the same structural flaws. In the MCP ecosystem, attack vectors such as Tool Squatting and Rug Pull have emerged as substantial risks, to the extent that they will be officially listed in MITRE ATLAS starting in 2025. The problem that static scopes fail to reflect runtime context, and the problem that tool definitions themselves can be squashed or replaced, both stem from the same design decision in the MCP specification: "do not pass the responsibility for verification to an external layer."

In this article, we explore how to design and implement a dual-defense architecture that statically guarantees the reliability of tool definitions using ETDI (Enhanced Tool Definition Interface) and dynamically evaluates the runtime context using OPA (Open Policy Agent). After reading this article, you will be able to apply Rego policy writing, FastMCP hook integration, JWT-based tool signature verification, and caching strategies to actual projects.

Key Concepts

Security Vacancy in MCP Tool Calls

The MCP acts as a "USB hub" through which the LLM communicates with the outside world. When the LLM calls a tool such as query_database, the MCP server connects to the actual database and returns the result. The problem is that there is no standardized answer in the MCP specification regarding "which user can call this tool under what conditions."

Limitations of the existing OAuth 2.0 scope method:

Context	OAuth Scope Method	Whether Runtime Context Is Required
Nighttime 00:00~06:00 Financial Inquiry Blocking	Not Possible (No Time Information)	Required
Analyst blocks data with sensitivity 3 or higher	Not possible (no data attributes)	Required
Block tool access after specific agent version	Not possible (no agent identified)	Required
Block session suspected of prompt injection	Unavailable (no session state)	Required

In addition to this, there are two structural attacks pointed out by the ETDI paper (arXiv:2506.01333).

Tool Squatting: An attack that registers a malicious tool with the same name as a legitimate tool (query_database) on the MCP server to impersonate it so that the LLM cannot distinguish it.

Rug Pull: An attack that maliciously replaces a secure tool definition initially approved by the user after deployment. It is difficult to detect because it operates without re-approval.

OPA — Separation of Policy Decision-Making and Implementation

OPA is a CNCF graduation project and a general-purpose policy engine that completely separates policy logic from application code. The core model is simple.

Input(JSON) + Policy(Rego) + Data(JSON) → Decision(allow/deny)

The service sends the current context to OPA as JSON, and OPA evaluates the policy written in Rego to return allow or deny. Security policies are reflected immediately by simply replacing the Rego files without redeploying the code.

How to Read Rego: Rego is a language that declares "what you want," just like SQL. allow if { 조건들 } becomes allow when all conditions are true. If there are multiple allow rules, it is allowed (OR combined) if at least one of them is true. default allow := false treats all cases where conditions are not met as blocking by default.

The Rego example below is a policy that combines role, time, data sensitivity, and agent trust.

python

# policy/mcp_tool_access.rego
package mcp.tool.access
 
import future.keywords.if
import future.keywords.in
 
default allow := false
 
# 기본 허용 조건: 아래 규칙을 모두 충족해야 함 (AND)
allow if {
    valid_role
    within_working_hours
    data_sensitivity_ok
    trusted_agent
}
 
# 역할 검증
valid_role if {
    input.principal.role in {"analyst", "engineer", "admin"}
}
 
# 근무 시간 검증 (UTC 기준 00:00~17:00)
# 주의: env.time은 서버가 UTC로 직접 채워야 합니다.
# 서버 로컬 타임존이 UTC가 아닌 경우 time.now_ns()를 서버에서 직접 주입하는 방식을 권장합니다.
within_working_hours if {
    hour := time.clock(time.parse_rfc3339_ns(input.env.time))[0]
    hour >= 0
    hour < 17
}
 
# 데이터 민감도 검증 (analyst는 민감도 3 미만만 허용)
data_sensitivity_ok if {
    input.principal.role == "analyst"
    input.tool.data_sensitivity < 3
}
 
data_sensitivity_ok if {
    input.principal.role in {"engineer", "admin"}
}
 
# 신뢰할 수 있는 에이전트 검증 (data.trusted_agents는 외부 JSON으로 관리)
trusted_agent if {
    input.client.agent_id in data.trusted_agents
}

Policy Decision Point (PDP): The component responsible for policy "decisions." By separating it from the Policy Enforcement Point (PEP), which actually "executes" the decisions, policy logic can be centralized and reused across multiple services.

ETDI — Tool-defined cryptographic trust chain

ETDI defends Tool Squatting and Rug Pull with three pillars.

1. Cryptographic Signature: The tool provider signs the tool definition with an RSA private key, and the client verifies it with a public key. The signature is delivered in the format of a JSON Web Token (JWT).

2. Immutable Version Management: Issuing a new version and re-approving the user are required when changing tool definitions, code, or permissions.

3. JWT-based authorization scope: Specifies the tool's authorization scope using a signed JWT.

JWT Signature Structure: A JWT consists of three parts. The Header (algorithm specification) and Payload (claims) are Base64 encoded, and a value signed with a private key is attached as the third part (Signature). The signature is not included as a separate field within the payload but is embedded within the JWT structure itself.

python

# ETDI 도구 정의 — JWT 페이로드 (decoded 상태)
{
  "tool_id": "query_database_v2",
  "provider": "internal-data-team",
  "version": "2.1.0",
  "permissions": ["db:read", "schema:read"],
  "issued_at": "2026-04-14T00:00:00Z"
}
# 실제 전송 형태 (Header.Payload.Signature 포함):
# eyJhbGciOiJSUzI1NiIsInR5cCI6IkpXVCJ9.eyJ0b29sX2lkIjoicXVlcnlfZGF0YWJhc2VfdjIiLC4uLn0.<서명>

Practical Application

Examples 1 through 4 constitute a complete architecture. Example 1 (OPA Integration Hook) creates the basic policy enforcement layer, Example 2 (ETDI Verification) adds tool reliability verification in front of it, Example 3 (Cacheing) reduces latency in high-frequency environments, and Example 4 (Integrated Input Structure) completes the context schema to be passed to OPA.

The overall flow is as follows.

[LLM Agent]
    ↓ tool_call 요청
[MCP Host/Broker]
    ↓ ① ETDI 서명 검증 (정적 레이어)
      ├─ JWT 서명 유효성 확인
      ├─ 버전 불변성 확인 (Rug Pull 탐지)
      └─ 제공자 신원 확인 (Tool Squatting 탐지)
          └─ 검증 실패 → deny + 보안 로그
    ↓ ② OPA PDP 쿼리 (동적 레이어)
      ├─ 사용자 역할·속성 평가
      ├─ 시간·환경 컨텍스트 평가
      └─ 에이전트 식별자 평가
          └─ deny → 차단 + 감사 로그
    ↓ allow
[MCP Server] → 도구 실행

Example 1: FastMCP + OPA Sidecar Integration

This is a pattern that queries the OPA HTTP API before calling FastMCP tools. Deploying OPA as a sidecar on the same host minimizes network latency.

python

# mcp_server.py
import httpx
from datetime import datetime, timezone
from fastmcp import FastMCP, Context
from fastmcp.exceptions import ToolError
 
mcp = FastMCP("secure-data-server")
 
OPA_URL = "http://localhost:8181/v1/data/mcp/tool/access/allow"
 
async def check_policy(
    principal: dict,
    client: dict,
    tool_name: str,
    tool_metadata: dict,
    args: dict,
) -> bool:
    """OPA PDP에 정책 평가 요청을 보냅니다."""
    input_payload = {
        "input": {
            "principal": principal,
            "client": client,
            "tool": {
                "name": tool_name,
                **tool_metadata,
            },
            "args": args,
            "env": {
                # 환경 정보는 반드시 서버 측에서 직접 채워 넣어야 합니다.
                # 클라이언트/에이전트가 제공한 값을 그대로 신뢰하면 정책 우회가 가능합니다.
                "time": datetime.now(timezone.utc).isoformat(),
                "network": "internal",
            },
        }
    }
 
    async with httpx.AsyncClient(timeout=0.5) as http:  # 500ms 타임아웃
        try:
            resp = await http.post(OPA_URL, json=input_payload)
            resp.raise_for_status()
            return resp.json().get("result", False)
        except (httpx.TimeoutException, httpx.HTTPError):
            # OPA 장애 시 허용이 아닌 거부로 폴백 — deny-safe 원칙
            return False
 
 
@mcp.tool()
async def query_financial_records(
    table: str,
    filter_expr: str,
    ctx: Context,  # FastMCP가 Context 객체를 자동 주입합니다
) -> dict:
    """금융 레코드를 조회합니다."""
    # Context에서 호출자 정보를 추출하는 방법은 MCP 서버 구현에 따라 다릅니다.
    # 아래는 인증 미들웨어에서 ctx에 주입된 값을 사용하는 예시입니다.
    principal = getattr(ctx, "principal", {"role": "unknown"})
    client_info = getattr(ctx, "client", {"agent_id": "unknown"})
 
    allowed = await check_policy(
        principal=principal,
        client=client_info,
        tool_name="query_financial_records",
        tool_metadata={"server": "postgres-mcp", "data_sensitivity": 3},
        args={"table": table, "filter": filter_expr},
    )
 
    if not allowed:
        raise ToolError(
            "접근이 거부되었습니다. 현재 컨텍스트에서 이 도구를 사용할 권한이 없습니다.",
            code="POLICY_DENIED",
        )
 
    # 실제 DB 조회 로직
    return {"records": [], "total": 0}

python

# pyproject.toml — 최소 의존성
[project]
dependencies = [
    "fastmcp>=2.0",
    "httpx>=0.27",
    "pyjwt[crypto]>=2.8",
    "cryptography>=41.0",
]

Code Section	Explanation
`OPA_URL`	Directly queries a specific policy path of OPA. `/v1/data/{패키지 경로}/allow`
`timeout=0.5`	Handle failure immediately after 500ms. Agent responsiveness protection
`return False` (Fallback)	Fallback to denial rather than allow in case of OPA failure — deny-safe principle
`env.time` Server Direct Injection	Prevents context manipulation by populating server time information with UTC
`ctx: Context`	FastMCP standard context injection method (`_ctx` is treated as a private parameter and excluded from injection targets)

Example 2: ETDI Signature Verification Middleware

It is Python middleware that verifies the signature of the tool JWT before the OPA query.

RSA Signature Verification: The provider signs the JWT with an RSA private key, and the verifier checks that the signature has not been forged using the provider's public key (PEM format). algorithms=["RS256"] specifies a SHA-256-based RSA signature algorithm. The PEM format is a text encoding scheme starting with -----BEGIN PUBLIC KEY-----.

python

# etdi_verifier.py
import jwt
from cryptography.hazmat.primitives import serialization
from dataclasses import dataclass
 
 
@dataclass
class ETDIToolDefinition:
    tool_id: str
    provider: str
    version: str
    permissions: list[str]
    raw_token: str  # 원본 JWT 문자열 (Header.Payload.Signature)
 
 
class ETDIVerificationError(Exception):
    pass
 
 
class ETDIVerifier:
    def __init__(self, trusted_providers: dict[str, str]):
        """
        trusted_providers: {provider_id: PEM 형식 공개 키 문자열}
        """
        # cryptography 41+ 에서는 backend 인자 없이 직접 호출합니다.
        self._keys = {
            pid: serialization.load_pem_public_key(pem.encode())
            for pid, pem in trusted_providers.items()
        }
        # 버전 레지스트리: {tool_id: 승인된 버전}
        # 프로덕션에서는 DB나 Redis에 영구 저장하고 재시작 시 복원해야 합니다.
        self._approved_versions: dict[str, str] = {}
 
    def verify(self, token: str) -> ETDIToolDefinition:
        """JWT 서명을 검증하고 ETDIToolDefinition을 반환합니다."""
        # 1단계: 헤더에서 제공자 ID 추출 (서명 검증 전 미리 확인)
        unverified = jwt.decode(token, options={"verify_signature": False})
        provider_id = unverified.get("provider")
 
        if provider_id not in self._keys:
            raise ETDIVerificationError(
                f"알 수 없는 제공자: {provider_id}. Tool Squatting 가능성이 있습니다."
            )
 
        # 2단계: 공개 키로 서명 검증 (JWT 위조·변조 탐지)
        try:
            payload = jwt.decode(
                token,
                key=self._keys[provider_id],
                algorithms=["RS256"],
            )
        except jwt.InvalidSignatureError as e:
            raise ETDIVerificationError("도구 서명 검증 실패: Rug Pull 공격 의심") from e
 
        tool_id = payload["tool_id"]
        current_version = payload["version"]
 
        # 3단계: 버전 불변성 확인 (Rug Pull 탐지)
        if tool_id in self._approved_versions:
            approved = self._approved_versions[tool_id]
            if approved != current_version:
                raise ETDIVerificationError(
                    f"도구 버전 불일치 — 승인된 버전: {approved}, "
                    f"현재 버전: {current_version}. 재승인이 필요합니다."
                )
 
        return ETDIToolDefinition(
            tool_id=tool_id,
            provider=provider_id,
            version=current_version,
            permissions=payload.get("permissions", []),
            raw_token=token,
        )
 
    def approve_version(self, tool_id: str, version: str) -> None:
        """사용자가 특정 버전을 명시적으로 승인합니다."""
        self._approved_versions[tool_id] = version

Example 3: OPA Policy Decision Caching

In a high-frequency tool call environment, querying the OPA REST API with every call accumulates latency. Performance can be improved by adding a TTL-based decision cache.

python

# policy_cache.py
import asyncio
import hashlib
import json
import time
from typing import Any
 
 
class PolicyDecisionCache:
    """OPA 정책 결정 결과를 TTL 기반으로 캐싱합니다."""
 
    def __init__(self, default_ttl: float = 30.0):
        self._cache: dict[str, tuple[bool, float]] = {}
        self._default_ttl = default_ttl
        self._lock = asyncio.Lock()
 
    def _make_key(self, input_data: dict[str, Any]) -> str:
        serialized = json.dumps(input_data, sort_keys=True)
        return hashlib.sha256(serialized.encode()).hexdigest()
 
    async def get(self, input_data: dict[str, Any]) -> bool | None:
        key = self._make_key(input_data)
        async with self._lock:
            if key in self._cache:
                result, expires_at = self._cache[key]
                if time.monotonic() < expires_at:
                    return result
                del self._cache[key]
        return None  # 캐시 미스
 
    async def set(
        self,
        input_data: dict[str, Any],
        decision: bool,
        ttl: float | None = None,
    ) -> None:
        key = self._make_key(input_data)
        expires_at = time.monotonic() + (ttl or self._default_ttl)
        async with self._lock:
            self._cache[key] = (decision, expires_at)
 
    async def invalidate_all(self) -> None:
        """정책 번들 갱신 시 전체 캐시를 무효화합니다."""
        async with self._lock:
            self._cache.clear()

python

# opa_client.py — 캐시를 활용한 OPA 클라이언트
import httpx
from policy_cache import PolicyDecisionCache
 
 
class OPAClient:
    def __init__(
        self,
        opa_url: str = "http://localhost:8181",
        cache_ttl: float = 30.0,
    ):
        self._base_url = opa_url
        self._cache = PolicyDecisionCache(default_ttl=cache_ttl)
 
    async def decide(
        self,
        policy_path: str,  # 예: "mcp.tool.access" → /v1/data/mcp/tool/access
        input_data: dict,
        ttl_override: float | None = None,
    ) -> bool:
        cached = await self._cache.get(input_data)
        if cached is not None:
            return cached
 
        # OPA Rego 패키지명(점 구분)을 URL 경로(슬래시 구분)로 변환합니다.
        # "mcp.tool.access" → "/v1/data/mcp/tool/access"
        path_segment = policy_path.replace(".", "/")
        url = f"{self._base_url}/v1/data/{path_segment}"
 
        async with httpx.AsyncClient(timeout=0.5) as client:
            resp = await client.post(url, json={"input": input_data})
            resp.raise_for_status()
            decision: bool = resp.json().get("result", False)
 
        # deny 결과는 짧은 TTL로 정책 변경이 빠르게 반영되도록 합니다.
        effective_ttl = ttl_override or (5.0 if not decision else 30.0)
        await self._cache.set(input_data, decision, ttl=effective_ttl)
        return decision

Deny short result TTL: "Block" decisions expire in 5 seconds to ensure policy changes are reflected quickly. "Allow" decisions apply a 30-second TTL to reduce high-frequency call overhead.

Example 4: Designing an Integrated Policy Input Structure

Input structure design is crucial for OPA to richly evaluate runtime context. Below is an example of a production-grade input schema.

css

{
  "principal": {
    "id": "user-123",
    "role": "analyst",
    "department": "finance",
    "clearance_level": 2
  },
  "client": {
    "agent_id": "claude-agent-v3.1",
    "session_id": "sess_abc123",
    "session_age_minutes": 15,
    "ip_address": "10.0.1.45"
  },
  "tool": {
    "name": "query_financial_records",
    "server": "postgres-mcp",
    "version": "2.1.0",
    "data_sensitivity": 3,
    "provider": "internal-data-team"
  },
  "args": {
    "table": "revenue_q1_2026",
    "filter": "region = 'APAC'"
  },
  "env": {
    "time": "2026-04-14T10:30:00Z",
    "network_zone": "internal",
    "request_id": "req_xyz789"
  }
}

Complex condition evaluation is possible in the Rego policy receiving this input.

python

# policy/financial_tools.rego
package mcp.tool.financial
 
import future.keywords.if
import future.keywords.in
 
default allow := false
 
# 금융 도구 접근 허용 조건 (모두 충족 필수 — AND)
allow if {
    authorized_role
    sensitivity_within_clearance
    within_business_hours
    trusted_agent_version
    not suspicious_session
}
 
authorized_role if {
    input.principal.role in {"analyst", "finance_manager", "auditor"}
}
 
# 사용자 클리어런스 레벨이 도구 민감도 이상이어야 함
sensitivity_within_clearance if {
    input.principal.clearance_level >= input.tool.data_sensitivity
}
 
# 주의: env.time은 서버가 UTC로 직접 채워야 합니다. 클라이언트 제공값 사용 금지.
within_business_hours if {
    hour := time.clock(time.parse_rfc3339_ns(input.env.time))[0]
    hour >= 9
    hour < 18
}
 
# 신뢰할 수 있는 에이전트 버전 목록 (data.json으로 외부 관리)
trusted_agent_version if {
    input.client.agent_id in data.trusted_agent_ids
}
 
# 30분 이상 된 세션에서의 고민감도 접근은 재인증 필요
suspicious_session if {
    input.tool.data_sensitivity >= 4
    input.client.session_age_minutes > 30
}

Pros and Cons Analysis

Advantages

Item	Content	Conditions/Limitations
Hierarchical Defense	Defends against Tool Squatting, Rug Pull, and Runtime Privilege Abuse respectively with a dual layer of ETDI (Static) + OPA (Dynamic)	Both layers must be implemented to complete protection
Policy Separation	Immediately reflect security policies by updating only Rego files without code redeployment	Requires implementation of cache invalidation hook after policy changes
Expressiveness	Supports complex condition combinations such as RBAC, ABAC, time-based, location-based, and sensitivity-based	Complex Rego policies become difficult to maintain without unit tests
Ecosystem Maturity	OPA is a CNCF-graduated project proven in major productions such as Netflix, Goldman Sachs, Google Cloud, and T-Mobile	OPA Founders Move to Apple in 2025 — Monitoring Community Direction Recommended
Audit Trail	Supports regulatory compliance such as SOC2, ISO27001, by logging policy evaluation results before every tool call	—
Prompt Injection Defense	Policy enforcement at the tool call layer rather than the agent layer → OPA is blocked even if the agent is fooled	—

Disadvantages and Precautions

Item	Content	Response Plan
Rego Entry Barrier	Initial learning curve for Python/JS developers due to declarative language nature	Enforce deterministic evaluation with `rego.v1` imports, policy unit testing with `opa test` commands
Latency Overhead	PDP query generated on every tool call	Minimize network latency with TTL-based decision cache + OPA sidecar deployment
ETDI ecosystem is immature	Standardization is still in progress (Python SDK PR #845), extensive MCP server support is lacking	Implementing a proprietary JWT signing and verification wrapper, establishing a migration plan after standardization is complete
Bundle Synchronization Complexity	Managing policy consistency between the OPA Bundle Server and each instance in a distributed environment	Utilizing the OPA Bundle API, implementing a hook to invalidate the entire cache upon policy update
Cold Start · Fallback	Fallback to stale policy upon bundle source failure → Degradation of security freshness	Apply deny-safe principle: Default deny when OPA is unresponsive

Bundle: A package unit in OPA that bundles policies (.rego) and data (.json) for deployment. OPA instances periodically poll policies from a central bundle server to keep them up to date.

The Most Common Mistakes in Practice

Setting to fall back to allow=True upon OPA response failure — If you set fallback to allowed for availability, the OPA failure itself becomes a security vulnerability. Based on the deny-safe principle, it is safer to explicitly design a separate exception handling path for availability-critical tools.
Allow clients to directly provide environment information such as env.time — If an LLM agent or client populates and sends values such as time or network_zone, the policy can be bypassed with a manipulated context. This must be populated directly on the server side using trusted sources (system time, network metadata).
Keep the ETDI version registry in memory only — If approved version information disappears after a server restart, Rug Pull detection is neutralized. Approved tool versions must be written to a persistent repository (DB, Redis, etc.), and logic is required to restore them upon restart.

In Conclusion

Regarding the question posed at the beginning of this article, "Is this user allowed to call this tool right now?", OAuth scopes can only answer "Does this user have access to this tool?". ETDI answers "Did this tool come from a trusted source without tampering?" and OPA answers "Does this call comply with the policy in this context at this very moment?", respectively; a complete answer is produced only when these two layers are combined.

3 Steps to Start Right Now:

Run local OPA → Upload policy → Test immediately with curl — You can launch OPA with a single line in Docker, upload the Rego policy above, and check the evaluation results with actual input.
docker run -p 8181:8181 openpolicyagent/opa:latest run --server curl -X PUT localhost:8181/v1/policies/mcp \ -H "Content-Type: text/plain" --data-binary @policy/mcp_tool_access.rego curl -X POST localhost:8181/v1/data/mcp/tool/access/allow \ -H "Content-Type: application/json" \ -d '{"input": {"principal": {"role": "analyst"}, "client": {"agent_id": "test"}, "tool": {"data_sensitivity": 2}, "env": {"time": "2026-04-14T10:00:00Z"}}}'
Insert check_policy() into only the most sensitive tool on the existing MCP server — We recommend verifying latency overhead and policy behavior with a pilot tool rather than applying it to all tools from the start.
Add ETDI Validation Layer with ETDIVerifier Wrapper — You can extend this by generating RSA key pairs with openssl genrsa -out tool_provider.pem 2048 and permanently storing the validated versions in SQLite or Redis. The ETDI Python SDK is currently undergoing PR #845, and you can use the ETDIVerifier wrapper from this article as is until the merge.

Next Post: We will cover configuring an OPA Bundle Server to securely deploy Rego policies and strategies to ensure policy consistency in a distributed MCP environment.

Reference Materials

Access Control for LLM Agent Tools Built with OPA — Designing a Runtime Context-Based Dynamic Policy Engine | DEV BAK - 기술블로그

Access Control for LLM Agent Tools Built with OPA — Designing a Runtime Context-Based Dynamic Policy Engine

Is it okay for this user to call this tool right now?

Key Concepts

Security Vacancy in MCP Tool Calls

Limitations of the existing OAuth 2.0 scope method:

Context	OAuth Scope Method	Whether Runtime Context Is Required
Nighttime 00:00~06:00 Financial Inquiry Blocking	Not Possible (No Time Information)	Required
Analyst blocks data with sensitivity 3 or higher	Not possible (no data attributes)	Required
Block tool access after specific agent version	Not possible (no agent identified)	Required
Block session suspected of prompt injection	Unavailable (no session state)	Required

In addition to this, there are two structural attacks pointed out by the ETDI paper (arXiv:2506.01333).

Tool Squatting: An attack that registers a malicious tool with the same name as a legitimate tool (query_database) on the MCP server to impersonate it so that the LLM cannot distinguish it.

Rug Pull: An attack that maliciously replaces a secure tool definition initially approved by the user after deployment. It is difficult to detect because it operates without re-approval.

OPA — Separation of Policy Decision-Making and Implementation

OPA is a CNCF graduation project and a general-purpose policy engine that completely separates policy logic from application code. The core model is simple.

Input(JSON) + Policy(Rego) + Data(JSON) → Decision(allow/deny)

The Rego example below is a policy that combines role, time, data sensitivity, and agent trust.

python

# policy/mcp_tool_access.rego
package mcp.tool.access
 
import future.keywords.if
import future.keywords.in
 
default allow := false
 
# 기본 허용 조건: 아래 규칙을 모두 충족해야 함 (AND)
allow if {
    valid_role
    within_working_hours
    data_sensitivity_ok
    trusted_agent
}
 
# 역할 검증
valid_role if {
    input.principal.role in {"analyst", "engineer", "admin"}
}
 
# 근무 시간 검증 (UTC 기준 00:00~17:00)
# 주의: env.time은 서버가 UTC로 직접 채워야 합니다.
# 서버 로컬 타임존이 UTC가 아닌 경우 time.now_ns()를 서버에서 직접 주입하는 방식을 권장합니다.
within_working_hours if {
    hour := time.clock(time.parse_rfc3339_ns(input.env.time))[0]
    hour >= 0
    hour < 17
}
 
# 데이터 민감도 검증 (analyst는 민감도 3 미만만 허용)
data_sensitivity_ok if {
    input.principal.role == "analyst"
    input.tool.data_sensitivity < 3
}
 
data_sensitivity_ok if {
    input.principal.role in {"engineer", "admin"}
}
 
# 신뢰할 수 있는 에이전트 검증 (data.trusted_agents는 외부 JSON으로 관리)
trusted_agent if {
    input.client.agent_id in data.trusted_agents
}

ETDI — Tool-defined cryptographic trust chain

ETDI defends Tool Squatting and Rug Pull with three pillars.

2. Immutable Version Management: Issuing a new version and re-approving the user are required when changing tool definitions, code, or permissions.

3. JWT-based authorization scope: Specifies the tool's authorization scope using a signed JWT.

python

# ETDI 도구 정의 — JWT 페이로드 (decoded 상태)
{
  "tool_id": "query_database_v2",
  "provider": "internal-data-team",
  "version": "2.1.0",
  "permissions": ["db:read", "schema:read"],
  "issued_at": "2026-04-14T00:00:00Z"
}
# 실제 전송 형태 (Header.Payload.Signature 포함):
# eyJhbGciOiJSUzI1NiIsInR5cCI6IkpXVCJ9.eyJ0b29sX2lkIjoicXVlcnlfZGF0YWJhc2VfdjIiLC4uLn0.<서명>

Practical Application

The overall flow is as follows.

[LLM Agent]
    ↓ tool_call 요청
[MCP Host/Broker]
    ↓ ① ETDI 서명 검증 (정적 레이어)
      ├─ JWT 서명 유효성 확인
      ├─ 버전 불변성 확인 (Rug Pull 탐지)
      └─ 제공자 신원 확인 (Tool Squatting 탐지)
          └─ 검증 실패 → deny + 보안 로그
    ↓ ② OPA PDP 쿼리 (동적 레이어)
      ├─ 사용자 역할·속성 평가
      ├─ 시간·환경 컨텍스트 평가
      └─ 에이전트 식별자 평가
          └─ deny → 차단 + 감사 로그
    ↓ allow
[MCP Server] → 도구 실행

Example 1: FastMCP + OPA Sidecar Integration

This is a pattern that queries the OPA HTTP API before calling FastMCP tools. Deploying OPA as a sidecar on the same host minimizes network latency.

python

# mcp_server.py
import httpx
from datetime import datetime, timezone
from fastmcp import FastMCP, Context
from fastmcp.exceptions import ToolError
 
mcp = FastMCP("secure-data-server")
 
OPA_URL = "http://localhost:8181/v1/data/mcp/tool/access/allow"
 
async def check_policy(
    principal: dict,
    client: dict,
    tool_name: str,
    tool_metadata: dict,
    args: dict,
) -> bool:
    """OPA PDP에 정책 평가 요청을 보냅니다."""
    input_payload = {
        "input": {
            "principal": principal,
            "client": client,
            "tool": {
                "name": tool_name,
                **tool_metadata,
            },
            "args": args,
            "env": {
                # 환경 정보는 반드시 서버 측에서 직접 채워 넣어야 합니다.
                # 클라이언트/에이전트가 제공한 값을 그대로 신뢰하면 정책 우회가 가능합니다.
                "time": datetime.now(timezone.utc).isoformat(),
                "network": "internal",
            },
        }
    }
 
    async with httpx.AsyncClient(timeout=0.5) as http:  # 500ms 타임아웃
        try:
            resp = await http.post(OPA_URL, json=input_payload)
            resp.raise_for_status()
            return resp.json().get("result", False)
        except (httpx.TimeoutException, httpx.HTTPError):
            # OPA 장애 시 허용이 아닌 거부로 폴백 — deny-safe 원칙
            return False
 
 
@mcp.tool()
async def query_financial_records(
    table: str,
    filter_expr: str,
    ctx: Context,  # FastMCP가 Context 객체를 자동 주입합니다
) -> dict:
    """금융 레코드를 조회합니다."""
    # Context에서 호출자 정보를 추출하는 방법은 MCP 서버 구현에 따라 다릅니다.
    # 아래는 인증 미들웨어에서 ctx에 주입된 값을 사용하는 예시입니다.
    principal = getattr(ctx, "principal", {"role": "unknown"})
    client_info = getattr(ctx, "client", {"agent_id": "unknown"})
 
    allowed = await check_policy(
        principal=principal,
        client=client_info,
        tool_name="query_financial_records",
        tool_metadata={"server": "postgres-mcp", "data_sensitivity": 3},
        args={"table": table, "filter": filter_expr},
    )
 
    if not allowed:
        raise ToolError(
            "접근이 거부되었습니다. 현재 컨텍스트에서 이 도구를 사용할 권한이 없습니다.",
            code="POLICY_DENIED",
        )
 
    # 실제 DB 조회 로직
    return {"records": [], "total": 0}

python

# pyproject.toml — 최소 의존성
[project]
dependencies = [
    "fastmcp>=2.0",
    "httpx>=0.27",
    "pyjwt[crypto]>=2.8",
    "cryptography>=41.0",
]

Code Section	Explanation
`OPA_URL`	Directly queries a specific policy path of OPA. `/v1/data/{패키지 경로}/allow`
`timeout=0.5`	Handle failure immediately after 500ms. Agent responsiveness protection
`return False` (Fallback)	Fallback to denial rather than allow in case of OPA failure — deny-safe principle
`env.time` Server Direct Injection	Prevents context manipulation by populating server time information with UTC
`ctx: Context`	FastMCP standard context injection method (`_ctx` is treated as a private parameter and excluded from injection targets)

Example 2: ETDI Signature Verification Middleware

It is Python middleware that verifies the signature of the tool JWT before the OPA query.

python

# etdi_verifier.py
import jwt
from cryptography.hazmat.primitives import serialization
from dataclasses import dataclass
 
 
@dataclass
class ETDIToolDefinition:
    tool_id: str
    provider: str
    version: str
    permissions: list[str]
    raw_token: str  # 원본 JWT 문자열 (Header.Payload.Signature)
 
 
class ETDIVerificationError(Exception):
    pass
 
 
class ETDIVerifier:
    def __init__(self, trusted_providers: dict[str, str]):
        """
        trusted_providers: {provider_id: PEM 형식 공개 키 문자열}
        """
        # cryptography 41+ 에서는 backend 인자 없이 직접 호출합니다.
        self._keys = {
            pid: serialization.load_pem_public_key(pem.encode())
            for pid, pem in trusted_providers.items()
        }
        # 버전 레지스트리: {tool_id: 승인된 버전}
        # 프로덕션에서는 DB나 Redis에 영구 저장하고 재시작 시 복원해야 합니다.
        self._approved_versions: dict[str, str] = {}
 
    def verify(self, token: str) -> ETDIToolDefinition:
        """JWT 서명을 검증하고 ETDIToolDefinition을 반환합니다."""
        # 1단계: 헤더에서 제공자 ID 추출 (서명 검증 전 미리 확인)
        unverified = jwt.decode(token, options={"verify_signature": False})
        provider_id = unverified.get("provider")
 
        if provider_id not in self._keys:
            raise ETDIVerificationError(
                f"알 수 없는 제공자: {provider_id}. Tool Squatting 가능성이 있습니다."
            )
 
        # 2단계: 공개 키로 서명 검증 (JWT 위조·변조 탐지)
        try:
            payload = jwt.decode(
                token,
                key=self._keys[provider_id],
                algorithms=["RS256"],
            )
        except jwt.InvalidSignatureError as e:
            raise ETDIVerificationError("도구 서명 검증 실패: Rug Pull 공격 의심") from e
 
        tool_id = payload["tool_id"]
        current_version = payload["version"]
 
        # 3단계: 버전 불변성 확인 (Rug Pull 탐지)
        if tool_id in self._approved_versions:
            approved = self._approved_versions[tool_id]
            if approved != current_version:
                raise ETDIVerificationError(
                    f"도구 버전 불일치 — 승인된 버전: {approved}, "
                    f"현재 버전: {current_version}. 재승인이 필요합니다."
                )
 
        return ETDIToolDefinition(
            tool_id=tool_id,
            provider=provider_id,
            version=current_version,
            permissions=payload.get("permissions", []),
            raw_token=token,
        )
 
    def approve_version(self, tool_id: str, version: str) -> None:
        """사용자가 특정 버전을 명시적으로 승인합니다."""
        self._approved_versions[tool_id] = version

Example 3: OPA Policy Decision Caching

In a high-frequency tool call environment, querying the OPA REST API with every call accumulates latency. Performance can be improved by adding a TTL-based decision cache.

python

# policy_cache.py
import asyncio
import hashlib
import json
import time
from typing import Any
 
 
class PolicyDecisionCache:
    """OPA 정책 결정 결과를 TTL 기반으로 캐싱합니다."""
 
    def __init__(self, default_ttl: float = 30.0):
        self._cache: dict[str, tuple[bool, float]] = {}
        self._default_ttl = default_ttl
        self._lock = asyncio.Lock()
 
    def _make_key(self, input_data: dict[str, Any]) -> str:
        serialized = json.dumps(input_data, sort_keys=True)
        return hashlib.sha256(serialized.encode()).hexdigest()
 
    async def get(self, input_data: dict[str, Any]) -> bool | None:
        key = self._make_key(input_data)
        async with self._lock:
            if key in self._cache:
                result, expires_at = self._cache[key]
                if time.monotonic() < expires_at:
                    return result
                del self._cache[key]
        return None  # 캐시 미스
 
    async def set(
        self,
        input_data: dict[str, Any],
        decision: bool,
        ttl: float | None = None,
    ) -> None:
        key = self._make_key(input_data)
        expires_at = time.monotonic() + (ttl or self._default_ttl)
        async with self._lock:
            self._cache[key] = (decision, expires_at)
 
    async def invalidate_all(self) -> None:
        """정책 번들 갱신 시 전체 캐시를 무효화합니다."""
        async with self._lock:
            self._cache.clear()

python

# opa_client.py — 캐시를 활용한 OPA 클라이언트
import httpx
from policy_cache import PolicyDecisionCache
 
 
class OPAClient:
    def __init__(
        self,
        opa_url: str = "http://localhost:8181",
        cache_ttl: float = 30.0,
    ):
        self._base_url = opa_url
        self._cache = PolicyDecisionCache(default_ttl=cache_ttl)
 
    async def decide(
        self,
        policy_path: str,  # 예: "mcp.tool.access" → /v1/data/mcp/tool/access
        input_data: dict,
        ttl_override: float | None = None,
    ) -> bool:
        cached = await self._cache.get(input_data)
        if cached is not None:
            return cached
 
        # OPA Rego 패키지명(점 구분)을 URL 경로(슬래시 구분)로 변환합니다.
        # "mcp.tool.access" → "/v1/data/mcp/tool/access"
        path_segment = policy_path.replace(".", "/")
        url = f"{self._base_url}/v1/data/{path_segment}"
 
        async with httpx.AsyncClient(timeout=0.5) as client:
            resp = await client.post(url, json={"input": input_data})
            resp.raise_for_status()
            decision: bool = resp.json().get("result", False)
 
        # deny 결과는 짧은 TTL로 정책 변경이 빠르게 반영되도록 합니다.
        effective_ttl = ttl_override or (5.0 if not decision else 30.0)
        await self._cache.set(input_data, decision, ttl=effective_ttl)
        return decision

Deny short result TTL: "Block" decisions expire in 5 seconds to ensure policy changes are reflected quickly. "Allow" decisions apply a 30-second TTL to reduce high-frequency call overhead.

Example 4: Designing an Integrated Policy Input Structure

Input structure design is crucial for OPA to richly evaluate runtime context. Below is an example of a production-grade input schema.

css

{
  "principal": {
    "id": "user-123",
    "role": "analyst",
    "department": "finance",
    "clearance_level": 2
  },
  "client": {
    "agent_id": "claude-agent-v3.1",
    "session_id": "sess_abc123",
    "session_age_minutes": 15,
    "ip_address": "10.0.1.45"
  },
  "tool": {
    "name": "query_financial_records",
    "server": "postgres-mcp",
    "version": "2.1.0",
    "data_sensitivity": 3,
    "provider": "internal-data-team"
  },
  "args": {
    "table": "revenue_q1_2026",
    "filter": "region = 'APAC'"
  },
  "env": {
    "time": "2026-04-14T10:30:00Z",
    "network_zone": "internal",
    "request_id": "req_xyz789"
  }
}

Complex condition evaluation is possible in the Rego policy receiving this input.

python

# policy/financial_tools.rego
package mcp.tool.financial
 
import future.keywords.if
import future.keywords.in
 
default allow := false
 
# 금융 도구 접근 허용 조건 (모두 충족 필수 — AND)
allow if {
    authorized_role
    sensitivity_within_clearance
    within_business_hours
    trusted_agent_version
    not suspicious_session
}
 
authorized_role if {
    input.principal.role in {"analyst", "finance_manager", "auditor"}
}
 
# 사용자 클리어런스 레벨이 도구 민감도 이상이어야 함
sensitivity_within_clearance if {
    input.principal.clearance_level >= input.tool.data_sensitivity
}
 
# 주의: env.time은 서버가 UTC로 직접 채워야 합니다. 클라이언트 제공값 사용 금지.
within_business_hours if {
    hour := time.clock(time.parse_rfc3339_ns(input.env.time))[0]
    hour >= 9
    hour < 18
}
 
# 신뢰할 수 있는 에이전트 버전 목록 (data.json으로 외부 관리)
trusted_agent_version if {
    input.client.agent_id in data.trusted_agent_ids
}
 
# 30분 이상 된 세션에서의 고민감도 접근은 재인증 필요
suspicious_session if {
    input.tool.data_sensitivity >= 4
    input.client.session_age_minutes > 30
}

Pros and Cons Analysis

Advantages

Item	Content	Conditions/Limitations
Hierarchical Defense	Defends against Tool Squatting, Rug Pull, and Runtime Privilege Abuse respectively with a dual layer of ETDI (Static) + OPA (Dynamic)	Both layers must be implemented to complete protection
Policy Separation	Immediately reflect security policies by updating only Rego files without code redeployment	Requires implementation of cache invalidation hook after policy changes
Expressiveness	Supports complex condition combinations such as RBAC, ABAC, time-based, location-based, and sensitivity-based	Complex Rego policies become difficult to maintain without unit tests
Ecosystem Maturity	OPA is a CNCF-graduated project proven in major productions such as Netflix, Goldman Sachs, Google Cloud, and T-Mobile	OPA Founders Move to Apple in 2025 — Monitoring Community Direction Recommended
Audit Trail	Supports regulatory compliance such as SOC2, ISO27001, by logging policy evaluation results before every tool call	—
Prompt Injection Defense	Policy enforcement at the tool call layer rather than the agent layer → OPA is blocked even if the agent is fooled	—

Disadvantages and Precautions

Item	Content	Response Plan
Rego Entry Barrier	Initial learning curve for Python/JS developers due to declarative language nature	Enforce deterministic evaluation with `rego.v1` imports, policy unit testing with `opa test` commands
Latency Overhead	PDP query generated on every tool call	Minimize network latency with TTL-based decision cache + OPA sidecar deployment
ETDI ecosystem is immature	Standardization is still in progress (Python SDK PR #845), extensive MCP server support is lacking	Implementing a proprietary JWT signing and verification wrapper, establishing a migration plan after standardization is complete
Bundle Synchronization Complexity	Managing policy consistency between the OPA Bundle Server and each instance in a distributed environment	Utilizing the OPA Bundle API, implementing a hook to invalidate the entire cache upon policy update
Cold Start · Fallback	Fallback to stale policy upon bundle source failure → Degradation of security freshness	Apply deny-safe principle: Default deny when OPA is unresponsive

Bundle: A package unit in OPA that bundles policies (.rego) and data (.json) for deployment. OPA instances periodically poll policies from a central bundle server to keep them up to date.

The Most Common Mistakes in Practice

Setting to fall back to allow=True upon OPA response failure — If you set fallback to allowed for availability, the OPA failure itself becomes a security vulnerability. Based on the deny-safe principle, it is safer to explicitly design a separate exception handling path for availability-critical tools.
Allow clients to directly provide environment information such as env.time — If an LLM agent or client populates and sends values such as time or network_zone, the policy can be bypassed with a manipulated context. This must be populated directly on the server side using trusted sources (system time, network metadata).
Keep the ETDI version registry in memory only — If approved version information disappears after a server restart, Rug Pull detection is neutralized. Approved tool versions must be written to a persistent repository (DB, Redis, etc.), and logic is required to restore them upon restart.

In Conclusion

3 Steps to Start Right Now:

Run local OPA → Upload policy → Test immediately with curl — You can launch OPA with a single line in Docker, upload the Rego policy above, and check the evaluation results with actual input.
docker run -p 8181:8181 openpolicyagent/opa:latest run --server curl -X PUT localhost:8181/v1/policies/mcp \ -H "Content-Type: text/plain" --data-binary @policy/mcp_tool_access.rego curl -X POST localhost:8181/v1/data/mcp/tool/access/allow \ -H "Content-Type: application/json" \ -d '{"input": {"principal": {"role": "analyst"}, "client": {"agent_id": "test"}, "tool": {"data_sensitivity": 2}, "env": {"time": "2026-04-14T10:00:00Z"}}}'
Insert check_policy() into only the most sensitive tool on the existing MCP server — We recommend verifying latency overhead and policy behavior with a pilot tool rather than applying it to all tools from the start.
Add ETDI Validation Layer with ETDIVerifier Wrapper — You can extend this by generating RSA key pairs with openssl genrsa -out tool_provider.pem 2048 and permanently storing the validated versions in SQLite or Redis. The ETDI Python SDK is currently undergoing PR #845, and you can use the ETDIVerifier wrapper from this article as is until the merge.

Next Post: We will cover configuring an OPA Bundle Server to securely deploy Rego policies and strategies to ensure policy consistency in a distributed MCP environment.

Key Concepts

Security Vacancy in MCP Tool Calls

OPA — Separation of Policy Decision-Making and Implementation

ETDI — Tool-defined cryptographic trust chain

Practical Application

Example 1: FastMCP + OPA Sidecar Integration

Example 2: ETDI Signature Verification Middleware

Example 3: OPA Policy Decision Caching

Example 4: Designing an Integrated Policy Input Structure

Pros and Cons Analysis

Advantages

Disadvantages and Precautions

The Most Common Mistakes in Practice

In Conclusion

Reference Materials

Key Concepts

Security Vacancy in MCP Tool Calls

OPA — Separation of Policy Decision-Making and Implementation

ETDI — Tool-defined cryptographic trust chain

Practical Application

Example 1: FastMCP + OPA Sidecar Integration

Example 2: ETDI Signature Verification Middleware

Example 3: OPA Policy Decision Caching

Example 4: Designing an Integrated Policy Input Structure

Pros and Cons Analysis

Advantages

Disadvantages and Precautions

The Most Common Mistakes in Practice

In Conclusion

Reference Materials

Recommended Posts

Eliminating Policy Drift in Distributed MCP Environments with GitOps + OPA Bundle Server: A Practical Implementation Guide

AI Agent Least Privilege Token Implementation Guide: MCP OAuth 2.1 + RFC 8707

MCP (Model Context Protocol) Zero-Config Authorization Implemented with RFC 9728: How Clients Automatically Discovery of Server Authorization Settings

Blocking MCP Tool Squatting · Rug Pull: Directly Implementing ETDI and OAuth Signature-Based Integrity Verification

MCP Agent Security Hardening: Practical Defense Guide to Prompt Injection and Tool Poisoning

MCP Multi-Agent Delegation Pattern: Designing Agent Chain Security with RFC 8693 Token Exchange and Audit Logs