What should an agent trace record?

2026-06-16 3 minute read

Summary

An agent trace is not a place to store the entire conversation by default. It is an operational record that should explain which input, permission, tool call, guardrail, approval, and verification result led to the next step. That same chain is what later makes incident review and approval audit possible.

Document Information

Written date: 2026-04-29
Verification date: 2026-04-29
Document type: analysis tutorial
Test environment: No live execution. This post summarizes trace design from OpenAI Agents SDK and Codex security documentation.
Test version: OpenAI Agents SDK and Codex documentation checked on 2026-04-29. No local SDK version is fixed.
Evidence level: official documentation, security project documentation

Problem Statement

If an agent trace is treated as “save every log”, two failures are likely. The trace may miss the decision path needed for incident review, while still storing too much raw prompt, output, or tool data. That can turn observability into a new sensitive-data exposure path.

This post separates what an agent trace should record from what it should minimize or redact.

Verified Facts

The OpenAI Agents SDK tracing documentation describes LLM generations, tool calls, handoffs, guardrails, and custom events as traceable events during an agent run. Evidence: OpenAI Agents SDK Tracing
The same documentation describes traces as workflow-level records and spans as operations with start time, end time, trace id, parent id, and span data. Evidence: OpenAI Agents SDK Traces and spans
LLM generation spans and function call spans may capture inputs and outputs, so sensitive-data capture must be configured deliberately. Evidence: OpenAI Agents SDK Sensitive data
Codex documentation separates sandbox and approval boundaries, including approval for edits outside the workspace, network access, and connector/tool calls with side effects. Evidence: Codex agent approvals and security

Reproduction Steps

Use this checklist when designing a trace schema.

Define the execution unit: trace_id, workflow_name, group_id, requester, repository/project, and task id.
Record input origin: user request, file, web result, MCP/resource, or memory.
Record model calls: model name, generation span, useful usage totals, failed calls, and retries.
Record tool calls: tool name, argument summary, target resource, permission scope, approval requirement, and result summary.
Record handoffs: receiving agent, reason, forwarded context scope, and filtered context.
Record guardrails: input/output/tool guardrail name, allow/block/tripwire result, and reason.
Record approvals: approval reason, approver or policy, approved/denied/timed out result, and whether execution actually followed.
Record verification: tests that ran, checks that failed, and checks that could not run.
Record sensitive-data handling: raw data policy, redaction method, retention, and external trace processor export.

Prefer classifications, hashes, lengths, and resource ids over raw secret values or full document bodies.

trace_id: trace_...
workflow_name: "repository maintenance"
input_source: user_request | file | web | mcp | memory
tool_call: { name, target, permission_scope, approved, result_summary }
guardrail: { name, stage, outcome, reason }
verification: { ran, failed, not_run, reason }
sensitive_data: { raw_input_saved: false, redaction: "token/value redacted" }

Observations

Raw prompts and final answers alone do not explain why a tool was allowed to run.
Storing every input and output can make debugging easier while moving secrets, personal data, or internal documents into the trace backend.
The highest-value trace records are usually boundaries: permissions, tool calls, guardrails, approvals, and verification outcomes.

Interpretation

In my view, the purpose of an agent trace is not to preserve the model’s hidden reasoning. It is to preserve an auditable operational chain: input origin, authority, action, control, result, and verification.

Opinion: the default should be minimal raw-content capture. If every prompt and tool output is always stored, the trace itself becomes a data exposure surface. In a postmortem, the first records that should be visible are the approval chain and verification trail, not the full raw prompt by default.

Limitations

This post does not replace organization-specific audit, privacy, or retention requirements.
OpenAI Agents SDK and Codex tracing or approval behavior may change by version.
Real SIEM, APM, DLP, or trace processor integrations require product-specific data modeling.

References

Change Log

2026-04-29: Initial draft.
2026-04-29: Added concrete trace fields, sensitive-data boundary, and official-documentation evidence.

Twitter Facebook LinkedIn

What should an agent trace record?

Summary

Document Information

Problem Statement

Verified Facts

Reproduction Steps

Observations

Interpretation

Limitations

References

Change Log

공유하기

댓글남기기

You may also enjoy

Cloud IAM and service account least privilege

AI agents and sensitive data boundaries

Container image signing and scan result interpretation

Threat modeling AI coding agents

Summary

Document Information

Problem Statement

Verified Facts

Reproduction Steps

Observations

Interpretation

Related Posts

Limitations

References

Change Log

공유하기

댓글남기기

You may also enjoy

Cloud IAM and service account least privilege

AI agents and sensitive data boundaries

Container image signing and scan result interpretation

Threat modeling AI coding agents