How is synthetic agent data generated?

Common approaches include using a more capable model to generate examples (knowledge distillation), running agents in simulation environments to collect traces, augmenting real data with paraphrase and variation, and generating adversarial examples for safety training.

What are the risks of training agents on synthetic data?

Synthetic data can introduce biases, create unrealistic task distributions, or reinforce model limitations if generated by the same model family. Always validate synthetic data quality, maintain diversity, and supplement with real-world examples where possible.

Data flow visualization representing synthetic training data generation pipelines — Photo by NASA on Unsplash

What Is Synthetic Data for AI Agents?

Q: What is synthetic data for AI agents?

Synthetic data for AI agents is artificially generated training data — task examples, conversation traces, tool call sequences, and outcomes — created to train, fine-tune, or evaluate agents when real-world data is scarce, sensitive, or expensive to collect.

Quick Definition#

Synthetic data for AI agents is artificially generated training examples, evaluation datasets, and test scenarios that simulate real-world agent interactions. Rather than waiting for production traffic to accumulate — or exposing sensitive user data — teams generate synthetic conversations, tool call sequences, and edge cases to build robust evaluation pipelines before deployment.

Browse all AI agent terms in the AI Agent Glossary. For using synthetic data in adversarial testing, see Agent Red Teaming. For tracking agent performance against evaluation benchmarks, see Agent Tracing.

Why Synthetic Data Matters#

New agent development faces a bootstrapping problem: you need data to evaluate your agent, but you have no production traffic until the agent ships. Synthetic data breaks this dependency:

Data Type	Real User Data	Synthetic Data
Availability	Requires production traffic	Available immediately
Edge cases	Rare in natural distribution	Explicitly generated
Ground truth labels	Manual labeling required	Auto-generated
Privacy	Contains PII risks	Clean by construction
Distribution control	Fixed to real usage	Configurable difficulty, type mix
Scale	Limited by traffic	Unlimited

The tradeoff: synthetic data may not fully capture the natural variation and unexpected patterns of real user behavior. Both data types serve different evaluation needs.

Generating Evaluation Datasets#

Basic Scenario Generator#

from anthropic import Anthropic
import json
import random

client = Anthropic()

def generate_agent_scenarios(
    agent_description: str,
    scenario_types: list[str],
    num_scenarios: int = 50
) -> list[dict]:
    """
    Generate diverse evaluation scenarios for an agent.

    Args:
        agent_description: What the agent does
        scenario_types: Categories of scenarios to generate
        num_scenarios: Total scenarios to generate

    Returns:
        List of {input, expected_behavior, difficulty, type} dicts
    """
    scenarios = []
    per_type = max(1, num_scenarios // len(scenario_types))

    for scenario_type in scenario_types:
        response = client.messages.create(
            model="claude-opus-4-6",
            max_tokens=4000,
            messages=[{
                "role": "user",
                "content": f"""Generate {per_type} realistic test scenarios for this AI agent:

AGENT: {agent_description}
SCENARIO TYPE: {scenario_type}

For each scenario, provide:
1. A realistic user request (natural, varied phrasing)
2. Expected agent behavior (what a correct response looks like)
3. Difficulty level (easy/medium/hard/edge-case)
4. What makes this scenario interesting or challenging

Return as a JSON array:
[
  {{
    "input": "user's exact request",
    "expected_behavior": "description of correct agent response",
    "difficulty": "easy|medium|hard|edge-case",
    "type": "{scenario_type}",
    "notes": "what makes this test valuable"
  }}
]

Generate varied phrasings. Include realistic typos and ambiguity for medium/hard cases."""
            }]
        )

        try:
            batch = json.loads(response.content[0].text)
            scenarios.extend(batch)
        except json.JSONDecodeError:
            # Parse error — extract JSON from response
            text = response.content[0].text
            start = text.find('[')
            end = text.rfind(']') + 1
            if start != -1 and end > start:
                scenarios.extend(json.loads(text[start:end]))

    return scenarios


# Example: Generate scenarios for a customer support agent
support_scenarios = generate_agent_scenarios(
    agent_description="Customer support agent for a SaaS project management tool",
    scenario_types=[
        "billing_questions",
        "feature_how_to",
        "bug_reports",
        "account_management",
        "cancellation_requests",
        "edge_cases"
    ],
    num_scenarios=120
)

print(f"Generated {len(support_scenarios)} scenarios")

Multi-Turn Conversation Synthesis#

Single-turn scenarios miss the complexity of real agent interactions. Generate complete multi-turn dialogues:

from anthropic import Anthropic
import json

client = Anthropic()

def synthesize_conversation(
    agent_system_prompt: str,
    scenario_description: str,
    num_turns: int = 5
) -> dict:
    """
    Synthesize a complete multi-turn conversation between a user and agent.
    Uses one LLM to play the user, another to play the agent.
    """

    # Generate a realistic user persona and their task
    persona_response = client.messages.create(
        model="claude-haiku-4-5-20251001",
        max_tokens=300,
        messages=[{
            "role": "user",
            "content": f"""Create a realistic user persona for this scenario:
{scenario_description}

Return JSON: {{
  "user_name": "...",
  "user_background": "...",
  "user_goal": "...",
  "communication_style": "formal|casual|frustrated|technical",
  "initial_message": "the user's opening message"
}}"""
        }]
    )

    persona = json.loads(persona_response.content[0].text)

    conversation = []
    current_user_message = persona["initial_message"]

    for turn in range(num_turns):
        # Agent turn: respond to user
        agent_messages = []
        for msg in conversation:
            agent_messages.append({"role": msg["role"], "content": msg["content"]})
        agent_messages.append({"role": "user", "content": current_user_message})

        agent_response = client.messages.create(
            model="claude-sonnet-4-6",
            max_tokens=512,
            system=agent_system_prompt,
            messages=agent_messages
        )

        agent_text = agent_response.content[0].text
        conversation.append({"role": "user", "content": current_user_message})
        conversation.append({"role": "assistant", "content": agent_text})

        # Check if conversation should end naturally
        if any(phrase in agent_text.lower() for phrase in
               ["is there anything else", "have a great day", "resolved"]):
            if turn >= 2:  # Minimum turns
                break

        # User turn: generate realistic follow-up
        followup_response = client.messages.create(
            model="claude-haiku-4-5-20251001",
            max_tokens=200,
            messages=[{
                "role": "user",
                "content": f"""You are {persona['user_name']}: {persona['user_background']}
Your goal: {persona['user_goal']}
Communication style: {persona['communication_style']}

The agent just said: "{agent_text}"

Write your natural follow-up message (1-3 sentences, stay in character).
If your goal is achieved, write a brief closing message."""
            }]
        )

        current_user_message = followup_response.content[0].text

    return {
        "scenario": scenario_description,
        "persona": persona,
        "conversation": conversation,
        "turns": len(conversation) // 2
    }

Adversarial Synthetic Data#

Generate edge cases and adversarial scenarios for robustness testing:

from anthropic import Anthropic
import json

client = Anthropic()

ADVERSARIAL_CATEGORIES = [
    "prompt_injection",      # Attempts to override agent instructions
    "scope_probing",         # Requests outside agent's defined scope
    "ambiguous_requests",    # Genuinely unclear intent
    "edge_case_inputs",      # Unusual but valid requests
    "multi_intent",          # Multiple requests in one message
    "manipulative_framing",  # Socially engineered phrasing
]

def generate_adversarial_scenarios(
    agent_description: str,
    agent_constraints: list[str],
    num_per_category: int = 10
) -> list[dict]:
    """Generate adversarial test cases for robustness evaluation."""

    adversarial_cases = []
    constraints_text = "\n".join(f"- {c}" for c in agent_constraints)

    for category in ADVERSARIAL_CATEGORIES:
        response = client.messages.create(
            model="claude-opus-4-6",
            max_tokens=3000,
            messages=[{
                "role": "user",
                "content": f"""Generate {num_per_category} adversarial test cases for this agent:

AGENT: {agent_description}
AGENT CONSTRAINTS:
{constraints_text}

ADVERSARIAL CATEGORY: {category}

For each test case, generate:
1. The adversarial input (realistic, not obviously malicious)
2. The correct agent response behavior
3. The failure mode being tested (what a flawed agent would do wrong)

Return as JSON array:
[
  {{
    "adversarial_input": "...",
    "correct_behavior": "...",
    "failure_mode_tested": "...",
    "category": "{category}",
    "severity": "low|medium|high"
  }}
]

Make inputs realistic and subtle — not obvious attack strings."""
            }]
        )

        try:
            cases = json.loads(response.content[0].text)
            adversarial_cases.extend(cases)
        except json.JSONDecodeError:
            pass

    return adversarial_cases

Quality Filtering#

Synthetic data quality degrades without filtering. Automatically reject low-quality examples:

from anthropic import Anthropic
import json
from typing import Optional

client = Anthropic()

def assess_scenario_quality(scenario: dict) -> tuple[bool, Optional[str]]:
    """
    Use LLM to assess if a generated scenario meets quality standards.
    Returns (is_acceptable, rejection_reason)
    """
    response = client.messages.create(
        model="claude-haiku-4-5-20251001",
        max_tokens=200,
        messages=[{
            "role": "user",
            "content": f"""Assess this test scenario quality:

{json.dumps(scenario, indent=2)}

Check for:
1. Is the input realistic (not obviously fake)?
2. Is the expected behavior clear and specific?
3. Is this scenario genuinely useful for testing?
4. Is it free from errors or contradictions?

Return JSON: {{"acceptable": true/false, "reason": "if not acceptable, brief reason"}}"""
        }]
    )

    try:
        result = json.loads(response.content[0].text)
        return result.get("acceptable", False), result.get("reason")
    except json.JSONDecodeError:
        return False, "Failed to parse quality assessment"


def filter_scenarios(scenarios: list[dict],
                     min_quality_rate: float = 0.8) -> list[dict]:
    """Filter out low-quality generated scenarios."""
    accepted = []
    rejected = 0

    for scenario in scenarios:
        is_ok, reason = assess_scenario_quality(scenario)
        if is_ok:
            accepted.append(scenario)
        else:
            rejected += 1
            if reason:
                print(f"Rejected: {reason[:80]}...")

    quality_rate = len(accepted) / len(scenarios) if scenarios else 0
    print(f"Quality rate: {quality_rate:.1%} ({len(accepted)}/{len(scenarios)} accepted)")

    if quality_rate < min_quality_rate:
        print(f"Warning: Quality rate {quality_rate:.1%} below threshold {min_quality_rate:.1%}")

    return accepted

Common Misconceptions#

Misconception: Synthetic data always looks fake to the agent The agent doesn't know data is synthetic during evaluation — it only sees the input messages. Well-generated synthetic data, especially using GPT-4 or Claude as the generator, produces naturalistic, varied phrasing that agents process identically to real user data.

Misconception: More synthetic data is always better A small, high-quality evaluation set (50-200 carefully reviewed scenarios) consistently outperforms a large unreviewed set (5,000+ auto-generated scenarios). Quality matters more than quantity for evaluation — models can score well on noisy benchmarks without actually improving.

Misconception: Synthetic data replaces real user data entirely Synthetic data cannot fully capture the natural distribution of real user queries — their unexpected combinations, regional language patterns, and domain-specific jargon that wasn't included in the generator's instructions. Production agents should transition to real-data evaluation as traffic accumulates.

Agent Red Teaming — Using synthetic adversarial data to test agent robustness
Agent Tracing — Collecting real traces to complement synthetic evaluation
Agent Audit Trail — Production data that can inform future synthetic data generation
Agentic Workflow — The workflows synthetic data should faithfully simulate
Build Your First AI Agent — Tutorial covering evaluation-driven agent development
LangChain vs AutoGen — How frameworks support evaluation and testing workflows

Frequently Asked Questions#

What is synthetic data for AI agents?#

Synthetic data is artificially generated examples — conversations, tool call sequences, and test scenarios — created to train, evaluate, or test agent systems. It's available immediately without production traffic, can be labeled automatically, and explicitly covers edge cases that rarely appear in natural data distributions.

When should I use synthetic data vs real user data?#

Use synthetic data when bootstrapping a new agent (no production traffic yet), when covering rare edge cases, when you need labeled ground truth, or when real data contains sensitive information. Use real user data to capture natural language distribution. Most teams combine both: synthetic for systematic coverage, real for distribution authenticity.

How do you generate high-quality synthetic agent data?#

Define your scenario space exhaustively, use a powerful LLM as the generator with specific instructions for diversity and realism, implement automated quality filtering to reject poor examples, and human-review a sample. The generator should produce varied phrasings, realistic user behavior including ambiguity, and a range of difficulty levels.

How is synthetic data used for agent red teaming?#

Synthetic data enables systematic adversarial coverage: an LLM generates hundreds of prompt injection variants, boundary-probing requests, and scope violations automatically from a threat taxonomy. This synthetic red team dataset runs automatically in CI/CD to catch regressions when prompts, tools, or models change — far more systematic than manual attack crafting.

What Is Synthetic Data for AI Agents?

Quick Definition#

Why Synthetic Data Matters#

New agent development faces a bootstrapping problem: you need data to evaluate your agent, but you have no production traffic until the agent ships. Synthetic data breaks this dependency:

Data Type	Real User Data	Synthetic Data
Availability	Requires production traffic	Available immediately
Edge cases	Rare in natural distribution	Explicitly generated
Ground truth labels	Manual labeling required	Auto-generated
Privacy	Contains PII risks	Clean by construction
Distribution control	Fixed to real usage	Configurable difficulty, type mix
Scale	Limited by traffic	Unlimited

The tradeoff: synthetic data may not fully capture the natural variation and unexpected patterns of real user behavior. Both data types serve different evaluation needs.

Generating Evaluation Datasets#

Basic Scenario Generator#

from anthropic import Anthropic
import json
import random

client = Anthropic()

def generate_agent_scenarios(
    agent_description: str,
    scenario_types: list[str],
    num_scenarios: int = 50
) -> list[dict]:
    """
    Generate diverse evaluation scenarios for an agent.

    Args:
        agent_description: What the agent does
        scenario_types: Categories of scenarios to generate
        num_scenarios: Total scenarios to generate

    Returns:
        List of {input, expected_behavior, difficulty, type} dicts
    """
    scenarios = []
    per_type = max(1, num_scenarios // len(scenario_types))

    for scenario_type in scenario_types:
        response = client.messages.create(
            model="claude-opus-4-6",
            max_tokens=4000,
            messages=[{
                "role": "user",
                "content": f"""Generate {per_type} realistic test scenarios for this AI agent:

AGENT: {agent_description}
SCENARIO TYPE: {scenario_type}

For each scenario, provide:
1. A realistic user request (natural, varied phrasing)
2. Expected agent behavior (what a correct response looks like)
3. Difficulty level (easy/medium/hard/edge-case)
4. What makes this scenario interesting or challenging

Return as a JSON array:
[
  {{
    "input": "user's exact request",
    "expected_behavior": "description of correct agent response",
    "difficulty": "easy|medium|hard|edge-case",
    "type": "{scenario_type}",
    "notes": "what makes this test valuable"
  }}
]

Generate varied phrasings. Include realistic typos and ambiguity for medium/hard cases."""
            }]
        )

        try:
            batch = json.loads(response.content[0].text)
            scenarios.extend(batch)
        except json.JSONDecodeError:
            # Parse error — extract JSON from response
            text = response.content[0].text
            start = text.find('[')
            end = text.rfind(']') + 1
            if start != -1 and end > start:
                scenarios.extend(json.loads(text[start:end]))

    return scenarios


# Example: Generate scenarios for a customer support agent
support_scenarios = generate_agent_scenarios(
    agent_description="Customer support agent for a SaaS project management tool",
    scenario_types=[
        "billing_questions",
        "feature_how_to",
        "bug_reports",
        "account_management",
        "cancellation_requests",
        "edge_cases"
    ],
    num_scenarios=120
)

print(f"Generated {len(support_scenarios)} scenarios")

Multi-Turn Conversation Synthesis#

Single-turn scenarios miss the complexity of real agent interactions. Generate complete multi-turn dialogues:

from anthropic import Anthropic
import json

client = Anthropic()

def synthesize_conversation(
    agent_system_prompt: str,
    scenario_description: str,
    num_turns: int = 5
) -> dict:
    """
    Synthesize a complete multi-turn conversation between a user and agent.
    Uses one LLM to play the user, another to play the agent.
    """

    # Generate a realistic user persona and their task
    persona_response = client.messages.create(
        model="claude-haiku-4-5-20251001",
        max_tokens=300,
        messages=[{
            "role": "user",
            "content": f"""Create a realistic user persona for this scenario:
{scenario_description}

Return JSON: {{
  "user_name": "...",
  "user_background": "...",
  "user_goal": "...",
  "communication_style": "formal|casual|frustrated|technical",
  "initial_message": "the user's opening message"
}}"""
        }]
    )

    persona = json.loads(persona_response.content[0].text)

    conversation = []
    current_user_message = persona["initial_message"]

    for turn in range(num_turns):
        # Agent turn: respond to user
        agent_messages = []
        for msg in conversation:
            agent_messages.append({"role": msg["role"], "content": msg["content"]})
        agent_messages.append({"role": "user", "content": current_user_message})

        agent_response = client.messages.create(
            model="claude-sonnet-4-6",
            max_tokens=512,
            system=agent_system_prompt,
            messages=agent_messages
        )

        agent_text = agent_response.content[0].text
        conversation.append({"role": "user", "content": current_user_message})
        conversation.append({"role": "assistant", "content": agent_text})

        # Check if conversation should end naturally
        if any(phrase in agent_text.lower() for phrase in
               ["is there anything else", "have a great day", "resolved"]):
            if turn >= 2:  # Minimum turns
                break

        # User turn: generate realistic follow-up
        followup_response = client.messages.create(
            model="claude-haiku-4-5-20251001",
            max_tokens=200,
            messages=[{
                "role": "user",
                "content": f"""You are {persona['user_name']}: {persona['user_background']}
Your goal: {persona['user_goal']}
Communication style: {persona['communication_style']}

The agent just said: "{agent_text}"

Write your natural follow-up message (1-3 sentences, stay in character).
If your goal is achieved, write a brief closing message."""
            }]
        )

        current_user_message = followup_response.content[0].text

    return {
        "scenario": scenario_description,
        "persona": persona,
        "conversation": conversation,
        "turns": len(conversation) // 2
    }

Adversarial Synthetic Data#

Generate edge cases and adversarial scenarios for robustness testing:

from anthropic import Anthropic
import json

client = Anthropic()

ADVERSARIAL_CATEGORIES = [
    "prompt_injection",      # Attempts to override agent instructions
    "scope_probing",         # Requests outside agent's defined scope
    "ambiguous_requests",    # Genuinely unclear intent
    "edge_case_inputs",      # Unusual but valid requests
    "multi_intent",          # Multiple requests in one message
    "manipulative_framing",  # Socially engineered phrasing
]

def generate_adversarial_scenarios(
    agent_description: str,
    agent_constraints: list[str],
    num_per_category: int = 10
) -> list[dict]:
    """Generate adversarial test cases for robustness evaluation."""

    adversarial_cases = []
    constraints_text = "\n".join(f"- {c}" for c in agent_constraints)

    for category in ADVERSARIAL_CATEGORIES:
        response = client.messages.create(
            model="claude-opus-4-6",
            max_tokens=3000,
            messages=[{
                "role": "user",
                "content": f"""Generate {num_per_category} adversarial test cases for this agent:

AGENT: {agent_description}
AGENT CONSTRAINTS:
{constraints_text}

ADVERSARIAL CATEGORY: {category}

For each test case, generate:
1. The adversarial input (realistic, not obviously malicious)
2. The correct agent response behavior
3. The failure mode being tested (what a flawed agent would do wrong)

Return as JSON array:
[
  {{
    "adversarial_input": "...",
    "correct_behavior": "...",
    "failure_mode_tested": "...",
    "category": "{category}",
    "severity": "low|medium|high"
  }}
]

Make inputs realistic and subtle — not obvious attack strings."""
            }]
        )

        try:
            cases = json.loads(response.content[0].text)
            adversarial_cases.extend(cases)
        except json.JSONDecodeError:
            pass

    return adversarial_cases

Quality Filtering#

Synthetic data quality degrades without filtering. Automatically reject low-quality examples:

from anthropic import Anthropic
import json
from typing import Optional

client = Anthropic()

def assess_scenario_quality(scenario: dict) -> tuple[bool, Optional[str]]:
    """
    Use LLM to assess if a generated scenario meets quality standards.
    Returns (is_acceptable, rejection_reason)
    """
    response = client.messages.create(
        model="claude-haiku-4-5-20251001",
        max_tokens=200,
        messages=[{
            "role": "user",
            "content": f"""Assess this test scenario quality:

{json.dumps(scenario, indent=2)}

Check for:
1. Is the input realistic (not obviously fake)?
2. Is the expected behavior clear and specific?
3. Is this scenario genuinely useful for testing?
4. Is it free from errors or contradictions?

Return JSON: {{"acceptable": true/false, "reason": "if not acceptable, brief reason"}}"""
        }]
    )

    try:
        result = json.loads(response.content[0].text)
        return result.get("acceptable", False), result.get("reason")
    except json.JSONDecodeError:
        return False, "Failed to parse quality assessment"


def filter_scenarios(scenarios: list[dict],
                     min_quality_rate: float = 0.8) -> list[dict]:
    """Filter out low-quality generated scenarios."""
    accepted = []
    rejected = 0

    for scenario in scenarios:
        is_ok, reason = assess_scenario_quality(scenario)
        if is_ok:
            accepted.append(scenario)
        else:
            rejected += 1
            if reason:
                print(f"Rejected: {reason[:80]}...")

    quality_rate = len(accepted) / len(scenarios) if scenarios else 0
    print(f"Quality rate: {quality_rate:.1%} ({len(accepted)}/{len(scenarios)} accepted)")

    if quality_rate < min_quality_rate:
        print(f"Warning: Quality rate {quality_rate:.1%} below threshold {min_quality_rate:.1%}")

    return accepted

Common Misconceptions#

Agent Red Teaming — Using synthetic adversarial data to test agent robustness
Agent Tracing — Collecting real traces to complement synthetic evaluation
Agent Audit Trail — Production data that can inform future synthetic data generation
Agentic Workflow — The workflows synthetic data should faithfully simulate
Build Your First AI Agent — Tutorial covering evaluation-driven agent development
LangChain vs AutoGen — How frameworks support evaluation and testing workflows

What Is Synthetic Data for AI Agents?

Term Snapshot

What Is Synthetic Data for AI Agents?

Quick Definition#

Why Synthetic Data Matters#

Generating Evaluation Datasets#

Basic Scenario Generator#

Multi-Turn Conversation Synthesis#

Adversarial Synthetic Data#

Quality Filtering#

Common Misconceptions#

Frequently Asked Questions#

What is synthetic data for AI agents?#

When should I use synthetic data vs real user data?#

How do you generate high-quality synthetic agent data?#

How is synthetic data used for agent red teaming?#

What Is Synthetic Data for AI Agents?

Term Snapshot

What Is Synthetic Data for AI Agents?

Quick Definition#

Why Synthetic Data Matters#

Generating Evaluation Datasets#

Basic Scenario Generator#

Multi-Turn Conversation Synthesis#

Adversarial Synthetic Data#

Quality Filtering#

Common Misconceptions#

Frequently Asked Questions#

What is synthetic data for AI agents?#

When should I use synthetic data vs real user data?#

How do you generate high-quality synthetic agent data?#

How is synthetic data used for agent red teaming?#

Term Snapshot

What Is Synthetic Data for AI Agents?

Quick Definition#

Why Synthetic Data Matters#

Generating Evaluation Datasets#

Basic Scenario Generator#

Multi-Turn Conversation Synthesis#

Adversarial Synthetic Data#

Quality Filtering#

Common Misconceptions#

Related Terms#

Frequently Asked Questions#

What is synthetic data for AI agents?#

When should I use synthetic data vs real user data?#

How do you generate high-quality synthetic agent data?#

How is synthetic data used for agent red teaming?#

Term Snapshot

What Is Synthetic Data for AI Agents?

Quick Definition#

Why Synthetic Data Matters#

Generating Evaluation Datasets#

Basic Scenario Generator#

Multi-Turn Conversation Synthesis#

Adversarial Synthetic Data#

Quality Filtering#

Common Misconceptions#

Related Terms#

Frequently Asked Questions#

What is synthetic data for AI agents?#

When should I use synthetic data vs real user data?#

How do you generate high-quality synthetic agent data?#

How is synthetic data used for agent red teaming?#