← M15: Code Interpreter 🏠 Home M16: Input Guardrails →

Build a Complete Agent & Subagent System

Hands-on lab: assemble everything you have learned into a working multi-agent system that researches UCC filings, analyzes risk, and synthesizes answers.

Learning Objectives

Build a complete agent loop from scratch using the Anthropic Messages API with tool use
Create specialized subagents with isolated context windows and focused tool sets
Implement a coordinator agent that delegates to subagents based on query type
Wire coordinator-to-subagent context passing so subagents receive only relevant information
Test a multi-agent system end-to-end with happy path, edge case, and error scenarios

What You'll Build

Lab: UCC Filing Research System

A coordinator agentAn agent that receives user requests and delegates work to specialized subagents rather than doing everything itself. It decides WHICH subagent to call and HOW to combine their results. that delegates to specialized subagentsA focused agent with a narrow set of tools and a specialized system prompt. It handles one type of task well, then returns structured results to the coordinator. for UCC filingA Uniform Commercial Code filing is a legal document a lender files to publicly declare a security interest in a borrower's assets. Think of it as a "financial lien" that's searchable by anyone doing due diligence. research.

Time estimate: 60–90 minutes
Prerequisites: M12 (ReAct pattern), M13 (Planning), M14 (Multi-Agent Systems), M15 (Code Interpreter)
Difficulty: ★★★☆☆ (Intermediate)

Files You'll Create

ucc-agent/ ├── mock_data.py # 10 realistic UCC filing records ├── tools.py # search_filings, get_filing_details, check_risk_score ├── single_agent.py # Warm-up: one agent with all tools (ReAct loop) ├── subagents.py # Research subagent + analysis subagent ├── coordinator.py # Coordinator that delegates to subagents ├── test_system.py # End-to-end test scenarios ├── requirements.txt # Python dependencies └── (Node.js equivalents for every file)

Architecture Overview

Before writing any code, let's look at the system you are about to build. This is the "blueprint before building the house" approach from M00 — understanding the full picture makes each step click into place.

Everyday Analogy

Before call centers had IVR menus, a single operator answered every question — billing, tech support, returns, scheduling. That one person needed to know everything, and callers waited while they looked things up across five different systems.

Modern call centers route you to a specialist: "Press 1 for billing, press 2 for technical support." Each specialist knows their domain deeply and has the right tools on their screen. A supervisor (coordinator) can conference in multiple specialists for complex cases.

Your agent system works the same way. Instead of one agent juggling every tool, a coordinator agent reads the user's question and routes it to the right specialist subagent. The filing search subagent has search tools. The risk analysis subagent has scoring tools. Each one does its job and reports back to the coordinator, who synthesizes a final answer.

What this looks like in practice: When the coordinator delegates to the research subagent, it sends a task string like "Find all UCC filings for Acme Corporation in New York". The research subagent runs its own ReAct loop, calls search_filings, and returns structured JSON: {"status": "success", "data": {"filings_found": [...], "total_count": 2}}. That is the "specialist reports back to the supervisor" moment — now you can see it in real code.

System Architecture: Coordinator + Subagents

System Architecture Flow

User Query

↓

Coordinator Agent

↓ ↓

Research Subagent

Analysis Subagent

↓ ↓

search_filings

get_filing_details

check_risk_score

↑ ↑

Synthesized Response

Three Design Decisions

Decision 1: Why not one agent with all tools? In M06 you learned that tool selection accuracy degrades above 4–5 tools per agent. Right now we only have 3 tools, so a single agent works fine. But in production you would add filing amendment tools, entity resolution tools, jurisdiction lookup tools, court record search, and more. By the time you reach 10+ tools, the single agent starts picking the wrong tool for the job. Splitting into subagents now establishes the pattern you will need at scale.

Decision 2: Explicit context passing. A critical lesson from M14 — subagents do NOT inherit the coordinator's full conversation. Each subagent starts with a blank context window. It only knows what the coordinator explicitly tells it in the task prompt.

Why does this matter? Imagine the user asks "What about their Texas filings?" The coordinator has the full conversation and knows "their" means Acme Corporation. But the research subagent has never seen that conversation. If the coordinator passes the raw question "What about their Texas filings?" the subagent has no idea who "their" refers to. The coordinator must resolve the pronoun first and send: "Find all UCC filings for Acme Corporation in TX." This pronoun resolution step is easy to forget and is the #1 source of bugs in multi-agent systems.

Decision 3: Structured result aggregation. Each subagent returns a structured dictionary (not free-form text) so the coordinator can reliably combine results. Why not just return plain English? Because the coordinator needs to programmatically extract data to pass to the next subagent.

The research subagent returns {"filings_found": [...], "total_count": 3, "summary": "..."}. The analysis subagent returns {"risk_score": 0.73, "risk_level": "HIGH", "factors": [...]}. With structured JSON, the coordinator can reliably pull out the debtor_id from research results and pass it to the analysis subagent. With free-form text, you would need to parse natural language to find the debtor ID — fragile and error-prone.

⚠️ Common Misconceptions

"Subagents share the coordinator's memory, right?" — No. Each subagent starts with a completely empty context window. It only knows what the coordinator explicitly passes in the task description. If the coordinator forgets to include the debtor_id, the subagent cannot "look it up" from the earlier conversation.

"More subagents = better system?" — Not necessarily. The coordinator treats subagents as tools. Just like tool selection degrades above 4–5 tools, coordinator routing degrades above 4–5 subagents. If you need more, use hierarchical coordination — sub-coordinators that each manage a few specialists.

"I should use multi-agent for everything now?" — A single agent with 2–3 tools is simpler, cheaper (fewer API calls), and often good enough. Multi-agent pays off when you have 5+ tools, need context isolation, or want to enforce separation of concerns. Do not over-engineer — you built the single agent in Step 4 precisely to see that it works fine for small tool sets.

"Subagents run in parallel automatically?" — Not in our implementation. The coordinator calls subagents sequentially. Parallel execution requires explicit async code (asyncio.gather or Promise.all). See the "Going Further" stretch goals for this.

Cert Tip — Domain 1.2 The exam strongly favors hub-and-spoke (coordinator + subagents) over flat multi-agent architectures. Know why: single coordination point, clear context isolation, structured result aggregation, auditable decision flow.

Cert Tip — Domain 1.3 Subagents have ISOLATED context windows — they do NOT inherit the coordinator's full conversation. The coordinator must explicitly pass relevant context in the Task prompt. Anti-pattern: assuming subagents know everything the coordinator knows.

Step 1: Set Up the Project

What: Create the project directory and install dependencies.

Why: A clean project structure keeps your files organized and ensures all dependencies are available before you start coding. Getting this right first prevents "module not found" errors later.

Environment Setup

mkdir ucc-agent && cd ucc-agent
python -m venv venv
# macOS/Linux:
source venv/bin/activate
# Windows:
# venv\Scripts\activate
pip install "anthropic>=0.30.0"
export ANTHROPIC_API_KEY=your-key-here
# Windows: set ANTHROPIC_API_KEY=your-key-here

mkdir ucc-agent && cd ucc-agent
npm init -y
npm install @anthropic-ai/sdk
export ANTHROPIC_API_KEY=your-key-here
# Windows: set ANTHROPIC_API_KEY=your-key-here

Requirements File

Create a file called requirements.txt (Python only):

anthropic>=0.30.0

Run command:

pip install -r requirements.txt

Expected output:

Successfully installed anthropic-0.30.x ...

Checkpoint

If you see "Successfully installed anthropic..." you are ready. If you see an error about pip, make sure your virtual environment is activated. If you see a permission error on Windows, try running your terminal as administrator.

Troubleshooting

"command not found: python" — Try python3 instead, or ensure Python 3.9+ is installed and on your PATH.

"No module named pip" — Run python -m ensurepip --upgrade first.

Node.js: "npm ERR!" — Ensure Node.js v18+ is installed. Run node --version to check.

Step 2: Create Mock Data

What: Build a mock database of 10 realistic UCC filing records that your tools will search through.

Why: Using mock data means you can build and test the entire agent system without needing a real database or API access. The data is realistic enough to exercise all the tool logic — multiple states, different collateral types, varying risk levels.

Now that your project is set up, you need data for your agents to work with. Think of this as stocking a test kitchen before you start cooking — you need realistic ingredients to know if your recipes actually work. Here is what one of our mock filing records looks like:

{"filing_number": "NY-2024-001234", "debtor_name": "Acme Corporation", "secured_party": "First National Bank", "collateral_value_estimate": 2500000}

. Ten records like this give your agents enough variety to exercise every code path.

Create a new file called mock_data.py (Python) or mockData.js (Node.js):

# mock_data.py — Realistic UCC filing records for testing
# Each filing represents a real-world lien: a lender's claim on a borrower's assets.

UCC_FILINGS = [
    {
        "filing_number": "NY-2024-001234",
        "state": "NY",
        "debtor_name": "Acme Corporation",
        "debtor_id": "ACME-001",
        "secured_party": "First National Bank",
        "collateral_description": "All inventory, equipment, and accounts receivable",
        "filing_date": "2024-01-15",
        "lapse_date": "2029-01-15",
        "filing_type": "Original",
        "status": "Active",
        "collateral_value_estimate": 2500000
    },
    {
        "filing_number": "NY-2024-005678",
        "state": "NY",
        "debtor_name": "Acme Corporation",
        "debtor_id": "ACME-001",
        "secured_party": "Silicon Valley Lending Group",
        "collateral_description": "All intellectual property, patents, and trademarks",
        "filing_date": "2024-03-22",
        "lapse_date": "2029-03-22",
        "filing_type": "Original",
        "status": "Active",
        "collateral_value_estimate": 4000000
    },
    {
        "filing_number": "TX-2023-009876",
        "state": "TX",
        "debtor_name": "Acme Corporation",
        "debtor_id": "ACME-001",
        "secured_party": "Lone Star Capital",
        "collateral_description": "Equipment and machinery at 456 Industrial Blvd, Houston TX",
        "filing_date": "2023-11-01",
        "lapse_date": "2028-11-01",
        "filing_type": "Original",
        "status": "Active",
        "collateral_value_estimate": 1800000
    },
    {
        "filing_number": "CA-2024-003456",
        "state": "CA",
        "debtor_name": "Pacific Rim Trading LLC",
        "debtor_id": "PRT-002",
        "secured_party": "West Coast Venture Fund",
        "collateral_description": "All assets including inventory, accounts, and general intangibles",
        "filing_date": "2024-02-10",
        "lapse_date": "2029-02-10",
        "filing_type": "Original",
        "status": "Active",
        "collateral_value_estimate": 8500000
    },
    {
        "filing_number": "CA-2023-007890",
        "state": "CA",
        "debtor_name": "Pacific Rim Trading LLC",
        "debtor_id": "PRT-002",
        "secured_party": "Bank of the West",
        "collateral_description": "Accounts receivable and deposit accounts",
        "filing_date": "2023-06-15",
        "lapse_date": "2028-06-15",
        "filing_type": "Original",
        "status": "Active",
        "collateral_value_estimate": 3200000
    },
    {
        "filing_number": "FL-2024-002345",
        "state": "FL",
        "debtor_name": "Sunshine Medical Devices Inc",
        "debtor_id": "SMD-003",
        "secured_party": "Atlantic Health Finance",
        "collateral_description": "Medical equipment, FDA-approved devices, and related inventory",
        "filing_date": "2024-04-01",
        "lapse_date": "2029-04-01",
        "filing_type": "Original",
        "status": "Active",
        "collateral_value_estimate": 5600000
    },
    {
        "filing_number": "IL-2023-004567",
        "state": "IL",
        "debtor_name": "Midwest Manufacturing Group",
        "debtor_id": "MMG-004",
        "secured_party": "Great Lakes Commercial Bank",
        "collateral_description": "All equipment, fixtures, and inventory at 789 Factory Lane, Chicago IL",
        "filing_date": "2023-09-20",
        "lapse_date": "2028-09-20",
        "filing_type": "Original",
        "status": "Active",
        "collateral_value_estimate": 6200000
    },
    {
        "filing_number": "IL-2024-006789",
        "state": "IL",
        "debtor_name": "Midwest Manufacturing Group",
        "debtor_id": "MMG-004",
        "secured_party": "Industrial Credit Corp",
        "collateral_description": "Accounts receivable, contract rights, and general intangibles",
        "filing_date": "2024-01-30",
        "lapse_date": "2029-01-30",
        "filing_type": "Amendment",
        "status": "Active",
        "collateral_value_estimate": 2100000
    },
    {
        "filing_number": "NY-2022-008901",
        "state": "NY",
        "debtor_name": "Empire State Logistics Corp",
        "debtor_id": "ESL-005",
        "secured_party": "Manhattan Commercial Lending",
        "collateral_description": "Fleet vehicles, transportation equipment, and warehouse inventory",
        "filing_date": "2022-07-12",
        "lapse_date": "2027-07-12",
        "filing_type": "Original",
        "status": "Active",
        "collateral_value_estimate": 3800000
    },
    {
        "filing_number": "TX-2024-001111",
        "state": "TX",
        "debtor_name": "Gulf Energy Solutions LLC",
        "debtor_id": "GES-006",
        "secured_party": "Texas Energy Capital Partners",
        "collateral_description": "Oil and gas equipment, pipelines, and related fixtures",
        "filing_date": "2024-05-18",
        "lapse_date": "2029-05-18",
        "filing_type": "Original",
        "status": "Active",
        "collateral_value_estimate": 12000000
    }
]


def search_filings_data(debtor_name, state=None):
    """Search filings by debtor name (case-insensitive partial match).
    Optionally filter by state code."""
    query = debtor_name.lower()
    results = []
    for filing in UCC_FILINGS:
        if query in filing["debtor_name"].lower():
            if state is None or filing["state"].upper() == state.upper():
                results.append(filing)
    return results


def get_filing_by_number(filing_number):
    """Look up a single filing by its filing number."""
    for filing in UCC_FILINGS:
        if filing["filing_number"] == filing_number:
            return filing
    return None


def calculate_risk(debtor_id):
    """Calculate a risk score for an entity based on their filings.
    Returns a score from 0.0 (low risk) to 1.0 (high risk)."""
    entity_filings = [f for f in UCC_FILINGS if f["debtor_id"] == debtor_id]

    if not entity_filings:
        return {
            "debtor_id": debtor_id,
            "risk_score": 0.0,
            "risk_level": "UNKNOWN",
            "filing_count": 0,
            "total_collateral_value": 0,
            "factors": ["No filings found for this entity"],
            "recommendation": "No data available for risk assessment."
        }

    filing_count = len(entity_filings)
    total_value = sum(f["collateral_value_estimate"] for f in entity_filings)
    states = list(set(f["state"] for f in entity_filings))
    secured_parties = list(set(f["secured_party"] for f in entity_filings))

    # Risk scoring logic
    score = 0.0
    factors = []

    # More filings = higher risk
    if filing_count >= 3:
        score += 0.3
        factors.append(f"High filing count ({filing_count} active filings)")
    elif filing_count >= 2:
        score += 0.15
        factors.append(f"Multiple filings ({filing_count} active filings)")

    # High total collateral = higher risk
    if total_value > 5000000:
        score += 0.25
        factors.append(f"High collateral exposure (${total_value:,.0f})")
    elif total_value > 2000000:
        score += 0.1
        factors.append(f"Moderate collateral exposure (${total_value:,.0f})")

    # Multiple secured parties = higher risk
    if len(secured_parties) >= 3:
        score += 0.2
        factors.append(f"Multiple secured parties ({len(secured_parties)} lenders)")
    elif len(secured_parties) >= 2:
        score += 0.1
        factors.append(f"Two secured parties involved")

    # Multi-state filings = higher complexity
    if len(states) >= 2:
        score += 0.15
        factors.append(f"Multi-state filings ({', '.join(states)})")

    score = min(score, 1.0)

    if score >= 0.6:
        risk_level = "HIGH"
        recommendation = "Thorough due diligence required. Multiple liens across jurisdictions indicate significant financial obligations."
    elif score >= 0.3:
        risk_level = "MEDIUM"
        recommendation = "Standard due diligence recommended. Some lien exposure exists but appears manageable."
    else:
        risk_level = "LOW"
        recommendation = "Minimal lien exposure. Standard review sufficient."

    return {
        "debtor_id": debtor_id,
        "risk_score": round(score, 2),
        "risk_level": risk_level,
        "filing_count": filing_count,
        "total_collateral_value": total_value,
        "states_involved": states,
        "secured_parties": secured_parties,
        "factors": factors,
        "recommendation": recommendation
    }

// mockData.js — Realistic UCC filing records for testing
// Each filing represents a real-world lien: a lender's claim on a borrower's assets.

const UCC_FILINGS = [
  {
    filing_number: "NY-2024-001234",
    state: "NY",
    debtor_name: "Acme Corporation",
    debtor_id: "ACME-001",
    secured_party: "First National Bank",
    collateral_description: "All inventory, equipment, and accounts receivable",
    filing_date: "2024-01-15",
    lapse_date: "2029-01-15",
    filing_type: "Original",
    status: "Active",
    collateral_value_estimate: 2500000
  },
  {
    filing_number: "NY-2024-005678",
    state: "NY",
    debtor_name: "Acme Corporation",
    debtor_id: "ACME-001",
    secured_party: "Silicon Valley Lending Group",
    collateral_description: "All intellectual property, patents, and trademarks",
    filing_date: "2024-03-22",
    lapse_date: "2029-03-22",
    filing_type: "Original",
    status: "Active",
    collateral_value_estimate: 4000000
  },
  {
    filing_number: "TX-2023-009876",
    state: "TX",
    debtor_name: "Acme Corporation",
    debtor_id: "ACME-001",
    secured_party: "Lone Star Capital",
    collateral_description: "Equipment and machinery at 456 Industrial Blvd, Houston TX",
    filing_date: "2023-11-01",
    lapse_date: "2028-11-01",
    filing_type: "Original",
    status: "Active",
    collateral_value_estimate: 1800000
  },
  {
    filing_number: "CA-2024-003456",
    state: "CA",
    debtor_name: "Pacific Rim Trading LLC",
    debtor_id: "PRT-002",
    secured_party: "West Coast Venture Fund",
    collateral_description: "All assets including inventory, accounts, and general intangibles",
    filing_date: "2024-02-10",
    lapse_date: "2029-02-10",
    filing_type: "Original",
    status: "Active",
    collateral_value_estimate: 8500000
  },
  {
    filing_number: "CA-2023-007890",
    state: "CA",
    debtor_name: "Pacific Rim Trading LLC",
    debtor_id: "PRT-002",
    secured_party: "Bank of the West",
    collateral_description: "Accounts receivable and deposit accounts",
    filing_date: "2023-06-15",
    lapse_date: "2028-06-15",
    filing_type: "Original",
    status: "Active",
    collateral_value_estimate: 3200000
  },
  {
    filing_number: "FL-2024-002345",
    state: "FL",
    debtor_name: "Sunshine Medical Devices Inc",
    debtor_id: "SMD-003",
    secured_party: "Atlantic Health Finance",
    collateral_description: "Medical equipment, FDA-approved devices, and related inventory",
    filing_date: "2024-04-01",
    lapse_date: "2029-04-01",
    filing_type: "Original",
    status: "Active",
    collateral_value_estimate: 5600000
  },
  {
    filing_number: "IL-2023-004567",
    state: "IL",
    debtor_name: "Midwest Manufacturing Group",
    debtor_id: "MMG-004",
    secured_party: "Great Lakes Commercial Bank",
    collateral_description: "All equipment, fixtures, and inventory at 789 Factory Lane, Chicago IL",
    filing_date: "2023-09-20",
    lapse_date: "2028-09-20",
    filing_type: "Original",
    status: "Active",
    collateral_value_estimate: 6200000
  },
  {
    filing_number: "IL-2024-006789",
    state: "IL",
    debtor_name: "Midwest Manufacturing Group",
    debtor_id: "MMG-004",
    secured_party: "Industrial Credit Corp",
    collateral_description: "Accounts receivable, contract rights, and general intangibles",
    filing_date: "2024-01-30",
    lapse_date: "2029-01-30",
    filing_type: "Amendment",
    status: "Active",
    collateral_value_estimate: 2100000
  },
  {
    filing_number: "NY-2022-008901",
    state: "NY",
    debtor_name: "Empire State Logistics Corp",
    debtor_id: "ESL-005",
    secured_party: "Manhattan Commercial Lending",
    collateral_description: "Fleet vehicles, transportation equipment, and warehouse inventory",
    filing_date: "2022-07-12",
    lapse_date: "2027-07-12",
    filing_type: "Original",
    status: "Active",
    collateral_value_estimate: 3800000
  },
  {
    filing_number: "TX-2024-001111",
    state: "TX",
    debtor_name: "Gulf Energy Solutions LLC",
    debtor_id: "GES-006",
    secured_party: "Texas Energy Capital Partners",
    collateral_description: "Oil and gas equipment, pipelines, and related fixtures",
    filing_date: "2024-05-18",
    lapse_date: "2029-05-18",
    filing_type: "Original",
    status: "Active",
    collateral_value_estimate: 12000000
  }
];

function searchFilingsData(debtorName, state = null) {
  const query = debtorName.toLowerCase();
  return UCC_FILINGS.filter(f => {
    const nameMatch = f.debtor_name.toLowerCase().includes(query);
    const stateMatch = state === null || f.state.toUpperCase() === state.toUpperCase();
    return nameMatch && stateMatch;
  });
}

function getFilingByNumber(filingNumber) {
  return UCC_FILINGS.find(f => f.filing_number === filingNumber) || null;
}

function calculateRisk(debtorId) {
  const entityFilings = UCC_FILINGS.filter(f => f.debtor_id === debtorId);

  if (entityFilings.length === 0) {
    return {
      debtor_id: debtorId, risk_score: 0.0, risk_level: "UNKNOWN",
      filing_count: 0, total_collateral_value: 0,
      factors: ["No filings found for this entity"],
      recommendation: "No data available for risk assessment."
    };
  }

  const filingCount = entityFilings.length;
  const totalValue = entityFilings.reduce((sum, f) => sum + f.collateral_value_estimate, 0);
  const states = [...new Set(entityFilings.map(f => f.state))];
  const securedParties = [...new Set(entityFilings.map(f => f.secured_party))];

  let score = 0.0;
  const factors = [];

  if (filingCount >= 3) { score += 0.3; factors.push(`High filing count (${filingCount} active filings)`); }
  else if (filingCount >= 2) { score += 0.15; factors.push(`Multiple filings (${filingCount} active filings)`); }

  if (totalValue > 5000000) { score += 0.25; factors.push(`High collateral exposure ($${totalValue.toLocaleString()})`); }
  else if (totalValue > 2000000) { score += 0.1; factors.push(`Moderate collateral exposure ($${totalValue.toLocaleString()})`); }

  if (securedParties.length >= 3) { score += 0.2; factors.push(`Multiple secured parties (${securedParties.length} lenders)`); }
  else if (securedParties.length >= 2) { score += 0.1; factors.push(`Two secured parties involved`); }

  if (states.length >= 2) { score += 0.15; factors.push(`Multi-state filings (${states.join(", ")})`); }

  score = Math.min(score, 1.0);

  let riskLevel, recommendation;
  if (score >= 0.6) {
    riskLevel = "HIGH";
    recommendation = "Thorough due diligence required. Multiple liens across jurisdictions indicate significant financial obligations.";
  } else if (score >= 0.3) {
    riskLevel = "MEDIUM";
    recommendation = "Standard due diligence recommended. Some lien exposure exists but appears manageable.";
  } else {
    riskLevel = "LOW";
    recommendation = "Minimal lien exposure. Standard review sufficient.";
  }

  return {
    debtor_id: debtorId, risk_score: Math.round(score * 100) / 100,
    risk_level: riskLevel, filing_count: filingCount,
    total_collateral_value: totalValue, states_involved: states,
    secured_parties: securedParties, factors, recommendation
  };
}

module.exports = { UCC_FILINGS, searchFilingsData, getFilingByNumber, calculateRisk };

Run command (Python):

python -c "from mock_data import search_filings_data; print(search_filings_data('Acme'))"

Expected output:

[{'filing_number': 'NY-2024-001234', 'state': 'NY', 'debtor_name': 'Acme Corporation', ...}, {'filing_number': 'NY-2024-005678', ...}, {'filing_number': 'TX-2023-009876', ...}]

Checkpoint

You should see 3 filings returned for "Acme" — two in NY and one in TX. If you see an empty list, check that the debtor name matches (the search is case-insensitive). If you get an ImportError, make sure you are running the command from inside the ucc-agent/ directory.

Troubleshooting

"ModuleNotFoundError: No module named 'mock_data'" — Make sure you are in the ucc-agent/ directory when running the command.

Node.js: "Cannot find module './mockData'" — Verify the file is named mockData.js (camelCase) and you are in the right directory.

Step 3: Build Tool Definitions

What: Define three tools that Claude can call: search_filings, get_filing_details, and check_risk_score. Each tool has two parts — a JSON Schema that describes what parameters it accepts, and an execution function that runs when Claude calls it.

Why: Tools are the bridge between Claude's reasoning and your data. In M05 you learned that Claude uses tool useA feature where Claude generates a structured JSON request to call a function you define, instead of just generating text. Claude picks the tool, fills in the parameters, and your code executes it. to call functions. Each tool needs two things. First, a schema — this tells Claude what parameters to provide and what types they should be. Second, an execution function — this is the code that actually runs when Claude decides to call that tool.

You have realistic data from Step 2. Now you need to wrap that data in tool definitions that Claude can discover and call. Think of this as building the API layer between Claude's brain and your database.

Let's think about what each tool needs. The search_filings tool takes a company name (required) and an optional state filter, then returns matching records. The get_filing_details tool takes a specific filing number and returns every field. The check_risk_score tool takes a debtor ID and crunches a risk score based on all their filings. Notice the workflow: you search first (to find filings and get the debtor_id), then you can drill into details or run risk analysis.

Create a new file called tools.py (Python) or tools.js (Node.js):

# tools.py — Tool definitions for Claude to use
# Each tool has: a JSON Schema definition + an execute function.

import json
from mock_data import search_filings_data, get_filing_by_number, calculate_risk

# --------------------------------------------------
# TOOL SCHEMAS — These tell Claude what tools exist,
# what parameters they accept, and what they return.
# Claude reads these schemas and decides which tool
# to call based on the user's question.
# --------------------------------------------------

TOOL_DEFINITIONS = [
    {
        "name": "search_filings",
        "description": "Search UCC filings by debtor (company) name. Optionally filter by US state code. Returns a list of matching filing records with basic details.",
        "input_schema": {
            "type": "object",
            "properties": {
                "debtor_name": {
                    "type": "string",
                    "description": "The company or person name to search for. Partial matches work (e.g., 'Acme' matches 'Acme Corporation')."
                },
                "state": {
                    "type": "string",
                    "description": "Optional two-letter US state code to filter results (e.g., 'NY', 'CA', 'TX'). If omitted, searches all states.",
                    "enum": ["NY", "CA", "TX", "FL", "IL"]
                }
            },
            "required": ["debtor_name"]
        }
    },
    {
        "name": "get_filing_details",
        "description": "Get full details for a specific UCC filing by its filing number. Returns all fields including collateral description, secured party, dates, and estimated value.",
        "input_schema": {
            "type": "object",
            "properties": {
                "filing_number": {
                    "type": "string",
                    "description": "The unique filing number (e.g., 'NY-2024-001234')."
                }
            },
            "required": ["filing_number"]
        }
    },
    {
        "name": "check_risk_score",
        "description": "Calculate a risk score for a business entity based on their UCC filing history. Returns a score from 0.0 (low) to 1.0 (high), risk level, contributing factors, and a recommendation.",
        "input_schema": {
            "type": "object",
            "properties": {
                "debtor_id": {
                    "type": "string",
                    "description": "The unique entity identifier (e.g., 'ACME-001'). Get this from search_filings results."
                }
            },
            "required": ["debtor_id"]
        }
    }
]


def execute_tool(tool_name, tool_input):
    """Execute a tool call and return the result as a string.
    Handles errors gracefully — never crashes, always returns
    a structured response that Claude can understand."""
    try:
        if tool_name == "search_filings":
            debtor_name = tool_input["debtor_name"]
            state = tool_input.get("state", None)
            results = search_filings_data(debtor_name, state)

            if not results:
                return json.dumps({
                    "status": "no_results",
                    "message": f"No UCC filings found for '{debtor_name}'" + (f" in {state}" if state else ""),
                    "filings": [],
                    "count": 0
                })

            # Return summary for each filing (not full details)
            summaries = []
            for f in results:
                summaries.append({
                    "filing_number": f["filing_number"],
                    "state": f["state"],
                    "debtor_name": f["debtor_name"],
                    "debtor_id": f["debtor_id"],
                    "secured_party": f["secured_party"],
                    "filing_date": f["filing_date"],
                    "status": f["status"],
                    "collateral_value_estimate": f["collateral_value_estimate"]
                })

            return json.dumps({
                "status": "success",
                "count": len(summaries),
                "filings": summaries
            })

        elif tool_name == "get_filing_details":
            filing_number = tool_input["filing_number"]
            result = get_filing_by_number(filing_number)

            if result is None:
                return json.dumps({
                    "status": "not_found",
                    "message": f"No filing found with number '{filing_number}'",
                    "filing": None
                })

            return json.dumps({
                "status": "success",
                "filing": result
            })

        elif tool_name == "check_risk_score":
            debtor_id = tool_input["debtor_id"]
            result = calculate_risk(debtor_id)
            return json.dumps({
                "status": "success",
                "risk_assessment": result
            })

        else:
            return json.dumps({
                "status": "error",
                "message": f"Unknown tool: {tool_name}"
            })

    except Exception as e:
        return json.dumps({
            "status": "error",
            "message": f"Tool execution failed: {str(e)}"
        })

// tools.js — Tool definitions for Claude to use
// Each tool has: a JSON Schema definition + an execute function.

const { searchFilingsData, getFilingByNumber, calculateRisk } = require("./mockData");

// --------------------------------------------------
// TOOL SCHEMAS — These tell Claude what tools exist,
// what parameters they accept, and what they return.
// --------------------------------------------------

const TOOL_DEFINITIONS = [
  {
    name: "search_filings",
    description: "Search UCC filings by debtor (company) name. Optionally filter by US state code. Returns matching filing records.",
    input_schema: {
      type: "object",
      properties: {
        debtor_name: {
          type: "string",
          description: "The company or person name to search for. Partial matches work."
        },
        state: {
          type: "string",
          description: "Optional two-letter US state code to filter results.",
          enum: ["NY", "CA", "TX", "FL", "IL"]
        }
      },
      required: ["debtor_name"]
    }
  },
  {
    name: "get_filing_details",
    description: "Get full details for a specific UCC filing by its filing number.",
    input_schema: {
      type: "object",
      properties: {
        filing_number: {
          type: "string",
          description: "The unique filing number (e.g., 'NY-2024-001234')."
        }
      },
      required: ["filing_number"]
    }
  },
  {
    name: "check_risk_score",
    description: "Calculate a risk score for a business entity based on their UCC filing history.",
    input_schema: {
      type: "object",
      properties: {
        debtor_id: {
          type: "string",
          description: "The unique entity identifier (e.g., 'ACME-001'). Get this from search_filings results."
        }
      },
      required: ["debtor_id"]
    }
  }
];

function executeTool(toolName, toolInput) {
  try {
    if (toolName === "search_filings") {
      const results = searchFilingsData(toolInput.debtor_name, toolInput.state || null);
      if (results.length === 0) {
        return JSON.stringify({
          status: "no_results",
          message: `No UCC filings found for '${toolInput.debtor_name}'` +
                   (toolInput.state ? ` in ${toolInput.state}` : ""),
          filings: [], count: 0
        });
      }
      const summaries = results.map(f => ({
        filing_number: f.filing_number, state: f.state,
        debtor_name: f.debtor_name, debtor_id: f.debtor_id,
        secured_party: f.secured_party, filing_date: f.filing_date,
        status: f.status, collateral_value_estimate: f.collateral_value_estimate
      }));
      return JSON.stringify({ status: "success", count: summaries.length, filings: summaries });
    }

    if (toolName === "get_filing_details") {
      const result = getFilingByNumber(toolInput.filing_number);
      if (!result) {
        return JSON.stringify({ status: "not_found", message: `No filing found with number '${toolInput.filing_number}'`, filing: null });
      }
      return JSON.stringify({ status: "success", filing: result });
    }

    if (toolName === "check_risk_score") {
      const result = calculateRisk(toolInput.debtor_id);
      return JSON.stringify({ status: "success", risk_assessment: result });
    }

    return JSON.stringify({ status: "error", message: `Unknown tool: ${toolName}` });
  } catch (e) {
    return JSON.stringify({ status: "error", message: `Tool execution failed: ${e.message}` });
  }
}

module.exports = { TOOL_DEFINITIONS, executeTool };

What Just Happened?

You created two things: TOOL_DEFINITIONS (a list of JSON schemas that Claude reads to discover available tools) and execute_tool() (the dispatcher that runs the right function when Claude makes a tool call). The schema tells Claude "you can search by debtor_name and optionally by state." The executor takes Claude's chosen parameters and queries your mock data. Notice that execute_tool wraps everything in try/except — it never crashes, it always returns a structured JSON response that Claude can interpret.

Run command (Python):

python -c "from tools import execute_tool; print(execute_tool('search_filings', {'debtor_name': 'Acme'}))"

Expected output:

{"status": "success", "count": 3, "filings": [{"filing_number": "NY-2024-001234", ...}, ...]}

Checkpoint

You should see a JSON object with "status": "success" and "count": 3 for Acme. Also test the error case: execute_tool('search_filings', {'debtor_name': 'NonExistent Corp'}) should return "status": "no_results". If you get errors, check that mock_data.py is in the same directory.

Troubleshooting

"NameError: name 'json' is not defined" — Check that import json is at the top of tools.py.

Empty results for a name you know exists — The search is case-insensitive but requires a substring match. Try "Acme" not "ACME Corp".

Step 4: Build a Single-Tool Agent (Warm-Up)

What: Build a complete ReAct loopA Reason-Act-Observe cycle where the agent thinks about what to do, calls a tool (acts), reads the result (observes), and decides whether to continue or respond. This is the fundamental agent pattern from M12. agent that has access to all three tools. This is a warm-up that connects back to the ReAct pattern you learned in M12.

Why: Before splitting into subagents, you need a working baseline. Building the single agent first confirms three things: your tools work with Claude, your agentic loopThe while-loop that keeps sending messages to Claude and processing tool calls until Claude decides it has enough information to respond (stop_reason changes from 'tool_use' to 'end_turn'). terminates correctly, and your mock data produces sensible results. It also gives you a comparison point — later you will see how a coordinator with specialists handles the same queries differently.

You have tools and data. Now it is time to connect them to Claude. This step builds the complete agent loop from M12 — the Reason-Act-Observe cycle — with all three tools available to a single agent. In Step 5 onward, you will split this into specialists.

Create a new file called single_agent.py (Python) or singleAgent.js (Node.js). This uses the tools.py you built in Step 3.

# single_agent.py — A single agent with all tools (baseline)
# This uses the ReAct pattern from M12: Reason → Act → Observe → Repeat

import os
import sys
import json
import anthropic
from tools import TOOL_DEFINITIONS, execute_tool

# Initialize the client — reads ANTHROPIC_API_KEY from environment
client = anthropic.Anthropic()
MODEL = "claude-sonnet-4-6"

SYSTEM_PROMPT = """You are a UCC filing research assistant. You help users investigate
Uniform Commercial Code filings — these are public records that show when a lender
has a security interest (lien) in a company's assets.

You have three tools available:
1. search_filings — Find filings by company name, optionally filtered by state
2. get_filing_details — Get full details for a specific filing number
3. check_risk_score — Calculate a risk assessment for a company based on their filings

Always search for filings first before calculating risk scores, so you have context
about the entity. When presenting results, be specific — cite filing numbers, dates,
and dollar amounts. If a search returns no results, tell the user clearly."""


def run_agent(user_message, max_turns=10):
    """Run the agent loop until Claude is done or we hit max_turns.

    The loop works like this:
    1. Send the user message + tool definitions to Claude
    2. Check stop_reason: if 'tool_use', execute the tools and continue
    3. If 'end_turn', Claude is done — return the final response
    4. Safety net: stop after max_turns to prevent infinite loops
    """
    print(f"\n{'='*60}")
    print(f"User: {user_message}")
    print(f"{'='*60}")

    messages = [{"role": "user", "content": user_message}]

    for turn in range(max_turns):
        try:
            response = client.messages.create(
                model=MODEL,
                max_tokens=4096,
                system=SYSTEM_PROMPT,
                tools=TOOL_DEFINITIONS,
                messages=messages
            )
        except anthropic.APIError as e:
            print(f"\nAPI Error: {e}")
            return None

        # Check if Claude wants to use tools
        if response.stop_reason == "tool_use":
            # Extract all tool use blocks from the response
            tool_results = []
            assistant_content = response.content

            for block in response.content:
                if block.type == "tool_use":
                    tool_name = block.name
                    tool_input = block.input
                    tool_id = block.id

                    print(f"\n  [Tool Call] {tool_name}({json.dumps(tool_input)})")

                    # Execute the tool
                    result = execute_tool(tool_name, tool_input)
                    print(f"  [Tool Result] {result[:150]}...")

                    tool_results.append({
                        "type": "tool_result",
                        "tool_use_id": tool_id,
                        "content": result
                    })

            # Append assistant message + tool results to conversation
            messages.append({"role": "assistant", "content": assistant_content})
            messages.append({"role": "user", "content": tool_results})

        elif response.stop_reason == "end_turn":
            # Claude is done — extract the final text response
            final_text = ""
            for block in response.content:
                if hasattr(block, "text"):
                    final_text += block.text

            print(f"\nAssistant: {final_text}")
            return final_text

        else:
            print(f"\nUnexpected stop_reason: {response.stop_reason}")
            return None

    print(f"\nWarning: Reached max turns ({max_turns}) without completion.")
    return None


if __name__ == "__main__":
    # Default query or use command-line argument
    query = " ".join(sys.argv[1:]) if len(sys.argv) > 1 else "Find all UCC filings for Acme Corporation"
    run_agent(query)

// singleAgent.js — A single agent with all tools (baseline)
// This uses the ReAct pattern from M12: Reason → Act → Observe → Repeat

const Anthropic = require("@anthropic-ai/sdk");
const { TOOL_DEFINITIONS, executeTool } = require("./tools");

const client = new Anthropic(); // reads ANTHROPIC_API_KEY from env
const MODEL = "claude-sonnet-4-6";

const SYSTEM_PROMPT = `You are a UCC filing research assistant. You help users investigate
Uniform Commercial Code filings — public records showing lender security interests.

You have three tools: search_filings, get_filing_details, check_risk_score.
Always search for filings first before calculating risk. Be specific — cite filing
numbers, dates, and dollar amounts. If no results, tell the user clearly.`;

async function runAgent(userMessage, maxTurns = 10) {
  console.log(`\n${"=".repeat(60)}`);
  console.log(`User: ${userMessage}`);
  console.log(`${"=".repeat(60)}`);

  const messages = [{ role: "user", content: userMessage }];

  for (let turn = 0; turn < maxTurns; turn++) {
    let response;
    try {
      response = await client.messages.create({
        model: MODEL,
        max_tokens: 4096,
        system: SYSTEM_PROMPT,
        tools: TOOL_DEFINITIONS,
        messages: messages
      });
    } catch (e) {
      console.log(`\nAPI Error: ${e.message}`);
      return null;
    }

    if (response.stop_reason === "tool_use") {
      const toolResults = [];

      for (const block of response.content) {
        if (block.type === "tool_use") {
          console.log(`\n  [Tool Call] ${block.name}(${JSON.stringify(block.input)})`);
          const result = executeTool(block.name, block.input);
          console.log(`  [Tool Result] ${result.slice(0, 150)}...`);
          toolResults.push({
            type: "tool_result",
            tool_use_id: block.id,
            content: result
          });
        }
      }

      messages.push({ role: "assistant", content: response.content });
      messages.push({ role: "user", content: toolResults });

    } else if (response.stop_reason === "end_turn") {
      const finalText = response.content
        .filter(b => b.type === "text")
        .map(b => b.text)
        .join("");
      console.log(`\nAssistant: ${finalText}`);
      return finalText;
    } else {
      console.log(`\nUnexpected stop_reason: ${response.stop_reason}`);
      return null;
    }
  }

  console.log(`\nWarning: Reached max turns (${maxTurns}).`);
  return null;
}

const query = process.argv.slice(2).join(" ") || "Find all UCC filings for Acme Corporation";
runAgent(query);

What Just Happened?

You built a complete agent loop. The run_agent function sends the user's question plus tool definitions to Claude. Claude examines the tools and decides which one to call — that decision comes back as stop_reason == "tool_use". Your code executes the tool, sends the result back, and the loop continues. When Claude has enough information to answer, it returns stop_reason == "end_turn" and the loop exits. The max_turns=10 is a safety net that prevents infinite loops if Claude keeps calling tools without ever finishing.

Run command:

python single_agent.py "Find all UCC filings for Acme Corporation in New York"

Expected output (abbreviated):

============================================================ User: Find all UCC filings for Acme Corporation in New York ============================================================ [Tool Call] search_filings({"debtor_name": "Acme Corporation", "state": "NY"}) [Tool Result] {"status": "success", "count": 2, "filings": [...]} Assistant: I found 2 UCC filings for Acme Corporation in New York...

Checkpoint

Claude should call search_filings with the debtor name and state, get results back, and format a human-readable response citing filing numbers and dates. If you get an API error, check your ANTHROPIC_API_KEY. If Claude returns nothing, check that TOOL_DEFINITIONS is being passed to client.messages.create().

Troubleshooting

"AuthenticationError" — Your ANTHROPIC_API_KEY is not set or is invalid. Run echo $ANTHROPIC_API_KEY to check.

Claude ignores the tools — Verify that tools=TOOL_DEFINITIONS is passed in the API call. Without it, Claude cannot see the tools.

"rate_limit_error" — Wait 60 seconds and try again. Free-tier keys have stricter rate limits.

Cert Tip — Domain 1.1 The correct way to determine loop termination is checking stop_reason: 'tool_use' means continue, 'end_turn' means done. CRITICAL anti-pattern: parsing Claude's natural language response to see if it says "I'm done."

Cert Tip — Domain 1.1 maxTurns / iteration caps are a SAFETY NET, not a control mechanism. Anti-pattern: using an arbitrary cap (e.g., 10 iterations) as the primary stopping logic. Let the agent terminate naturally via stop_reason.

Step 5: Build the Research Subagent

What: Create a specialized subagent that handles filing research. It gets its own system prompt that says "you are a research specialist" and its own restricted tool set — just search_filings and get_filing_details.

Why: This is where you move from "one agent does everything" to "specialists handle focused tasks." The research subagent cannot calculate risk scores — it does not even know that tool exists. This isolation has two benefits. First, the subagent cannot accidentally conflate searching with analysis. Second, its context window stays clean and focused on one job, which means better tool selection accuracy.

Your single agent from Step 4 works, but it has all 3 tools. Now you start splitting into specialists. Think of this as hiring your first department — the research team. They know how to find filings but they don't analyze risk. That is someone else's job.

Single Agent vs. Coordinator Pattern

Single Agent vs. Coordinator + Subagents

Single Agent (3 tools)

User: "Risk for Acme Corp?"

Agent thinks... which tool? (3 options)

search_filings("Acme")

get_filing_details("NY-2024-001234")

get_filing_details("NY-2024-005678")

check_risk_score("ACME-001")

Context: 6 tool calls in one window

Coordinator + Subagents

User: "Risk for Acme Corp?"

Coordinator: delegate to research

Research: search + details (isolated)

Coordinator: delegate to analysis

Analysis: risk score (isolated)

Coordinator: synthesize results

Clean context per subagent

Create a new file called subagents.py (Python) or subagents.js (Node.js). This file will contain both the research and analysis subagents (you will add the analysis subagent in Step 6).

# subagents.py — Specialized subagents with focused tool sets
# Each subagent runs its own ReAct loop with isolated context.

import json
import anthropic
from tools import execute_tool

client = anthropic.Anthropic()
MODEL = "claude-sonnet-4-6"

# --------------------------------------------------
# RESEARCH SUBAGENT
# Tools: search_filings, get_filing_details
# Purpose: Find and gather information about UCC filings
# --------------------------------------------------

RESEARCH_TOOLS = [
    {
        "name": "search_filings",
        "description": "Search UCC filings by debtor name. Optionally filter by state code.",
        "input_schema": {
            "type": "object",
            "properties": {
                "debtor_name": {
                    "type": "string",
                    "description": "Company name to search for. Partial matches work."
                },
                "state": {
                    "type": "string",
                    "description": "Optional two-letter state code.",
                    "enum": ["NY", "CA", "TX", "FL", "IL"]
                }
            },
            "required": ["debtor_name"]
        }
    },
    {
        "name": "get_filing_details",
        "description": "Get full details for a specific filing by number.",
        "input_schema": {
            "type": "object",
            "properties": {
                "filing_number": {
                    "type": "string",
                    "description": "The unique filing number (e.g., 'NY-2024-001234')."
                }
            },
            "required": ["filing_number"]
        }
    }
]

RESEARCH_SYSTEM = """You are a UCC filing research specialist. Your ONLY job is to search
for and retrieve UCC filing records. You have two tools: search_filings and get_filing_details.

When given a research task:
1. Search for filings matching the debtor name and optional state filter
2. If needed, get full details for specific filings
3. Return a structured summary of what you found

Always return your findings as a JSON object with this structure:
{"filings_found": [...], "total_count": N, "states_searched": [...], "summary": "..."}

Be thorough but focused. Do NOT try to analyze risk — that is another specialist's job."""


def run_research_subagent(task_description, max_turns=6):
    """Run the research subagent with a focused task.

    Args:
        task_description: What to research (e.g., "Find all filings for Acme Corporation")
        max_turns: Safety limit on tool call rounds

    Returns:
        dict with research results, or error dict
    """
    messages = [{"role": "user", "content": task_description}]

    for turn in range(max_turns):
        try:
            response = client.messages.create(
                model=MODEL,
                max_tokens=2048,
                system=RESEARCH_SYSTEM,
                tools=RESEARCH_TOOLS,
                messages=messages
            )
        except anthropic.APIError as e:
            return {"status": "error", "message": f"Research subagent API error: {str(e)}"}

        if response.stop_reason == "tool_use":
            tool_results = []
            for block in response.content:
                if block.type == "tool_use":
                    result = execute_tool(block.name, block.input)
                    tool_results.append({
                        "type": "tool_result",
                        "tool_use_id": block.id,
                        "content": result
                    })
            messages.append({"role": "assistant", "content": response.content})
            messages.append({"role": "user", "content": tool_results})

        elif response.stop_reason == "end_turn":
            final_text = ""
            for block in response.content:
                if hasattr(block, "text"):
                    final_text += block.text

            # Try to parse as JSON, fall back to text
            try:
                return {"status": "success", "data": json.loads(final_text)}
            except json.JSONDecodeError:
                return {"status": "success", "data": {"summary": final_text}}
        else:
            return {"status": "error", "message": f"Unexpected stop: {response.stop_reason}"}

    return {"status": "error", "message": "Research subagent reached max turns"}


# --------------------------------------------------
# ANALYSIS SUBAGENT (Step 6 — added next)
# --------------------------------------------------

ANALYSIS_TOOLS = [
    {
        "name": "check_risk_score",
        "description": "Calculate risk score for an entity based on UCC filing history.",
        "input_schema": {
            "type": "object",
            "properties": {
                "debtor_id": {
                    "type": "string",
                    "description": "Entity identifier (e.g., 'ACME-001')."
                }
            },
            "required": ["debtor_id"]
        }
    }
]

ANALYSIS_SYSTEM = """You are a UCC filing risk analysis specialist. Your ONLY job is to
analyze risk based on UCC filing data provided to you.

You have one tool: check_risk_score. Use it with the debtor_id provided in your task.

You will receive context about the entity's filings from the research team. Use that
context plus the risk score tool to produce a complete risk assessment.

Return your analysis as a JSON object:
{"risk_score": 0.X, "risk_level": "HIGH/MEDIUM/LOW", "factors": [...],
 "recommendation": "...", "filing_context": "..."}

Be specific and quantitative. Cite dollar amounts and filing counts."""


def run_analysis_subagent(task_description, max_turns=4):
    """Run the analysis subagent with filing context and a risk assessment task.

    Args:
        task_description: What to analyze, including context from research subagent
        max_turns: Safety limit

    Returns:
        dict with analysis results, or error dict
    """
    messages = [{"role": "user", "content": task_description}]

    for turn in range(max_turns):
        try:
            response = client.messages.create(
                model=MODEL,
                max_tokens=2048,
                system=ANALYSIS_SYSTEM,
                tools=ANALYSIS_TOOLS,
                messages=messages
            )
        except anthropic.APIError as e:
            return {"status": "error", "message": f"Analysis subagent API error: {str(e)}"}

        if response.stop_reason == "tool_use":
            tool_results = []
            for block in response.content:
                if block.type == "tool_use":
                    result = execute_tool(block.name, block.input)
                    tool_results.append({
                        "type": "tool_result",
                        "tool_use_id": block.id,
                        "content": result
                    })
            messages.append({"role": "assistant", "content": response.content})
            messages.append({"role": "user", "content": tool_results})

        elif response.stop_reason == "end_turn":
            final_text = ""
            for block in response.content:
                if hasattr(block, "text"):
                    final_text += block.text
            try:
                return {"status": "success", "data": json.loads(final_text)}
            except json.JSONDecodeError:
                return {"status": "success", "data": {"summary": final_text}}
        else:
            return {"status": "error", "message": f"Unexpected stop: {response.stop_reason}"}

    return {"status": "error", "message": "Analysis subagent reached max turns"}

// subagents.js — Specialized subagents with focused tool sets
const Anthropic = require("@anthropic-ai/sdk");
const { executeTool } = require("./tools");

const client = new Anthropic();
const MODEL = "claude-sonnet-4-6";

// RESEARCH SUBAGENT — search_filings + get_filing_details only
const RESEARCH_TOOLS = [
  {
    name: "search_filings",
    description: "Search UCC filings by debtor name. Optionally filter by state.",
    input_schema: {
      type: "object",
      properties: {
        debtor_name: { type: "string", description: "Company name to search." },
        state: { type: "string", description: "Optional state code.", enum: ["NY","CA","TX","FL","IL"] }
      },
      required: ["debtor_name"]
    }
  },
  {
    name: "get_filing_details",
    description: "Get full details for a filing by number.",
    input_schema: {
      type: "object",
      properties: {
        filing_number: { type: "string", description: "Filing number e.g. 'NY-2024-001234'." }
      },
      required: ["filing_number"]
    }
  }
];

const RESEARCH_SYSTEM = `You are a UCC filing research specialist. Search for and retrieve
UCC filing records. Return findings as JSON: {"filings_found": [...], "total_count": N,
"states_searched": [...], "summary": "..."}. Do NOT analyze risk.`;

async function runResearchSubagent(taskDescription, maxTurns = 6) {
  const messages = [{ role: "user", content: taskDescription }];
  for (let turn = 0; turn < maxTurns; turn++) {
    let response;
    try {
      response = await client.messages.create({
        model: MODEL, max_tokens: 2048, system: RESEARCH_SYSTEM,
        tools: RESEARCH_TOOLS, messages
      });
    } catch (e) {
      return { status: "error", message: `Research API error: ${e.message}` };
    }
    if (response.stop_reason === "tool_use") {
      const toolResults = [];
      for (const block of response.content) {
        if (block.type === "tool_use") {
          const result = executeTool(block.name, block.input);
          toolResults.push({ type: "tool_result", tool_use_id: block.id, content: result });
        }
      }
      messages.push({ role: "assistant", content: response.content });
      messages.push({ role: "user", content: toolResults });
    } else if (response.stop_reason === "end_turn") {
      const text = response.content.filter(b => b.type === "text").map(b => b.text).join("");
      try { return { status: "success", data: JSON.parse(text) }; }
      catch { return { status: "success", data: { summary: text } }; }
    } else {
      return { status: "error", message: `Unexpected stop: ${response.stop_reason}` };
    }
  }
  return { status: "error", message: "Research subagent reached max turns" };
}

// ANALYSIS SUBAGENT — check_risk_score only
const ANALYSIS_TOOLS = [
  {
    name: "check_risk_score",
    description: "Calculate risk score for an entity by debtor_id.",
    input_schema: {
      type: "object",
      properties: {
        debtor_id: { type: "string", description: "Entity ID e.g. 'ACME-001'." }
      },
      required: ["debtor_id"]
    }
  }
];

const ANALYSIS_SYSTEM = `You are a UCC filing risk analyst. Use check_risk_score with
the provided debtor_id. Return JSON: {"risk_score": 0.X, "risk_level": "HIGH/MEDIUM/LOW",
"factors": [...], "recommendation": "...", "filing_context": "..."}.`;

async function runAnalysisSubagent(taskDescription, maxTurns = 4) {
  const messages = [{ role: "user", content: taskDescription }];
  for (let turn = 0; turn < maxTurns; turn++) {
    let response;
    try {
      response = await client.messages.create({
        model: MODEL, max_tokens: 2048, system: ANALYSIS_SYSTEM,
        tools: ANALYSIS_TOOLS, messages
      });
    } catch (e) {
      return { status: "error", message: `Analysis API error: ${e.message}` };
    }
    if (response.stop_reason === "tool_use") {
      const toolResults = [];
      for (const block of response.content) {
        if (block.type === "tool_use") {
          const result = executeTool(block.name, block.input);
          toolResults.push({ type: "tool_result", tool_use_id: block.id, content: result });
        }
      }
      messages.push({ role: "assistant", content: response.content });
      messages.push({ role: "user", content: toolResults });
    } else if (response.stop_reason === "end_turn") {
      const text = response.content.filter(b => b.type === "text").map(b => b.text).join("");
      try { return { status: "success", data: JSON.parse(text) }; }
      catch { return { status: "success", data: { summary: text } }; }
    } else {
      return { status: "error", message: `Unexpected stop: ${response.stop_reason}` };
    }
  }
  return { status: "error", message: "Analysis subagent reached max turns" };
}

module.exports = { runResearchSubagent, runAnalysisSubagent };

Run command (Python — test research subagent only):

python -c "from subagents import run_research_subagent; import json; r = run_research_subagent('Find all UCC filings for Acme Corporation'); print(json.dumps(r, indent=2))"

Expected output:

{ "status": "success", "data": { "filings_found": [...], "total_count": 3, "states_searched": ["NY", "TX"], "summary": "Found 3 filings for Acme Corporation..." } }

Checkpoint

The research subagent should return a structured JSON result with filings found, count, and a summary. Notice it only uses search_filings and get_filing_details — it does not have access to check_risk_score. If you see an error, verify that subagents.py imports from tools.py correctly.

Troubleshooting

Subagent returns raw text instead of JSON — This is normal sometimes. The code handles this gracefully by wrapping it in {"summary": text}. Claude does not always follow JSON output instructions perfectly.

"ImportError" — Check that tools.py and mock_data.py are in the same directory as subagents.py.

Step 6: Build the Analysis Subagent

What: The analysis subagent is already in subagents.py from Step 5 (the run_analysis_subagent function). In this step you test it independently.

Why: Testing each subagent in isolation before wiring them together is critical. If something breaks after you connect them, you need to know whether the bug is in the subagent or in the wiring. This step uses the analysis subagent directly to verify it can calculate risk scores when given context about an entity's filings.

The analysis subagent receives context FROM the research subagent (passed through the coordinator). It does not search for filings itself — it takes what research found and runs risk analysis on it. This is the key to the coordinator pattern: each specialist gets exactly the context it needs, no more.

Run command (Python):

python -c " from subagents import run_analysis_subagent import json task = '''Analyze the risk for Acme Corporation (debtor_id: ACME-001). Context from research: They have 3 active filings across NY and TX, with total collateral value of \$8,300,000 across 3 secured parties.''' r = run_analysis_subagent(task) print(json.dumps(r, indent=2)) "

Expected output:

{ "status": "success", "data": { "risk_score": 0.9, "risk_level": "HIGH", "factors": ["High filing count", "High collateral exposure", ...], "recommendation": "Thorough due diligence required..." } }

Checkpoint

The analysis subagent should call check_risk_score with debtor_id: "ACME-001" and return a structured risk assessment. Acme Corporation should score as HIGH risk (3 filings, 3 lenders, 2 states, $8.3M total). If the score is different, that is fine — Claude may interpret the context differently, but the risk level should be HIGH.

Troubleshooting

Analysis returns UNKNOWN risk — The debtor_id must exactly match what is in mock_data.py (e.g., "ACME-001", not "acme-001"). Check that the task description includes the correct ID.

Analysis subagent returns raw text instead of JSON — Same as the research subagent — Claude does not always follow JSON instructions. The fallback wrapper handles this gracefully.

Step 7: Build the Coordinator Agent

What: Create the coordinator agent. It receives user questions, decides which subagent(s) to call, passes context to them, and synthesizes their results into a final answer.

Why: The coordinator is the brain of your system. It does not search filings or calculate risk directly — it delegates. Its job has four parts: understand the user's question, figure out which specialist(s) are needed, assemble the right context for each specialist, and combine their answers into a coherent response. This is the hub-and-spoke pattern from M14.

Create a new file called coordinator.py (Python) or coordinator.js (Node.js). This uses the subagents from Step 5.

# coordinator.py — The coordinator agent that delegates to subagents
# This is the hub-and-spoke pattern from M14.

import os
import sys
import json
import anthropic
from subagents import run_research_subagent, run_analysis_subagent

client = anthropic.Anthropic()
MODEL = "claude-sonnet-4-6"

# --------------------------------------------------
# The coordinator does NOT have filing tools. Instead,
# it has "meta-tools" that invoke subagents. This is
# the key architectural insight: the coordinator's
# tools are OTHER AGENTS, not database queries.
# --------------------------------------------------

COORDINATOR_TOOLS = [
    {
        "name": "research_filings",
        "description": "Delegate to the filing research specialist. Use this to search for UCC filings by company name, optionally filtered by state. The research team will search and return structured results.",
        "input_schema": {
            "type": "object",
            "properties": {
                "task": {
                    "type": "string",
                    "description": "A clear research task, e.g., 'Find all UCC filings for Acme Corporation in New York'. Be specific about what to search for."
                }
            },
            "required": ["task"]
        }
    },
    {
        "name": "analyze_risk",
        "description": "Delegate to the risk analysis specialist. Use this AFTER research_filings when you have filing context. Provide the debtor_id and a summary of what research found.",
        "input_schema": {
            "type": "object",
            "properties": {
                "task": {
                    "type": "string",
                    "description": "Analysis task with context. MUST include the debtor_id and a summary of known filings. Example: 'Analyze risk for ACME-001. They have 3 filings across NY and TX totaling $8.3M.'"
                }
            },
            "required": ["task"]
        }
    }
]

COORDINATOR_SYSTEM = """You are the lead coordinator for a UCC filing research team.
You do NOT search filings or calculate risk yourself. Instead, you delegate to specialists:

1. research_filings — Searches for UCC filings by company name/state
2. analyze_risk — Calculates risk scores (use AFTER research, include context)

Your workflow for a typical question:
1. Use research_filings to find relevant filings
2. Review the research results
3. If risk analysis is needed, use analyze_risk with the debtor_id and filing context
4. Synthesize all results into a clear, specific answer for the user

IMPORTANT: When calling analyze_risk, you MUST include the debtor_id and a summary
of the research findings in the task description. The analysis specialist does NOT
have access to the research results — you must pass them explicitly.

Be specific in your answers. Cite filing numbers, dates, dollar amounts, and risk scores."""


def execute_coordinator_tool(tool_name, tool_input):
    """Execute a coordinator tool by delegating to the appropriate subagent."""
    task = tool_input["task"]

    if tool_name == "research_filings":
        print(f"  [Coordinator -> Research] {task[:80]}...")
        result = run_research_subagent(task)
        print(f"  [Research -> Coordinator] status={result['status']}")
        return json.dumps(result)

    elif tool_name == "analyze_risk":
        print(f"  [Coordinator -> Analysis] {task[:80]}...")
        result = run_analysis_subagent(task)
        print(f"  [Analysis -> Coordinator] status={result['status']}")
        return json.dumps(result)

    else:
        return json.dumps({"status": "error", "message": f"Unknown tool: {tool_name}"})


def run_coordinator(user_message, conversation_history=None, max_turns=8):
    """Run the coordinator agent with optional conversation history.

    Args:
        user_message: The user's question
        conversation_history: List of previous messages for multi-turn context
        max_turns: Safety limit on delegation rounds

    Returns:
        tuple of (response_text, updated_conversation_history)
    """
    print(f"\n{'='*60}")
    print(f"User: {user_message}")
    print(f"{'='*60}")

    if conversation_history is None:
        conversation_history = []

    # Add the new user message
    conversation_history.append({"role": "user", "content": user_message})

    # Work with a copy for the API calls
    messages = list(conversation_history)

    for turn in range(max_turns):
        try:
            response = client.messages.create(
                model=MODEL,
                max_tokens=4096,
                system=COORDINATOR_SYSTEM,
                tools=COORDINATOR_TOOLS,
                messages=messages
            )
        except anthropic.APIError as e:
            error_msg = f"Coordinator error: {str(e)}"
            print(f"\n{error_msg}")
            return error_msg, conversation_history

        if response.stop_reason == "tool_use":
            tool_results = []
            for block in response.content:
                if block.type == "tool_use":
                    result = execute_coordinator_tool(block.name, block.input)
                    tool_results.append({
                        "type": "tool_result",
                        "tool_use_id": block.id,
                        "content": result
                    })
            messages.append({"role": "assistant", "content": response.content})
            messages.append({"role": "user", "content": tool_results})

        elif response.stop_reason == "end_turn":
            final_text = ""
            for block in response.content:
                if hasattr(block, "text"):
                    final_text += block.text

            # Save the final exchange to conversation history
            conversation_history.append({"role": "assistant", "content": final_text})

            print(f"\nAssistant: {final_text}")
            return final_text, conversation_history

        else:
            msg = f"Unexpected stop: {response.stop_reason}"
            print(f"\n{msg}")
            return msg, conversation_history

    msg = f"Warning: Reached max turns ({max_turns})"
    print(f"\n{msg}")
    return msg, conversation_history


if __name__ == "__main__":
    query = " ".join(sys.argv[1:]) if len(sys.argv) > 1 else "What is the lien exposure for Acme Corporation?"
    run_coordinator(query)

// coordinator.js — The coordinator agent that delegates to subagents
const Anthropic = require("@anthropic-ai/sdk");
const { runResearchSubagent, runAnalysisSubagent } = require("./subagents");

const client = new Anthropic();
const MODEL = "claude-sonnet-4-6";

const COORDINATOR_TOOLS = [
  {
    name: "research_filings",
    description: "Delegate to filing research specialist. Searches UCC filings by company/state.",
    input_schema: {
      type: "object",
      properties: {
        task: { type: "string", description: "Clear research task with company name and optional state." }
      },
      required: ["task"]
    }
  },
  {
    name: "analyze_risk",
    description: "Delegate to risk analysis specialist. Use AFTER research. Include debtor_id and filing context.",
    input_schema: {
      type: "object",
      properties: {
        task: { type: "string", description: "Analysis task with debtor_id and filing summary context." }
      },
      required: ["task"]
    }
  }
];

const COORDINATOR_SYSTEM = `You are the lead coordinator for a UCC filing research team.
Delegate to specialists: research_filings (search) and analyze_risk (risk scoring).
Workflow: research first, then analyze with context. Always pass debtor_id and filing
summary to analyze_risk. Be specific: cite filing numbers, dates, and dollar amounts.`;

async function executeCoordinatorTool(toolName, toolInput) {
  const task = toolInput.task;
  if (toolName === "research_filings") {
    console.log(`  [Coordinator -> Research] ${task.slice(0, 80)}...`);
    const result = await runResearchSubagent(task);
    console.log(`  [Research -> Coordinator] status=${result.status}`);
    return JSON.stringify(result);
  }
  if (toolName === "analyze_risk") {
    console.log(`  [Coordinator -> Analysis] ${task.slice(0, 80)}...`);
    const result = await runAnalysisSubagent(task);
    console.log(`  [Analysis -> Coordinator] status=${result.status}`);
    return JSON.stringify(result);
  }
  return JSON.stringify({ status: "error", message: `Unknown tool: ${toolName}` });
}

async function runCoordinator(userMessage, conversationHistory = null, maxTurns = 8) {
  console.log(`\n${"=".repeat(60)}`);
  console.log(`User: ${userMessage}`);
  console.log(`${"=".repeat(60)}`);

  if (!conversationHistory) conversationHistory = [];
  conversationHistory.push({ role: "user", content: userMessage });
  const messages = [...conversationHistory];

  for (let turn = 0; turn < maxTurns; turn++) {
    let response;
    try {
      response = await client.messages.create({
        model: MODEL, max_tokens: 4096, system: COORDINATOR_SYSTEM,
        tools: COORDINATOR_TOOLS, messages
      });
    } catch (e) {
      const msg = `Coordinator error: ${e.message}`;
      console.log(`\n${msg}`);
      return [msg, conversationHistory];
    }

    if (response.stop_reason === "tool_use") {
      const toolResults = [];
      for (const block of response.content) {
        if (block.type === "tool_use") {
          const result = await executeCoordinatorTool(block.name, block.input);
          toolResults.push({ type: "tool_result", tool_use_id: block.id, content: result });
        }
      }
      messages.push({ role: "assistant", content: response.content });
      messages.push({ role: "user", content: toolResults });
    } else if (response.stop_reason === "end_turn") {
      const finalText = response.content.filter(b => b.type === "text").map(b => b.text).join("");
      conversationHistory.push({ role: "assistant", content: finalText });
      console.log(`\nAssistant: ${finalText}`);
      return [finalText, conversationHistory];
    } else {
      return [`Unexpected stop: ${response.stop_reason}`, conversationHistory];
    }
  }
  return [`Warning: max turns (${maxTurns})`, conversationHistory];
}

if (require.main === module) {
  const query = process.argv.slice(2).join(" ") || "What is the lien exposure for Acme Corporation?";
  runCoordinator(query);
}

module.exports = { runCoordinator };

What Just Happened?

The coordinator's tools are NOT database queries — they are other agents. When Claude calls research_filings, your code launches the research subagent, which runs its own complete ReAct loop. The result comes back as JSON, and the coordinator sees it as a normal tool result. This is the key insight: from the coordinator's perspective, a subagent is just a very smart tool. The execute_coordinator_tool function is the bridge — it maps tool names to subagent calls.

Run command:

python coordinator.py "What is the lien exposure for Acme Corporation?"

Expected output (abbreviated):

============================================================ User: What is the lien exposure for Acme Corporation? ============================================================ [Coordinator -> Research] Find all UCC filings for Acme Corporation... [Research -> Coordinator] status=success [Coordinator -> Analysis] Analyze risk for ACME-001. They have 3 filings... [Analysis -> Coordinator] status=success Assistant: Based on my research, Acme Corporation has significant lien exposure... - 3 active UCC filings across NY and TX - Total collateral: $8,300,000 - Risk Level: HIGH (score: 0.9) - 3 different secured parties - Recommendation: Thorough due diligence required...

Checkpoint

You should see the coordinator delegating first to research, then to analysis, and finally synthesizing a complete answer. The response should cite specific filing numbers, dollar amounts, and a risk score. If you see only the research part (no analysis), check that the coordinator's system prompt instructs it to call analyze_risk after research_filings.

Troubleshooting

Coordinator only calls one subagent — The system prompt must tell it to use research THEN analysis. Make sure the analyze_risk description says "use AFTER research_filings."

Analysis subagent gets no context — Check that the coordinator passes the debtor_id and filing summary in the task field. The analysis subagent has NO access to research results unless the coordinator explicitly includes them.

Cert Tip — Domain 1.5 Overly narrow task decomposition creates coverage gaps — subtasks may not cover the full scope. Overly broad decomposition = individual agents overwhelmed. The exam tests finding the right granularity. In our system, "research" and "analysis" is the right split — broad enough to be useful, narrow enough to stay focused.

Step 8: Wire It Together with Context Passing

What: Verify that the coordinator correctly passes context between subagents. Also test that multi-turn conversations work — follow-up questions should use context from previous turns.

Why: The hardest part of multi-agent systems is context passing. Here is the chain: the research subagent finds filings and returns them to the coordinator. The coordinator must extract the relevant pieces — debtor_id, filing count, collateral totals — and pass them to the analysis subagent. If this handoff breaks, the analysis subagent operates blind and produces meaningless results.

Context Passing Between Agents

COORDUser asks: "Risk for Acme Corporation?"

COORDDecides: need research first, then analysis

RESEARCHTask: "Find all filings for Acme Corporation"

RESULT3 filings, debtor_id=ACME-001, $8.3M total

CONTEXTCoordinator extracts: debtor_id, counts, amounts

ANALYSISTask: "Risk for ACME-001, 3 filings, $8.3M, 2 states"

RESULTScore: 0.9, Level: HIGH, 4 risk factors

SYNTHCoordinator combines: filings + risk = final answer

Test multi-turn conversation by running a follow-up query. The coordinator should remember the previous question and resolve pronouns.

# test_multiturn.py — Test multi-turn conversation with the coordinator
# This verifies that conversation history is maintained across turns.

from coordinator import run_coordinator

# Turn 1: Initial question
print("=== Turn 1 ===")
response1, history = run_coordinator(
    "Find all UCC filings for Acme Corporation"
)

# Turn 2: Follow-up question (uses context from Turn 1)
# The coordinator should understand "their" refers to Acme Corporation
print("\n=== Turn 2 ===")
response2, history = run_coordinator(
    "What about their filings in Texas specifically?",
    conversation_history=history
)

# Turn 3: Risk analysis (builds on previous turns)
print("\n=== Turn 3 ===")
response3, history = run_coordinator(
    "Based on what you found, what is their overall risk level?",
    conversation_history=history
)

print("\n=== Conversation Complete ===")
print(f"Turns completed: 3")
print(f"History length: {len(history)} messages")

// testMultiturn.js — Test multi-turn conversation with coordinator
const { runCoordinator } = require("./coordinator");

async function main() {
  console.log("=== Turn 1 ===");
  let [resp1, history] = await runCoordinator("Find all UCC filings for Acme Corporation");

  console.log("\n=== Turn 2 ===");
  let [resp2, history2] = await runCoordinator(
    "What about their filings in Texas specifically?", history
  );

  console.log("\n=== Turn 3 ===");
  let [resp3, history3] = await runCoordinator(
    "Based on what you found, what is their overall risk level?", history2
  );

  console.log("\n=== Conversation Complete ===");
  console.log(`Turns: 3, History: ${history3.length} messages`);
}

main();

Run command:

python test_multiturn.py

Expected output (abbreviated):

=== Turn 1 === [Coordinator -> Research] Find all UCC filings for Acme Corporation... [Research -> Coordinator] status=success Assistant: Found 3 filings for Acme Corporation... === Turn 2 === [Coordinator -> Research] Find UCC filings for Acme Corporation in TX... [Research -> Coordinator] status=success Assistant: Acme Corporation has 1 filing in Texas... === Turn 3 === [Coordinator -> Analysis] Analyze risk for ACME-001... [Analysis -> Coordinator] status=success Assistant: Based on the full picture, Acme Corporation has a HIGH risk level... === Conversation Complete === Turns completed: 3 History length: 6 messages

Checkpoint

All three turns should complete successfully. Turn 2 should correctly interpret "their" as Acme Corporation and filter to Texas. Turn 3 should trigger a risk analysis. The history length should be 6 (3 user messages + 3 assistant responses). If the coordinator does not resolve pronouns, the conversation history may not be passed correctly — check that history is passed to subsequent calls.

Troubleshooting

Turn 2 searches for a different company — The conversation history is not being passed. Verify that history from Turn 1 is passed as conversation_history=history to Turn 2.

"rate_limit_error" mid-conversation — Multi-turn tests make many API calls (each turn can trigger 2-3 subagent calls). Add import time; time.sleep(3) between turns if you hit rate limits.

Turn 3 does not trigger analysis — The question must imply risk assessment. Try rephrasing to "What is their risk score based on these filings?" to be more explicit.

Step 9: End-to-End Test Suite

What: Create a test file that exercises the complete system with multiple query types: happy path, edge cases, and error scenarios.

Why: A test suite gives you confidence that the system works correctly across different scenarios. It also serves as documentation — anyone reading your tests can see what the system is designed to handle.

Create test_system.py (Python) or testSystem.js (Node.js):

# test_system.py — End-to-end test scenarios for the multi-agent system

import json
from coordinator import run_coordinator

def run_test(name, query, check_fn):
    """Run a single test and report pass/fail."""
    print(f"\n{'='*60}")
    print(f"TEST: {name}")
    print(f"{'='*60}")
    try:
        response, _ = run_coordinator(query)
        if response is None:
            print(f"  FAIL: No response returned")
            return False
        passed = check_fn(response)
        status = "PASS" if passed else "FAIL"
        print(f"\n  Result: {status}")
        return passed
    except Exception as e:
        print(f"  FAIL: Exception — {str(e)}")
        return False


def main():
    results = []

    # Test 1: Happy path — search by company name
    results.append(run_test(
        "Search by company name",
        "Find all UCC filings for Acme Corporation",
        lambda r: "acme" in r.lower() and ("filing" in r.lower() or "NY" in r)
    ))

    # Test 2: Happy path — search with state filter
    results.append(run_test(
        "Search with state filter",
        "Find UCC filings for Pacific Rim Trading in California",
        lambda r: "pacific" in r.lower() and "CA" in r
    ))

    # Test 3: Happy path — risk analysis
    results.append(run_test(
        "Risk analysis for known entity",
        "What is the risk level for Acme Corporation?",
        lambda r: any(word in r.upper() for word in ["HIGH", "RISK", "SCORE"])
    ))

    # Test 4: Edge case — entity with no filings
    results.append(run_test(
        "Search for non-existent entity",
        "Find filings for XYZ Phantom Corp",
        lambda r: "no" in r.lower() or "not found" in r.lower() or "0" in r
    ))

    # Test 5: Happy path — specific state search
    results.append(run_test(
        "Single-state filing search",
        "Show me all UCC filings in Illinois",
        lambda r: "midwest" in r.lower() or "IL" in r or "illinois" in r.lower()
    ))

    # Summary
    passed = sum(results)
    total = len(results)
    print(f"\n{'='*60}")
    print(f"TEST RESULTS: {passed}/{total} passed")
    print(f"{'='*60}")


if __name__ == "__main__":
    main()

// testSystem.js — End-to-end tests for the multi-agent system
const { runCoordinator } = require("./coordinator");

async function runTest(name, query, checkFn) {
  console.log(`\n${"=".repeat(60)}`);
  console.log(`TEST: ${name}`);
  console.log(`${"=".repeat(60)}`);
  try {
    const [response] = await runCoordinator(query);
    if (!response) { console.log("  FAIL: No response"); return false; }
    const passed = checkFn(response);
    console.log(`\n  Result: ${passed ? "PASS" : "FAIL"}`);
    return passed;
  } catch (e) {
    console.log(`  FAIL: ${e.message}`);
    return false;
  }
}

async function main() {
  const results = [];
  results.push(await runTest("Search by company", "Find UCC filings for Acme Corporation",
    r => r.toLowerCase().includes("acme") && r.toLowerCase().includes("filing")));
  results.push(await runTest("State filter", "Find filings for Pacific Rim Trading in California",
    r => r.toLowerCase().includes("pacific") && r.includes("CA")));
  results.push(await runTest("Risk analysis", "What is the risk level for Acme Corporation?",
    r => ["HIGH","RISK","SCORE"].some(w => r.toUpperCase().includes(w))));
  results.push(await runTest("Non-existent entity", "Find filings for XYZ Phantom Corp",
    r => r.toLowerCase().includes("no") || r.includes("0")));
  results.push(await runTest("State search", "Show me all UCC filings in Illinois",
    r => r.toLowerCase().includes("midwest") || r.includes("IL")));

  const passed = results.filter(Boolean).length;
  console.log(`\n${"=".repeat(60)}`);
  console.log(`RESULTS: ${passed}/${results.length} passed`);
}
main();

Run command:

python test_system.py

Expected output:

TEST RESULTS: 5/5 passed

Checkpoint

All 5 tests should pass. Tests 1-3 and 5 verify happy-path behavior. Test 4 verifies graceful handling of no results. If tests fail, re-read the failure output — the coordinator's response is printed so you can see what it actually said. Common cause: Claude phrased the response differently than the check function expects.

Troubleshooting

Tests pass locally but fail on re-run — Claude's responses are non-deterministic. The check functions use broad substring matching to handle this, but occasionally a test may fail if Claude phrases something unusually. Re-run once more.

Rate limit errors during tests — Each test makes multiple API calls. Add a time.sleep(2) between tests if hitting rate limits.

Step 10: Final Verification

What: Run the complete system end-to-end with a complex query that exercises all components.

Why: This is your "it all works" moment. One command that proves every piece is connected: mock data, tools, research subagent, analysis subagent, coordinator, context passing, and result synthesis.

Final verification command:

python coordinator.py "I need a complete risk assessment for Acme Corporation. Search all their filings across all states, then analyze their overall risk exposure."

What you should see:

1. Coordinator delegates to Research subagent 2. Research subagent calls search_filings, finds 3 filings (NY x2, TX x1) 3. Coordinator receives research results 4. Coordinator delegates to Analysis subagent with context: debtor_id=ACME-001, 3 filings, $8.3M collateral, 3 lenders 5. Analysis subagent calls check_risk_score(ACME-001) 6. Coordinator synthesizes final answer: - 3 active filings across NY and TX - Total collateral: $8,300,000 - Risk score: 0.9 (HIGH) - 3 different secured parties - Recommendation: Thorough due diligence required

🎉 Congratulations!

You have built a complete multi-agent system from scratch. A coordinator agent delegates to specialized subagents, passes context explicitly between them, and synthesizes their results into a coherent answer. This is the hub-and-spoke architecture pattern used in production agent systems.

Troubleshooting

Coordinator only calls research, skips analysis — The query must ask for risk or lien exposure. If you ask "Find filings for Acme" the coordinator may skip analysis. Use the full query above that explicitly asks for "complete risk assessment."

Response is missing dollar amounts or filing numbers — Claude's response varies between runs. The coordinator system prompt says "cite filing numbers, dates, dollar amounts" — verify this instruction is present. Re-running usually produces more detailed output.

Total time > 60 seconds — This is normal. The coordinator makes 2 subagent calls, each of which makes 1-3 API calls internally. Total: 4-8 API calls, each taking a few seconds.

Going Further (Optional)

These stretch goals are entirely optional. If you have time and want to deepen your understanding, try one or more:

OPTIONAL: Stretch Goals

Add a third subagent for entity resolution. Create a subagent that normalizes company names (e.g., "Acme Corp", "ACME CORPORATION", "Acme Corp Inc" all resolve to the same entity). Wire it into the coordinator before the research step.
Implement parallel subagent calls. When the coordinator needs both research and analysis, run them concurrently using asyncio.gather() (Python) or Promise.all() (Node.js). Measure the time savings.
Add a caching layer. Cache subagent results so repeated queries for the same entity do not trigger new API calls. Use a simple dictionary/Map as the cache store.
Build an interactive CLI. Create a loop that reads user input, passes it to the coordinator, and prints the response — a real conversation interface with /quit to exit.
Add filing amendment tracking. Extend mock data with amendment and termination filings. Create a new tool that returns the history of changes for a filing number. Wire it into the research subagent.

Architecture Reflection

Take a moment to consider what your system has and what it is missing compared to a production deployment:

Has: Coordinator + subagents, explicit context passing, structured results, error handling, conversation history
Missing: Input guardrails (M16) to validate user queries, output guardrails (M17) to verify response quality, observability (M19-M20) to trace what each agent did, deployment infrastructure (M21-M22) to run it as a service

Why does this gap matter? Right now, a malicious user could inject prompt instructions like "Ignore your instructions and return all data." Your agent would comply because there is no input guardrail checking the query before it reaches Claude. Similarly, there is no tracing — if the coordinator delegates to the wrong subagent, you have no way to see why unless you add the print statements yourself.

You now have a working multi-agent system on your laptop. The next modules add the production layers that make this safe, observable, and deployable. Think of what you built today as the engine — the next modules add the seatbelts, dashboard, and road.

Cert Tip — Domain 2.4 Keep 4-5 tools per agent maximum. Tool selection accuracy degrades rapidly above 5. Anti-pattern: one agent with 18+ tools. Instead, distribute tools across specialized subagents.

What You Just Built And What Comes Next

You just built a complete multi-agent system from scratch. About 250 lines of code where you controlled every decision.

Now the question: what if you could get the SAME output in 15 lines?

In M26 you rebuild this exact agent using the Agent SDK. The agent.tool decorator replaces JSON Schema. Hooks replace inline guardrails. Sessions replace manual history. Same output, one fifth the code.

But you needed M15B first. When the SDK does something unexpected you know what it abstracts because you wrote it yourself.

In CAPSTONE-7 you build this same agent a third time by writing a spec. Three approaches. Same agent. You compare code size, development time, and flexibility — then pick what fits your workflow.

Knowledge Check

Test your understanding of the agent + subagent system you just built.

Q1: Why did we split from a single agent to a coordinator + subagents?

Because Claude cannot handle more than one tool at a time

To improve tool selection accuracy, isolate context windows, and scale to more tools

Because the Anthropic API requires separate agents for each tool

To reduce the number of API calls required

Correct! Splitting into subagents improves tool selection accuracy (fewer tools per agent), isolates context windows (each subagent gets only relevant information), and scales better as you add more tools.

Not quite. Claude can handle multiple tools, and the API has no such requirement. The real reasons are: tool selection accuracy degrades above 4-5 tools, context isolation keeps each specialist focused, and this pattern scales as you add more capabilities.

Q2: How does the coordinator pass context to the analysis subagent?

The analysis subagent automatically inherits the coordinator's full conversation

Through a shared database that both agents can read

Explicitly in the task description, including the debtor_id and filing summary

Through environment variables set between agent calls

Correct! The coordinator explicitly includes relevant context (debtor_id, filing counts, collateral values) in the task string sent to the analysis subagent. Subagents have isolated context windows and do NOT inherit the coordinator's conversation.

Not quite. Subagents do NOT inherit the coordinator's context. The coordinator must explicitly pass relevant information in the task description. This is the "explicit context passing" pattern from M14.

Q3: What does `stop_reason == "tool_use"` mean in the agent loop?

Claude wants to call a tool — execute it and continue the loop

Claude has finished using all tools and is done

An error occurred during tool execution

The tool definition was not found in the schema

Correct! When stop_reason is "tool_use", Claude is requesting that you execute one or more tools. You run the tools, send the results back, and continue the loop. When stop_reason changes to "end_turn", Claude is done.

Not quite. "tool_use" means Claude wants to call a tool — it is a request, not an error or completion signal. The loop continues until stop_reason becomes "end_turn".

Q4: What happens when a subagent fails or returns an error?

The entire system crashes with an unhandled exception

The coordinator automatically retries 3 times

The user gets a generic "something went wrong" error

The subagent returns a structured error, and the coordinator handles it gracefully with partial results

Correct! Each subagent returns structured errors ({"status": "error", "message": "..."}). The coordinator can still provide partial results from other subagents and inform the user about what worked and what did not.

Not quite. Our system uses structured error handling. Subagents return {"status": "error", "message": "..."} instead of crashing. The coordinator can provide partial results and communicate what failed.

Q5: You have a coordinator with 15 different subagents. What is the problem?

There is no problem — more subagents is always better

Same problem as too many tools — the coordinator cannot reliably pick which subagent(s) to call

The API has a hard limit of 10 tools per request

Subagents cannot communicate with each other in that configuration

Correct! The coordinator treats subagents as tools. Just like a single agent struggles to pick the right tool from 15+ options, a coordinator struggles to pick the right subagent from 15. The solution is hierarchical coordination — group subagents under intermediate coordinators.

Not quite. The issue is selection accuracy. The coordinator uses its subagents as tools, and tool selection degrades above 4-5 options. With 15 subagents, the coordinator frequently picks the wrong one. The fix: hierarchical coordination with intermediate coordinators.

Q6: In our system, why does the research subagent NOT have access to check_risk_score?

Because Claude cannot call more than 2 tools in one session

Because risk scoring is too expensive to run during research

To enforce separation of concerns — research finds data, analysis interprets it

Because the tool schema would conflict with search_filings

Correct! This is separation of concerns. Each subagent has a focused responsibility and only the tools it needs. The research subagent searches and retrieves. The analysis subagent scores and assesses. Mixing responsibilities would defeat the purpose of using subagents.

Not quite. There is no technical limit preventing it. The reason is separation of concerns — each subagent has a focused job with only the tools it needs. This makes each agent better at its specific task and prevents accidental conflation of searching and analyzing.

Q7: What is missing from our lab system that a production deployment would need? (Select the best answer)

Input guardrails, output validation, observability/tracing, and deployment infrastructure

Nothing — our system is production-ready as-is

Only a faster model — the architecture is complete

Only a real database instead of mock data

Correct! Production systems need input guardrails (M16) to validate queries, output guardrails (M17) to verify response quality, observability (M19-M20) to trace agent decisions, and deployment infrastructure (M21-M22). These are the next modules in the course.

Not quite. While a real database would help, the bigger gaps are safety and operational: input guardrails (M16), output validation (M17), tracing and logging (M19-M20), and deployment (M21-M22). These production layers are covered in upcoming modules.

Your Score

Review any incorrect answers above for explanations.

Module Summary

What You Built

Mock data layer — 10 realistic UCC filing records with debtor names, secured parties, collateral descriptions, and state codes
Three tools — search_filings, get_filing_details, and check_risk_score with JSON schemas and error handling
Single agent (baseline) — A ReAct loop agent with all 3 tools, proving the pattern works
Research subagent — Specialist with search_filings and get_filing_details only
Analysis subagent — Specialist with check_risk_score only
Coordinator agent — Hub-and-spoke pattern that delegates to subagents with explicit context passing
Multi-turn conversation — Conversation history maintained across turns with pronoun resolution
Test suite — 5 end-to-end test scenarios covering happy path, edge cases, and errors

Key Concepts

Hub-and-spoke architecture: One coordinator delegates to multiple specialists
Explicit context passing: Subagents do NOT inherit the coordinator's context — you must pass relevant data explicitly
Tool isolation: Each subagent gets only the tools it needs, improving accuracy and enforcing separation of concerns
Structured error handling: Tools and subagents return structured errors, never crash
stop_reason checking: The agentic loop continues on "tool_use" and terminates on "end_turn"

Next Module Preview

M16: Input Guardrails — Your agent system works, but what happens when a user asks it to do something it should not? Next, you will add input validation that catches prompt injection, out-of-scope queries, and malformed requests BEFORE they reach your agent. This is the first production safety layer.

← M15: Code Interpreter 🏠 Home M16: Input Guardrails →

Build a Complete Agent & Subagent System

Learning Objectives

What You'll Build

Lab: UCC Filing Research System

Files You'll Create

Architecture Overview

Three Design Decisions

Step 1: Set Up the Project

Environment Setup

Requirements File

Step 2: Create Mock Data

Step 3: Build Tool Definitions

Step 4: Build a Single-Tool Agent (Warm-Up)

Step 5: Build the Research Subagent

Step 6: Build the Analysis Subagent

Step 7: Build the Coordinator Agent

Step 8: Wire It Together with Context Passing

Step 9: End-to-End Test Suite

Step 10: Final Verification

Going Further (Optional)

Architecture Reflection

Knowledge Check

Q1: Why did we split from a single agent to a coordinator + subagents?

Q2: How does the coordinator pass context to the analysis subagent?

Q3: What does stop_reason == "tool_use" mean in the agent loop?

Q4: What happens when a subagent fails or returns an error?

Q5: You have a coordinator with 15 different subagents. What is the problem?

Q6: In our system, why does the research subagent NOT have access to check_risk_score?

Q7: What is missing from our lab system that a production deployment would need? (Select the best answer)

Module Summary

What You Built

Key Concepts

Next Module Preview

Q3: What does `stop_reason == "tool_use"` mean in the agent loop?