Technical Details

The Pandects MCP is a read-only remote MCP server at https://api.pandects.org/mcp.

It is designed for human users working through LLM clients, but this page describes the technical surface available behind the scenes.

High-Level Capability Areas

The current MCP surface covers:

Agreement discovery and retrieval
Section search and section listing within one agreement
Section-level retrieval
Concept-to-taxonomy mapping and focused snippet extraction
Tax clause retrieval
Filter and taxonomy lookup
Counsel, NAICS, summary, trend, and server-introspection reference data

Tool Groups

Agreement Discovery And Retrieval

search_agreements
list_agreements
get_agreement

Use these when the client needs to find the right agreement first or fetch one agreement directly.

Section Research

search_sections
list_agreement_sections
list_agreement_sections_batch
get_section
get_section_snippet
get_section_snippets_batch
get_sections_batch

Use these when the client needs to search clause language across the corpus, navigate sections inside one agreement, inspect a known section directly, or extract a shorter plain-text excerpt from one section. list_agreement_sections_batch, get_section_snippets_batch, and get_sections_batch accept a list of agreement or section UUIDs and return results in a single call, reducing round-trips for multi-agreement workflows. get_sections_batch returns full section XML (capped by default at 10 000 characters per section); get_section_snippets_batch returns focused plain-text excerpts and is the right choice when the full XML is not needed.

Discovery Helpers

suggest_clause_families

Use this when the client knows the business or legal concept but does not know the right taxonomy standard_id yet. The tool returns ranked clause-family candidates with their taxonomy paths and matched terms. Each match also reports whether it is a canonical fit, a proxy, or a broader semantic match.

Tax Clause Research

get_agreement_tax_clauses
get_section_tax_clauses

Use these when the task is specifically about extracted tax-module clauses rather than the full agreement or section text.

Research Bootstrap

list_filter_options
get_clause_taxonomy
get_tax_clause_taxonomy

Use these when the client needs valid structured inputs before searching.

Reference And Context

get_counsel_catalog
get_naics_catalog
get_agreements_summary
get_agreement_trends
get_server_capabilities
get_server_metrics

Use these when the client needs canonical lookup data, corpus-level context, or MCP introspection metadata.

Current Tool List

The current MCP tools are:

search_agreements
search_sections
list_agreements
list_agreement_sections
list_agreement_sections_batch
get_agreement
get_section
get_section_snippet
get_section_snippets_batch
get_sections_batch
get_agreement_tax_clauses
get_section_tax_clauses
list_filter_options
suggest_clause_families
get_server_metrics
get_server_capabilities
get_clause_taxonomy
get_tax_clause_taxonomy
get_counsel_catalog
get_naics_catalog
get_agreements_summary
get_agreement_trends

Design Notes

The server is read-only
Clients should typically choose the right tools automatically
search_agreements is the discovery-oriented agreement search; accepts a standard_id list to filter to agreements that contain at least one section tagged with any of the given taxonomy ids
list_agreements is the exact-filter, cursor-based agreement listing surface; also accepts standard_id for taxonomy-based agreement filtering
list_agreement_sections is an MCP convenience tool for within-agreement navigation
search_sections is a clause-language retrieval surface, not a normalized document-facts surface; it has no free-text/keyword parameter — it searches by clause-family standard_id and the structured M&A filters only. Use suggest_clause_families (or get_clause_taxonomy) to translate a plain-English concept into a standard_id, then filter by it
search_sections exposes count_mode and returns count_metadata plus interpretation so clients can tell when totals are exact versus estimated and when taxonomy is acting as a proxy
search_agreements returns exact totals today and also includes count_metadata plus interpretation
standard_id values are opaque 16-character hex node IDs obtained from get_clause_taxonomy or suggest_clause_families — they are not dotted decimal numbers. Unrecognized IDs are ignored (they silently match nothing) and are echoed back under interpretation.unrecognized_standard_ids, alongside interpretation.taxonomy_filters, so clients can detect a mistyped or stale ID rather than mistaking an empty result for "no matches"
suggest_clause_families exists to bridge plain-English concepts to taxonomy ids and now reports fit/confidence metadata so clients can distinguish canonical matches from broader proxies
get_section_snippet is a focused reading aid, not a replacement for get_section or a canonical extracted-facts surface
get_section_snippets_batch and list_agreement_sections_batch accept arrays of UUIDs and collapse multiple single-item calls into one round-trip; use them when a workflow would otherwise fan out across many agreements or sections
get_sections_batch fetches full section XML for up to 10 sections in one call; XML is capped at max_xml_chars per section (default 10 000, range 500–20 000) to prevent context overload; when a section is truncated the result includes xml_truncated: true; pass max_xml_chars: null only if uncapped XML is explicitly needed
search_sections results include filing_date and transaction_price_total inline on every result without needing to request them via metadata
get_section_snippet, get_section_snippets_batch, and get_sections_batch all include a monetary_values list — dollar amounts and value expressions extracted from the section text — so clients can surface deal economics without parsing XML
search_sections and search_agreements both accept filed_after and filed_before (ISO 8601 date strings, YYYY-MM-DD) for sub-year filing-date precision; year/year_min/year_max filter on the agreement year; filed_after/filed_before filter on the exact filing date
get_agreement returns metadata only by default (the response reports xml_included: false) to keep the payload small; pass include_xml: true to also return the agreement XML, which preserves the existing redaction and full-text access behavior (focus_section_uuid/neighbor_sections only take effect when include_xml is true and the response is redacted)
get_agreement_trends accepts a sections selector (ownership, target_industries, pairings, naics_catalog) and defaults to [ownership, target_industries]; the large pairings (every industry-by-industry cell) and naics_catalog (full NAICS hierarchy) sections are opt-in, and the response echoes sections_returned
get_server_capabilities is the main machine-readable semantics surface; it includes auth guidance, field inventory, concept notes, and negative guidance about when not to use a tool
The server exposes a small set of MCP resources (pandects://capabilities, pandects://auth-help) that mirror get_server_capabilities for clients that prefer the resources/read primitive over calling a tool
The server exposes curated MCP prompts (compare_agreements, clause_family_survey, deal_trend_brief) as research templates; they orchestrate the primitive retrieval tools rather than introducing new functionality

Transport

POST /mcp is the primary JSON-RPC endpoint. It supports content negotiation: clients that advertise Accept: text/event-stream receive an SSE-framed response; clients that prefer application/json receive a plain JSON body. This matches the Streamable HTTP behaviour required by Claude Code.
GET /mcp returns an SSE retry probe for clients that opportunistically open a server-to-client stream.
DELETE /mcp is accepted as an authenticated session-termination signal and returns 204.
initialize responses carry an Mcp-Session-Id header. The server is stateless, so the id is informational — clients are not required to echo it, but Claude Code does.
Every response carries an MCP-Protocol-Version header echoing the negotiated protocol version.
Advertised server capabilities: tools, resources (listChanged=false, subscribe=false), prompts (listChanged=false), and logging (logging/setLevel is accepted as a no-op).

Tool error codes

tools/call failures use distinct JSON-RPC error codes so a client can tell a fixable request from a server-side fault. The codes are identical across the JSON and SSE response paths:

-32602 — invalid tool arguments (schema/validation failure or a malformed identifier)
-32002 — the tool ran but the requested resource was not found (e.g. an unknown agreement or section UUID); the error data is {"category": "not_found", "status_code": 404}. Verify the identifier with a search or list tool and retry — this is not a transient error.
-32003 — missing scope / authorization
-32603 — the tool result violated its advertised output schema
-32004 — a genuine tool failure (server-side)

In get_server_metrics, a not-found tool result is counted under its own not_found error category (not the generic http_exception), since a bad identifier is a client mistake rather than a server fault. Note that metrics counters are per-process: in a multi-worker deployment each get_server_metrics call reports only the worker that served it, not a server-wide total.

Progress notifications

When a tools/call request includes params._meta.progressToken and the client advertises Accept: text/event-stream, the server returns a multi-event SSE stream:

notifications/progress with progress=0, total=1, and a Starting <tool> message
notifications/progress with progress=1, total=1, and a <tool> complete message
The final tools/call JSON-RPC result (or error)

This keeps intermediary proxies and client UIs aware of in-flight work on long calls. Clients that do not set a progress token, or do not accept SSE, receive the usual single-response behaviour.

OAuth discovery and Dynamic Client Registration

The server is protected by an embedded OAuth authorization server whose issuer lives under /v1/auth/oauth. To make OAuth discovery work with clients that implement RFC 8414 strictly (including Claude Code), authorization-server metadata is exposed at three locations:

GET /.well-known/oauth-authorization-server — host-root fallback
GET /.well-known/oauth-authorization-server/v1/auth/oauth — RFC 8414 host-root + issuer-path form
GET /v1/auth/oauth/.well-known/oauth-authorization-server — issuer-prefixed form (original)

GET /.well-known/openid-configuration is also exposed at the host root for OIDC-leaning clients.

The metadata document advertises registration_endpoint (/v1/auth/oauth/register), so compliant clients can self-register via Dynamic Client Registration (RFC 7591) without a manual out-of-band step. Only public, PKCE (S256), authorization-code clients are supported.

Authentication

MCP uses normal Pandects account login
MCP does not use Pandects API keys
codex mcp add only registers the server; in Codex, codex mcp login <name> starts OAuth
Auth failures return structured remediation metadata so clients can distinguish missing-token, expired-token, unverified-account, and unlinked-subject cases

High-Level Capability Areas​

Tool Groups​

Agreement Discovery And Retrieval​

Section Research​

Discovery Helpers​

Tax Clause Research​

Research Bootstrap​

Reference And Context​

Current Tool List​

Design Notes​

Transport​

Tool error codes​

Progress notifications​

OAuth discovery and Dynamic Client Registration​

Authentication​

Related Pages​