Extract Evals

client.evals.extract is the SDK surface for running extraction evals against labeled datasets. It gives you the full workflow: create an eval from a schema, attach ground-truth documents, create iterations with draft overrides, run documents through an eval or iteration, and measure quality with iteration metrics.

Resource Map

client.evals.extract
|- create / get / list / update / delete / publish
|- process
|- process_stream (Python only)
|- datasets
|  |- create / get / list / update / delete / duplicate
|  |- add_document / get_document / list_documents / update_document / delete_document
|  |- process_document
|  `- iterations
|     |- create / get / list / update_draft / delete / finalize
|     |- get_schema / process_documents / get_metrics
|     `- get_document / list_documents / update_document / delete_document / process_document
`- templates
   |- list / get / clone
   `- list_builder_document_previews / list_builder_documents

Quick Start

from retab import Retab
from retab.types.mime import MIMEData
from retab.types.projects.predictions import PredictionData

client = Retab()

invoice_schema = {
    "type": "object",
    "properties": {
        "invoice_number": {"type": "string"},
        "total_amount": {"type": "number"},
    },
    "required": ["invoice_number", "total_amount"],
}

eval_project = client.evals.extract.create(
    name="Invoice extraction eval",
    json_schema=invoice_schema,
)

dataset = client.evals.extract.datasets.create(
    eval_project.id,
    name="March invoices",
)

dataset_document = client.evals.extract.datasets.add_document(
    eval_project.id,
    dataset.id,
    mime_data=MIMEData(
        filename="invoice.pdf",
        url="data:application/pdf;base64,JVBERi0xLjQKJ...",
    ),
    prediction_data=PredictionData(
        prediction={
            "invoice_number": "INV-1001",
            "total_amount": 1299.50,
        }
    ),
)

iteration = client.evals.extract.datasets.iterations.create(
    eval_project.id,
    dataset.id,
)

client.evals.extract.datasets.iterations.process_documents(
    eval_project.id,
    dataset.id,
    iteration.id,
    dataset_document.id,
)

metrics = client.evals.extract.datasets.iterations.get_metrics(
    eval_project.id,
    dataset.id,
    iteration.id,
)

print(metrics.overall_metrics.accuracy)

Core Resources

Eval

Use the top-level resource to manage the extraction eval itself.

create(name, json_schema, ...): create a new extraction eval from a JSON schema.
get(eval_id), list(...), update(eval_id, ...), delete(eval_id): standard CRUD methods.
publish(eval_id, origin=None): promote the current draft config to the published config.
process(eval_id, iteration_id=None, document=..., ...): run a document directly against the eval or a specific iteration.
process_stream(...) in Python: stream RetabParsedChatCompletionChunk values while the structured output fills in.

process() returns a RetabParsedChatCompletion, so the structured result is available on the parsed message content. Use process(). The older Python alias client.evals.extract.extract(...) is deprecated in the SDK.

Datasets

Datasets hold labeled examples for an eval.

datasets.create(eval_id, name, base_json_schema=None): create a dataset for an eval.
datasets.duplicate(...): clone an existing dataset.
datasets.add_document(...): attach a document and optional prediction_data ground truth.
datasets.update_document(...): revise labels, validation flags, or attach an extraction_id.
datasets.process_document(...): run one dataset document through the base eval configuration.

For extraction evals, ground truth usually lives in prediction_data.prediction.

Iterations

Iterations are where you compare model settings and schema changes.

iterations.create(...): create a new draft iteration for a dataset.
iterations.update_draft(...): stage new inference_settings or schema_overrides.
iterations.get_schema(..., use_draft=True) / getSchema(..., { useDraft: true }): preview the materialized schema after draft overrides.
iterations.finalize(...): promote the draft into the finalized iteration.
iterations.process_documents(...): create an iteration document from a dataset document and queue processing.
iterations.get_metrics(...): compute accuracy and field-level metrics for the iteration.
iterations.process_document(...): rerun a specific iteration document.

The draft/finalize split matters here: you edit the draft first, inspect it, then finalize when you want those changes locked in.

Templates

Templates let you start from existing eval blueprints.

templates.list(...): browse extraction eval templates.
templates.get(template_id): fetch one template.
templates.clone(template_id, name=None): turn a template into a new eval project.
templates.list_builder_documents(template_id): fetch the sample documents attached to a template.
templates.list_builder_document_previews(template_ids): fetch preview MIME metadata for multiple templates.

SDK Notes

Python list() methods on evals, datasets, and iterations return model lists. Node returns paginated API payloads with data.
Python templates list() returns a paginated wrapper with data and list_metadata.
Extraction eval processing accepts a single document. Passing documents= raises an SDK error in Python.

Overview

Core Concepts

Consensus

Workflows

Projects

Evals

Resource Map

Quick Start

Core Resources

Eval

Datasets

Iterations

Templates

SDK Notes

Overview

Core Concepts

Consensus

Workflows

Projects

Evals

​Resource Map

​Quick Start

​Core Resources

​Eval

​Datasets

​Iterations

​Templates

​SDK Notes

Resource Map

Quick Start

Core Resources

Eval

Datasets

Iterations

Templates

SDK Notes