Reasoning

When extracting data, LLMs sometimes make errors with calculations, unit conversions, or complex logic. Reasoning helps by encouraging the LLM to “show its work” before providing the final answer. Reasoning uses a special JSON Schema annotation to create auxiliary fields where the LLM can explain its thought process:

X-ReasoningPrompt - Generates reasoning fields alongside data fields, helping LLMs provide step-by-step justification for extracted values in complex scenarios.

This approach improves accuracy while keeping your original schema structure intact.

Reasoning Prompt

A X-ReasoningPrompt tag in the schema generates a reasoning field alongside the data field. This is particularly useful for calculations, unit conversions, or multi-step logic. Example: Temperature Unit Conversion

Temperature_report.md

| Date        | Location | Temperature (°C)| Humidity | Conditions    |
|-------------|----------|-----------------|----------|---------------|
| 2024-01-15  | New York | 22.5            | 65       | Partly Cloudy |

Let’s say we have a temperature report in Celsius, but our data model expects the temperature in Fahrenheit:

from pydantic import BaseModel, Field
from datetime import date

# You can define the custom annotations in the `pydantic.Field` class using the `json_schema_extra` field.

class TemperatureReport(BaseModel):
    date: date
    location: str
    temperature: float = Field(...,
        description="temperature in Fahrenheit",
        json_schema_extra={
            "X-ReasoningPrompt": "If the temperature is given in Celsius, make the explicit computation to convert it to Fahrenheit. If the temperature is given in Fahrenheit, leave it as is.",
        }
    )
    humidity: float
    conditions: str

# If you need a json_schema, you can call TemperatureReport.model_json_schema()

Without reasoning, the LLM might incorrectly use 22.5°F instead of converting from Celsius. With reasoning, it produces:

{
  "date": "2024-01-15",
  "location": "New York",
  "reasoning___temperature": "The temperature is given as 22.5°C. To convert to Fahrenheit: F = (C × 9/5) + 32. So: F = (22.5 × 9/5) + 32 = 72.5°F",
  "temperature": 72.5,
  "humidity": 65,
  "conditions": "Partly Cloudy"
}

As you can see, the “reasoning___” fields help the LLM show its work while the final output follows your schema structure.

Key Benefits

Accuracy: LLMs perform calculations more reliably when they can show their work
Transparency: You can see exactly how the LLM arrived at its answer
Debugging: Easy to identify where conversions or calculations went wrong
Trust: Users can verify the logic behind complex transformations

Best Practices

Use reasoning fields for any calculations, unit conversions, or multi-step logic
Make the reasoning description specific to guide the LLM’s thought process
Place reasoning fields before the fields that depend on the reasoning
Keep reasoning concise but complete enough to follow the logic

Overview

Core Concepts

Reasoning Prompt

Key Benefits

Best Practices

Go further

Overview

Core Concepts

​Reasoning Prompt

​Key Benefits

​Best Practices

​Go further

Reasoning Prompt

Key Benefits

Best Practices

Go further