.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_examples/figure_generation/plot_sm3_log_offsets.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_examples_figure_generation_plot_sm3_log_offsets.py>`
        to download the full example code.

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_examples_figure_generation_plot_sm3_log_offsets.py:


SM3 log offsets: learning where the inferred fields drift from their priors
==============================================================================

This example teaches you how to read the GeoPrior
SM3 log-offset diagnostics.

The identifiability figures answer questions such as:

- can the model recover tau?
- does the recovery lie on a ridge?
- are some regimes more identifiable than others?

This page answers a simpler but very important question:

**How far did the inferred physics fields drift from their prior
reference values?**

That is what the log-offset figure is for.

What this figure shows
----------------------
The real script builds two diagnostics:

1. a 2×2 histogram grid for

   - ``delta_logK``
   - ``delta_logSs``
   - ``delta_logHd``
   - ``delta_log_tau``

2. a small scatter plot of

   - :math:`\log_{10}(\tau_{prior})`
   - versus
   - :math:`\delta_{\tau}`

The offset variables are log10 differences. For example,

.. math::

   \delta_{\log K}
   =
   \log_{10}(K)
   -
   \log_{10}(K_{prior})

and similarly for :math:`S_s`, :math:`H_d`, and :math:`\tau`.

Why this matters
----------------
A model can produce a plausible output while still leaning very
far away from its prior field assumptions.

This is not automatically bad.

Sometimes the data genuinely force the model away from the prior.
But large or asymmetric offset distributions are scientifically
useful because they tell you:

- whether the prior is being respected,
- whether one field moves much more than the others,
- whether the shifts are centered or biased,
- and whether tau errors depend on the prior timescale itself.

This gallery page uses a compact synthetic payload so the example
is fully executable during the documentation build.

.. GENERATED FROM PYTHON SOURCE LINES 71-77

Imports
-------
We use the real table-building helpers from the project script,
then render the figures inline for the gallery page. This keeps
the lesson faithful to the real implementation while making the
page easy to read directly in Sphinx-Gallery.

.. GENERATED FROM PYTHON SOURCE LINES 77-93

.. code-block:: Python


    from __future__ import annotations

    import json
    import tempfile
    from pathlib import Path

    import matplotlib.pyplot as plt
    import numpy as np
    import pandas as pd

    from geoprior.scripts.plot_sm3_log_offsets import (
        build_offsets_table,
        summarise_offsets,
    )


.. GENERATED FROM PYTHON SOURCE LINES 94-110

Step 1 - Build a compact synthetic payload
------------------------------------------
The real helper ``build_offsets_table(...)`` expects a payload
containing at least:

- tau
- tau_prior (or tau_closure / tau_cl)
- K
- Ss
- Hd (or H)

Optional prior fields can either be embedded directly in the
payload or passed as scalar values.

For this lesson we build one synthetic payload with moderate,
interpretable log-offset structure.

.. GENERATED FROM PYTHON SOURCE LINES 110-125

.. code-block:: Python


    rng = np.random.default_rng(21)
    n = 2200

    # Prior fields
    K_prior = 10.0 ** rng.normal(-6.0, 0.18, size=n)
    Ss_prior = 10.0 ** rng.normal(-4.8, 0.14, size=n)
    Hd_prior = np.clip(
        rng.normal(24.0, 3.2, size=n),
        10.0,
        None,
    )

    tau_prior = 10.0 ** rng.normal(7.2, 0.28, size=n)


.. GENERATED FROM PYTHON SOURCE LINES 126-136

Step 2 - Create inferred fields with structured drift
-----------------------------------------------------
We deliberately give each field a different offset behavior:

- K: modest positive shift
- Ss: broader, near-centered spread
- Hd: slight negative shift
- tau: a weak dependence on tau_prior

This makes the final plots more instructive.

.. GENERATED FROM PYTHON SOURCE LINES 136-165

.. code-block:: Python


    delta_logK = rng.normal(0.10, 0.14, size=n)
    delta_logSs = rng.normal(0.02, 0.18, size=n)
    delta_logHd = rng.normal(-0.06, 0.10, size=n)

    # Slight trend in tau offset as prior tau changes.
    log10_tau_prior = np.log10(tau_prior)
    delta_log_tau = (
        -0.10
        + 0.055 * (log10_tau_prior - log10_tau_prior.mean())
        + rng.normal(0.0, 0.12, size=n)
    )

    K = K_prior * (10.0 ** delta_logK)
    Ss = Ss_prior * (10.0 ** delta_logSs)
    Hd = Hd_prior * (10.0 ** delta_logHd)
    tau = tau_prior * (10.0 ** delta_log_tau)

    payload = {
        "tau": tau,
        "tau_prior": tau_prior,
        "K": K,
        "Ss": Ss,
        "Hd": Hd,
        "K_prior": K_prior,
        "Ss_prior": Ss_prior,
        "Hd_prior": Hd_prior,
    }


.. GENERATED FROM PYTHON SOURCE LINES 166-171

Step 3 - Build the real offsets table
-------------------------------------
This step uses the actual project helper. The resulting DataFrame
contains the log10 fields and the delta columns used by the
plotting script.

.. GENERATED FROM PYTHON SOURCE LINES 171-182

.. code-block:: Python


    df = build_offsets_table(
        payload,
        K_prior=None,
        Ss_prior=None,
        Hd_prior=None,
    )

    print("Offsets table preview")
    print(df.head().to_string(index=False))


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Offsets table preview
     log10_tau  log10_tau_prior  delta_log_tau  log10_K  log10_Ss  log10_Hd  delta_logK  delta_logSs  delta_logHd  index
        7.1137           7.3478        -0.2340  -6.0260   -4.7874    1.3012     -0.0906       0.0384      -0.0774      0
        6.8553           7.0719        -0.2166  -5.6563   -4.8547    1.3647      0.0718      -0.0279      -0.0533      1
        6.6535           6.7959        -0.1424  -6.1955   -4.6485    1.3964      0.1260       0.0587      -0.0625      2
        7.5677           7.6482        -0.0805  -5.6475   -4.8300    1.2547      0.0489       0.2225      -0.0913      3
        7.4665           7.4480         0.0185  -5.9712   -4.4286    1.4450      0.0373       0.0962       0.1908      4


.. GENERATED FROM PYTHON SOURCE LINES 183-188

Step 4 - Build the summary table
--------------------------------
The real script also computes a compact summary over all
``delta_*`` columns, including mean, standard deviation, and
selected quantiles.

.. GENERATED FROM PYTHON SOURCE LINES 188-195

.. code-block:: Python


    summary = summarise_offsets(df)

    print("")
    print("Summary statistics")
    print(summary.to_string())


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Summary statistics
                     mean    std     p05     p50    p95
    metric                                             
    delta_log_tau -0.1005 0.1233 -0.3051 -0.1010 0.1003
    delta_logK     0.0952 0.1380 -0.1251  0.0954 0.3220
    delta_logSs    0.0206 0.1817 -0.2792  0.0222 0.3326
    delta_logHd   -0.0546 0.0986 -0.2180 -0.0532 0.1001


.. GENERATED FROM PYTHON SOURCE LINES 196-203

Step 5 - Render the histogram figure
------------------------------------
The real script draws a 2×2 histogram grid for the four main
delta columns.

In the gallery page we render the same logic inline so the image
appears directly as part of the lesson.

.. GENERATED FROM PYTHON SOURCE LINES 203-236

.. code-block:: Python


    delta_cols = [
        "delta_logK",
        "delta_logSs",
        "delta_logHd",
        "delta_log_tau",
    ]

    fig = plt.figure(
        figsize=(7.2, 4.2),
        constrained_layout=True,
    )
    gs = fig.add_gridspec(2, 2)

    for i, col in enumerate(delta_cols):
        ax = fig.add_subplot(gs[i // 2, i % 2])
        ax.spines["top"].set_visible(False)
        ax.spines["right"].set_visible(False)
        ax.tick_params(direction="out", length=3, width=0.6)

        x = df[col].to_numpy(float)
        x = x[np.isfinite(x)]

        ax.hist(x, bins=45)
        ax.set_xlabel(col)
        ax.set_ylabel("Count")

    fig.suptitle(
        "Synthetic SM3 log-offset diagnostics",
        x=0.02,
        ha="left",
    )


.. image-sg:: /auto_examples/figure_generation/images/sphx_glr_plot_sm3_log_offsets_001.png
   :alt: Synthetic SM3 log-offset diagnostics
   :srcset: /auto_examples/figure_generation/images/sphx_glr_plot_sm3_log_offsets_001.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Text(0.02, 0.9900785714285715, 'Synthetic SM3 log-offset diagnostics')


.. GENERATED FROM PYTHON SOURCE LINES 237-244

Step 6 - Render the tau-offset scatter
--------------------------------------
The second figure in the real script compares the prior tau
scale against the tau offset.

This is useful because a centered histogram alone does not tell
us whether the error depends on the regime.

.. GENERATED FROM PYTHON SOURCE LINES 244-267

.. code-block:: Python


    fig2 = plt.figure(
        figsize=(3.6, 3.0),
        constrained_layout=True,
    )
    ax2 = fig2.add_subplot(1, 1, 1)

    ax2.spines["top"].set_visible(False)
    ax2.spines["right"].set_visible(False)
    ax2.tick_params(direction="out", length=3, width=0.6)

    ax2.scatter(
        df["log10_tau_prior"],
        df["delta_log_tau"],
        s=6,
        alpha=0.35,
        rasterized=True,
    )
    ax2.axhline(0.0, linestyle="--", linewidth=0.9)
    ax2.set_xlabel(r"$\log_{10}\tau_{\mathrm{prior}}$")
    ax2.set_ylabel(r"$\delta_\tau$")
    ax2.set_title("Tau offset versus prior timescale")


.. image-sg:: /auto_examples/figure_generation/images/sphx_glr_plot_sm3_log_offsets_002.png
   :alt: Tau offset versus prior timescale
   :srcset: /auto_examples/figure_generation/images/sphx_glr_plot_sm3_log_offsets_002.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Text(0.5, 1.0, 'Tau offset versus prior timescale')


.. GENERATED FROM PYTHON SOURCE LINES 268-274

Step 7 - Read the summary like a scientist
------------------------------------------
Let us compute one compact interpretation helper from the table:

- which field is most shifted on average?
- which field is most spread out?

.. GENERATED FROM PYTHON SOURCE LINES 274-291

.. code-block:: Python


    mean_abs = (
        summary["mean"]
        .abs()
        .sort_values(ascending=False)
    )

    spread = summary["std"].sort_values(ascending=False)

    print("")
    print("Largest mean offset")
    print(mean_abs.to_string())

    print("")
    print("Largest spread")
    print(spread.to_string())


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Largest mean offset
    metric
    delta_log_tau   0.1005
    delta_logK      0.0952
    delta_logHd     0.0546
    delta_logSs     0.0206

    Largest spread
    metric
    delta_logSs     0.1817
    delta_logK      0.1380
    delta_log_tau   0.1233
    delta_logHd     0.0986


.. GENERATED FROM PYTHON SOURCE LINES 292-326

Step 8 - Learn how to read the histogram panels
-----------------------------------------------
A histogram panel answers three questions immediately.

1. Is the distribution centered near zero?
   If yes, the inferred field is not systematically pushed far
   above or below the prior.

2. Is it narrow or wide?
   A narrow histogram means the inferred field stays close to
   the prior for most samples. A wide one means the field is
   moving more aggressively.

3. Is it symmetric?
   If the mass is clearly shifted left or right, that suggests a
   systematic bias relative to the prior.

Interpreting the four offsets
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
``delta_logK``
    Measures how much inferred hydraulic conductivity moved away
    from its prior.

``delta_logSs``
    Measures drift in specific storage.

``delta_logHd``
    Measures drift in effective drainage thickness.

``delta_log_tau``
    Measures drift in the relaxation timescale itself.

These four are useful together because they tell you whether the
model is changing one field much more than the others.

.. GENERATED FROM PYTHON SOURCE LINES 328-343

Step 9 - Learn how to read the tau scatter
------------------------------------------
The tau scatter answers a different question:

"Does the tau offset depend on the prior timescale regime?"

If the cloud is centered horizontally around zero with no
visible slope, then the tau drift is roughly regime-neutral.

If the cloud trends upward or downward, then the prior timescale
itself is influencing the direction of the inferred shift.

In this synthetic lesson we built a mild slope on purpose, so
the page teaches that even when the tau histogram looks
reasonable overall, there can still be a regime dependence.

.. GENERATED FROM PYTHON SOURCE LINES 345-361

Step 10 - Practical takeaway
----------------------------
This figure is useful because it gives a fast, field-by-field
explanation of prior drift.

It helps answer:

- which inferred field moves the most?
- is the movement centered or biased?
- are the shifts moderate or very broad?
- and does tau drift depend on the prior timescale itself?

In practice, this page is especially helpful after the main SM3
identifiability figures. The identifiability figures show
whether recovery is stable. This page shows *how the fields
actually moved*.

.. GENERATED FROM PYTHON SOURCE LINES 363-408

Command-line version
--------------------
The same diagnostics can be produced from the command line.

The real script supports:

- ``--src`` to auto-discover a physics payload under a run
  directory,
- ``--payload`` for an explicit NPZ / CSV / parquet payload,
- optional scalar priors through
  ``--K-prior``, ``--Ss-prior``, and ``--Hd-prior``,
- ``--bins`` and ``--dpi``,
- ``--out-raw-csv``, ``--out-summary-csv``, and ``--out-json``,
- plus the shared plot text arguments such as ``--out``,
  ``--show-title``, and ``--title``. :contentReference[oaicite:3]{index=3}

Legacy dispatcher:

.. code-block:: bash

   python -m scripts plot-sm3-log-offsets \
     --src results/sm3_synth_1d \
     --bins 50 \
     --out sm3-log-offsets

Explicit payload:

.. code-block:: bash

   python -m scripts plot-sm3-log-offsets \
     --payload results/.../physics_payload_run_val.npz \
     --K-prior 1e-7 \
     --Ss-prior 1e-5 \
     --Hd-prior 40 \
     --out sm3-log-offsets

Modern CLI:

.. code-block:: bash

   geoprior plot sm3-log-offsets \
     --payload results/.../physics_payload_run_val.npz \
     --out sm3-log-offsets

The gallery page teaches the figure.
The command line reproduces it in a workflow.


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (0 minutes 0.535 seconds)


.. _sphx_glr_download_auto_examples_figure_generation_plot_sm3_log_offsets.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_sm3_log_offsets.ipynb <plot_sm3_log_offsets.ipynb>`

    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_sm3_log_offsets.py <plot_sm3_log_offsets.py>`

    .. container:: sphx-glr-download sphx-glr-download-zip

      :download:`Download zipped: plot_sm3_log_offsets.zip <plot_sm3_log_offsets.zip>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_