.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_examples/figure_generation/plot_uncertainty_extras.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_examples_figure_generation_plot_uncertainty_extras.py>`
        to download the full example code.

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_examples_figure_generation_plot_uncertainty_extras.py:


Expanded uncertainty diagnostics: learning what the main uncertainty figure still hides
=========================================================================================

This example teaches you how to read the GeoPrior
uncertainty-extras figure.

The main uncertainty figure answers broad questions about:

- overall reliability,
- horizon-wise reliability,
- and the sharpness–coverage balance.

This supplementary page goes further. It asks:

**What does uncertainty look like when we inspect PIT behavior,
bootstrap confidence intervals, horizon-wise calibration factors,
and quantile residual drift together?**

That is why this figure is useful. It is a deeper uncertainty
audit, not just a second version of the same plot.

What the figure shows
---------------------
The real plotting backend builds six uncertainty views on one
page:

1. reliability small multiples by horizon,
2. PIT histogram for Nansha,
3. PIT histogram for Zhongshan,
4. coverage80 versus horizon with bootstrap confidence bands,
5. sharpness80 versus horizon with bootstrap confidence bands,
6. optional horizon-wise calibration factors,
7. quantile residuals by horizon.

In the actual script, the figure is created by ``plot_s5(...)``,
the summary tables are written by ``write_tables(...)``, and the
CLI entrypoint is ``supp_figS5_uncertainty_extras_main(...)``.
The script accepts the text controls
``show_legend``, ``show_labels``, ``show_ticklabels``,
``show_title``, and ``show_panel_titles``. It does **not**
use panel-label controls. 

Why this matters
----------------
A model can look acceptably calibrated in one headline figure and
still reveal important problems when you look closer:

- the PIT may still be far from uniform,
- later horizons may lose coverage,
- interval widths may widen too fast,
- and one city may need stronger post-hoc calibration factors than
  the other.

This gallery page creates compact synthetic forecast tables and
small metadata blocks so the example is fully executable during
documentation builds.

.. GENERATED FROM PYTHON SOURCE LINES 61-66

Imports
-------
We use the real helper functions and plotting backend from the
project script. For gallery use, we surface the PNG output
directly.

.. GENERATED FROM PYTHON SOURCE LINES 66-80

.. code-block:: Python


    from __future__ import annotations

    import tempfile
    from pathlib import Path

    import matplotlib.image as mpimg
    import matplotlib.pyplot as plt
    import numpy as np
    import pandas as pd

    from geoprior.scripts import plot_uncertainty_extras as pux


.. GENERATED FROM PYTHON SOURCE LINES 81-94

Step 1 - Build compact synthetic forecast tables
------------------------------------------------
The real script expects calibrated forecast CSVs with:

- forecast_step
- subsidence_q10
- subsidence_q50
- subsidence_q90
- subsidence_actual

We build two synthetic cities over three forecast horizons.
Nansha is made a little better calibrated; Zhongshan is a bit
wider and slightly more biased.

.. GENERATED FROM PYTHON SOURCE LINES 94-172

.. code-block:: Python


    rng = np.random.default_rng(31)


    def make_city_forecast(
        *,
        city: str,
        n_per_h: int,
        bias_scale: float,
        spread_scale: float,
        noise_scale: float,
    ) -> pd.DataFrame:
        rows: list[pd.DataFrame] = []

        for h in [1, 2, 3]:
            base = 10.0 + 4.5 * h
            latent = rng.normal(
                base,
                3.8 + 0.8 * h,
                size=n_per_h,
            )

            q50 = (
                0.93 * latent
                + bias_scale * h
                + rng.normal(0.0, 0.9, size=n_per_h)
            )

            width = (
                spread_scale * (4.2 + 1.6 * h)
                + 0.12 * np.abs(q50 - np.median(q50))
            )

            q10 = q50 - width
            q90 = q50 + width

            y = latent + rng.normal(
                0.0,
                noise_scale * (1.0 + 0.18 * h),
                size=n_per_h,
            )

            rows.append(
                pd.DataFrame(
                    {
                        "forecast_step": np.full(n_per_h, h),
                        "subsidence_actual": y,
                        "subsidence_q10": q10,
                        "subsidence_q50": q50,
                        "subsidence_q90": q90,
                        "city": city,
                    }
                )
            )

        return pd.concat(rows, ignore_index=True)


    ns_df = make_city_forecast(
        city="Nansha",
        n_per_h=180,
        bias_scale=0.10,
        spread_scale=0.95,
        noise_scale=0.95,
    )

    zh_df = make_city_forecast(
        city="Zhongshan",
        n_per_h=180,
        bias_scale=0.30,
        spread_scale=1.08,
        noise_scale=1.08,
    )

    print("Synthetic forecast rows")
    print(f"  Nansha:    {len(ns_df)}")
    print(f"  Zhongshan: {len(zh_df)}")


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Synthetic forecast rows
      Nansha:    540
      Zhongshan: 540


.. GENERATED FROM PYTHON SOURCE LINES 173-182

Step 2 - Build small JSON-like calibration metadata
---------------------------------------------------
The real script can extract:

  interval_calibration.factors_per_horizon

from a GeoPrior physics JSON file. It accepts both a list form
and a dict form such as {"H1": ..., "H2": ...}. We use one of
each here to show that both work. 

.. GENERATED FROM PYTHON SOURCE LINES 182-207

.. code-block:: Python


    ns_meta = {
        "interval_calibration": {
            "factors_per_horizon": [1.02, 1.06, 1.11]
        }
    }

    zh_meta = {
        "interval_calibration": {
            "factors_per_horizon": {
                "H1": 1.05,
                "H2": 1.11,
                "H3": 1.18,
            }
        }
    }

    factors_n = pux._extract_factors_per_horizon(ns_meta)
    factors_z = pux._extract_factors_per_horizon(zh_meta)

    print("")
    print("Extracted calibration factors")
    print(f"  Nansha:    {factors_n}")
    print(f"  Zhongshan: {factors_z}")


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Extracted calibration factors
      Nansha:    [1.02, 1.06, 1.11]
      Zhongshan: [1.05, 1.11, 1.18]


.. GENERATED FROM PYTHON SOURCE LINES 208-217

Step 3 - Build the real per-city summaries
------------------------------------------
The script uses summarize_city(...) to compute:

- horizon-wise coverage80 and sharpness80,
- bootstrap confidence intervals,
- reliability points by horizon,
- PIT values and KS statistics,
- quantile residuals by horizon.

.. GENERATED FROM PYTHON SOURCE LINES 217-238

.. code-block:: Python


    nansha = pux.summarize_city(
        ns_df,
        city="Nansha",
        B=300,
    )

    zhongshan = pux.summarize_city(
        zh_df,
        city="Zhongshan",
        B=300,
    )

    print("")
    print("Nansha summary keys")
    print(sorted(nansha.keys()))

    print("")
    print("Zhongshan summary keys")
    print(sorted(zhongshan.keys()))


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Nansha summary keys
    ['city', 'coverage80', 'coverage80_ci', 'horizons', 'ks_D', 'ks_p', 'pit', 'qresiduals', 'reliability', 'sharpness80', 'sharpness80_ci']

    Zhongshan summary keys
    ['city', 'coverage80', 'coverage80_ci', 'horizons', 'ks_D', 'ks_p', 'pit', 'qresiduals', 'reliability', 'sharpness80', 'sharpness80_ci']


.. GENERATED FROM PYTHON SOURCE LINES 239-245

Step 4 - Render the real S5 figure
----------------------------------
We call the actual plotting backend. For the gallery page we
keep only the PNG output by temporarily replacing the shared
saver. This keeps the lesson aligned with your current docs
preference: show the PNG, not PDF/EPS.

.. GENERATED FROM PYTHON SOURCE LINES 245-292

.. code-block:: Python


    tmp_dir = Path(
        tempfile.mkdtemp(prefix="gp_sg_uncx_")
    )

    out_base = tmp_dir / "uncertainty_extras_gallery"
    table_csv = tmp_dir / "uncertainty_extras_gallery.csv"
    table_tex = tmp_dir / "uncertainty_extras_gallery.tex"

    _orig_save_figure = pux.utils.save_figure


    def _save_png_only(fig, out, *, dpi):
        p = Path(out)
        if p.suffix:
            p = p.with_suffix("")
        fig.savefig(
            str(p) + ".png",
            dpi=int(dpi),
            bbox_inches="tight",
        )
        plt.close(fig)


    pux.utils.save_figure = _save_png_only

    try:
        pux.plot_s5(
            nansha=nansha,
            zhongshan=zhongshan,
            factors_n=factors_n,
            factors_z=factors_z,
            out_stem=out_base,
            dpi=160,
            show_legends=True,
            show_labels=True,
            show_ticklabels=True,
            show_title=True,
            show_panel_titles=True,
            title=(
                "Synthetic uncertainty extras: PIT, horizon drift, "
                "coverage and sharpness diagnostics"
            ),
        )
    finally:
        pux.utils.save_figure = _orig_save_figure


.. GENERATED FROM PYTHON SOURCE LINES 293-297

Step 5 - Write the real summary tables
--------------------------------------
The script also writes a CSV and a TeX table with the per-horizon
coverage80, sharpness80, and PIT KS statistics. We keep that

.. GENERATED FROM PYTHON SOURCE LINES 297-306

.. code-block:: Python


    pux.write_tables(
        nansha=nansha,
        zhongshan=zhongshan,
        out_csv=table_csv,
        out_tex=table_tex,
    )


.. GENERATED FROM PYTHON SOURCE LINES 307-311

Step 6 - Show the PNG produced by the backend
---------------------------------------------
The gallery page displays the actual PNG result generated by the
project plotting code.

.. GENERATED FROM PYTHON SOURCE LINES 311-318

.. code-block:: Python


    img = mpimg.imread(str(out_base) + ".png")

    fig, ax = plt.subplots(figsize=(10.0, 8.0))
    ax.imshow(img)
    ax.axis("off")


.. image-sg:: /auto_examples/figure_generation/images/sphx_glr_plot_uncertainty_extras_001.png
   :alt: plot uncertainty extras
   :srcset: /auto_examples/figure_generation/images/sphx_glr_plot_uncertainty_extras_001.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    (np.float64(-0.5), np.float64(1381.5), np.float64(1309.5), np.float64(-0.5))


.. GENERATED FROM PYTHON SOURCE LINES 319-323

Step 7 - Read the exported summary table
----------------------------------------
The exported table is a compact audit of the key horizon-wise
uncertainty quantities.

.. GENERATED FROM PYTHON SOURCE LINES 323-330

.. code-block:: Python


    tbl = pd.read_csv(table_csv)

    print("")
    print("Exported summary table")
    print(tbl.to_string(index=False))


.. rst-class:: sphx-glr-script-out

 .. code-block:: none


    Exported summary table
         city  horizon  coverage80  coverage80_lo  coverage80_hi  sharpness80  sharpness80_lo  sharpness80_hi   ks_D   ks_p
       Nansha        1      1.0000         1.0000         1.0000      11.7904         11.7207         11.8863 0.3601 0.0000
       Nansha        2      1.0000         1.0000         1.0000      15.0881         14.9613         15.2011 0.3601 0.0000
       Nansha        3      1.0000         1.0000         1.0000      18.1467         18.0246         18.2590 0.3601 0.0000
    Zhongshan        1      1.0000         1.0000         1.0000      13.3503         13.2661         13.4376 0.3554 0.0000
    Zhongshan        2      1.0000         1.0000         1.0000      17.0388         16.9232         17.1555 0.3554 0.0000
    Zhongshan        3      1.0000         1.0000         1.0000      20.6914         20.5636         20.7978 0.3554 0.0000


.. GENERATED FROM PYTHON SOURCE LINES 331-345

Step 8 - Learn how to read the reliability row
----------------------------------------------
The top row contains one small reliability panel per horizon.

Each panel compares:

- nominal quantiles
- empirical quantiles

The dashed diagonal is the ideal relationship.

This row is useful because it reveals horizon-specific drift
directly. A model that looks acceptable overall can still become
underconfident or overconfident at later horizons.

.. GENERATED FROM PYTHON SOURCE LINES 347-358

Step 9 - Learn how to read the PIT panels
-----------------------------------------
The PIT histograms are a stronger calibration stress test.

A well-calibrated forecast tends to produce a PIT distribution
that is close to uniform.

The script also reports a KS statistic and an approximate KS
p-value in the panel title. A large KS statistic or a strongly
shaped histogram suggests that the forecast distribution is not
matching the observations well. 

.. GENERATED FROM PYTHON SOURCE LINES 360-381

Step 10 - Learn how to read coverage and sharpness
--------------------------------------------------
The next two panels summarize interval behavior across horizon.

Coverage panel
~~~~~~~~~~~~~~
This shows empirical 80% interval coverage together with
bootstrap confidence bands and a dashed 0.80 target line.

Sharpness panel
~~~~~~~~~~~~~~~
This shows the mean 80% interval width together with bootstrap
confidence bands.

The key lesson is:

- coverage tells you whether the interval captures reality often
  enough,
- sharpness tells you how wide the interval had to be to do that.

The two should always be read together.

.. GENERATED FROM PYTHON SOURCE LINES 383-394

Step 11 - Learn how to read the calibration-factor panel
--------------------------------------------------------
If horizon-specific post-hoc calibration factors are available,
the script shows them as grouped bars by city.

This panel is especially useful because it answers:

"How much extra interval scaling did each horizon need?"

Larger factors usually mean that the raw predictive intervals
were too narrow and needed to be widened more strongly.

.. GENERATED FROM PYTHON SOURCE LINES 396-414

Step 12 - Learn how to read quantile residuals
----------------------------------------------
The bottom panel shows horizon-wise residuals for q10, q50,
and q90:

  empirical - nominal

A residual near zero is good.

Positive residuals mean the empirical quantile lies above the
nominal target; negative residuals mean it lies below.

This panel is valuable because it separates where the calibration
error lives:

- lower tail,
- median,
- or upper tail.

.. GENERATED FROM PYTHON SOURCE LINES 416-436

Step 13 - Practical takeaway
----------------------------
This figure is one of the best uncertainty-audit pages in the
whole gallery because it combines:

- reliability by horizon,
- PIT behavior,
- coverage with confidence bands,
- sharpness with confidence bands,
- optional horizon calibration factors,
- and quantile residual drift.

In practice, it helps move from:

"The intervals look okay"

to:

"The intervals are okay, here is where they drift, and here is
how much recalibration each horizon needed."

.. GENERATED FROM PYTHON SOURCE LINES 438-490

Command-line version
--------------------
The same figure can be produced from the command line.

The real script supports:

- ``--ns-src`` and ``--zh-src`` for artifact discovery,
- ``--ns-forecast`` and ``--zh-forecast`` for direct CSV
  overrides,
- ``--ns-phys-json`` and ``--zh-phys-json`` for metadata
  overrides,
- ``--split`` with ``auto | val | test``,
- ``--bootstrap`` for CI draws,
- ``--tables-stem`` for the CSV/TEX summary table,
- the shared text flags added through
  ``utils.add_plot_text_args(..., default_out="supp-fig-s5-uncertainty-extras")``,
- and the backward-friendly ``--no-legends`` switch. 

Legacy dispatcher:

.. code-block:: bash

   python -m scripts plot-uncertainty-extras \
     --ns-src results/nansha_stage2_run \
     --zh-src results/zhongshan_stage2_run \
     --split auto \
     --bootstrap 1000 \
     --out supp-fig-s5-uncertainty-extras \
     --tables-stem supp-table-s5-reliability

Direct CSV override:

.. code-block:: bash

   python -m scripts plot-uncertainty-extras \
     --ns-forecast results/ns_calibrated.csv \
     --zh-forecast results/zh_calibrated.csv \
     --ns-phys-json results/ns_phys.json \
     --zh-phys-json results/zh_phys.json \
     --no-legends \
     --out supp-fig-s5-uncertainty-extras

Modern CLI:

.. code-block:: bash

   geoprior plot uncertainty-extras \
     --ns-src results/nansha_stage2_run \
     --zh-src results/zhongshan_stage2_run \
     --out supp-fig-s5-uncertainty-extras

The gallery page teaches the figure.
The command line reproduces it in a workflow.


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (0 minutes 1.036 seconds)


.. _sphx_glr_download_auto_examples_figure_generation_plot_uncertainty_extras.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_uncertainty_extras.ipynb <plot_uncertainty_extras.ipynb>`

    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_uncertainty_extras.py <plot_uncertainty_extras.py>`

    .. container:: sphx-glr-download sphx-glr-download-zip

      :download:`Download zipped: plot_uncertainty_extras.zip <plot_uncertainty_extras.zip>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_