.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_examples/diagnostics/plot_r2_in_overview.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_examples_diagnostics_plot_r2_in_overview.py>`
        to download the full example code.

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_examples_diagnostics_plot_r2_in_overview.py:


Compare independent regression pairs with ``plot_r2_in``
========================================================

This lesson explains how to use
:func:`geoprior.plot.r2.plot_r2_in` when you do **not** have one shared
truth vector, but instead several separate ``(y_true, y_pred)`` pairs.

Why this function matters
-------------------------
There are many situations where each prediction task has its own target
vector:

- train vs validation predictions,
- city A vs city B,
- one forecast year vs another,
- one output variable vs another,
- or one experiment split vs another.

In those cases, the helper :func:`plot_r2` is not the most natural fit,
because it assumes one common ``y_true`` and many competing prediction
arrays.

``plot_r2_in`` solves that problem by accepting an alternating sequence
of pairs:

- ``y_true_1, y_pred_1, y_true_2, y_pred_2, ...``

and then building one diagnostic subplot for each pair.

What makes this helper special
------------------------------
Compared with ``plot_r2``, this version adds one useful diagnostic
feature: an optional fitted regression line and displayed line equation.
That turns the page into more than a score report. It also helps users
see whether a pair suffers from:

- slope shrinkage,
- amplitude inflation,
- offset bias,
- or broader scatter.

.. GENERATED FROM PYTHON SOURCE LINES 48-57

.. code-block:: Python


    from __future__ import annotations

    import matplotlib.pyplot as plt
    import numpy as np

    from geoprior.plot.r2 import plot_r2_in


.. GENERATED FROM PYTHON SOURCE LINES 58-72

Build three independent regression tasks
----------------------------------------

To make the purpose of ``plot_r2_in`` clear, we create three different
truth/prediction pairs rather than one shared truth vector.

Here we mimic:

- a training split,
- a validation split,
- and a more difficult external split.

This is exactly the kind of situation where alternating pairs are more
natural than the single-truth design of ``plot_r2``.

.. GENERATED FROM PYTHON SOURCE LINES 72-92

.. code-block:: Python


    rng = np.random.default_rng(7)

    n_train = 110
    n_valid = 90
    n_external = 100

    x_train = np.linspace(0.0, 1.0, n_train)
    x_valid = np.linspace(0.0, 1.0, n_valid)
    x_external = np.linspace(0.0, 1.0, n_external)

    y_true_train = 12.0 + 45.0 * x_train + 4.0 * np.sin(3.0 * np.pi * x_train)
    y_true_valid = 11.0 + 46.0 * x_valid + 4.5 * np.sin(3.0 * np.pi * x_valid)
    y_true_external = 10.0 + 48.0 * x_external + 5.5 * np.sin(3.2 * np.pi * x_external)

    y_pred_train = y_true_train + rng.normal(0.0, 1.8, n_train)
    y_pred_valid = 0.95 * y_true_valid + 1.2 + rng.normal(0.0, 2.8, n_valid)
    y_pred_external = 0.86 * y_true_external + 3.0 + rng.normal(0.0, 4.5, n_external)


.. GENERATED FROM PYTHON SOURCE LINES 93-105

Start with the simplest independent-pairs comparison
----------------------------------------------------

The first lesson use is to pass alternating pairs directly. This is
the core mental model of the function.

Read the resulting figure in this order:

1. compare the R² annotations,
2. compare the width of the scatter clouds,
3. check whether the fitted line stays close to the perfect-fit line,
4. inspect whether slope and intercept suggest systematic bias.

.. GENERATED FROM PYTHON SOURCE LINES 105-127

.. code-block:: Python


    fig = plot_r2_in(
        y_true_train,
        y_pred_train,
        y_true_valid,
        y_pred_valid,
        y_true_external,
        y_pred_external,
        titles=["Training split", "Validation split", "External split"],
        xlabel="Observed subsidence",
        ylabel="Predicted subsidence",
        scatter_colors=["#1f77b4", "#ff7f0e", "#d62728"],
        line_colors=["#3b3b3b", "#3b3b3b", "#3b3b3b"],
        line_styles=["--", "--", "--"],
        annotate=True,
        fit_eq=True,
        fit_line_color="#2ca02c",
        show_grid=True,
        max_cols=2,
    )


.. image-sg:: /auto_examples/diagnostics/images/sphx_glr_plot_r2_in_overview_001.png
   :alt: Training split, Validation split, External split
   :srcset: /auto_examples/diagnostics/images/sphx_glr_plot_r2_in_overview_001.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 128-144

Why the fitted line matters
---------------------------

This helper is especially valuable when the fitted line tells a story
that the R² score alone does not.

For example:

- a slope clearly below 1 suggests amplitude compression,
- a large intercept suggests offset bias,
- and a fitted line that diverges from the 1:1 line reveals systematic
  distortion even when the cloud still looks correlated.

In the external split above, the fitted line is often the fastest way
to notice that the model is not only noisier, but also less well
calibrated in amplitude.

.. GENERATED FROM PYTHON SOURCE LINES 147-153

Add RMSE and MAE to each pair
-----------------------------

``plot_r2_in`` can annotate extra metrics on each subplot. This is
useful when each pair represents a different operational condition and
you want a compact diagnostic card for each one.

.. GENERATED FROM PYTHON SOURCE LINES 153-176

.. code-block:: Python


    fig = plot_r2_in(
        y_true_train,
        y_pred_train,
        y_true_valid,
        y_pred_valid,
        y_true_external,
        y_pred_external,
        titles=["Training split", "Validation split", "External split"],
        xlabel="Observed subsidence",
        ylabel="Predicted subsidence",
        scatter_colors=["#17becf", "#9467bd", "#8c564b"],
        line_colors=["#444444", "#444444", "#444444"],
        line_styles=[":", ":", ":"],
        other_metrics=["rmse", "mae"],
        annotate=True,
        fit_eq=True,
        fit_line_color="#e377c2",
        show_grid=True,
        max_cols=3,
    )


.. image-sg:: /auto_examples/diagnostics/images/sphx_glr_plot_r2_in_overview_002.png
   :alt: Training split, Validation split, External split
   :srcset: /auto_examples/diagnostics/images/sphx_glr_plot_r2_in_overview_002.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 177-183

Use the function without the fitted equation when you want a cleaner page
-------------------------------------------------------------------------

When many pairs are compared, the figure can become text-heavy. In that
case, disabling the fitted equation is often the better teaching and
reporting choice.

.. GENERATED FROM PYTHON SOURCE LINES 183-205

.. code-block:: Python


    fig = plot_r2_in(
        y_true_train,
        y_pred_train,
        y_true_valid,
        y_pred_valid,
        y_true_external,
        y_pred_external,
        titles=["Training split", "Validation split", "External split"],
        xlabel="Observed subsidence",
        ylabel="Predicted subsidence",
        scatter_colors=["#4daf4a", "#377eb8", "#e41a1c"],
        line_colors=["#222222", "#222222", "#222222"],
        line_styles=["--", "--", "--"],
        other_metrics=["rmse"],
        annotate=True,
        fit_eq=False,
        show_grid=True,
        max_cols=3,
    )


.. image-sg:: /auto_examples/diagnostics/images/sphx_glr_plot_r2_in_overview_003.png
   :alt: Training split, Validation split, External split
   :srcset: /auto_examples/diagnostics/images/sphx_glr_plot_r2_in_overview_003.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 206-214

Demonstrate pair-wise cleaning with imperfect arrays
----------------------------------------------------

The implementation uses ``process_y_pairs`` to validate and clean the
alternating pairs. That is helpful when one pair contains missing
values but the others remain usable.

Here we introduce a few NaNs into one pair only.

.. GENERATED FROM PYTHON SOURCE LINES 214-243

.. code-block:: Python


    y_true_valid_nan = y_true_valid.copy()
    y_pred_valid_nan = y_pred_valid.copy()
    y_true_valid_nan[[10, 17]] = np.nan
    y_pred_valid_nan[[17, 25]] = np.nan

    fig = plot_r2_in(
        y_true_train,
        y_pred_train,
        y_true_valid_nan,
        y_pred_valid_nan,
        y_true_external,
        y_pred_external,
        titles=["Training split", "Validation split with NaNs", "External split"],
        xlabel="Observed subsidence",
        ylabel="Predicted subsidence",
        scatter_colors=["#a6cee3", "#fb9a99", "#b2df8a"],
        line_colors=["#4c4c4c", "#4c4c4c", "#4c4c4c"],
        line_styles=["--", "--", "--"],
        other_metrics=["rmse"],
        annotate=True,
        fit_eq=True,
        fit_line_color="#1b9e77",
        show_grid=True,
        max_cols=2,
    )


.. image-sg:: /auto_examples/diagnostics/images/sphx_glr_plot_r2_in_overview_004.png
   :alt: Training split, Validation split with NaNs, External split
   :srcset: /auto_examples/diagnostics/images/sphx_glr_plot_r2_in_overview_004.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 244-261

When should you choose ``plot_r2_in`` instead of ``plot_r2``?
--------------------------------------------------------------

Use ``plot_r2_in`` when each subplot should represent its own
independent diagnostic pair.

Good examples:

- train, validation, and test predictions,
- Nansha, Zhongshan, and an external city,
- groundwater and subsidence outputs,
- different forecast years evaluated separately.

A simple rule is:

- **shared truth, many predictions** -> ``plot_r2``
- **many independent truth/prediction pairs** -> ``plot_r2_in``

.. GENERATED FROM PYTHON SOURCE LINES 264-286

How to use it on your own data
------------------------------

A realistic workflow might look like this:

.. code-block:: python

   plot_r2_in(
       train_df["subsidence_actual"].to_numpy(),
       train_df["subsidence_q50"].to_numpy(),
       valid_df["subsidence_actual"].to_numpy(),
       valid_df["subsidence_q50"].to_numpy(),
       test_df["subsidence_actual"].to_numpy(),
       test_df["subsidence_q50"].to_numpy(),
       titles=["Train", "Validation", "Test"],
       other_metrics=["rmse", "mae"],
       fit_eq=True,
       max_cols=2,
   )

This is particularly useful in diagnostics pages because it makes the
quality difference across operating conditions visible at a glance.

.. GENERATED FROM PYTHON SOURCE LINES 289-301

A compact reading checklist
---------------------------

When reading a ``plot_r2_in`` figure, ask:

- Which pair has the strongest R²?
- Which pair has the narrowest scatter cloud?
- Does the fitted line remain close to the perfect-fit line?
- Is the slope near 1 and the intercept near 0?
- Do RMSE and MAE confirm the same ranking?

That combination gives a much richer diagnostic than a single R² table.

.. GENERATED FROM PYTHON SOURCE LINES 301-303

.. code-block:: Python


    plt.show()


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (0 minutes 1.680 seconds)


.. _sphx_glr_download_auto_examples_diagnostics_plot_r2_in_overview.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_r2_in_overview.ipynb <plot_r2_in_overview.ipynb>`

    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_r2_in_overview.py <plot_r2_in_overview.py>`

    .. container:: sphx-glr-download sphx-glr-download-zip

      :download:`Download zipped: plot_r2_in_overview.zip <plot_r2_in_overview.zip>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_