Add populate_cache for batched multi-RHS warming of virtual matrices by jd-lara · Pull Request #314 · Sienna-Platform/PowerNetworkMatrices.jl

jd-lara · 2026-06-02T15:05:59Z

Summary

Adds a new exported function populate_cache for VirtualPTDF, VirtualLODF, and VirtualMODF that lets callers pre-warm the lazy row caches over an iterable of components/contingencies using the solver backends' batched multi-RHS solve_sparse!, instead of the one-single-RHS-solve-per-row that getindex performs today.

This is targeted at psy6 for integration into the re-designed back end. The goal is to accelerate the VirtualPTDF and VirtualMODF queries that dominate the build of large optimization problems.

Motivation

getindex on the virtual matrices computes one row at a time, issuing a single right-hand-side solve through the ABA factorization per row. When an optimization-problem build queries many rows (every monitored branch, every contingency), those solves dominate runtime. Both solver backends (KLULinSolveCache, AAFactorCache) already expose a batched solve_sparse!(K, B, out; block=64) primitive — used today only by the full PTDF/LODF builders. populate_cache reuses that primitive on the virtual path.

What changed

VirtualPTDF / VirtualLODF — resolve requested rows, build one sparse RHS of the corresponding BA[valid_ix, rows] columns, solve in a single batched call, then scatter back to bus space (PTDF) or apply the LODF map (A·X) .* inv_PTDF_A_diag across all columns at once. Accepts integer arc indices, arc bus-pair tuples (from, to), or branch-name strings. The per-row math is identical to the existing single-row _compute_*_row.
VirtualMODF — batches the pre-contingency B⁻¹·BA[:,arc] solves over the union of every contingency's modified arcs and the user-supplied monitored set. Because the pre-contingency monitored-arc solve is contingency-independent, it is computed once and reused; the per-contingency Woodbury factors are assembled from these precomputed solves with no further linear solves. This collapses O(n_contingencies × (M + n_monitored)) one-at-a-time solves into a single batched solve over the distinct arcs. The monitored set is supplied by the user (monitored keyword).
RowCache — adds set_persistent_row! / pin_row! so populated rows are pinned (added to persistent_cache_keys) and never evicted by later lazy lookups, plus warn_if_over_capacity to flag undersized caches.
Backend dispatch — routes to KLU's solve_sparse! or AccelerateWrapper.solve_sparse!, with a column-by-column fallback for any other factorization backend.
Exports populate_cache; the public API docs page picks it up automatically via @autodocs.

API

populate_cache(vptdf::VirtualPTDF, components)            # arc indices / tuples / branch names
populate_cache(vlodf::VirtualLODF, components)
populate_cache(vmodf::VirtualMODF, contingencies; monitored)  # NetworkModification / ContingencySpec / Outage / UUID

Tests

test/test_populate_cache.jl covers, for KLU and (when available) AppleAccelerate:

PTDF/LODF/MODF equivalence between the batched populate_cache path and the lazy getindex path,
row pinning (persistence) semantics,
identifier resolution (integer / tuple / outage UUID),
the LODF self-element convention, and
the unregistered-contingency error path.

Note

Julia was not available in the execution environment, so the suite was not run locally — correctness was validated by close review against the existing solve/cache code paths and Aqua constraints (exports, ambiguities, unbound args). CI on this PR will exercise it.

https://claude.ai/code/session_011TFwa4Hsdse4XYEeuHmYro

Generated by Claude Code

Introduce `populate_cache` for VirtualPTDF, VirtualLODF, and VirtualMODF so callers can pre-warm row caches over an iterable of components/contingencies using the solver backends' batched multi-RHS `solve_sparse!` instead of one single-RHS solve per row. This amortizes the ABA factorization solves that dominate optimization-problem builds on large systems. - VirtualPTDF / VirtualLODF: gather requested arc rows, build one sparse RHS of the corresponding BA columns, solve in a single batched call, then scatter (PTDF) or apply the LODF map (A*X .* inv_PTDF_A_diag) across all columns at once. Accepts integer indices, arc bus-pair tuples, or branch-name strings. - VirtualMODF: batch the pre-contingency BA-column solves over the union of all monitored arcs and every contingency's modified arcs. The monitored-arc solve is contingency-independent, so it is computed once and reused; Woodbury factors per contingency are assembled from the precomputed solves with no further linear solves. Monitored set is supplied by the user. - RowCache: add set_persistent_row!/pin_row! so populated rows are pinned and never evicted by later lazy lookups, plus warn_if_over_capacity. - Backend dispatch routes to KLU's solve_sparse! or AccelerateWrapper's, with a column-by-column fallback for other factorization backends. - Export populate_cache; add tests covering PTDF/LODF/MODF equivalence with the lazy getindex path, pinning, identifier resolution, and error handling. https://claude.ai/code/session_011TFwa4Hsdse4XYEeuHmYro

…ache-KFeWX

github-actions · 2026-06-02T15:15:32Z

Performance Results

Precompile Time

Main	This Branch	Delta
2.188 s	2.244 s	+2.6%

Execution Time

Test	Main	This Branch	Delta
matpower_ACTIVSg2000_sys-Build PTDF	FAILED	N/A	N/A
matpower_ACTIVSg2000_sys-Build Ybus	FAILED	N/A	N/A
matpower_ACTIVSg2000_sys-Build LODF	FAILED	N/A	N/A
matpower_ACTIVSg2000_sys-Build VirtualMODF	FAILED	N/A	N/A
matpower_ACTIVSg2000_sys-Radial network reduction	FAILED	N/A	N/A
matpower_ACTIVSg2000_sys-Degree two network reduction	FAILED	N/A	N/A
Base_Eastern_Interconnect_515GW-Build Ybus	FAILED	N/A	N/A
Base_Eastern_Interconnect_515GW-Radial network reduction	FAILED	N/A	N/A
Base_Eastern_Interconnect_515GW-Degree two network reduction	FAILED	N/A	N/A
matpower_ACTIVSg2000_sys-Build PTDF First	N/A	1.744 s	N/A
matpower_ACTIVSg2000_sys-Build PTDF Second	N/A	126.2 ms	N/A
matpower_ACTIVSg2000_sys-Build Ybus First	N/A	6.2 ms	N/A
matpower_ACTIVSg2000_sys-Build Ybus Second	N/A	5.6 ms	N/A
matpower_ACTIVSg2000_sys-Build LODF First	N/A	416.7 ms	N/A
matpower_ACTIVSg2000_sys-Build LODF Second	N/A	121.9 ms	N/A
matpower_ACTIVSg2000_sys-Build VirtualMODF First	N/A	3.718 s	N/A
matpower_ACTIVSg2000_sys-Build VirtualMODF Second	N/A	1.113 s	N/A
matpower_ACTIVSg2000_sys-VirtualMODF Query 10 rows	N/A	489.6 ms	N/A
matpower_ACTIVSg2000_sys-Radial network reduction First	N/A	511.1 ms	N/A
matpower_ACTIVSg2000_sys-Radial network reduction Second	N/A	0.6 ms	N/A
matpower_ACTIVSg2000_sys-Degree two network reduction First	N/A	1.762 s	N/A
matpower_ACTIVSg2000_sys-Degree two network reduction Second	N/A	1.0 ms	N/A
Base_Eastern_Interconnect_515GW-Build Ybus First	N/A	1.105 s	N/A
Base_Eastern_Interconnect_515GW-Build Ybus Second	N/A	277.9 ms	N/A
Base_Eastern_Interconnect_515GW-Radial network reduction First	N/A	32.1 ms	N/A
Base_Eastern_Interconnect_515GW-Radial network reduction Second	N/A	31.0 ms	N/A
Base_Eastern_Interconnect_515GW-Degree two network reduction First	N/A	396.5 ms	N/A
Base_Eastern_Interconnect_515GW-Degree two network reduction Second	N/A	35.0 ms	N/A

…property Merge of origin/psy6 moved the virtual matrices' shared state into VirtualFactorCore. Rework populate_cache so every wrapper field is read through its proper getter (get_core, get_cache, get_cache_lock, get_dist_slack, get_dist_slack_normalized, get_inv_PTDF_A_diag, get_row_caches, get_woodbury_cache, get_max_cache_size_bytes, get_contingency_cache, get_arc_lookup, get_tol, get_ref_bus_position) instead of relying on the wrappers' getproperty forwarding. Core container fields are accessed directly on the VirtualFactorCore, matching _compute_ptdf_row/_compute_lodf_row and the Woodbury kernel. The batched multi-RHS solve and the from-precomputed-solve Woodbury assembly are unchanged and remain byte-for-byte consistent with the post-merge woodbury_kernel.jl. Tests read cache/core/contingency state through getters too.

The redesigned backend routes array getindex through to_index, which resolves the row against the arc-tuple lookup, so vptdf[i, :] with an integer row no longer works (integer+integer and arc-tuple+colon do). Rework the PTDF/LODF tests to index rows by arc tuple, and cover integer-keyed population by inspecting the row cache directly. VirtualMODF keeps its integer getindex(::Int, contingency) method, so those tests are unchanged in spirit. Also apply the JuliaFormatter wrapping flagged by reviewdog on src/populate_cache.jl (generic _solve_multi_rhs!, the mods comprehension, and the _woodbury_factors_from_base call) and restructure the solver-looped testsets to a begin/for form that stays within the line-length margin.

Copilot

Pull request overview

Adds a new public API populate_cache to bulk pre-warm and pin lazy row caches for VirtualPTDF, VirtualLODF, and VirtualMODF using the existing batched multi-RHS solve_sparse! backends, reducing “one-solve-per-row” overhead during large optimization-model builds.

Changes:

Introduces populate_cache implementations for VirtualPTDF/LODF/MODF, including batched BA-column solves and MODF Woodbury assembly from precomputed base solves.
Extends RowCache with pinned-row helpers (set_persistent_row!, pin_row!) and a capacity warning (warn_if_over_capacity).
Adds a dedicated test file covering equivalence vs lazy getindex, pinning semantics, identifier resolution, and key error paths.

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 3 comments.

File	Description
`src/populate_cache.jl`	New `populate_cache` API + batched solve plumbing and MODF batched warm path.
`src/row_cache.jl`	Adds APIs to pin rows persistently and warn when pinned rows exceed configured capacity.
`src/PowerNetworkMatrices.jl`	Exports `populate_cache` and includes the new implementation file.
`test/test_populate_cache.jl`	Adds coverage for populate_cache behavior across PTDF/LODF/MODF and backends.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+# Generic fallback for any other factorization backend: solve column-by-column.
+function _solve_multi_rhs!(
+    K,
+    B::SparseArrays.SparseMatrixCSC{Float64, Int},
+    out::Matrix{Float64},
+)
+    n = size(B, 1)
+    col = zeros(n)
+    @inbounds for j in axes(B, 2)
+        fill!(col, 0.0)
+        for p in SparseArrays.nzrange(B, j)
+            col[SparseArrays.rowvals(B)[p]] = SparseArrays.nonzeros(B)[p]
+        end


+# --- Component resolution --------------------------------------------------
+
+# Resolve a user-supplied component to an integer arc/row index. Accepts the
+# same identifiers as `getindex`: an integer index, an arc bus-pair tuple, or a


+    if !use_dist_slack && !isempty(dist_slack)
+        error("Distributed bus specification doesn't match the number of buses.")
+    end
+    tol = get_tol(core)
+
+    # Only solve rows not already resident; existing rows are pinned below.
+    new_rows = @lock cache_lock Int[r for r in rows if !haskey(cache, r)]
+
+    if !isempty(new_rows)
+        sol = @lock core.solver_lock _solve_arc_columns(core, new_rows)
+        valid_ix = core.valid_ix
+        @lock cache_lock begin
+            full = zeros(buscount)
+            for (j, r) in enumerate(new_rows)
+                haskey(cache, r) && continue  # lost a race; keep the winner


…back - VirtualMODF populate_cache resolves monitored arc tuples through _monitored_arc_index, so an arc eliminated by network reduction yields the descriptive VirtualMODF error instead of a raw KeyError. - VirtualPTDF/VirtualLODF populate_cache build each stored row (scatter, dist-slack, sparsify) outside cache_lock and take the lock only for the double-check + insert, matching the cached_row_lookup pattern and shortening the critical section. - The generic _solve_multi_rhs! fallback now checks `applicable` and raises a clear, actionable error when a backend implements neither solve_sparse! nor _solve_factorization, instead of surfacing a raw MethodError. - Add shared RowCacheValue alias for the cached-row value type.

Replace the remaining `vmodf.arc_susceptances` field reads in the populate_cache tests with the `_get_arc_susceptances` accessor, so no code in this change reaches struct fields through the wrappers' getproperty forwarding.

jd-lara requested a review from Copilot June 2, 2026 15:06

Copilot started reviewing on behalf of jd-lara June 2, 2026 15:06 View session

Merge remote-tracking branch 'origin/psy6' into claude/pnm-populate-c…

132600a

…ache-KFeWX

claude added 2 commits June 2, 2026 15:16

Copilot AI reviewed Jun 2, 2026

View reviewed changes

claude added 2 commits June 2, 2026 15:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add populate_cache for batched multi-RHS warming of virtual matrices#314

Add populate_cache for batched multi-RHS warming of virtual matrices#314
jd-lara wants to merge 6 commits into
psy6from
claude/pnm-populate-cache-KFeWX

jd-lara commented Jun 2, 2026

Uh oh!

github-actions Bot commented Jun 2, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jd-lara commented Jun 2, 2026

Summary

Motivation

What changed

API

Tests

Note

Uh oh!

github-actions Bot commented Jun 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Performance Results

Precompile Time

Execution Time

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions Bot commented Jun 2, 2026 •

edited

Loading