Skip to content

Skip auto-DDL replication for commands referencing temporary relations#486

Open
danolivo wants to merge 3 commits into
mainfrom
spoc-569
Open

Skip auto-DDL replication for commands referencing temporary relations#486
danolivo wants to merge 3 commits into
mainfrom
spoc-569

Conversation

@danolivo

@danolivo danolivo commented May 28, 2026

Copy link
Copy Markdown
Contributor

Problem

Spock's automatic DDL replication ships the raw command text to be re-parsed and executed on every subscriber. When a permanent table is created from a temporary one, the shipped command is valid on the origin but cannot run on the subscriber, where the temporary relation does not exist:

  • CREATE TABLE foo (LIKE some_temp_table)
  • CREATE TABLE foo AS SELECT ... FROM some_temp_table

The replicated command fails on the subscriber with relation "..." does not exist, and because queued changes apply in order, the apply worker stalls — every later change from that origin (including spock.sync_event()) is blocked behind the un-appliable command. The existing temp-relation guard only checked the target of the CREATE, not relations it copies from.

Root cause

By the time auto-DDL runs (post-execution, in the ProcessUtility hook) transformCreateStmt() has already expanded the LIKE clause out of CreateStmt->tableElts, so the executed parse tree no longer shows the source relation. Inspecting that tree finds nothing. The information only survives in the raw command text — the exact text we would ship and the subscriber would re-parse.

Solution

In spock_auto_replicate_ddl(), detect temporary-relation references and skip replication with an explanatory message instead of shipping a command that cannot apply:

  • CREATE TABLE ... (LIKE ...) — re-parse the raw command text (which still carries the LIKE clause), resolve each source relation against the catalogue, and check its persistence. Name resolution uses the live search_path, matching what the executor did on the origin.
  • CREATE TABLE AS — walk the analysed defining query with the existing isQueryUsingTempRelation(), which catches references at any nesting level (e.g. inside a subquery).

spock_auto_replicate_ddl() now returns whether the statement was queued. spock_autoddl_process() adds the relation to the replication set only when it was — otherwise a table whose CREATE was not replicated would have its later DML fail to apply downstream, reintroducing the same stall.

Limitations

Detection covers only relations named directly in the statement. A temporary relation reached indirectly — through a function body or dynamic SQL — cannot be detected statically; that is a separate, fundamentally undecidable case.

Also in this branch

Replaces the multi-PG mesh installcheck rig with a single-PG18 one (tests/run-single-pg18-installcheck.sh + GitHub Actions workflow). The multi-version rig conflated genuine bugs with inherent cross-version DDL-shipping fragility (e.g. a PG16-built regress.dylib failing to load on a PG18 subscriber), which is expected and out of scope for the regression mesh. The single-PG18 rig builds PostgreSQL REL_18_STABLE and Spock once, wires three PG18 nodes into a full mesh, stresses it with make installcheck, and asserts every subscription stays enabled and spock.sync_event() round-trips on every edge.

@danolivo danolivo self-assigned this May 28, 2026
@danolivo danolivo added enhancement New feature or request skip-test-nightly Skip this PR in the nightly TAP workflow labels May 28, 2026
@coderabbitai

coderabbitai Bot commented May 28, 2026

Copy link
Copy Markdown

Review Change Stack

📝 Walkthrough

Walkthrough

Two independent changes: (1) spock_auto_replicate_ddl is changed to return bool and now skips replication for DDL referencing temporary relations (CREATE TABLE LIKE or CREATE TABLE AS from temp sources), a new append_set_search_path helper fixes empty search_path quoting, spock_autoddl_process gates add_ddl_to_repset on that bool, and regression tests cover the new behaviors; (2) a new Bash harness and GitHub Actions workflow build PG18 once, spin up three Spock nodes, wire a full subscription mesh, run installcheck-parallel, and verify spock.sync_event propagation.

Changes

DDL Replication Safety Fixes

Layer / File(s) Summary
search_path quoting helper and replicate_ddl_command refactor
src/spock_functions.c
Adds append_set_search_path() to produce a safe SET search_path TO ...; fragment for empty/whitespace values; refactors spock_replicate_ddl_command to use it instead of inline SplitIdentifierString quoting.
spock_auto_replicate_ddl bool return and temp-relation detection
include/spock.h, src/spock_functions.c
Changes declaration and implementation from void to bool; adds query_temp_like_source() for CREATE TABLE ... LIKE from temp sources; adds CREATE TABLE AS temp-source check; updates EXPLAIN, search_path, and skip_ddl paths to return false.
Conditional add_ddl_to_repset
src/spock_autoddl.c
Wraps add_ddl_to_repset(parsetree) behind if (spock_auto_replicate_ddl(...)), preventing replication-set tracking when DDL replication is skipped.
Structured relcache error
src/spock_relcache.c
Replaces bare elog(ERROR) for unknown remote relations with ereport(ERROR, ERRCODE_INTERNAL_ERROR) carrying errmsg, errdetail, and errhint.
Regression tests
tests/regress/sql/autoddl.sql
Adds test cases: replication with whitespace-only search_path; CREATE TABLE LIKE from temp source (replication skipped); CREATE TABLE AS from temp source and nested subquery (replication skipped); provider-side cleanup.

PG18 Single-Node Installcheck Infrastructure

Layer / File(s) Summary
CI workflow and .gitignore
.github/workflows/installcheck-single-pg18.yml, .gitignore
New workflow with manual/push triggers, apt dependency install, script invocation with nproc parallelism, and conditional log artifact upload on failure; working directory added to .gitignore.
Script bootstrap, helpers, and CLI parsing
tests/run-single-pg18-installcheck.sh
Bash strict mode, logging helpers (log/say/fail), ERR trap, CLI parsing (--base-dir, --keep, --force, --jobs), log directory creation, and per-node DSN/psql helpers.
PG18 and Spock build pipeline
tests/run-single-pg18-installcheck.sh
Clones REL_18_STABLE, applies Spock patches with idempotency markers, configures and builds PostgreSQL including pg_regress, then builds and installs the Spock extension against the shared pg_config.
Node initialization, lifecycle, and mesh bootstrap
tests/run-single-pg18-installcheck.sh
initdb per node with Spock config and local-only pg_hba.conf; pg_ctl start/stop; pg_isready readiness probing; creates regression DBs, enables Spock, calls spock.node_create, and wires full cross-node subscriptions via spock.sub_create.
Installcheck execution and subscription monitoring
tests/run-single-pg18-installcheck.sh
Runs make -k installcheck-parallel against n1 with --use-existing; polls spock.sub_show_status() pre/post installcheck until all subscriptions reach status='replicating'; prints subscription state and connection parameters.
Sync-event verification and main orchestration
tests/run-single-pg18-installcheck.sh
check_sync_event_propagation() confirms spock.sync_event() delivery on every mesh edge; verify_subs_enabled() detects disabled subscriptions; main() coordinates all phases and returns exit code 2 on failure.

Poem

🐇 A bunny hopped through temp tables with care,
"No replication for you!" it declared with flair.
search_path gets quoted, no blank SQL in sight,
Three PG18 nodes mesh up overnight.
With sync_event verified hop by hop—
The regression suite runs without a flop! 🎉

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name Status Explanation Resolution
Docstring Coverage ⚠️ Warning Docstring coverage is 43.69% which is insufficient. The required threshold is 80.00%. Write docstrings for the functions missing them to satisfy the coverage threshold.
✅ Passed checks (4 passed)
Check name Status Explanation
Title check ✅ Passed The title accurately describes the main change: preventing auto-DDL replication for commands that reference temporary relations, which is the core logic change across multiple files.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.
Description check ✅ Passed The pull request description is directly related to the changeset, describing the multi-PG mesh installcheck harness with detailed layout and verification requirements.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
📝 Generate docstrings
  • Create stacked PR
  • Commit on current branch
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch spoc-569

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@codacy-production

codacy-production Bot commented May 28, 2026

Copy link
Copy Markdown

Up to standards ✅

🟢 Issues 0 issues

Results:
0 new issues

View in Codacy

🟢 Metrics 0 duplication

Metric Results
Duplication 0

View in Codacy

NEW Get contextual insights on your PRs based on Codacy's metrics, along with PR and Jira context, without leaving GitHub. Enable AI reviewer
TIP This summary will be updated as you push new changes.

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 4

🧹 Nitpick comments (1)
.github/workflows/installcheck-multi-pg.yml (1)

50-50: 🏗️ Heavy lift

Consider pinning actions to commit SHAs for supply-chain security.

Currently, the workflow references actions/checkout@v4 and actions/upload-artifact@v4 using mutable tags. Pinning to full commit SHAs (e.g., actions/checkout@<sha>) prevents tag-moving attacks and improves supply-chain security by ensuring the exact action code is reviewed and locked.

Example pinning (verify current SHAs before applying):

uses: actions/checkout@b4ffde65f46336ab88eb53be808477a3936bae11  # v4.1.1

Note: This requires looking up and maintaining SHA references, which adds maintenance overhead when updating action versions.

Also applies to: 67-67

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In @.github/workflows/installcheck-multi-pg.yml at line 50, The workflow uses
mutable action tags (actions/checkout@v4 and actions/upload-artifact@v4);
replace those with the corresponding immutable commit SHAs (e.g.,
actions/checkout@<commit-sha> and actions/upload-artifact@<commit-sha>) by
looking up the current, verified commit SHAs for the desired versions and
updating the `uses:` entries for the steps that reference `actions/checkout@v4`
and `actions/upload-artifact@v4` so the workflow pins to exact commits.
🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In @.github/workflows/installcheck-multi-pg.yml:
- Around line 49-50: Update the GitHub Actions checkout step named "Checkout
spock" (the actions/checkout@v4 usage) to include persist-credentials: false to
avoid persisting the GITHUB_TOKEN to disk; locate the step with name "Checkout
spock" and add a persist-credentials: false key under that step (properly
indented) so the checkout action explicitly disables credential persistence.

In `@tests/run-multi-pg-installcheck.sh`:
- Around line 204-218: on_err currently logs failure and kills builders but does
not stop any started Postgres nodes; update on_err to invoke stop_all_nodes (or
the existing node-teardown function used in main) unless the run was started
with the "--keep" option, i.e. check the same keep-flag variable used elsewhere
in the script and call stop_all_nodes || true before exit; retain
dump_logs_on_failure and kill_outstanding_builders behavior and make the
stop_all_nodes call idempotent/safe so it can run from the ERR trap without
breaking command-substitution contexts.
- Around line 338-342: When a partial patch application leaves a checkout in a
modified state, reruns hit already-applied patches because _do_patch_pg() only
writes .spock-patches-applied at the end; update the logic so that before
reapplying patches (in patch_pg() or at the start of _do_patch_pg()) you detect
the absence of the .spock-patches-applied marker and then either reset the
existing checkout (e.g., git reset --hard && git clean -fdx) or force a fresh
clone via clone_pg(); ensure the reset/clean happens whenever the marker is not
present but the repository directory exists so partial applies are wiped before
attempting the patch sequence again.
- Around line 232-260: Validate BASE_DIR (after resolving with BASE_DIR="$(cd
"${BASE_DIR}" && pwd)") before running mkdir/rm: ensure it's non-empty and not a
dangerous root like "/" or other top-level system roots (at minimum reject ""
and "/"), or require a harness-owned sentinel file (e.g. check for
"${BASE_DIR}/.spock-harness") before performing rm -rf on LOG_DIR and PID_DIR;
if validation fails, print an error and exit non‑zero. Update the logic around
BASE_DIR, SOCK_DIR, LOG_DIR, PID_DIR, mkdir -p and rm -rf to enforce this guard.

---

Nitpick comments:
In @.github/workflows/installcheck-multi-pg.yml:
- Line 50: The workflow uses mutable action tags (actions/checkout@v4 and
actions/upload-artifact@v4); replace those with the corresponding immutable
commit SHAs (e.g., actions/checkout@<commit-sha> and
actions/upload-artifact@<commit-sha>) by looking up the current, verified commit
SHAs for the desired versions and updating the `uses:` entries for the steps
that reference `actions/checkout@v4` and `actions/upload-artifact@v4` so the
workflow pins to exact commits.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: cca87c39-c1bb-4f8e-ab01-1367688b0037

📥 Commits

Reviewing files that changed from the base of the PR and between 5345184 and 80d6a74.

📒 Files selected for processing (3)
  • .github/workflows/installcheck-multi-pg.yml
  • .gitignore
  • tests/run-multi-pg-installcheck.sh

Comment on lines +49 to +50
- name: Checkout spock
uses: actions/checkout@v4

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Set persist-credentials: false to prevent credential leakage.

The checkout action should explicitly set persist-credentials: false to ensure the GitHub token is not persisted to disk after checkout completes. While the artifacts being uploaded are only logs and test output (not the git checkout itself), this is a security best practice to prevent accidental credential exposure.

🔒 Proposed fix
       - name: Checkout spock
         uses: actions/checkout@v4
+        with:
+          persist-credentials: false
📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change
- name: Checkout spock
uses: actions/checkout@v4
- name: Checkout spock
uses: actions/checkout@v4
with:
persist-credentials: false
🧰 Tools
🪛 zizmor (1.25.2)

[warning] 49-50: credential persistence through GitHub Actions artifacts (artipacked): does not set persist-credentials: false

(artipacked)


[error] 50-50: unpinned action reference (unpinned-uses): action is not pinned to a hash (required by blanket policy)

(unpinned-uses)

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In @.github/workflows/installcheck-multi-pg.yml around lines 49 - 50, Update the
GitHub Actions checkout step named "Checkout spock" (the actions/checkout@v4
usage) to include persist-credentials: false to avoid persisting the
GITHUB_TOKEN to disk; locate the step with name "Checkout spock" and add a
persist-credentials: false key under that step (properly indented) so the
checkout action explicitly disables credential persistence.

Comment thread tests/run-multi-pg-installcheck.sh Outdated
Comment on lines +204 to +218
trap 'on_err $? $LINENO' ERR

on_err() {
local rc=$1 line=$2
log "Aborted: exit ${rc} at line ${line}"
dump_logs_on_failure || true
# Deliberately NO stop_all_nodes here. It used to live in this
# trap, but combined with set -E it would fire from inside command
# substitutions and silently kill every cluster on a single
# transient psql hiccup. Teardown lives in main()'s normal flow
# (where it has the correct context); kill_outstanding_builders is
# still safe to call here because it operates on PIDs we know are
# ours.
kill_outstanding_builders || true
exit "${rc}"

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Always tear down started clusters on failure unless --keep is set.

Right now the only stop_all_nodes() call is on the success path in main(). Any error after some postmasters have started—including the already-hit wait_for_builders || fail "one or more builders failed" path—exits without cleanup, so a failed run can leave sockets/processes behind even though the script documents “servers will be stopped on script exit” by default.

Also applies to: 1012-1016, 1082-1086

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/run-multi-pg-installcheck.sh` around lines 204 - 218, on_err currently
logs failure and kills builders but does not stop any started Postgres nodes;
update on_err to invoke stop_all_nodes (or the existing node-teardown function
used in main) unless the run was started with the "--keep" option, i.e. check
the same keep-flag variable used elsewhere in the script and call stop_all_nodes
|| true before exit; retain dump_logs_on_failure and kill_outstanding_builders
behavior and make the stop_all_nodes call idempotent/safe so it can run from the
ERR trap without breaking command-substitution contexts.

Comment thread tests/run-multi-pg-installcheck.sh Outdated
Comment on lines +232 to +260
while [ "$#" -gt 0 ]; do
case "$1" in
--base-dir) BASE_DIR="$2"; shift 2 ;;
--keep) KEEP_RUNNING=1; shift ;;
--force) FORCE_REBUILD=1; shift ;;
--jobs) JOBS_TOTAL="$2"; shift 2 ;;
-h|--help) usage; exit 0 ;;
*) fail "unknown argument: $1" 4 ;;
esac
done

mkdir -p "${BASE_DIR}/src" \
"${BASE_DIR}/bin" \
"${BASE_DIR}/spock-build" \
"${BASE_DIR}/pgdata" \
"${BASE_DIR}/log" \
"${BASE_DIR}/sock" \
"${BASE_DIR}/pid"
BASE_DIR="$(cd "${BASE_DIR}" && pwd)"
SOCK_DIR="${BASE_DIR}/sock"
LOG_DIR="${BASE_DIR}/log"
PID_DIR="${BASE_DIR}/pid"

# Fresh log and pid directories per run: stale lines from a previous
# run would otherwise commingle with new output and make diagnosis
# painful. Deliberately scoped to log/ and pid/ -- src/, bin/,
# spock-build/, and pgdata/ are preserved so reuse-on-rerun still works.
rm -rf "${LOG_DIR}" "${PID_DIR}"
mkdir -p "${LOG_DIR}" "${PID_DIR}"

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Guard --base-dir before the script starts deleting under it.

BASE_DIR is accepted verbatim and then used for mkdir -p plus rm -rf "${LOG_DIR}" "${PID_DIR}". A typo like --base-dir /var or --base-dir / turns this harness into a host-filesystem cleanup tool. Please reject unsafe roots (at minimum empty, /, and other top-level system paths) or require a harness-owned sentinel file before removing anything.

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/run-multi-pg-installcheck.sh` around lines 232 - 260, Validate BASE_DIR
(after resolving with BASE_DIR="$(cd "${BASE_DIR}" && pwd)") before running
mkdir/rm: ensure it's non-empty and not a dangerous root like "/" or other
top-level system roots (at minimum reject "" and "/"), or require a
harness-owned sentinel file (e.g. check for "${BASE_DIR}/.spock-harness") before
performing rm -rf on LOG_DIR and PID_DIR; if validation fails, print an error
and exit non‑zero. Update the logic around BASE_DIR, SOCK_DIR, LOG_DIR, PID_DIR,
mkdir -p and rm -rf to enforce this guard.

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

♻️ Duplicate comments (3)
tests/run-multi-pg-installcheck.sh (3)

243-260: ⚠️ Potential issue | 🟠 Major

Validate --base-dir before deleting under it.

This path is accepted verbatim and then used for rm -rf "${LOG_DIR}" "${PID_DIR}". A typo like --base-dir / or another top-level system path turns the harness into a host-filesystem cleanup step.

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/run-multi-pg-installcheck.sh` around lines 243 - 260, The script
currently accepts --base-dir verbatim and then runs rm -rf "${LOG_DIR}"
"${PID_DIR}" which can wipe critical paths; before deleting, validate and
canonicalize BASE_DIR (the variable set by --base-dir) and assert it is
non-empty, not "/" (or other top-level paths), and that LOG_DIR and PID_DIR are
actual subdirectories of that canonicalized BASE_DIR; if the checks fail, abort
with an error. Use the existing BASE_DIR, LOG_DIR, PID_DIR variables (and the
canonicalization step BASE_DIR="$(cd "${BASE_DIR}" && pwd)") to perform these
checks and refuse to run rm -rf unless the safe-guard conditions pass.

206-218: ⚠️ Potential issue | 🟠 Major

Default cleanup still does not run on failure.

on_err() only kills builders, and the explicit wait_for_builders || fail ... / wait_for_all_ready || fail ... paths in main() bypass the ERR trap entirely. A failed run can still leave started postmasters and sockets behind even when --keep was not set.

Also applies to: 1012-1016

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/run-multi-pg-installcheck.sh` around lines 206 - 218, The failure paths
bypass the ERR trap so started postmasters/sockets can be left running; update
cleanup so failures always perform default teardown: modify on_err() to call
stop_all_nodes and the normal teardown (or the same function main() uses for
cleanup) in addition to kill_outstanding_builders and dump_logs_on_failure,
honoring the --keep flag if set, and change the explicit failure paths in main()
(the points that call wait_for_builders || fail... and wait_for_all_ready ||
fail...) to invoke on_err with the proper exit code (or call the shared teardown
function) instead of exiting directly so cleanup runs consistently.

363-381: ⚠️ Potential issue | 🟠 Major

A partial patch failure still leaves reruns stuck on a dirty checkout.

The marker is only written after the entire patch loop succeeds. If one patch fails after earlier ones applied, the next run reuses that modified tree and re-enters git apply on already-applied patches until the checkout is manually reset.

Also applies to: 390-394

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/run-multi-pg-installcheck.sh` around lines 363 - 381, The loop in
_do_patch_pg applies patches but always writes the .spock-patches-applied marker
even if some git apply invocations fail; modify _do_patch_pg so that each git
apply is checked and on any failure the function immediately prints an error,
returns non-zero (or exits), and does not write the marker, and only touch
"${src}/.spock-patches-applied" after the loop completes successfully (i.e.,
after all git apply calls returned success and any==1); reference the
_do_patch_pg function, the git apply invocation and the .spock-patches-applied
marker when making this change.
🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In @.github/workflows/installcheck-multi-pg.yml:
- Around line 52-53: The workflow uses mutable action tags; replace the tagged
usages of the actions with the provided immutable commit SHAs so the steps
"uses: actions/checkout@v4" and "uses: actions/upload-artifact@v4" are updated
to "uses: actions/checkout@34e114876b0b11c390a56381ad16ebd13914f8d5" and "uses:
actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02" respectively
to pin the actions to immutable SHAs.

In `@tests/run-multi-pg-installcheck.sh`:
- Around line 338-342: The --force flag is currently not forcing a fresh
source+patch because clone_pg() reuses existing checkouts and patch_pg() skips
when a marker exists; update both to honor FORCE by having clone_pg() delete or
reclone the ${src} directory (or run a fresh git clone) when FORCE/--force is
set so it cannot reuse a stale checkout, and have patch_pg() remove or ignore
the existing marker (the patch-applied sentinel used there) when FORCE is set so
patches are re-applied; modify the logic around clone_pg() and patch_pg() to
check the FORCE variable and remove the existing source tree and marker before
proceeding.

---

Duplicate comments:
In `@tests/run-multi-pg-installcheck.sh`:
- Around line 243-260: The script currently accepts --base-dir verbatim and then
runs rm -rf "${LOG_DIR}" "${PID_DIR}" which can wipe critical paths; before
deleting, validate and canonicalize BASE_DIR (the variable set by --base-dir)
and assert it is non-empty, not "/" (or other top-level paths), and that LOG_DIR
and PID_DIR are actual subdirectories of that canonicalized BASE_DIR; if the
checks fail, abort with an error. Use the existing BASE_DIR, LOG_DIR, PID_DIR
variables (and the canonicalization step BASE_DIR="$(cd "${BASE_DIR}" && pwd)")
to perform these checks and refuse to run rm -rf unless the safe-guard
conditions pass.
- Around line 206-218: The failure paths bypass the ERR trap so started
postmasters/sockets can be left running; update cleanup so failures always
perform default teardown: modify on_err() to call stop_all_nodes and the normal
teardown (or the same function main() uses for cleanup) in addition to
kill_outstanding_builders and dump_logs_on_failure, honoring the --keep flag if
set, and change the explicit failure paths in main() (the points that call
wait_for_builders || fail... and wait_for_all_ready || fail...) to invoke on_err
with the proper exit code (or call the shared teardown function) instead of
exiting directly so cleanup runs consistently.
- Around line 363-381: The loop in _do_patch_pg applies patches but always
writes the .spock-patches-applied marker even if some git apply invocations
fail; modify _do_patch_pg so that each git apply is checked and on any failure
the function immediately prints an error, returns non-zero (or exits), and does
not write the marker, and only touch "${src}/.spock-patches-applied" after the
loop completes successfully (i.e., after all git apply calls returned success
and any==1); reference the _do_patch_pg function, the git apply invocation and
the .spock-patches-applied marker when making this change.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 36b9fb51-6a70-4c1d-bc1a-50ab582519ee

📥 Commits

Reviewing files that changed from the base of the PR and between 80d6a74 and ed892a8.

📒 Files selected for processing (3)
  • .github/workflows/installcheck-multi-pg.yml
  • .gitignore
  • tests/run-multi-pg-installcheck.sh
✅ Files skipped from review due to trivial changes (1)
  • .gitignore

Comment on lines +52 to +53
- name: Checkout spock
uses: actions/checkout@v4

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟠 Major | ⚡ Quick win

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash
set -euo pipefail

for repo in actions/checkout actions/upload-artifact; do
  type="$(gh api "repos/${repo}/git/ref/tags/v4" --jq '.object.type')"
  sha="$(gh api "repos/${repo}/git/ref/tags/v4" --jq '.object.sha')"

  if [ "$type" = tag ]; then
    sha="$(gh api "repos/${repo}/git/tags/${sha}" --jq '.object.sha')"
  fi

  printf '%s@v4 -> %s\n' "$repo" "$sha"
done

Repository: pgEdge/spock

Length of output: 191


🏁 Script executed:

#!/bin/bash
set -euo pipefail

FILE=".github/workflows/installcheck-multi-pg.yml"

echo "== File excerpt around lines 45-85 =="
nl -ba "$FILE" | sed -n '45,85p'

echo
echo "== All uses of `@v4` in that workflow =="
rg -n "uses:\s*.*`@v4`" "$FILE" || true

echo
echo "== All uses of actions/checkout@v4 and actions/upload-artifact@v4 =="
rg -n "uses:\s*actions/(checkout|upload-artifact)`@v4`" "$FILE" || true

Repository: pgEdge/spock

Length of output: 155


🏁 Script executed:

#!/bin/bash
set -euo pipefail

FILE=".github/workflows/installcheck-multi-pg.yml"

echo "== File excerpt around lines 45-85 =="
awk 'NR>=45 && NR<=85 {printf "%5d %s\n", NR, $0}' "$FILE"

echo
echo "== All uses of `@v4` in that workflow =="
rg -n "uses:\s*.*`@v4`" "$FILE" || true

echo
echo "== All uses of actions/checkout@v4 and actions/upload-artifact@v4 =="
rg -n "uses:\s*actions/(checkout|upload-artifact)`@v4`" "$FILE" || true

Repository: pgEdge/spock

Length of output: 1850


Pin GitHub Actions versions to immutable SHAs.

This workflow uses mutable tags:

  • Line 53: actions/checkout@v4actions/checkout@34e114876b0b11c390a56381ad16ebd13914f8d5
  • Line 73: actions/upload-artifact@v4actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02
🧰 Tools
🪛 zizmor (1.25.2)

[warning] 52-53: credential persistence through GitHub Actions artifacts (artipacked): does not set persist-credentials: false

(artipacked)


[error] 53-53: unpinned action reference (unpinned-uses): action is not pinned to a hash (required by blanket policy)

(unpinned-uses)

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In @.github/workflows/installcheck-multi-pg.yml around lines 52 - 53, The
workflow uses mutable action tags; replace the tagged usages of the actions with
the provided immutable commit SHAs so the steps "uses: actions/checkout@v4" and
"uses: actions/upload-artifact@v4" are updated to "uses:
actions/checkout@34e114876b0b11c390a56381ad16ebd13914f8d5" and "uses:
actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02" respectively
to pin the actions to immutable SHAs.

Comment thread tests/run-multi-pg-installcheck.sh Outdated
Comment on lines +338 to +342
if [ -d "${src}/.git" ] \
&& [ -f "${src}/src/test/regress/parallel_schedule" ]; then
log "${node}: [pg-clone] PG${ver} source already present, skipping"
return 0
fi

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟠 Major | ⚡ Quick win

--force does not actually force a fresh source+patch state.

clone_pg() still reuses an existing checkout, and patch_pg() still skips when the marker exists, so --force only rebuilds binaries from whatever source tree was already on disk. That contradicts the CLI contract and can silently test stale PostgreSQL or stale patches.

🛠️ Proposed fix
 clone_pg() {
 	local ver="$1"
 	local node; node="$(ver_to_node "${ver}")"
 	local branch="REL_${ver}_STABLE"
 	local src; src="$(src_for "${ver}")"
 
+	if [ "${FORCE_REBUILD}" -ne 0 ]; then
+		rm -rf "${src}"
+	fi
+
 	if [ -d "${src}/.git" ] \
 		&& [ -f "${src}/src/test/regress/parallel_schedule" ]; then
 		log "${node}: [pg-clone] PG${ver} source already present, skipping"
 		return 0
 	fi
@@
 patch_pg() {
 	local ver="$1"
 	local node; node="$(ver_to_node "${ver}")"
 	local src;  src="$(src_for "${ver}")"
 	local patch_dir="${SPOCK_SRC}/patches/${ver}"
 
-	if [ -f "${src}/.spock-patches-applied" ]; then
+	if [ "${FORCE_REBUILD}" -eq 0 ] && [ -f "${src}/.spock-patches-applied" ]; then
 		log "${node}: [pg-patch] patches already applied (marker present), skipping"
 		return 0
 	fi
 	run_phase "${node}" pg-patch _do_patch_pg "${ver}" "${src}" "${patch_dir}"
 }

Also applies to: 390-394

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/run-multi-pg-installcheck.sh` around lines 338 - 342, The --force flag
is currently not forcing a fresh source+patch because clone_pg() reuses existing
checkouts and patch_pg() skips when a marker exists; update both to honor FORCE
by having clone_pg() delete or reclone the ${src} directory (or run a fresh git
clone) when FORCE/--force is set so it cannot reuse a stale checkout, and have
patch_pg() remove or ignore the existing marker (the patch-applied sentinel used
there) when FORCE is set so patches are re-applied; modify the logic around
clone_pg() and patch_pg() to check the FORCE variable and remove the existing
source tree and marker before proceeding.

danolivo added 3 commits June 15, 2026 14:00
The auto-DDL path in spock_auto_replicate_ddl() interpolated the
search_path GUC value into the queued command with a bare "%s", guarded
only by strlen() > 0.  That is unsafe because GetConfigOptionByName()
does not always return something that is valid SQL on its own.

Fix this issue. Change error message on missed entry in spock cache:
highlight the fact that the root of the issue might be schema mismatch.
Auto-DDL ships the raw command text to be re-parsed and executed on
subscribers.  A permanent table built from a temporary one -
CREATE TABLE ... (LIKE temp) or CREATE TABLE ... AS SELECT ... FROM temp
- would therefore ship a command that cannot run on the subscriber,
where the temporary relation does not exist, stalling the apply worker.

Detect those cases in spock_auto_replicate_ddl() and skip replication
with an explanatory message instead:

  - CREATE TABLE ... (LIKE ...): by the time auto-DDL runs (post
    execution) transformCreateStmt() has already expanded the LIKE
    clause out of tableElts, so the executed parse tree no longer shows
    the source relation.  Re-parse the raw command text -- the exact
    text we would ship and the subscriber would re-parse -- which still
    carries the LIKE clause, and resolve each source against the
    catalogue.

  - CREATE TABLE AS: walk the analysed defining query with the existing
    isQueryUsingTempRelation(), which catches references at any nesting
    level.

spock_auto_replicate_ddl() now returns whether the statement was
queued, and spock_autoddl_process() adds the relation to the
replication set only when it was.  Otherwise a table whose CREATE was
not replicated would have its later DML fail to apply downstream.

Note this detects only relations named directly in the statement; a
temporary relation reached indirectly (e.g. through a function body)
cannot be detected statically.

Add regression coverage for LIKE-of-temp, CTAS-of-temp (including a
nested subquery), and a permanent-LIKE sanity case.
Add a self-contained test rig (tests/run-single-pg18-installcheck.sh)
that builds PostgreSQL REL_18_STABLE and the Spock extension once, wires
three single-node PG18 clusters into a full Spock mesh (6 subscriptions,
exception_behaviour='discard', auto-DDL on), runs `make installcheck`
against one node as a stress workload, and asserts that every
subscription stays enabled and that spock.sync_event() round-trips on
every directed edge.
@danolivo danolivo changed the title tests: add multi-PG mesh installcheck tests: add single-PG mesh installcheck Jun 16, 2026
@danolivo danolivo changed the title tests: add single-PG mesh installcheck Add single-PG mesh installcheck Jun 16, 2026
@danolivo danolivo changed the title Add single-PG mesh installcheck Skip auto-DDL replication for commands referencing temporary relations Jun 16, 2026
@danolivo danolivo added the bug Something isn't working label Jun 16, 2026

@coderabbitai coderabbitai Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 2

♻️ Duplicate comments (1)
.github/workflows/installcheck-single-pg18.yml (1)

51-52: 🛠️ Refactor suggestion | 🟠 Major | ⚡ Quick win

Unpinned GitHub Actions references. Both action references use mutable tags (@v4) instead of immutable commit SHAs, creating a supply-chain security risk. The checkout action should also set persist-credentials: false.

  • .github/workflows/installcheck-single-pg18.yml#L51-L52: Pin actions/checkout@v4 to actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 and add persist-credentials: false.
  • .github/workflows/installcheck-single-pg18.yml#L72-L72: Pin actions/upload-artifact@v4 to actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In @.github/workflows/installcheck-single-pg18.yml around lines 51 - 52, Replace
mutable GitHub Actions tags with immutable commit SHAs to eliminate supply-chain
security risks. At .github/workflows/installcheck-single-pg18.yml lines 51-52,
update the actions/checkout action reference from `@v4` to the pinned commit SHA
and add persist-credentials: false to the checkout step to prevent storing
credentials. At .github/workflows/installcheck-single-pg18.yml lines 72-72,
update the actions/upload-artifact action reference from `@v4` to its pinned
commit SHA.

Source: Linters/SAST tools

🤖 Prompt for all review comments with AI agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/spock_functions.c`:
- Around line 2363-2370: The query_temp_like_source() function only checks
top-level CreateStmt nodes by iterating through results of
pg_parse_query(query), but when the original query is wrapped in EXPLAIN
ANALYZE, the top-level statement is ExplainStmt rather than CreateStmt, causing
the CREATE TABLE ... LIKE detection to be missed. At the recursion point around
line 2644-2646, the original query text (still containing EXPLAIN wrapper) is
being passed to query_temp_like_source(), which fails to detect the nested
CREATE TABLE ... LIKE statement. Strip the EXPLAIN wrapper from the query before
passing it to the recursive query_temp_like_source() call at line 2644-2646 so
that the function can properly identify and skip temp-source CREATE statements
at all nesting levels.

In `@tests/run-single-pg18-installcheck.sh`:
- Line 717: The variable `${out}` in the command substitution within the log
statement needs to be double-quoted to prevent word splitting and globbing
issues. Wrap `${out}` with double quotes in the echo command that pipes to tr to
ensure the variable is treated as a single word and special characters are
properly handled.

---

Duplicate comments:
In @.github/workflows/installcheck-single-pg18.yml:
- Around line 51-52: Replace mutable GitHub Actions tags with immutable commit
SHAs to eliminate supply-chain security risks. At
.github/workflows/installcheck-single-pg18.yml lines 51-52, update the
actions/checkout action reference from `@v4` to the pinned commit SHA and add
persist-credentials: false to the checkout step to prevent storing credentials.
At .github/workflows/installcheck-single-pg18.yml lines 72-72, update the
actions/upload-artifact action reference from `@v4` to its pinned commit SHA.
🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

  • Push a commit to this branch (recommended)
  • Create a new PR with the fixes

ℹ️ Review info
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 15678d0c-1949-48e1-b7b3-83eb783f731f

📥 Commits

Reviewing files that changed from the base of the PR and between ed892a8 and 87616fa.

⛔ Files ignored due to path filters (2)
  • tests/regress/expected/autoddl.out is excluded by !**/*.out
  • tests/regress/expected/exception_row_capture.out is excluded by !**/*.out
📒 Files selected for processing (8)
  • .github/workflows/installcheck-single-pg18.yml
  • .gitignore
  • include/spock.h
  • src/spock_autoddl.c
  • src/spock_functions.c
  • src/spock_relcache.c
  • tests/regress/sql/autoddl.sql
  • tests/run-single-pg18-installcheck.sh
✅ Files skipped from review due to trivial changes (1)
  • .gitignore

Comment thread src/spock_functions.c
Comment on lines +2363 to +2370
foreach(rawlc, pg_parse_query(query))
{
RawStmt *raw = lfirst_node(RawStmt, rawlc);
CreateStmt *cstmt;
ListCell *cell;

if (!IsA(raw->stmt, CreateStmt))
continue;

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟠 Major | ⚡ Quick win

EXPLAIN-wrapped CREATE TABLE ... LIKE temp can bypass temp-source skip logic.

At Line 2644 recursion keeps the original query text (still EXPLAIN ...), but query_temp_like_source() (Line 2363 onward) only checks top-level CreateStmt nodes. This can miss CREATE TABLE ... LIKE inside EXPLAIN ANALYZE, causing unsafe DDL to be queued and fail on subscribers.

💡 Suggested fix
-		if (!IsA(raw->stmt, CreateStmt))
-			continue;
-
-		cstmt = (CreateStmt *) raw->stmt;
+		Node *stmt = raw->stmt;
+
+		/* Unwrap EXPLAIN wrapper in raw parse trees. */
+		if (IsA(stmt, ExplainStmt))
+			stmt = castNode(ExplainStmt, stmt)->query;
+
+		if (!IsA(stmt, CreateStmt))
+			continue;
+
+		cstmt = castNode(CreateStmt, stmt);

Also applies to: 2644-2646

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/spock_functions.c` around lines 2363 - 2370, The query_temp_like_source()
function only checks top-level CreateStmt nodes by iterating through results of
pg_parse_query(query), but when the original query is wrapped in EXPLAIN
ANALYZE, the top-level statement is ExplainStmt rather than CreateStmt, causing
the CREATE TABLE ... LIKE detection to be missed. At the recursion point around
line 2644-2646, the original query text (still containing EXPLAIN wrapper) is
being passed to query_temp_like_source(), which fails to detect the nested
CREATE TABLE ... LIKE statement. Strip the EXPLAIN wrapper from the query before
passing it to the recursive query_temp_like_source() call at line 2644-2646 so
that the function can properly identify and skip temp-source CREATE statements
at all nesting levels.

|| { any_bad=1; log "${node}: NOT reachable -- treating as failure"; continue; }
if [ -n "${out}" ]; then
any_bad=1
log "${node}: DISABLED subscriptions: $(echo ${out} | tr '\n' ' ')"

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

Quote variable to prevent word splitting.

The ${out} variable should be double-quoted to prevent word splitting and globbing issues.

🛠️ Suggested fix
-			log "${node}: DISABLED subscriptions: $(echo ${out} | tr '\n' ' ')"
+			log "${node}: DISABLED subscriptions: $(echo "${out}" | tr '\n' ' ')"
🧰 Tools
🪛 Shellcheck (0.11.0)

[info] 717-717: Double quote to prevent globbing and word splitting.

(SC2086)

🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/run-single-pg18-installcheck.sh` at line 717, The variable `${out}` in
the command substitution within the log statement needs to be double-quoted to
prevent word splitting and globbing issues. Wrap `${out}` with double quotes in
the echo command that pipes to tr to ensure the variable is treated as a single
word and special characters are properly handled.

Source: Linters/SAST tools

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working enhancement New feature or request skip-test-nightly Skip this PR in the nightly TAP workflow

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant