test_timeline_ancestor_detach_idempotent_success: increase timeout #10464

erikgrinaker · 2025-01-21T15:23:54Z

Consider increasing the timeout here, unless there are signs of actual problems.

See https://neonprod.grafana.net/d/fddp4rvg7k2dcf/regression-test-failures?orgId=1&var-test_name=test_timeline_ancestor_detach_idempotent_success%5Bshards_initial_after2%5D&from=now-7d&to=now&timezone=utc&var-restrict=true&var-max_count=100&var-reference=$__all&var-ignore_reference=refs%2Fpull%2F0000%2Fmerge

test_runner/regress/test_timeline_detach_ancestor.py:639: in test_timeline_ancestor_detach_idempotent_success
    env.storage_controller.reconcile_until_idle()
test_runner/fixtures/neon_fixtures.py:2091: in reconcile_until_idle
    raise RuntimeError("Timeout in reconcile_until_idle")
E   RuntimeError: Timeout in reconcile_until_idle

The text was updated successfully, but these errors were encountered:

arpad-m · 2025-01-23T14:40:02Z

Hmm yeah looking at the storcon logs of this run, filtering for "Applying optimization", it seems to migrate the shards one by one from one location to the other, so there is a constant progress, and not a hang. Same goes for two other flaky failures I looked at.

So increasing the timeout makes the most sense I think.

Sometimes, especially when the host running the tests is overloaded, we can run into reconcile timeouts in `test_timeline_ancestor_detach_idempotent_success`, making the test flaky. By increasing the timeouts from 30 seconds to 120 seconds, we can address the flakiness. Fixes #10464

erikgrinaker added a/test/flaky Area: related to flaky tests c/storage Component: storage labels Jan 21, 2025

erikgrinaker assigned arpad-m Jan 21, 2025

arpad-m mentioned this issue Jan 23, 2025

Increase reconciler timeout after shard split #10490

Merged

arpad-m closed this as completed in #10490 Jan 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test_timeline_ancestor_detach_idempotent_success: increase timeout #10464

test_timeline_ancestor_detach_idempotent_success: increase timeout #10464

erikgrinaker commented Jan 21, 2025

arpad-m commented Jan 23, 2025

test_timeline_ancestor_detach_idempotent_success: increase timeout #10464

test_timeline_ancestor_detach_idempotent_success: increase timeout #10464

Comments

erikgrinaker commented Jan 21, 2025

arpad-m commented Jan 23, 2025