high priority low complexity testing pending testing specialist Tier 3

Acceptance Criteria

Unit test confirms spacing token resolves to exactly 8.0 logical pixels at textScaleFactor 1.0
Widget test in Column mode measures the vertical gap between two adjacent children and asserts it is >= 8.0 pt using tester.getRect()
Widget test in Wrap mode confirms horizontal gap between adjacent children is >= 8.0 pt
Widget renders without overflow errors with 2, 5, and 10 child widgets in both Column and Wrap modes
Golden test at devicePixelRatio 1.0 matches committed baseline image
Golden test at devicePixelRatio 2.0 matches committed baseline image
All tests pass on flutter test --update-goldens baseline generation and subsequent runs

Technical Requirements

frameworks
Flutter
flutter_test
performance requirements
Golden image generation must complete within 30 seconds
Layout tests must not require real device rendering
ui components
InteractiveControlSpacingSystem

Execution Context

Execution Tier
Tier 3

Tier 3 - 413 tasks

Can start after Tier 2 completes

Implementation Notes

Measure gaps by calling tester.getRect() on consecutive children and computing (nextChild.top - previousChild.bottom) for Column or (nextChild.left - previousChild.right) for Wrap. Use SizedBox children of fixed size for deterministic geometry. For golden tests, use a fixed 400x800 surface size to keep baseline images consistent across machines. Register goldens in a separate test group so they can be skipped in fast local runs with --tags.

Testing Requirements

Widget tests for layout geometry (use tester.getRect() to measure gaps between child Rect boundaries). Golden tests using matchesGoldenFile for visual regression at 1x and 2x DPR — set via TestWidgetsFlutterBinding.setSurfaceSize and window.devicePixelRatio. Parameterize child count tests with a list [2, 5, 10] to avoid duplication. Store golden files in test/goldens/control_spacing/.

Run golden tests in CI with --update-goldens only on intentional design changes.

Epic Risks (4)
medium impact high prob integration

Flutter's textScaleFactor behaviour differs between iOS and Android, and third-party widgets used across the app (date pickers, bottom sheets, chips) may not respect the per-role scale caps applied by the dynamic-type-scale-service, causing overflow in screens this epic cannot directly control.

Mitigation & Contingency

Mitigation: Enumerate all third-party widget usages that render text. For each, verify whether they honour the inherited DefaultTextStyle and MediaQuery.textScaleFactor or use hardcoded sizes. File issues with upstream packages and wrap non-compliant widgets in MediaQuery overrides scoped to the safe cap for that role.

Contingency: If upstream packages cannot be patched within the sprint, implement a global MediaQuery wrapper at the app root that clamps textScaleFactor to the highest per-role safe value (typically 1.6–2.0), accepting that users at extreme OS scales see a safe cap rather than full scaling for those widgets.

high impact medium prob dependency

The CI accessibility lint runner depends on the Dart CLI toolchain and potentially custom_lint or a bespoke Dart script. CI environments differ from local dev environments in Dart SDK version, pub cache configuration, and platform availability, risking intermittent CI failures that block all pull requests.

Mitigation & Contingency

Mitigation: Pin the Dart SDK version in the CI workflow configuration. Package the lint runner as a self-contained Dart script with all dependencies vendored or declared in a dedicated pubspec.yaml. Add a CI smoke test that runs the runner against a known-compliant fixture and a known-violating fixture to verify the exit codes are correct.

Contingency: If the custom runner proves too fragile, fall back to running dart analyze with the flutter-accessibility-lint-config rules as the sole CI gate, and schedule the custom manifest validation as a separate non-blocking advisory check until the runner is stabilised.

medium impact medium prob technical

Wrapping all interactive widgets with a 44 pt minimum hit area via HitTestBehavior.opaque may cause unintended tap interception in widgets where interactive elements are closely stacked, particularly in the expense type selector, bulk confirmation screen, and notification filter bar.

Mitigation & Contingency

Mitigation: Conduct integration testing of the touch target wrapper specifically in dense layout scenarios (expense selector, filter bars, bottom sheets with multiple buttons). Use the Flutter Inspector to visualise hit areas and confirm no overlaps. Pair with the interactive-control-spacing-system to ensure minimum 8 dp gaps between expanded hit areas.

Contingency: If overlapping hit areas cause mis-tap regressions in specific screens, allow the touch target wrapper to accept an explicit hitAreaSize parameter that can be reduced below 44 pt only in contexts where the interactive-control-spacing-system guarantees sufficient gap, with a mandatory code review flag for any such override.

high impact medium prob scope

The contrast-safe-color-palette must guarantee WCAG AA ratios for both light and dark mode token sets. Dark mode color derivation is non-trivial — simply inverting a light palette often produces pairs that pass in one mode but fail in the other, and the token manifest must encode both sets explicitly.

Mitigation & Contingency

Mitigation: Define both light and dark token sets explicitly in the accessibility-token-manifest rather than deriving one from the other programmatically. Run the contrast-ratio-validator against both sets as part of the token manifest generation process and include both in the CI lint runner's validation scope.

Contingency: If time pressure forces a dark mode deferral, ship with light mode only and add a prominent in-app notice. Gate dark mode colour tokens behind a feature flag until the full dual-palette validation is complete.