epic-organizational-hierarchy-management-duplicate-detection-task-009 - Implementation Task | Likepersonsapp

high priority medium complexity backend pending backend specialist Tier 3

Acceptance Criteria

SuspectedDuplicate model maps all columns from the suspected_duplicates table: id, activity_a_id, activity_b_id, org_id, similarity_score, status, detected_at, resolved_at, resolved_by, resolution_notes

DuplicateDetectionConfig model maps org_id, similarity_threshold, auto_flag_threshold, enabled fields from detection_config table

ActivityFingerprint model captures the computed fingerprint fields: activity_id, title_hash, date_bucket, duration_bucket, location_hash

All three models implement fromJson(Map<String, dynamic>) factory constructors and toJson() methods

Models use immutable fields (final) and provide copyWith() for state updates

DuplicateActivityDetectorRepository exposes: fetchPendingDuplicates(String orgId) → Future<List<SuspectedDuplicate>>

Repository exposes: updateReviewStatus(String duplicateId, DuplicateStatus status, String? notes) → Future<void>

Repository exposes: loadDetectionConfig(String orgId) → Future<DuplicateDetectionConfig?>

All repository methods use the injected Supabase client — no static SupabaseClient.instance calls

Repository methods throw typed exceptions (DuplicateRepositoryException) on Supabase errors, not raw PostgrestException

Null safety is fully respected — no dynamic casts, no ! operators on potentially null fields

DuplicateStatus is a Dart enum with values: pending, confirmedDuplicate, falsePositive

Technical Requirements

frameworks

Flutter

Supabase Flutter SDK (supabase_flutter)

Dart null safety

apis

Supabase PostgREST API (select, update)

Supabase RLS (policies from task-008 applied automatically)

data models

SuspectedDuplicate

DuplicateDetectionConfig

ActivityFingerprint

performance requirements

fetchPendingDuplicates must filter by status = 'pending' server-side — never fetch all rows and filter in Dart

Use .select() with explicit column list to avoid over-fetching

Repository calls must complete within 3 seconds on a standard mobile connection

security requirements

Repository must not expose raw SQL or table names to UI layer

Supabase client must be injected (not global singleton) to support testing with mock client

No sensitive fields (resolution_notes with PII) logged to console in production mode

Execution Context

Execution Tier

Tier 3

Tier 3 - 413 tasks

Can start after Tier 2 completes

View Full Execution Plan

Implementation Notes

Follow the repository pattern already established in the codebase — inject SupabaseClient via constructor. Place models in `lib/features/duplicate_detection/data/models/` and repository in `lib/features/duplicate_detection/data/repositories/`. Use Dart enums with string serialization helpers for DuplicateStatus to safely handle unknown values from the DB (use `fromString` factory that falls back to `pending`). For the Supabase query in fetchPendingDuplicates: `.from('suspected_duplicates').select('*').eq('org_id', orgId).eq('status', 'pending').order('detected_at', ascending: false)`.

Use freezed package if already used in the project for immutable models with copyWith — check existing model files for the pattern in use.

Testing Requirements

Write unit tests using flutter_test and a mock Supabase client (mockito or manual stub). Test cases must cover: (1) fetchPendingDuplicates returns correctly deserialized SuspectedDuplicate list, (2) fetchPendingDuplicates with empty result returns empty list without throwing, (3) updateReviewStatus sends correct status and notes payload, (4) loadDetectionConfig returns null when no config row exists for org, (5) any Supabase PostgrestException is rethrown as DuplicateRepositoryException with readable message. Also test all fromJson/toJson round-trips for all three model classes with representative fixture data. Target 90%+ line coverage on models and repository.

Component

Duplicate Activity Detector

infrastructure high

Dependencies (1)

Configure Row Level Security policies on the suspected_duplicates table so that admin users can read and update all records for their organization, coordinators can read records involving their own submitted activities, and no user can delete records (only mark as resolved). Verify that cross-organization data leakage is impossible. epic-organizational-hierarchy-management-duplicate-detection-task-008

Epic Risks (3)

medium impact high prob technical

Fingerprint-based similarity matching may produce high false-positive rates for common activity types (e.g., weekly group sessions with the same participants), causing alert fatigue among coordinators and undermining trust in the detection system.

Mitigation & Contingency

Mitigation: Start with conservative, high-confidence thresholds (exact peer mentor match + same date + same activity type) before adding looser fuzzy matching. Allow NHF administrators to tune thresholds based on observed false-positive rates. Log all detection decisions for retrospective threshold calibration.

Contingency: Introduce a snooze mechanism allowing coordinators to dismiss false positives for a configurable period. Track dismissal rates per activity type and automatically raise the similarity threshold for activity types with high dismissal rates.

medium impact medium prob technical

A database trigger on the activities insert path adds synchronous overhead to every activity registration. For HLF peer mentors with 380 annual registrations or coordinators doing bulk proxy registration, this could create perceptible latency or lock contention.

Mitigation & Contingency

Mitigation: Implement the trigger as a DEFERRED constraint trigger (fires after the transaction commits) or replace it with a LISTEN/NOTIFY pattern that queues detection work asynchronously via an Edge Function, completely decoupling detection from the registration write path.

Contingency: Disable the synchronous trigger entirely and rely solely on the scheduled Edge Function for batch detection. Accept a detection delay of up to the scheduling interval (e.g., 15 minutes) in exchange for zero impact on registration latency.

medium impact medium prob dependency

The duplicate detection logic must be validated and approved by NHF before go-live, including agreement on threshold values and the review workflow. NHF stakeholder availability for sign-off may delay this epic's release independently of technical readiness.

Mitigation & Contingency

Mitigation: Gate the feature behind the NHF-specific feature flag so technical deployment can proceed independently of business approval. Involve an NHF administrator in threshold calibration sessions during QA, reducing the formal sign-off surface to policy and workflow rather than technical details.

Contingency: Release the detection system in 'silent mode' — flagging duplicates internally without surfacing notifications to coordinators — until NHF approves the workflow. Use the silent period to collect real data on false-positive rates and refine thresholds before activating notifications.

Quick Links

All Tasks Execution Plan

Create Dart model and repository for duplicates

Acceptance Criteria

Technical Requirements

Execution Context

Implementation Notes

Testing Requirements