Audit Log of All Duplicate Detection Events
Every time the duplicate detection system triggers a warning, the event should be logged with: the two record IDs involved, the coordinator who saw the warning, the resolution action taken (proceed, cancel, merge, mark distinct), a timestamp, and the chapter context. This log should be accessible from the coordinator's settings or admin area. During Bufdir export preparation, coordinators should be able to generate a summary of duplicate-reviewed records to include as supporting documentation.
User Story
Acceptance Criteria
- Given a coordinator dismisses a duplicate warning by selecting 'Submit Anyway', When the activity is saved, Then a log entry is written with the coordinator ID, both activity IDs, resolution action 'PROCEED_WITH_DUPLICATE', and timestamp
- Given a coordinator resolves a pair in the deduplication queue, When the resolution is saved, Then a log entry is written with resolution action, resolver ID, and timestamp
- Given the coordinator opens the duplicate audit log, When they view the list, Then each entry shows a human-readable summary: date, peer mentor name (masked if sensitive), resolution action, and resolver name
- Given the coordinator is preparing a Bufdir export, When they access the export configuration, Then they can optionally append the duplicate audit log as a supplementary CSV
- Given a log entry exists for a record that was later deleted, When the coordinator views the audit log, Then the entry is preserved with a note that the referenced record was subsequently removed
Business Value
Bufdir audits require organizations to demonstrate data integrity practices, not just clean data. A persistent, exportable audit trail proves that duplicate resolution is a deliberate, documented process rather than an absence of duplicates by chance. This directly supports NHF's compliance posture during annual grant reporting cycles and reduces the risk of funding disputes arising from data quality questions.
Components
- Duplicate Warning Event Logger infrastructure
- Duplicate Reviewed Flag Middleware infrastructure
- Duplicate Detection Service service
- Duplicate Check Repository data
- Export Data Query Builder data
- CSV Generation Service service