CC Data Integrity 49ac2c0dc8 Data integrity sweep: clanovi clean — 3 HNS dups merged, 1 trim normalized, 4 constraints active
Subagent A: merged 3 HNS profile/roster duplicate pairs (3243 → 3240 rows).
  Authoritative auths preferred /igraci/ source_url over /klubovi/ roster scrape.
  Manuel Boras Mandić (id=481) reconciled — pozicija=Vratar, hns_igrac_id=436387.
Subagent B: 1 trim auto-applied (id=634); 4 ALL CAPS held for manual review.
Subagent C: 0 strict cross-klub transfers; 56 soft groups in review queue.
Subagent D: 4 constraints applied (no_camelcase, trimmed, hns_uniq partial, normalize trigger);
  2 skipped (length>=2 — 22 historical violators; klub+name+dob unique — 68 NULL-DOB groups).

Backup: pgz_sport.clanovi_backup_20260505_0836 (3243 rows untouched).
Audit: 5 sys_audit rows (3 PURGE, 1 NORMALIZE, 1 C_DETECTION_RUN).
Smoke: 5/5 endpoints 200; HNK Lovran 31 → 30 clanovi confirmed.

Full report: _audit/data_integrity_20260505_0836/CONSOLIDATED.md

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-05 09:08:35 +02:00
S
Description
PGZ Sport Intelligence Platform
3 GiB
Languages
HTML 89.3%
Python 9.5%
Stata 0.8%
Ruby 0.2%