fix: Propagation globale sélective v2 - Normalisation dates + Multi-pass
- Normalisation agressive des dates : génère 4 variations (/, ., -, espaces) - Remplacement multi-pass : avec/sans contexte 'Né(e) le' - Amélioration force_term : case-insensitive + word boundaries - Outil de validation post-anonymisation - Tests : 162 CRO, 0 fuite dates, 0 fuite CHCB (100% succès) - Temps: 0.1s/doc Résout les 36 CRO avec fuites identifiées dans l'audit initial.
This commit is contained in:
19
tests/ground_truth/pdfs/test_all_cro/614 CRO.audit.jsonl
Normal file
19
tests/ground_truth/pdfs/test_all_cro/614 CRO.audit.jsonl
Normal file
@@ -0,0 +1,19 @@
|
||||
{"page": 0, "kind": "DATE_NAISSANCE", "original": "né le 29/08/1939", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "NOM", "original": "Pierre BRUNETEAU", "placeholder": "[NOM]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "NOM", "original": "Pierre Lou CUCUPHAT", "placeholder": "[NOM]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "NOM", "original": "BLANGIS On", "placeholder": "[NOM]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "RPPS", "original": "10100981090", "placeholder": "[RPPS]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "TEL", "original": "05 59 44 35 12", "placeholder": "[TEL]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "RPPS", "original": "10002828365", "placeholder": "[RPPS]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "TEL", "original": "05 59 44 40 84", "placeholder": "[TEL]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "NOM", "original": "Pierre BRUNETEAU", "placeholder": "[NOM]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "RPPS", "original": "10107546912", "placeholder": "[RPPS]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "TEL", "original": "05 59 44 40 59", "placeholder": "[TEL]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "RPPS", "original": "10102402095", "placeholder": "[RPPS]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "TEL", "original": "05 59 44 35 17", "placeholder": "[TEL]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "RPPS", "original": "10004431168", "placeholder": "[RPPS]", "bbox_hint": null}
|
||||
{"page": 0, "kind": "TEL", "original": "05 59 44 31 35", "placeholder": "[TEL]", "bbox_hint": null}
|
||||
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "29/08/1939", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
||||
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "29 08 1939", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
||||
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "29.08.1939", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
||||
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "29-08-1939", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
||||
Reference in New Issue
Block a user