Files
anonymisation/tests/ground_truth/pdfs/test_all_cro/CRO 23030611.audit.jsonl
Domi31tls f92da4d54e fix: Propagation globale sélective v2 - Normalisation dates + Multi-pass
- Normalisation agressive des dates : génère 4 variations (/, ., -, espaces)
- Remplacement multi-pass : avec/sans contexte 'Né(e) le'
- Amélioration force_term : case-insensitive + word boundaries
- Outil de validation post-anonymisation
- Tests : 162 CRO, 0 fuite dates, 0 fuite CHCB (100% succès)
- Temps: 0.1s/doc

Résout les 36 CRO avec fuites identifiées dans l'audit initial.
2026-03-02 12:22:58 +01:00

23 lines
2.4 KiB
JSON

{"page": 0, "kind": "NOM", "original": "Emmanuel DUPLACEAU", "placeholder": "[NOM]", "bbox_hint": null}
{"page": 0, "kind": "CODE_POSTAL", "original": "64220 ST JEAN PIED DE PORT", "placeholder": "[CODE_POSTAL]", "bbox_hint": null}
{"page": 0, "kind": "NOM", "original": "Floriane MINNE", "placeholder": "[NOM]", "bbox_hint": null}
{"page": 0, "kind": "NOM", "original": "Thomas GRELLETY", "placeholder": "[NOM]", "bbox_hint": null}
{"page": 0, "kind": "NOM", "original": "CHRISTINE GILSOUL", "placeholder": "[NOM]", "bbox_hint": null}
{"page": 0, "kind": "NOM", "original": "Sophie GHECK", "placeholder": "[NOM]", "bbox_hint": null}
{"page": 0, "kind": "NOM", "original": "Jules ISERENTANT", "placeholder": "[NOM]", "bbox_hint": null}
{"page": 0, "kind": "AGE", "original": "patiente de 52 ans", "placeholder": "[AGE]", "bbox_hint": null}
{"page": 0, "kind": "NOM", "original": "Sophie GHECK", "placeholder": "[NOM]", "bbox_hint": null}
{"page": 0, "kind": "DATE_NAISSANCE", "original": "Née le : 17/02/1971", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
{"page": 0, "kind": "CODE_POSTAL", "original": "64220 ST JEAN PIED DE PORT\nDr Floriane MINNE", "placeholder": "[CODE_POSTAL]", "bbox_hint": null}
{"page": 0, "kind": "AGE", "original": "patiente de 52 ans", "placeholder": "[AGE]", "bbox_hint": null}
{"page": 0, "kind": "NOM", "original": "Emmanuel DUPLACEAU", "placeholder": "[NOM]", "bbox_hint": null}
{"page": 0, "kind": "NOM", "original": "Thomas GRELLETY", "placeholder": "[NOM]", "bbox_hint": null}
{"page": 0, "kind": "NOM", "original": "CHRISTINE GILSOUL", "placeholder": "[NOM]", "bbox_hint": null}
{"page": 0, "kind": "NOM", "original": "Sophie GHECK", "placeholder": "[NOM]", "bbox_hint": null}
{"page": 0, "kind": "NOM", "original": "Jules ISERENTANT", "placeholder": "[NOM]", "bbox_hint": null}
{"page": 0, "kind": "NOM", "original": "Sophie GHECK", "placeholder": "[NOM]", "bbox_hint": null}
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "17-02-1971", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "17 02 1971", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "17.02.1971", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "17/02/1971", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}