- Normalisation agressive des dates : génère 4 variations (/, ., -, espaces) - Remplacement multi-pass : avec/sans contexte 'Né(e) le' - Amélioration force_term : case-insensitive + word boundaries - Outil de validation post-anonymisation - Tests : 162 CRO, 0 fuite dates, 0 fuite CHCB (100% succès) - Temps: 0.1s/doc Résout les 36 CRO avec fuites identifiées dans l'audit initial.
14 lines
1.4 KiB
JSON
14 lines
1.4 KiB
JSON
{"page": 0, "kind": "NOM", "original": "Julie BARDOU", "placeholder": "[NOM]", "bbox_hint": null}
|
|
{"page": 0, "kind": "ADRESSE", "original": "39 BD ALSACE LORRAINE", "placeholder": "[ADRESSE]", "bbox_hint": null}
|
|
{"page": 0, "kind": "CODE_POSTAL", "original": "64100 BAYONNE", "placeholder": "[CODE_POSTAL]", "bbox_hint": null}
|
|
{"page": 0, "kind": "NOM", "original": "JEAN-PIERRE ARTIGOLLES", "placeholder": "[NOM]", "bbox_hint": null}
|
|
{"page": 0, "kind": "DATE_NAISSANCE", "original": "Né le 20/11/1949", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
|
{"page": 0, "kind": "NOM", "original": "Clément KLEIN", "placeholder": "[NOM]", "bbox_hint": null}
|
|
{"page": 0, "kind": "NOM", "original": "Yann LAMMERTYN", "placeholder": "[NOM]", "bbox_hint": null}
|
|
{"page": 0, "kind": "NOM", "original": "Clément KLEIN", "placeholder": "[NOM]", "bbox_hint": null}
|
|
{"page": 0, "kind": "NOM", "original": "Yann LAMMERTYN", "placeholder": "[NOM]", "bbox_hint": null}
|
|
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "20-11-1949", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
|
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "20.11.1949", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
|
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "20 11 1949", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
|
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "20/11/1949", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|