- Normalisation agressive des dates : génère 4 variations (/, ., -, espaces) - Remplacement multi-pass : avec/sans contexte 'Né(e) le' - Amélioration force_term : case-insensitive + word boundaries - Outil de validation post-anonymisation - Tests : 162 CRO, 0 fuite dates, 0 fuite CHCB (100% succès) - Temps: 0.1s/doc Résout les 36 CRO avec fuites identifiées dans l'audit initial.
13 lines
1.3 KiB
JSON
13 lines
1.3 KiB
JSON
{"page": 0, "kind": "NIR", "original": "244026402401863", "placeholder": "[NIR]", "bbox_hint": null}
|
|
{"page": 0, "kind": "DATE_NAISSANCE", "original": "Date de naissance: 29/02/1944", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
|
{"page": 0, "kind": "IPP", "original": "04021061", "placeholder": "[IPP]", "bbox_hint": null}
|
|
{"page": 0, "kind": "EPISODE", "original": "N° Episode 23066847", "placeholder": "[EPISODE]", "bbox_hint": null}
|
|
{"page": 1, "kind": "IPP", "original": "04021061", "placeholder": "[IPP]", "bbox_hint": null}
|
|
{"page": 1, "kind": "EPISODE", "original": "N° Episode 23066847", "placeholder": "[EPISODE]", "bbox_hint": null}
|
|
{"page": -1, "kind": "NIR_GLOBAL", "original": "244026402401863", "placeholder": "[NIR]", "bbox_hint": null}
|
|
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "29-02-1944", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
|
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "29 02 1944", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
|
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "29/02/1944", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
|
{"page": -1, "kind": "DATE_NAISSANCE_GLOBAL", "original": "29.02.1944", "placeholder": "[DATE_NAISSANCE]", "bbox_hint": null}
|
|
{"page": -1, "kind": "IPP_GLOBAL", "original": "04021061", "placeholder": "[IPP]", "bbox_hint": null}
|