feat(phase2): Gazetteers INSEE (36K prénoms + 34K communes) + silver annotations
- Prénoms INSEE renforcent la confiance NER (prénom connu → ne pas filtrer) - Communes INSEE disponibles pour distinction ville/nom de famille - Export 29 fichiers silver annotations (252K tokens, 12.8K entités) pour fine-tuning Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
33813
data/insee/communes_france.txt
Normal file
33813
data/insee/communes_france.txt
Normal file
File diff suppressed because it is too large
Load Diff
36112
data/insee/prenoms_france.txt
Normal file
36112
data/insee/prenoms_france.txt
Normal file
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user