tests: alias DLBCL + garde-fou Trackare + e2e PDFs réels + gold CRH + benchmark enrichi

- 11 tests unitaires : TestAliasAndConclusionBonus (7) + TestTrackareSymptomGuard (4)
- Tests e2e sur PDFs réels (skip si absent) : méningite A87.0 + DLBCL C83.3 top1
- Gold CRH enrichi : 5 cas (2 réels ajoutés : 115_23066188, 132_23080179)
- Benchmark synthese : récupération conclusion depuis source_excerpt des DAS/traitements
- .gitignore : protection anti-PHI (real_crh_pdfs/, data/crh_samples/*.pdf)
- docs/PHI_POLICY.md : 7 règles de sécurité PHI
- Rapports debug : case 132 REVIEW (garde-fou actif), top errors, DIM pack

1043 tests passent, 0 régression.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
dom
2026-02-24 14:35:57 +01:00
parent 06a1be5425
commit cad0dd22b1
16 changed files with 1513 additions and 11 deletions

View File

@@ -0,0 +1,6 @@
case_id,document_type,chosen_code,chosen_term,verdict,confidence,expected_code,acceptable_codes,acceptable_family3,strict_match,acceptable_match,family3_match,symptom_not_allowed,raw_pool_size,filtered_pool_size,topk_size,evidence_count,review_reason_tag,top1_score,top2_score,delta_top1_top2,top3_codes,top3_terms
132_23080179,trackare,R59.0,Adénopathie,REVIEW,medium,C83.3,,C83,False,False,False,True,23,0,0,2,other,0,0,0,,
74_23141536,crh,D50,Anémie,REVIEW,medium,I25.1,I25.1|I25.5,I25,False,False,False,False,3,3,3,1,low_delta,4.0,4.0,0.0,D50|I25.1|Z95.5,Anémie|SCA (Syndrome Coronarien Aigu)|Stent vasculaire
115_23066188,trackare,A87.0,Méningite à entérovirus,CONFIRMED,high,A87.0,,A87,True,True,True,False,6,0,0,1,other,0,0,0,,
106_23056475,trackare,I26.9,Embolie pulmonaire,REVIEW,medium,I26.9,I26.0|I26.9,I26,True,True,True,False,10,7,7,1,low_delta,6.0,5.0,1.0,I26.9|I26.9|Q53.9,Embolie pulmonaire|Embolie pulmonaire|Cryptorchidie
73_23139637,trackare,R06.0,Dyspnée,REVIEW,medium,R06.0,,R06,True,True,True,False,1,1,1,1,mono_fragile,1.0,0,1.0,R06.0,Dyspnée
1 case_id document_type chosen_code chosen_term verdict confidence expected_code acceptable_codes acceptable_family3 strict_match acceptable_match family3_match symptom_not_allowed raw_pool_size filtered_pool_size topk_size evidence_count review_reason_tag top1_score top2_score delta_top1_top2 top3_codes top3_terms
2 132_23080179 trackare R59.0 Adénopathie REVIEW medium C83.3 C83 False False False True 23 0 0 2 other 0 0 0
3 74_23141536 crh D50 Anémie REVIEW medium I25.1 I25.1|I25.5 I25 False False False False 3 3 3 1 low_delta 4.0 4.0 0.0 D50|I25.1|Z95.5 Anémie|SCA (Syndrome Coronarien Aigu)|Stent vasculaire
4 115_23066188 trackare A87.0 Méningite à entérovirus CONFIRMED high A87.0 A87 True True True False 6 0 0 1 other 0 0 0
5 106_23056475 trackare I26.9 Embolie pulmonaire REVIEW medium I26.9 I26.0|I26.9 I26 True True True False 10 7 7 1 low_delta 6.0 5.0 1.0 I26.9|I26.9|Q53.9 Embolie pulmonaire|Embolie pulmonaire|Cryptorchidie
6 73_23139637 trackare R06.0 Dyspnée REVIEW medium R06.0 R06 True True True False 1 1 1 1 mono_fragile 1.0 0 1.0 R06.0 Dyspnée