tests: alias DLBCL + garde-fou Trackare + e2e PDFs réels + gold CRH + benchmark enrichi
- 11 tests unitaires : TestAliasAndConclusionBonus (7) + TestTrackareSymptomGuard (4) - Tests e2e sur PDFs réels (skip si absent) : méningite A87.0 + DLBCL C83.3 top1 - Gold CRH enrichi : 5 cas (2 réels ajoutés : 115_23066188, 132_23080179) - Benchmark synthese : récupération conclusion depuis source_excerpt des DAS/traitements - .gitignore : protection anti-PHI (real_crh_pdfs/, data/crh_samples/*.pdf) - docs/PHI_POLICY.md : 7 règles de sécurité PHI - Rapports debug : case 132 REVIEW (garde-fou actif), top errors, DIM pack 1043 tests passent, 0 régression. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
This commit is contained in:
6
docs/gold_debug/NUKE3_GOLD_TOP_ERRORS.csv
Normal file
6
docs/gold_debug/NUKE3_GOLD_TOP_ERRORS.csv
Normal file
@@ -0,0 +1,6 @@
|
||||
case_id,document_type,chosen_code,chosen_term,verdict,confidence,expected_code,acceptable_codes,acceptable_family3,strict_match,acceptable_match,family3_match,symptom_not_allowed,raw_pool_size,filtered_pool_size,topk_size,evidence_count,review_reason_tag,top1_score,top2_score,delta_top1_top2,top3_codes,top3_terms
|
||||
132_23080179,trackare,R59.0,Adénopathie,REVIEW,medium,C83.3,,C83,False,False,False,True,23,0,0,2,other,0,0,0,,
|
||||
74_23141536,crh,D50,Anémie,REVIEW,medium,I25.1,I25.1|I25.5,I25,False,False,False,False,3,3,3,1,low_delta,4.0,4.0,0.0,D50|I25.1|Z95.5,Anémie|SCA (Syndrome Coronarien Aigu)|Stent vasculaire
|
||||
115_23066188,trackare,A87.0,Méningite à entérovirus,CONFIRMED,high,A87.0,,A87,True,True,True,False,6,0,0,1,other,0,0,0,,
|
||||
106_23056475,trackare,I26.9,Embolie pulmonaire,REVIEW,medium,I26.9,I26.0|I26.9,I26,True,True,True,False,10,7,7,1,low_delta,6.0,5.0,1.0,I26.9|I26.9|Q53.9,Embolie pulmonaire|Embolie pulmonaire|Cryptorchidie
|
||||
73_23139637,trackare,R06.0,Dyspnée,REVIEW,medium,R06.0,,R06,True,True,True,False,1,1,1,1,mono_fragile,1.0,0,1.0,R06.0,Dyspnée
|
||||
|
Reference in New Issue
Block a user