feat: replay E2E fonctionnel — 25/25 actions, 0 retries, SomEngine via serveur

Validé sur PC Windows (DESKTOP-58D5CAC, 2560x1600) :
- 8 clics résolus visuellement (1 anchor_template, 1 som_text_match, 6 som_vlm)
- Score moyen 0.75, temps moyen 1.6s
- Texte tapé correctement (bonjour, test word, date, email)
- 0 retries, 2 actions non vérifiées (OK)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
This commit is contained in:
Dom
2026-03-31 14:04:41 +02:00
parent 5e0b53cfd1
commit a7de6a488b
79542 changed files with 6091757 additions and 1 deletions

View File

@@ -0,0 +1,19 @@
{
"mode": "hybrid",
"step_fail_streak_to_degraded": 3,
"workflow_fail_window_s": 600,
"workflow_fail_max_in_window": 10,
"global_fail_max_in_window": 30,
"min_confidence_normal": 0.72,
"min_confidence_degraded": 0.82,
"min_margin_top1_top2_degraded": 0.08,
"disable_learning_in_degraded": true,
"rollback_on_regression": true,
"regression_window_steps": 50,
"regression_fail_ratio": 0.20,
"quarantine_duration_s": 1800,
"max_versions_to_keep": 5
}