rpa_vision_v3/agent_v0/server_v1 at 0a02a6ec9c498f590da7da26150eba225d3d1b88 - rpa_vision_v3 - Gitea Aivanov : Git with a cup of tea

Dom/rpa_vision_v3

Files

History

Dom 0a02a6ec9c

tests / Lint (ruff + black) (push) Successful in 15s

Details

tests / Tests unitaires (sans GPU) (push) Failing after 14s

Details

tests / Tests sécurité (critique) (push) Has been skipped

Details

feat(qw4): bench rigoureux LLM safety_checks → gemma4:latest par défaut

Bench 5 modèles × 5 scénarios × cold+warm sur RTX 5070 :
- gemma4:latest : warm 2.9s, JSON 92%, détection 46% → gagnant
- qwen2.5vl:7b : warm 6.6s, détection 23% (trop lent)
- qwen2.5vl:3b : warm 2.0s, détection 8% (vérifie pour vérifier)
- medgemma:4b : warm 0.5s, détection 0% (refuse de signaler) → mauvais
  défaut initial, corrigé
- qwen3-vl:8b : 0% JSON valide (ignore format=json Ollama) → écarté

Modifications safety_checks_provider.py :
- RPA_SAFETY_CHECKS_LLM_MODEL défaut: medgemma:4b → gemma4:latest
- RPA_SAFETY_CHECKS_LLM_TIMEOUT_S défaut: 5 → 7 (warm 2.9s + marge)

Doc complète : docs/BENCH_SAFETY_CHECKS_2026-05-06.md
Script : tools/bench_safety_checks_models.py (reproductible, ~10-15 min)

Limite assumée : 46% de détection. À présenter en démo comme aide médecin,
pas certification. Amélioration V2 = prompt plus dirigé sur champs à vérifier.

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

2026-05-06 09:23:09 +02:00

..

__init__.py

chore: ajouter agent_v0/ au tracking git (était un repo embarqué)

2026-03-18 11:12:23 +01:00

agent_registry.py

feat(fleet): endpoints /agents/enroll|uninstall|fleet + SQLite

2026-04-15 09:07:19 +02:00

api_stream.py

feat(qw4): hook safety_checks_provider + extension /replay/resume avec acquittements

2026-05-05 23:45:22 +02:00

audit_trail.py

feat: pipeline complet MACRO/MÉSO/MICRO — Critic, Observer, Policy, Recovery, Learning, Audit Trail, TaskPlanner

2026-04-09 21:03:25 +02:00

chat_interface.py

feat: Léa chat + IRBuilder enrichi (stratégies V4 complètes)

2026-04-10 09:01:13 +02:00

domain_context.py

feat: Léa personnalité — langage métier multi-domaines

2026-04-10 09:01:52 +02:00

execution_plan_runner.py

fix: contrôle strict des étapes + routage par machine_id

2026-04-10 14:05:23 +02:00

live_session_manager.py

feat(qw1): enrichissement Agent V1 (monitor_index + monitors_geometry) + hook serveur

2026-05-05 23:05:44 +02:00

loop_detector.py

feat(qw2): LoopDetector composite (screen_static + action_repeat + retry)

2026-05-05 23:09:43 +02:00

monitor_router.py

chore(qw): cleanup post-review (préfixes BUS, événements monitor, import io)

2026-05-06 00:08:22 +02:00

replay_engine.py

feat(qw4): hook safety_checks_provider + extension /replay/resume avec acquittements

2026-05-05 23:45:22 +02:00

replay_failure_logger.py

chore: ajouter replay_failure_logger.py au tracking git

2026-04-12 10:35:51 +02:00

replay_learner.py

feat: premier replay E2E + mode apprentissage supervisé

2026-04-13 07:42:50 +02:00

replay_memory.py

feat(security): API streaming fail-closed + /image privé + target_memory prefix fix

2026-04-14 16:49:02 +02:00

replay_verifier.py

feat: pipeline complet MACRO/MÉSO/MICRO — Critic, Observer, Policy, Recovery, Learning, Audit Trail, TaskPlanner

2026-04-09 21:03:25 +02:00

resolve_engine.py

fix(stream+vwb): chaîne replay robuste — auth, anchor type_text, lock async, drift, prompt LLM

2026-05-02 00:32:57 +02:00

run_worker.py

feat: replay visuel VLM-first, worker séparé, package Léa, AZERTY, sécurité HTTPS

2026-03-26 10:19:18 +01:00

safety_checks_provider.py

feat(qw4): bench rigoureux LLM safety_checks → gemma4:latest par défaut

2026-05-06 09:23:09 +02:00

session_worker.py

chore: ajouter agent_v0/ au tracking git (était un repo embarqué)

2026-03-18 11:12:23 +01:00

stream_processor.py

feat(cognition): timing + écran attendu + auto-apprentissage Shadow + VLM qwen2.5vl

2026-04-20 21:52:45 +02:00

task_planner.py

feat: pipeline complet MACRO/MÉSO/MICRO — Critic, Observer, Policy, Recovery, Learning, Audit Trail, TaskPlanner

2026-04-09 21:03:25 +02:00

visual_wait.py

chore: ajouter agent_v0/ au tracking git (était un repo embarqué)

2026-03-18 11:12:23 +01:00

vm_controller.py

chore: ajouter agent_v0/ au tracking git (était un repo embarqué)

2026-03-18 11:12:23 +01:00

worker_stream.py

chore: ajouter agent_v0/ au tracking git (était un repo embarqué)

2026-03-18 11:12:23 +01:00

workflow_replay.py

feat: branchement workflow — actions magnétoscope enrichies avec CLIP

2026-04-05 16:30:27 +02:00