Dom
5b2afa3629
fix(p1w): make default VLM model DGX-safe (qwen2.5vl:7b-rpa)
...
Sans env RPA_VLM_MODEL/VLM_MODEL, get_vlm_model() tombait sur le default
gemma4:latest, qui peut etre absent du tunnel DGX (depull) -> 404 Ollama et
echec de tout le pipeline VLM avant un test Lea humain.
- core/detection/vlm_config.py : DEFAULT_VLM_MODEL gemma4:latest -> qwen2.5vl:7b-rpa
(confirme present DGX, deja default reasoning + fallback bbox grounding).
+ DGX_SAFE_VLM_MODELS allow-list documentee.
- tests/unit/test_vlm_default_dgx_safe.py : 5 tests (default != gemma4:latest,
default in allow-list, no-env -> DGX-safe, env garde priorite).
Logique de resolution inchangee, pas d'appel reseau a l'import.
gemma4:latest reste accessible via env explicite.
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com >
2026-06-05 12:06:10 +02:00
Dom
0f122a512f
feat(p1y-alpha): add OpenAI-compatible LeaBench adapter (benchmark only)
...
Adapter de benchmark isole (hors runtime Lea) ciblant un serveur
/v1/chat/completions a support vision (vLLM/SGLang/TGI), pour comparer
plus tard a Ollama via LeaBench. Ne controle jamais le desktop.
- core/evaluation/openai_compat_lea_bench_adapter.py : payload data-URL
image_url, parsing choices[0].message.content. Reutilise par import la
logique prompt/parse/normalisation de ollama_lea_bench_adapter (zero refactor).
- tools/lea_bench_openai_compat.py : wrapper CLI (--base-url defaut :8001).
- tests/unit/test_openai_compat_lea_bench_adapter.py : 6 tests mockes HTTP
(data URL, pas de fuite expectation/click_region, prediction valide,
abstain safe sur HTTP!=200 et reponse malformee, JSONL rechargeable).
Aucun runtime Lea modifie. Aucun service lance.
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com >
2026-06-04 16:49:53 +02:00
Dom
806cc04b82
feat(p1z): centralize V4 reasoning model resolution (DGX-safe)
...
Remplace le default runtime dangereux `qwen2.5vl:7b` (absent du tunnel DGX
-> 404) des chemins V4/reasoning par un helper central get_reasoning_model().
- core/detection/vlm_config.py : + get_reasoning_model() + DEFAULT_REASONING_MODEL
(qwen2.5vl:7b-rpa). Ordre : RPA_REASONING_MODEL -> RPA_VLM_MODEL/VLM_MODEL ->
default DGX-safe. Pas d'appel reseau (lazy, safe a l'import).
- core/execution/input_handler.py, observe_reason_act.py (x3),
core/cognition/vram_orchestrator.py : migration des 5 call-sites.
- tests/unit/test_reasoning_model.py : 8 tests (default DGX-safe, ordre de
resolution, non-regression wiring des 3 modules V4).
Hors scope (signale lot P1.w) : DEFAULT_VLM_MODEL=gemma4:latest reste fallback
de get_vlm_model(). Client gele non touche.
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com >
2026-06-04 16:23:10 +02:00
Dom
4dc7d840d6
feat(p1x): de-hardcode VLM models/endpoints to vlm_config (DGX-ready)
...
Migre les call-sites VLM serveur vers la configuration centrale pour
fonctionner sur DGX (tunnel Ollama 11434), où gemma4:* est absent et le
port Docker 11435 est mort.
- task_planner, replay_verifier, domain_context, ir_builder, resolve_engine
(popup): modele -> vlm_config.get_vlm_model(), defaut 11435 -> 11434
(override GEMMA4_PORT legacy conserve)
- resolve_engine (grounding bbox x2): nouvel helper
vlm_config.get_bbox_grounding_model() (var dediee RPA_BBOX_GROUNDING_MODEL,
fallback RPA_GROUNDING_MODEL puis qwen2.5vl:7b-rpa) -> desambiguise le
conflit D5-v3b, bbox_2d + num_ctx 4096 preserves
- safety_checks_provider: defaut -> get_vlm_model(), override
RPA_SAFETY_CHECKS_LLM_MODEL preserve
- ui_detector: default_factory + resolution lazy (corrige aussi un gel a
l'import), pas d'appel reseau a l'import
- field_extractor: property lazy via vlm_config
TDD strict (RED->GREEN), 305 tests verts, tests mockes HTTP (zero dependance
DGX reel), aucun alias Ollama.
Hors perimetre (arbitrage Dom): client Lea agent_v1/executor.py (gele),
chemin V4 observe_reason_act (RPA_REASONING_MODEL), core/config.py defaults.
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com >
2026-06-03 14:06:03 +02:00
Dom
4e7c2a7628
docs(coordination): dispatch dgx vlm model cleanup
2026-06-02 18:16:55 +02:00
Dom
3697e3ba0e
docs(coordination): record p11 option a decision
2026-06-02 17:46:22 +02:00
Dom
5289f3de48
feat(p11): learn from offline cross-session matches
2026-06-02 17:46:15 +02:00
Dom
4b3d5ce0d7
chore(gitignore): ignore local agent and runtime artifacts
2026-06-02 16:31:09 +02:00
Dom
9b8bdfdbbe
docs(coordination): sync agent inboxes and active decisions
2026-06-02 16:30:14 +02:00
Dom
f2e9aac6b7
docs: add POC specs, handoffs, and research notes
2026-06-02 16:28:34 +02:00
Dom
18ed6cb751
feat(vwb): add dashboard competence testing and health tools
2026-06-02 16:27:19 +02:00
Dom
d38f0b0f2f
feat(agent): add learn action flow and grounding guards
2026-06-02 16:24:10 +02:00
Dom
86b3c8f7e7
feat(p1): persist workflows and semantic learning artifacts
2026-06-02 16:20:38 +02:00
Dom
7a1a5cb6fd
fix(p0): secure agent revocation and R6 worker queue
2026-06-02 15:52:35 +02:00
Dom
2dd306724c
docs(coordination): report no-cli competence test patch
2026-06-01 12:10:01 +02:00
Dom
335d576830
feat(dashboard): launch supervised competence tests
2026-06-01 12:09:09 +02:00
Dom
1a58a0d1f1
docs(coordination): sync dgx no-cli phase1 gaps
2026-06-01 11:59:27 +02:00
Dom
eb2df539f1
docs(poc): revise dgx spark dsi prerequisites docx
2026-06-01 11:04:16 +02:00
Dom
c9f848273b
docs(poc): add minimal dgx spark dsi prerequisites
2026-06-01 10:45:46 +02:00
Dom
45ec5fe969
docs(coordination): answer c gamma clarifications
2026-06-01 10:40:53 +02:00
Dom
8b6c397531
docs(poc): share dgx spark readiness context
2026-06-01 10:37:00 +02:00
Dom
6a300a4298
docs(coordination): add dgx spark multi-poste poc focus
2026-06-01 10:14:27 +02:00
Dom
0587036c17
docs(coordination): dispatch dgx spark poc readiness
2026-06-01 10:05:12 +02:00
Dom
f2a9e40502
docs(coordination): report c gamma dashboard promotion
2026-05-29 21:49:36 +02:00
Dom
34527b5cc5
feat(lea): add dashboard competence promotion dry run
2026-05-29 21:48:00 +02:00
Dom
bd3aaf7d64
docs(coordination): dispatch c gamma dashboard work
2026-05-29 19:04:58 +02:00
Dom
05a30f2d1d
docs(coordination): propose c gamma writeback decisions
2026-05-29 18:58:12 +02:00
Dom
47377226f2
feat(vwb): harden supervised verdict evidence
2026-05-29 18:54:54 +02:00
Dom
d515b22d1b
docs(coordination): report c beta supervision
2026-05-29 18:40:03 +02:00
Dom
aba849324a
feat(vwb): log supervised competence verdicts
2026-05-29 18:36:06 +02:00
Dom
7ad260d02f
docs(coordination): report c alpha preview
2026-05-29 18:15:30 +02:00
Dom
794a248dae
feat(vwb): preview lea competence workflows
2026-05-29 18:13:36 +02:00
Dom
8332b2cd37
docs(coordination): delegate yaml vwb supervision patch
2026-05-29 17:54:10 +02:00
Dom
9a45e61e2a
docs(coordination): report wait for state runtime
2026-05-29 17:26:35 +02:00
Dom
e66bc6d452
feat(vwb): execute wait for state
2026-05-29 17:22:35 +02:00
Dom
7b1f30af1a
fix(vwb): preserve static palette tools
2026-05-29 17:16:24 +02:00
Dom
488d14240a
docs(coordination): report vwb catalog patch
2026-05-29 17:11:02 +02:00
Dom
45b6da5e3f
feat(vwb): load palette from catalog
2026-05-29 17:09:47 +02:00
Dom
02211fddf2
docs(coordination): answer lea vwb mapping questions
2026-05-29 16:30:11 +02:00
Dom
ed36bc2b37
docs(coordination): share reflex vwb supervision findings
2026-05-29 14:33:57 +02:00
Dom
9677738f32
docs(coordination): request global review after vwb feedback
2026-05-29 14:05:40 +02:00
Dom
d422aa119c
docs(coordination): require claude qwen vision guardrails
2026-05-29 13:59:39 +02:00
Dom
7b943926db
docs(coordination): clarify vwb learning bridge
2026-05-29 13:46:22 +02:00
Dom
99f89317cb
feat(lea): substitute save menu gesture
2026-05-29 13:45:44 +02:00
Dom
6b8114eb97
docs(coordination): recadre lea direct competence flow
2026-05-29 13:41:18 +02:00
Dom
7ef98d8089
feat(lea): expose competence replay api
2026-05-29 13:40:15 +02:00
Dom
8ea4ed0ad2
docs(coordination): record supervised competence replay plan
2026-05-29 11:38:51 +02:00
Dom
a49f59b4d6
feat(competences): plan supervised replay tests
2026-05-29 11:38:12 +02:00
Dom
762e75a077
docs(coordination): record competence catalog integration
2026-05-29 11:29:18 +02:00
Dom
c1a144c673
feat(vwb): expose competence yaml catalog
2026-05-29 11:28:25 +02:00