rpa_vision_v3/docs/README.md

# Documentation RPA Vision V3

## 📚 Organisation

### `reference/` - Documents de Référence Architecture

Documents techniques décrivant l'architecture complète du système :

- **`ARCHITECTURE_VISION_COMPLETE.md`** - Architecture complète en 5 couches (ESSENTIEL)
- **`ARCHITECTURE_ENRICHISSEMENTS.md`** - 8 enrichissements production-ready
- **`ARCHITECTURE_INDEX.md`** - Index de navigation
- **`ENRICHISSEMENTS_RESUME.md`** - Résumé des enrichissements
- **`START_HERE.md`** - Point de départ pour comprendre l'architecture
- **`QUICK_SUMMARY.txt`** - Résumé rapide
- **`SESSION_COMPLETE_22_NOV.md`** - Session de travail complète
- **`RESUME_CREATION_DOCS.md`** - Historique de création
- **`CHANGELOG_MVP.md`** - Changelog du MVP

### `specs/` - Spécification d'Implémentation

Spécification complète pour l'implémentation :

- **`requirements.md`** - 15 requirements avec 89 critères d'acceptation
- **`design.md`** - Design détaillé avec 20 correctness properties
- **`tasks.md`** - Plan d'implémentation en 13 phases avec 60+ tâches

### Autres Documents

- **`ROADMAP_RPA_100_VISION.md`** - Vision et roadmap du projet

## 🎯 Par Où Commencer ?

### Pour Comprendre l'Architecture

1. **Lire** : `reference/START_HERE.md`
2. **Approfondir** : `reference/ARCHITECTURE_VISION_COMPLETE.md`
3. **Enrichissements** : `reference/ARCHITECTURE_ENRICHISSEMENTS.md`

### Pour Implémenter

1. **Requirements** : `specs/requirements.md`
2. **Design** : `specs/design.md`
3. **Tasks** : `specs/tasks.md`

### Pour la Vision Globale

- **Roadmap** : `ROADMAP_RPA_100_VISION.md`

## 🏗️ Architecture en 5 Couches

```
Couche 0: RawSession
    ↓
Couche 1: ScreenState (4 niveaux)
    ↓
Couche 2: UIElement Detection
    ↓
Couche 3: State Embedding
    ↓
Couche 4: Workflow Graph
```

## 📖 Concepts Clés

### RPA 100% Vision

- Pas de coordonnées (x, y) fixes
- Rôles sémantiques (primary_action, form_input, etc.)
- Matching par similarité visuelle et textuelle
- Robuste aux changements d'UI

### Workflow Graph

- **WorkflowNode** : Template d'état d'écran
- **WorkflowEdge** : Transition (action) entre nodes
- **Learning States** : OBSERVATION → COACHING → AUTO_CANDIDATE → AUTO_CONFIRMÉ

### State Embedding

Fusion multi-modale :
- 50% Image (screenshot complet)
- 30% Texte (texte détecté)
- 10% Titre (fenêtre)
- 10% UI (éléments détectés)

## 🔗 Liens Utiles

- Code source : `../core/`
- Tests : `../tests/`
- Données : `../data/`
- README principal : `../README.md`