Files
rpa_vision_v3/.kiro/steering/tech.md
Dom a7de6a488b feat: replay E2E fonctionnel — 25/25 actions, 0 retries, SomEngine via serveur
Validé sur PC Windows (DESKTOP-58D5CAC, 2560x1600) :
- 8 clics résolus visuellement (1 anchor_template, 1 som_text_match, 6 som_vlm)
- Score moyen 0.75, temps moyen 1.6s
- Texte tapé correctement (bonjour, test word, date, email)
- 0 retries, 2 actions non vérifiées (OK)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-31 14:04:41 +02:00

74 lines
2.4 KiB
Markdown

# Technology Stack
## Core Technologies
- **Python 3.8+**: Primary development language
- **PyTorch**: Deep learning framework for embeddings and models
- **FAISS**: Vector similarity search and indexing
- **OpenCV**: Computer vision and image processing
- **Flask**: Web API framework for server components
- **React + TypeScript**: Frontend for Visual Workflow Builder
## Key Libraries
- **open_clip_torch**: CLIP embeddings for visual-semantic understanding
- **transformers**: Hugging Face models (OWL-ViT for object detection)
- **Pillow**: Image processing and manipulation
- **PyQt5**: Desktop GUI framework
- **scikit-learn**: Machine learning utilities
- **pytest**: Testing framework with property-based testing (Hypothesis)
## External Dependencies
- **Ollama**: Local VLM inference server (qwen3-vl:8b model)
- **NVIDIA GPU**: Optional but recommended for performance
- **System capture libraries**: mss, pygetwindow, pyautogui
## Build System
### Environment Setup
```bash
# Create virtual environment
python3 -m venv venv_v3
source venv_v3/bin/activate # Linux/macOS
# or venv_v3\Scripts\activate # Windows
# Install dependencies
pip install -r requirements.txt
```
### Common Commands
```bash
# Complete setup and launch
./run.sh # Launch GUI (default)
./run.sh --server # API server only (port 8000)
./run.sh --dashboard # Web dashboard (port 5001)
./run.sh --all # All services
./run.sh --agent # Capture agent
./run.sh --check # Environment check only
# Testing
./test_quick.sh # Quick system test
pytest tests/ # Full test suite
pytest tests/unit/ # Unit tests only
pytest tests/integration/ # Integration tests
pytest tests/property/ # Property-based tests
# Development
./install_dependencies.sh # Manual dependency install
./status.sh # System status check
```
### Configuration
- **Environment variables**: Use `.env` file (see `.env.example`)
- **Central config**: `core/config.py` with dataclass-based configuration
- **Production**: Set `ENVIRONMENT=production` for security validation
### Project Structure
- **Modular architecture**: Each layer in separate `core/` subdirectories
- **Test-driven**: Comprehensive test coverage with property-based testing
- **Multi-component**: GUI, server, dashboard, agent as separate runnable components
- **Cross-platform**: Linux, macOS, Windows support with platform-specific handling