Files
rpa_vision_v3/.kiro/steering/tech.md
Dom a7de6a488b feat: replay E2E fonctionnel — 25/25 actions, 0 retries, SomEngine via serveur
Validé sur PC Windows (DESKTOP-58D5CAC, 2560x1600) :
- 8 clics résolus visuellement (1 anchor_template, 1 som_text_match, 6 som_vlm)
- Score moyen 0.75, temps moyen 1.6s
- Texte tapé correctement (bonjour, test word, date, email)
- 0 retries, 2 actions non vérifiées (OK)

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-31 14:04:41 +02:00

2.4 KiB

Technology Stack

Core Technologies

  • Python 3.8+: Primary development language
  • PyTorch: Deep learning framework for embeddings and models
  • FAISS: Vector similarity search and indexing
  • OpenCV: Computer vision and image processing
  • Flask: Web API framework for server components
  • React + TypeScript: Frontend for Visual Workflow Builder

Key Libraries

  • open_clip_torch: CLIP embeddings for visual-semantic understanding
  • transformers: Hugging Face models (OWL-ViT for object detection)
  • Pillow: Image processing and manipulation
  • PyQt5: Desktop GUI framework
  • scikit-learn: Machine learning utilities
  • pytest: Testing framework with property-based testing (Hypothesis)

External Dependencies

  • Ollama: Local VLM inference server (qwen3-vl:8b model)
  • NVIDIA GPU: Optional but recommended for performance
  • System capture libraries: mss, pygetwindow, pyautogui

Build System

Environment Setup

# Create virtual environment
python3 -m venv venv_v3
source venv_v3/bin/activate  # Linux/macOS
# or venv_v3\Scripts\activate  # Windows

# Install dependencies
pip install -r requirements.txt

Common Commands

# Complete setup and launch
./run.sh                    # Launch GUI (default)
./run.sh --server          # API server only (port 8000)
./run.sh --dashboard       # Web dashboard (port 5001)
./run.sh --all             # All services
./run.sh --agent           # Capture agent
./run.sh --check           # Environment check only

# Testing
./test_quick.sh            # Quick system test
pytest tests/              # Full test suite
pytest tests/unit/         # Unit tests only
pytest tests/integration/  # Integration tests
pytest tests/property/     # Property-based tests

# Development
./install_dependencies.sh  # Manual dependency install
./status.sh               # System status check

Configuration

  • Environment variables: Use .env file (see .env.example)
  • Central config: core/config.py with dataclass-based configuration
  • Production: Set ENVIRONMENT=production for security validation

Project Structure

  • Modular architecture: Each layer in separate core/ subdirectories
  • Test-driven: Comprehensive test coverage with property-based testing
  • Multi-component: GUI, server, dashboard, agent as separate runnable components
  • Cross-platform: Linux, macOS, Windows support with platform-specific handling