This commit is contained in:
yadonglu
2024-10-01 17:25:16 +00:00
parent 6cd06a7a86
commit 80572b823a
10 changed files with 423 additions and 54 deletions

View File

@@ -6,6 +6,12 @@
**OmniParser** is a comprehensive method for parsing user interface screenshots into structured and easy-to-understand elements, which significantly enhances the ability of GPT-4V to generate actions that can be accurately grounded in the corresponding regions of the interface.
## Install
```python
conda create -n "omni" python==3.12
pip install -r requirements.txt
```
## Examples:
We put together a few simple examples in the demo.ipynb.