Sign in
← Back to search

microsoft/OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Stars
24,805
Forks
2,172
Commits
154
Language
Jupyter Notebook
Awesome lists
1

Similar repositories

adithya-s-k/omniparse

Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks

6820 stars
Python 1 awesome list

OmniSVG/OmniSVG

[NeurIPS 2025] OmniSVG is the first family of end-to-end multimodal SVG generators that leverage pre-trained Vision-Language Models (VLMs), capable of generating complex and detailed SVGs, from simple icons to intricate anime characters.

2504 stars
Python 1 awesome list

bytedance/UI-TARS-desktop

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

35233 stars
TypeScript 3 awesome lists

allenai/olmocr

Toolkit for linearizing PDFs for LLM datasets/training

17353 stars
Python 1 awesome list

omnirexflora-labs/omnicoreagent

Open Python agent harness for production AI apps: tools, MCP, memory, workspace, telemetry, subagents, background tasks, and OmniServe APIs.

241 stars
Python 1 awesome list

Tracked growth

1 capture since 2026-05-25

Latest capture 2026-05-25 21:07

Stars history

Total stars

Commits history

Default branch commits

Metadata

  • Created: 2024-09-20
  • First commit: —
  • Last pushed: 2026-04-13
  • Archived: no
  • Stack detected: —
  • License: CC-BY-4.0

AI development signals

No AI development config files detected.