Upsonic is a reliability-focused agent framework with dockerized, server-client architecture and MCP
-
Updated
Feb 15, 2025 - Python
Upsonic is a reliability-focused agent framework with dockerized, server-client architecture and MCP
Let AI be your browser operator.
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Ui.Vision Open-Source RPA Software with Computer Vision, OCR, Anthropic Computer Use. Selenium IDE import/export.
Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.
A fork of Anthropic Computer Use that you can run on Mac computers to give Claude and other AI models autonomous access to your computer.
An open-sourced end-to-end VLM-based GUI Agent
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
A framework to enable autonomous android and computer use using any LLM (local or remote)
Desktop app powered by Claude’s computer use capability to control your computer
This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for General Computing Devices Use".
A general AI agent framework that can be adapted to various tasks and environments.
A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating Systems (LLM-OS).
✨ Use natural language to control your browser, powered by LLM and playwright
Meet WebHive, the AI-powered browser that takes care of tasks for you. No more endless clicks, tell it what you need, and it gets it done.
Mark web pages for use with vision-language models
Code repo for the paper: Attacking Vision-Language Computer Agents via Pop-ups
Add a description, image, and links to the computer-use topic page so that developers can more easily learn about it.
To associate your repository with the computer-use topic, visit your repo's landing page and select "manage topics."