CSTA Philly

mcpproxy: A Config-Driven MCP Host with a Built-In Web UI

5 minute read

Published: May 25, 2026

The Model Context Protocol defines a standard way for AI clients to discover and call tools, but standing up a personal MCP server still usually means writing Python glue code, wiring up a framework, and restarting a process every time you add a tool. mcpproxy takes a different approach: every tool provider is a single YAML file, the server reloads tools at startup without any code changes to the host, and a browser-based web UI handles the full provider lifecycle — editing, secret management, and live command streaming — without leaving the browser.

BetterWebUI: A Faculty-Friendly Agentic Front End for OpenWebUI

11 minute read

Published: May 15, 2026

Most large language model interfaces are designed for developers or for a general consumer audience. Faculty who want to use an AI assistant to help with grading, research, or course preparation either accept the limitations of a consumer chat interface or invest significant time learning to run and configure a developer-grade setup. BetterWebUI is an attempt to close that gap. It is a local Python/FastAPI server with a pure-HTML front end that connects to an existing OpenWebUI instance and layers on the features that make an agentic assistant genuinely useful in a higher-education context: workspaces, skills, MCP server management, CLI shortcuts, math rendering, and a suite of integrations with sibling agentic services.

AutoGUI: A Vendor-Neutral Desktop Automation Agent for LLMs

9 minute read

Published: May 15, 2026

Most LLM agents can read files, call APIs, and run shell commands, but they have no reliable way to operate a graphical desktop. They cannot click a button in a running application, verify that a dialog appeared, fill a form field, or observe what is currently on screen. AutoGUI is a research prototype that fills that gap. It connects any OpenAI-compatible LLM — including models served locally through OpenWebUI or directly through Ollama — to a full suite of OS-level desktop controls via a ReAct-style agentic loop.

OSScreenObserver: Giving AI Agents Eyes and Hands on Your Desktop

15 minute read

Published: May 11, 2026

Most AI agents, whether a large language model assistant running locally or a cloud-hosted agentic framework, have no reliable way to see or interact with the desktop applications running on the machine they are supposed to be helping with. They can read files, call APIs, and run shell commands, but they cannot observe that a dialog box appeared, that a form field is waiting for input, or that an application is in a specific state. OSScreenObserver is a prototype that changes that. It exposes the operating system’s UI accessibility tree, textual descriptions from multiple sources, and ASCII spatial sketches of the current screen layout through two simultaneous interfaces: a browser-based web inspector for humans and an MCP sees are always consistent.

Bill Mongan

CSTA Philly

Share on

You May Also Enjoy

mcpproxy: A Config-Driven MCP Host with a Built-In Web UI

BetterWebUI: A Faculty-Friendly Agentic Front End for OpenWebUI

AutoGUI: A Vendor-Neutral Desktop Automation Agent for LLMs

OSScreenObserver: Giving AI Agents Eyes and Hands on Your Desktop