What makes an agent different from a chatbot
A regular chatbot takes a prompt and returns text. An OpenSwarm agent takes a prompt and acts on it — it plans, uses tools, reads results, adapts, and keeps going until the job is done or it needs your input. Agents can:- Read, create, and modify files across your entire filesystem
- Browse the web via a real browser — navigating, clicking, filling forms
- Use integrations — Gmail, Google Calendar, Drive, Sheets, X/Twitter, Reddit, and any custom integrations you connect
- Spawn sub-agents to parallelize work or delegate specialized tasks
- Generate interactive apps that render live on the dashboard
Agents in the context of OpenSwarm
OpenSwarm isn’t about running one agent — it’s about running many at once. Each agent lives as a card in a workspace canvas. You can have infinite agents working simultaneously. The workspace gives you a bird’s-eye view of everything that’s happening.The swarm model
- You launch an agent from the toolbar and give it a task
- That agent can spawn sub-agents, browsers, and execute actions as needed.
- Permissions are set globally so you control what exactly an agent can or can’t do.
- When human approval or insight is needed, a popup appears within that agent’s chat.
In the center of the Open Swarm application’s header, you can also find the floating island bubble. This gives you a quick overview of all agent states. If human input is needed, this floating island will morph into an input field that you can interact with.
Modes
Agents run in a mode that controls their behavior — which tools are available, what system prompt they use, and how they approach tasks. OpenSwarm ships with built-in modes:- Agent — Full access to all tools. The default general-purpose mode.
- Ask — Conversational only, no tool use. For quick questions.
- Plan — Thinks through a plan before acting. Read-only tools only.
- App Builder — Specialized for generating interactive Views.
- Skill Builder — Specialized for creating reusable skills.
99% of the time, the default “Agent” mode is all you need. Modes are helpful when you want holistic, re-usable, presets for how agents behave.
Models
Each agent is powered by an Ai Model which you can select. Here is a brief overview of the models:| Model | Best for |
|---|---|
| Sonnet 4.6 | Fast, capable, good default for most tasks |
| Opus 4.6 | Most capable, best for complex reasoning and long-horizon tasks |
| Haiku 3.5 | Fastest and cheapest, good for simple tasks |
Integration approvals
When an agent wants to use an integration, it may need your permission depending on how that integration is configured:- Allow — runs automatically, no approval needed
- Ask — pauses and shows you exactly what it’s about to do (the command, the file, the API call) so you can approve or deny
- Deny — blocked entirely
You can set these permissions in the Integrations page.

