What Is Hermes Agent? Self-Hosted AI Assistant Explained

The Short Version

Hermes is an open-source AI assistant you install on a server you own. You chat with it exactly like you'd chat with ChatGPT — except it's running on your machine, it never resets between sessions, and it can take actions on a schedule without you being present.

Three things make it structurally different from a browser-based chatbot:

It's always on. Your VPS runs 24/7. So does Hermes. It fires scheduled jobs, monitors things, and delivers results whether your laptop is open or not.
It learns your workflow. After you give it a task, you can tell it to save the approach as a reusable skill. Future similar requests use that recipe — it doesn't rebuild from scratch.
You control the data. Your conversation history, memory store, and API keys live on your server. Nothing is shared with a third-party app layer.

A Chatbot vs. an Agent — The Real Difference

A chatbot (ChatGPT, Claude in your browser, Gemini) has one job: respond to what you type. It's reactive, stateless, and limited to the chat window. The moment you close the tab, it's gone.

An agent like Hermes is proactive, stateful, and autonomous:

💬

What a chatbot does

Responds when you ask. Forgets the last conversation. Can't do anything while you're offline. Doesn't know your preferences unless you re-explain them every session.

🪽

What Hermes does

Runs tasks on a schedule while you sleep. Remembers who you are, what you've asked before, and how you like things. Delivers results to your phone unprompted.

Skills: Recipes It Writes Itself

A skill is a saved workflow — a step-by-step recipe Hermes can reuse for future similar tasks. The key point: you don't write the skills, Hermes does.

Here's how it works in practice:

You give Hermes a real task: "Research the top 3 trending AI tools right now and summarize them for a YouTube video script."
It completes the task.
You say: "Save that approach as a skill called youtube-video-research."
Done. Any future request like "use the youtube-video-research skill" reuses that exact workflow — same sources, same format, same output structure.

Hermes ships with ~75 pre-installed skills covering web research, file work, code, scheduling, and more. You add to that library every time you run a new kind of task.

💡

Skills are stored as files on your VPS. You can read them, edit them, back them up, or transfer them to a different server. They're not locked to any cloud account.

Memory That Persists Across Every Conversation

Every time you interact with a chatbot, you're starting from zero. Hermes doesn't work that way. Its built-in memory store tracks:

Your name, role, and how you like to be addressed
Your preferences (response length, tone, format)
Ongoing projects and their current status
Corrections you've made ("don't do it that way, do it this way")
Facts about your business or workflow you've shared

This memory survives app restarts, VPS reboots, and session endings. You can inspect it anytime: "Show me what you remember about me." You can update it: "Add to memory that I prefer bullet-point summaries over paragraphs." And you can delete individual entries if something is wrong.

Honcho: The Optional Upgrade

The built-in memory stores explicit facts you tell it. Honcho is a paid add-on that builds a richer behavioral model — it notices patterns in how you work and adjusts proactively, rather than waiting for you to tell it things explicitly. Most people start without it; it's an option once you've been using Hermes for a few weeks and want deeper personalization.

Scheduled Jobs

This is the feature that changes how people use Hermes long-term. You write a job description in plain English, set a schedule, and Hermes handles the rest — no cron syntax, no code.

Example jobs people actually run:

"Every weekday at 8am, send me a digest of the top AI news from the last 24 hours."
"Every Monday, research the top competitor content in my niche and summarize what they published last week."
"The first of every month, check my project list and remind me which ones haven't had activity in 30+ days."

Results go to your home channel (your Telegram DM with the bot, by default). You can manage every job with plain chat commands: pause, resume, run now, delete, or change the schedule.

Messaging Platforms

Hermes connects to messaging apps through a component called the Gateway. Telegram is the easiest first connection — no developer portal, just two quick bot conversations in the Telegram app itself. Beyond that, the Gateway supports:

Discord — message it in a server channel or DM
Slack — works as a Slack app in your workspace
WhatsApp — via the WhatsApp Business API
Signal — self-hosted, privacy-first option
Plus 10+ more platforms

The same agent, the same memory, and the same skills are available on every platform you connect. If you ask Hermes something on Telegram in the morning and follow up on Slack in the afternoon, it knows the full context.

How It's Built (For the Curious)

You don't need to know this to use Hermes, but it helps to understand what's running under the hood:

VPS: A virtual server (e.g., $10/month from Hostinger, DigitalOcean, or Hetzner) that runs 24/7. This is the physical home of your agent.
Docker container: Hermes runs inside a containerized environment that keeps it isolated from the host OS. Commands run inside the container using the Hermes CLI.
OpenRouter / AI provider: Hermes doesn't have its own AI model — it connects to a model provider via API (OpenRouter, Anthropic, OpenAI). You pick the model, you pay for usage.
Gateway: The always-on process that keeps your messaging app connection alive. Without it, the bot goes offline when you close the terminal.
Skills store: A directory of skill files on the VPS. Each skill is a structured recipe with steps, tools, and conditions.
Memory store: A local database that persists facts and preferences. Optional Honcho integration adds a cloud-backed behavioral layer.

What You Actually Pay For

Hermes itself is free and open source. Two things cost money:

🖥️

VPS hosting

Typically $5–$15/month for a starter plan (2 vCPU, 8 GB RAM). You pay the VPS provider directly — not Hermes.

🤖

AI usage

Per-message cost via your AI provider (OpenRouter, Anthropic, OpenAI). A few dollars a month for moderate use on a mid-range model. Set a hard spending cap to prevent surprises.

⚠️

The most common bill surprise: using a premium model (Claude Opus, GPT-5 class) as the default for all conversations. These are expensive per token. Use them for specific heavy tasks; set a cheaper model as the everyday default. Cost management guide →

Who Hermes Is (and Isn't) For

It's a good fit if:

You find yourself repeating the same kind of research or reporting tasks weekly
You want an AI assistant that works while you're offline
You care about data privacy and don't want your work conversations stored in a third-party cloud
You're comfortable renting a $10/month VPS (or willing to learn)

It might not be the right fit if:

You need something that just answers one-off questions — a plain ChatGPT subscription is cheaper and simpler for that
You're not comfortable with any server management (copying a command, running a setup wizard)
You want a fully managed, no-maintenance solution — Hermes requires occasional upkeep (model updates, gateway restarts after a VPS reboot)

Next Steps

Was this guide helpful?