How to Set Up Hermes Agent on a Mini PC Home Server

I've been running Hermes Agent on a BMAX Pro 8 mini PC in my home lab for a few months now. Instead of relying on ChatGPT's web interface or paying for a cloud agent service, I have my own autonomous AI running 24/7 on a box smaller than a paperback — and the whole thing cost less than a mid-range smartphone.

Here's what this guide covers:

- What Hermes Agent is and why run it on a home server - The hardware you need (verified: BMAX Pro 8) - Step-by-step installation from scratch - Configuring AI providers — local or cloud - Running your agent 24/7 with Discord gateway access - Common pitfalls and fixes

BMAX Pro 8 mini PC with Hermes Agent terminal

What Is Hermes Agent?

Hermes Agent is an open-source AI assistant built by Nous Research. It's MIT-licensed, CLI-first, and designed to be self-hosted. Here's what sets it apart:

Tool use built in. Hermes can run bash commands, read and write files, search the web, and manage a Kanban board. It's not just a chatbot — it actually does things on your server.

Multi-provider support. Use OpenAI, Anthropic, DeepSeek, Ollama, or any OpenAI-compatible provider. You're not locked into one model. I keep Ollama for quick local tasks and Claude for complex reasoning.

Skill system. Skills are markdown files that teach your agent new abilities — YouTube summaries, GitHub PR reviews, PDF extraction. You can write your own.

Gateway system. Your agent connects to Discord or Telegram so you can message it from your phone. It responds wherever you are.

If you're comparing AI coding tools, I wrote about my experience with Replit Agent — but for persistent autonomous operation on your own hardware, Hermes is in a different league.

What You'll Need: BMAX Pro 8

The good news: Hermes Agent itself uses about 200 MB of RAM. You don't need a GPU unless you want local LLM inference.

ComponentSpec
Mini PCBMAX Pro 8
CPUIntel i7 with Iris Xe graphics
RAM24 GB
Storage1 TB NVMe SSD
NetworkGigabit Ethernet (use wired, not WiFi)
OSUbuntu Server 24.04 LTS

RAM is what matters most. For cloud API usage, 24 GB is plenty. If you want to run a local LLM via Ollama, 24 GB with the i7 Iris Xe can handle a quantized 7B model at usable speeds.

Don't use a Raspberry Pi for this. A Pi can technically run the agent, but the CPU and RAM become bottlenecks fast. The BMAX Pro 8 is under $300 used and runs circles around any single-board computer.

If you're new to mini PC servers, my Proxmox Homelab Guide covers hardware selection and Linux basics that pair well with this setup.

BMAX Pro 8 ports and connections

Step 1 — Install Ubuntu Server 24.04

Download Ubuntu Server 24.04 LTS and create a bootable USB:

# On Linux
sudo dd if=ubuntu-24.04-live-server-amd64.iso of=/dev/sdX bs=4M status=progress

On Windows use Rufus or Balena Etcher. Boot from the USB (mash F2/F7/F10/Del during startup — it varies by manufacturer). A few things to check:

- Filesystem: ext4 is fine. ZFS is overkill. - SSH server: check the box to install OpenSSH during setup. - Username: I use hermes as mine.

The install takes about 10 minutes. When it's done, SSH in from another machine:

ssh [email protected]
sudo apt update && sudo apt upgrade -y
sudo ufw allow OpenSSH
sudo ufw enable
sudo apt install -y git curl wget htop

Your mini PC is ready.

Step 2 — Install Hermes Agent

Ubuntu 24.04 ships with Python 3.12, which works perfectly with Hermes.

python3 -m venv ~/hermes-env
source ~/hermes-env/bin/activate
echo 'source ~/hermes-env/bin/activate' >> ~/.bashrc
pip install hermes-agent
hermes --version

Then initialize:

hermes init

This creates ~/.hermes/ with config files, profiles, and the skill directory.

Step 3 — Configure Your AI Provider

You have two paths.

Option A: Local LLM with Ollama

Everything runs on your BMAX Pro 8 with no external API calls.

curl -fsSL https://ollama.ai/install.sh | sh
ollama pull llama3.1:8b

Configure ~/.hermes/config.yaml:

default_provider: ollama
providers:
  ollama:
    type: openai-compatible
    base_url: http://localhost:11434/v1
    api_key: ollama
    model: llama3.1:8b

The Iris Xe iGPU handles a quantized 7B model at about 10–15 tokens per second. Fast enough for automation tasks.

Option B: Cloud API

For the best models (Claude, GPT-4o, DeepSeek):

default_provider: openai
providers:
  openai:
    api_key: ***
    model: gpt-4o
  anthropic:
    api_key: sk-ant...
    model: claude-sonnet-4-20250514

Cost note: Cloud API costs add up — I spend about $20–30/month for a moderately active agent.

Step 4 — Run 24/7 with systemd and Gateway

Create a systemd service so your agent runs in the background and restarts on failure:

sudo nano /etc/systemd/system/hermes.service
[Unit]
Description=Hermes Agent
After=network.target

[Service]
Type=simple
User=hermes
ExecStart=/home/hermes/hermes-env/bin/hermes daemon
WorkingDirectory=/home/hermes
Restart=on-failure
RestartSec=5

[Install]
WantedBy=multi-user.target

Then enable it:

sudo systemctl enable hermes
sudo systemctl start hermes
sudo systemctl status hermes

Discord Gateway

This is my favorite feature — I message my agent from my phone and it responds in real-time.

1. Go to the Discord Developer Portal, create an application, get a bot token. 2. Invite the bot to your server. 3. Add to ~/.hermes/config.yaml:

gateway:
  enabled: true
  platform: discord
  token: your-bot-token-here
  channel_id: your-channel-id

4. Restart: sudo systemctl restart hermes

Now message your agent from your phone. Ask it to check disk space, summarize a website, or run a script.

Discord chat with Hermes Agent showing system resources

Cron Jobs

Hermes has a built-in cron scheduler:

cron:
  - name: "daily-summary"
    schedule: "0 8 * * *"
    prompt: "Check system resources and summarize any issues."
    deliver_to: discord

I use this for daily health checks, weekly backup reports, and service monitoring.

Common Pitfalls

YAML indentation. config.yaml uses spaces, not tabs. One wrong space and Hermes silently falls back to defaults. Run hermes config validate to check.

Missing dependencies. Skills sometimes need system packages:

sudo apt install -y ffmpeg pandoc poppler-utils

OOM with local models. If Ollama crashes, the model is too big for your RAM. Drop to a smaller quantized version: ollama pull llama3.1:8b-q4_K_M uses about 5 GB instead of 8 GB.

Wrap-up

Setting up Hermes Agent on a BMAX Pro 8 is straightforward. You're not installing a static web app — you're installing an AI that can act on your behalf.

StepWhat you get
Ubuntu Server 24.04Stable Linux foundation
Hermes Agent installYour AI ready to go
Provider configLocal or cloud AI brains
systemd + gateway24/7 access from anywhere
Cron jobsAutomated tasks on schedule

The first time your agent sends you a Discord message with a system health report it generated on its own, you'll realize how different this is from talking to a chatbot. It's not just answering questions — it's doing work.

Give it a try, and let me know how it goes.