Golem

A clay body, animated by words.

Golem is a minimal AI agent harness built in Rust. It has no model of its own — it borrows its intelligence from whatever you plug in: a human at a keyboard, or a cloud API (Anthropic, Google Gemini).

Built as a learning project to explore ReAct, tool calling, and memory from scratch.

What it does

You give a task → Thinker reasons → Tools execute → Observations fed back → Repeat

This is the ReAct pattern: Reason + Act in a loop until the task is done.

Providers

Provider	Auth	Default model	Status
Anthropic	OAuth (Claude Pro/Max) or `ANTHROPIC_API_KEY`	`claude-sonnet-4-20250514`	✅
Google Gemini	OAuth or `GEMINI_API_KEY`	`gemini-3-flash-preview`	✅
Human	None	—	✅

Adding a new provider: implement the ProviderConfig trait (5 methods) + Thinker trait. See AGENTS.md.

Install

AUR (Arch Linux)

yay -S golem-bin

GitHub Releases

Pre-built binaries for x86_64 and aarch64 (Linux + macOS + Windows):

# Download latest release (example for x86_64 Linux)
curl -LO https://github.com/assapir/golem/releases/latest/download/golem-x86_64-linux
chmod +x golem-x86_64-linux
sudo mv golem-x86_64-linux /usr/local/bin/golem

# Windows (PowerShell)
Invoke-WebRequest -Uri https://github.com/assapir/golem/releases/latest/download/golem-x86_64-windows.exe -OutFile golem.exe

Build from source

git clone https://github.com/assapir/golem.git
cd golem
cargo build --release

Quick start

# Log in to Anthropic (opens browser for OAuth)
golem login

# Log in to Google Gemini (opens browser, auto-captures callback)
golem login google

# Interactive mode (default: Anthropic)
golem

# Use Gemini
golem --provider google

# Single task
golem -r "list files in the current directory"

# Debug mode — see raw LLM requests and responses
golem --debug

CLI

Usage: golem [OPTIONS] [COMMAND]

Commands:
  login   Log in to an LLM provider via OAuth
  logout  Log out from an LLM provider
  help    Print this message or the help of the given subcommand(s)

Options:
  -p, --provider <PROVIDER>    LLM provider [default: anthropic] [possible values: human, anthropic, google]
      --model <MODEL>          Model name (provider-specific, ignored for human)
  -d, --db <DB>                SQLite database path [default: ~/.golem/golem.db]
  -m, --max-iterations <N>     Max ReAct loop iterations [default: 20]
  -t, --timeout <SECONDS>      Tool execution timeout [default: 30]
      --allow-write            Allow write operations in shell (default: read-only)
  -w, --work-dir <PATH>        Working directory for shell commands
      --no-confirm             Skip confirmation prompts before executing commands
  -r, --run <TASK>             Run a single task and exit
      --debug                  Show raw LLM request/response data
  -h, --help                   Print help
  -V, --version                Print version

REPL commands

Type /help at the prompt to see all available commands:

Command	Aliases	Description
`/help`	`/h`, `/?`	Show available commands
`/whoami`		Show provider, model, and auth status
`/tools`		List registered tools
`/tokens`		Show session token usage
`/model`		List and switch the active model
`/new`		Start a new session (clear conversation history)
`/debug`		Toggle debug mode (show raw LLM request/response data)
`/login`		Log in to the current provider
`/logout`		Log out from the current provider
`/quit`	`quit`, `exit`, `/exit`	Exit the REPL

Commands are trait-based (Command trait + CommandRegistry) — plugins can register additional commands at runtime. The REPL prompt supports readline-style editing and up/down history navigation; command history is saved to ~/.golem/history.txt.

Session memory

Golem remembers prior tasks within a session. Each completed task's question and answer are stored in SQLite, so follow-up tasks can reference earlier context:

golem> list files in /tmp
=> file1.txt (10KB), file2.txt (50KB), file3.txt (1KB)

golem> delete the biggest one
=> (LLM knows file2.txt is 50KB from the prior task)

Session history persists across restarts. Use /new to clear it and start fresh. By default, only the last 50 task summaries are loaded into context.

Design

Everything is a trait. Everything is swappable.

ProviderConfig — defines a provider's identity, auth flow, and thinker construction
Engine — the outermost boundary (fn run(task) -> answer)
Thinker — the brain (Anthropic, Gemini, human, mock — picked via --provider)
Tool — something the agent can do (shell commands, exit, more coming)
Command — built-in REPL commands (/help, /model, /new, etc.)
Memory — what the agent remembers (task iterations + session history, SQLite-backed)
Config — persistent key-value settings (model preference, etc.)
EventBus — decoupled broadcast channel for cross-component communication

See AGENTS.md for full architecture and contributing instructions.

License

GPL-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
.github/workflows		.github/workflows
assets		assets
aur		aur
src		src
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
PRIVACY.md		PRIVACY.md
README.md		README.md
TERMS.md		TERMS.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Golem

What it does

Providers

Install

AUR (Arch Linux)

GitHub Releases

Build from source

Quick start

CLI

REPL commands

Session memory

Design

License

About

Uh oh!

Releases 27

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Golem

What it does

Providers

Install

AUR (Arch Linux)

GitHub Releases

Build from source

Quick start

CLI

REPL commands

Session memory

Design

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 27

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages