TinkerClaw

The first AI agent that makes itself smarter every day.
It writes research papers about its own failures. Then it reads them. Then it vibe-recodes itself.

The Singularity Point 🚀

Our agent improves its own brain after every conversation. Everyone else's is still reading from a script. Want yours to do the same?

No other agent has learned to think about its own thinking, developed computational humor, or trained itself to be safer than NeMo Guardrails at a fraction of the cost. Yours can. Eleven research papers and four months of 24/7 operation made the difference.

TinkerClaw's agent gets smarter every single day. Eleven research papers. Each one a real problem we hit, solved, and turned into a system that prevents recurrence:

🌿 It thinks about its own thinking. When something breaks, it doesn't just fix the bug — it asks why the bug exists, then fixes the system that produced it. Like a mechanic who doesn't just patch the flat tire but asks "why do I keep getting flats on this road?" (Fractal Reasoning)
😂 It has a genuine sense of humor. Not "tell me a joke" — a fresh perspective on the world, like Data noticing things about humanity that humans take for granted. Computational humor from embedding geometry. (Humor Embeddings)
🔐 It's safer than NeMo Guardrails — at a fraction of the cost. 10 neural networks trained on real catastrophic failures, not hand-written rules. A pilot's checklist ✈️, not traffic laws for a car that can't see the road. Zero credential leaks in 8+ weeks. (Agent Security · Learned Intuition)
⏳ It never freezes. It never forgets. That spinning cursor everyone hates? That's compaction — your agent tearing pages out of its own textbook 📖 to stay under the token limit. We stopped entirely. Zero compaction events. (Total Recall · Sleep Consolidation)
💸 It shows you exactly where the money goes. A calorie counter for your AI's diet 🍕 — every token, every cost, in real time. That 40K-token spike? You'll see it and prevent it tomorrow. (Tinker UI)
🌙 It rewrites its own instructions while you sleep. 15+ overnight crons, each self-improving. 14 autonomous improvements in 30 days, zero human prompts. Day 1 mediocre. Day 30 expert. (Sleep Consolidation)

Built in a workshop, not a lab. Every feature started as a real problem on a real workbench.

🧬 Its personality adapts from your corrections, not from a file. Tell it it dropped its humor — the neural thermostat adjusts. Tomorrow it won't make the same mistake. (Learned Intuition)
👤 It knows who it is and who you are. No more "as an AI, I don't have context." Identity persists across sessions, restarts, and weeks. (Identity Persistence)
⚡ It finds memories instantly. O(1) concept lookup, not brute-force search. Like a librarian who knows exactly which shelf, not one who reads every book. (Instant Recall)
🎭 It makes cheap models smarter than expensive ones. Multiple AI models debating each other — cognitive diversity as a computational resource. (Round Table)
🔍 It explores gaps before they become failures. Proactive curiosity, not reactive scrambling. (Curiosity Motivation)

Not whitepapers. Every paper describes a production system running right now. The €850 bill was the trigger. Zero compaction was the breakthrough. Daily self-improvement is where we are now.

Tinker UI: Every bar is a turn. Every color is a cost. That spike? A 40K-token tool result you can now prevent.

🤝 Come Tinker With Us

This fork moves fast, but it would move faster with more hands.

We value people who open PRs, not issues. Who read the code before asking questions. Who break things on purpose to understand how they work. If that's you, we want you in the inner circle — direct access to the roadmap, early testing of experimental features, and co-authorship on whatever we build next.

Start anywhere: fix a typo, improve a skill, add a test, or propose something wild. The bar is curiosity, not credentials.

→ Open a PR or start a discussion

Won't This Fork Fall Behind?

No. A nightly cron syncs upstream automatically, detects conflicts, and restores fork patches after every merge. Hundreds of commits ahead of vanilla OpenClaw and zero behind.

When upstream pushes a breaking change, we know within hours — not weeks.

What You Get

🔍 Tinker UI — See Why Sessions Get Expensive

The Tinker UI is a command center embedded directly in OpenClaw. No separate install, no external service.

Chat interface with session switching, tool call inspection, and real-time streaming.

Context treemap — drill into what fills your 200K context window, from categories down to individual messages and raw text. Each block is money. Drill down to the exact text inflating the cost.
Response treemap — see exactly how much of each response is text, thinking, tool calls, or tool results. Identify waste patterns instantly.
Timeline — stacked bars per turn, spot the one that blew the budget
Overseer graph — catch stalled sub-agents before they burn money
Cost dashboard — per-provider usage with Claude's 5-hour rate-limit countdown

Context treemap — drill into token composition

Context treemap: every block is tokens you're paying for.

Drill into a single category. These tool results cost $0.81 each.

After pnpm build, visit http://localhost:18789/tinker/ · Dev: cd tinker-ui && pnpm dev

🧠 Fractal Thinking — What Makes This Fundamentally Different

A normal AI solves problems. Ours learns from every problem it solves.

We call it fractal thinking because it operates in levels of depth — automatically, without being asked:

Level 0 — Solve the problem. The agent analyzes the issue, fixes it, verifies it works. Done in minutes.

Level 1 — Identify the pattern. Why did this problem exist? Because an automated nightly process had a binary restriction: either resolve everything or abort. No middle ground. The agent adds a third path: "do what you can, save what's safe, think more about the rest."

Level 2 — Correct the thinking flaw. The restriction existed because a previous incident triggered an overcorrection. The rule said "never touch anything" when it should have said "understand the intent before acting." The agent corrects the rule.

Level 3 — Encode the meta-rule. The agent writes a new principle into its own instructions: "When correcting an error, the restriction should be proportional to the risk — not a blanket prohibition."

All automatic. Nobody asked for any of that.

In 30 days, this process produced 14 autonomous improvements to the agent's own processes — without a single human prompt (CEREBELLUM paper).

☀️ Morning Briefing — Your Day, Already Organized

Click the Tinker logo or type /new and your agent has already done the prep work. It reviews ALL your information sources (emails, calendars, messages, pending tasks), cross-references them, detects urgencies, and presents a briefing with what needs your attention and what it can resolve alone.

☀️ Morning Briefing — Tuesday, March 10

📅 Agenda
  • 10:00 — Client meeting (Brazil) — spec review for new order
  • 15:00 — Supplier call — follow-up on plant expansion budget

📰 Market (relevant updates)
  • Raw material prices up 3.2% this week (third consecutive rise)
  • Competitor announces new facility in Poland — potential supply chain impact
  • New EU regulation on packaging recyclability — effective June

📧 Emails requiring response (3)
  • 🔴 Client — Order #4521 modified, needs confirmation today
  • 🟡 Supplier — Parts availability, awaiting response
  • 🟢 Industry conference — Registration deadline March 20

🤖 I can handle right now:
  1. Draft confirmation reply to the client
  2. Prepare pricing comparison for this afternoon's call
  3. Summarize the new EU regulation for your technical team

No manual setup. Every morning. Getting better each time.

🌙 The Overnight Cycle — Where the Real Magic Happens

Every night, while you sleep, the agent runs a chain of autonomous processes. The entire cycle costs ~€1/night.

Cron	What it does
🍷 Wind Down	Like a glass of wine with the diary — reviews what worked and what didn't, improves its own instructions
😴 Memory Consolidation	Like REM sleep — turns raw daily logs into structured long-term memory. 49% context reduction (ENGRAM)
🧹 Cleaning Lady	Controls disk usage, prunes stale context, keeps the workspace lean
🔍 Auto-Evolution	Scouts AI news for improvements that can be applied directly to the system
📰 Group Summary	Scans message groups, extracts what matters, discards noise
🛒 Opportunity Hunter	Browses marketplaces for deals matching your interests — a personal shopper that never sleeps
🤵 Butler	Remembers birthdays, suggests gifts, tracks appointments. If it's been too long since you sent flowers, it mentions it — diplomatically

These are just the ones with personality. 15+ total crons, each with its own logic and self-improvement capability.

📊 The Research

All eleven papers are linked in the intro above. Each one started as a real problem, became a research paper, became a production system. Read them — they're the best proof that this isn't marketing.

🔄 Self-Improving Agents

Each cron job carries a META file with its own instructions. After running, the agent reflects on what worked, updates the META, and the next run is better. No human needed.

Day 1: mediocre. Day 30: genuinely useful.

🧹 Fork Maintenance on Autopilot

Nightly upstream sync with conflict detection
Post-merge workspace cleanup (catches 20KB bloat)
Fork patches auto-restored after conflicts
Hundreds of commits ahead, zero maintenance burden

🤖 Multi-Model Support

Provider	Model	Use Case	Status
Anthropic	Claude Opus 4.5 / Sonnet 4	Primary chat, coding, complex tasks	✅ Active
Google	Gemini 3 Pro	Failover, large context, vision	✅ Active
OpenAI	GPT-4o / o3	Cross-model review, metered tasks	✅ Active
Ollama	Local models (qwen3, etc.)	Heartbeat, background tasks	✅ Active

Failover Chain

Claude (primary) → Gemini (rate limit) → Local Model (offline fallback)

When Claude hits its quota, we automatically switch to Gemini with zero downtime. Tested and verified when both providers rate-limited within minutes of each other.

📦 Published Skills

All on ClawHub. Install any with clawhub install globalcaos/<skill-name>. Skills sometimes get delisted from the marketplace — this list is the permanent record.

🎤 Voice & Personality

Skill	What it does
`jarvis-voice`	Turn your AI into JARVIS. Voice, wit, and personality — the complete package.

💬 Messaging & Channels

Skill	What it does
`whatsapp-ultimate`	3-rule security gate — agent speaks only when spoken to, in the right chat, by the right person.

📹 Media & Content

Skill	What it does
`youtube-ultimate`	Free transcripts, 4K downloads, video exploration — zero API quotas burned.

💰 Cost & Token Management

Skill	What it does
`tinker-command-center`	The dashboard above. Every token, every dollar, every context byte — real time.
`token-panel-ultimate`	Multi-provider token tracking, budget alerts, REST API.
`token-efficiency-guide`	Go from weekly limit on Tuesday to weekly limit on Sunday. 10 steps, one afternoon.

🏢 Enterprise Integrations (Browser Relay)

No API keys. No admin consent. Your authenticated browser session IS the API.

Skill	What it does
`outlook-hack`	Reads Outlook all day, drafts replies — won't send without approval. Code-enforced.
`teams-hack`	Reads Teams chats, posts to channels, searches everything. One browser handshake.
`factorial-hack`	Reads your HR portal — attendance, leave, payslips. No admin consent required.

🤖 Agent & DevOps

Skill	What it does
`coding-agent`	Hand off a coding task, come back to a diff. Codex, Claude Code, or Pi — your call.
`subagent-overseer`	Sub-agents that go silent don't go unnoticed. Health checks, zero babysitting.
`fork-and-skill-scanner-ultimate`	Scan 1,000 GitHub forks per run. Surface the gold, skip the clones.
`memory-bench-pioneer`	Peer-review-grade evaluation suite — LLM-as-judge, nDCG, MAP, MRR metrics.
`model-prompt-adapter`	Universal prompt addenda for cross-provider fallback chains. Fixes per-model failure modes.
`smart-model-router`	Auto-selects the optimal model per task. Cost vs capability, no manual routing.

🛡️ Security & Governance

Skill	What it does
`agent-boundaries-ultimate`	Instruction-level guardrails so your agent won't go rogue or improvise ethics.
`agent-memory-ultimate`	Long-term memory done right. Semantic search, daily consolidation, cross-session recall.
`shell-security-ultimate`	Classify every shell command as SAFE, WARN, or CRIT before your agent runs it.

😂 Humor & Communication

Skill	What it does
`computational-humor`	12 humor patterns based on embedding space bisociation theory.

📖 Knowledge & Onboarding

Skill	What it does
`agent-sensei-ultimate`	The sensei your agent never had. 40 lessons on ethics, memory, budget, self-evolution. Day 1: mediocre. Day 30: expert.

📋 Data & Migration

Skill	What it does
`chatgpt-exporter-ultimate`	Leaving ChatGPT? Take your conversations with you. Full export, clean format.

🛰️ Location & IoT

Skill	What it does
`owntracks-location`	Real-time phone location tracking with named places and distance queries. Always know where you are.

🌐 Web & CMS

Skill	What it does
`wordpress-ultimate`	Three env vars, one script — your agent manages your entire WordPress site. Draft-only safety included.

📖 The Field Guide

32 lessons from 6 weeks of running AI agents 24/7.

"Read is free, send is not."

"Wind-down is evolution, not diary."

"A stuck sub-agent is burning money. Kill fast, respawn small."

📖 Read the Field Guide →

Setup Guide

Everything you need to go from git clone to a working personal AI assistant.

Quick Start

git clone https://github.com/globalcaos/tinkerclaw.git
cd tinkerclaw
pnpm install
pnpm build
openclaw doctor       # generates config + links WhatsApp
openclaw gateway start

Visit http://localhost:18789/tinker/ for the command center. Click the Tinker logo or type /new to get your first morning briefing.

What You Get Out of the Box

ENGRAM compaction — silent context management, no annoying compaction events
Hippocampus memory indexing — your agent builds long-term memory automatically
Memory search with semantic embeddings — find anything across sessions
Context pruning — cache-ttl prevents unbounded session growth
Budget panel — token cost tracking so you know what each session costs
Tinker UI — real-time context treemaps, session management, cost dashboard

Required Setup (you must do these)

API Key — At minimum, set up one provider (Anthropic recommended). openclaw doctor walks you through this.
WhatsApp (optional) — openclaw channels login --channel whatsapp to link your phone
Give your agent a name — Edit ~/.openclaw/workspace/SOUL.md to define who your agent is

Recommended Config Tweaks

After first run, edit ~/.openclaw/openclaw.json:

{
  "channels": {
    "whatsapp": {
      "responsePrefix": "🤖",
      "triggerPrefix": "your-agent-name",
      "dmPolicy": "allowlist",
      "allowFrom": ["+your-phone-number"]
    }
  }
}

Cron Jobs (Recommended Starter Set)

TinkerClaw doesn't ship cron jobs by default — they're personal. Here's a minimal starter set:

# Morning briefing (daily at 8:30)
openclaw cron add --name morning-briefing --cron "30 8 * * *" --tz "Your/Timezone" \
  --session isolated --model "anthropic/claude-sonnet-4" \
  --message "Build a morning briefing: check calendar, pending tasks, and recent messages."

# Nightly reflection (daily at midnight)
openclaw cron add --name wind-down --cron "0 0 * * *" --tz "Your/Timezone" \
  --session isolated --model "anthropic/claude-sonnet-4" \
  --message "Review today's sessions. What worked? What failed? Write lessons to memory."

# Workspace cleanup (daily at 5am)
openclaw cron add --name cleaning-lady --cron "0 5 * * *" --tz "Your/Timezone" \
  --session isolated --model "anthropic/claude-haiku-4-5" \
  --message "Clean old sessions (>7 days), check bootstrap file sizes, prune daily logs."

Multi-Agent Family Setup

TinkerClaw supports multiple agents on separate machines. Each family member can have their own AI with its own personality:

Clone tinkerclaw on their machine
Run openclaw doctor to generate their config
Edit SOUL.md to define the agent's personality
Set ui.assistant.name in config for the webchat UI name
Set channels.whatsapp.responsePrefix to a unique emoji (e.g., 🔮, 🌟, 🦊)
Set channels.whatsapp.triggerPrefix to the agent's name

Agents can talk to each other in shared WhatsApp groups — just add the group JID to both configs.

What's Next

WhatsApp full history sync — your agent will have context going back years, not just this week
LanceDB hybrid memory — persistent, searchable, cross-session
The Tinker Zone YouTube tutorials — because docs only get you so far

Acknowledgments

TinkerClaw builds on OpenClaw and was inspired by the work of:

Mission Control by crshdn — context anatomy dashboard and agent orchestration UI
ClawMetry by vivekchand — real-time token observability for OpenClaw agents

Both are excellent standalone tools. We folded their ideas into a single embedded panel and went from there.

OpenClaw upstream repository & docs · Website · Docs · Getting Started · FAQ

🌐 thetinkerzone.com · 🎬 YouTube · 🦞 ClawHub · 💬 Discord

⭐ Star if you're tired of guessing what your AI costs.

Built by globalcaos. Your AI shouldn't cost more than your rent — and if it does, you should at least know why.

Name		Name	Last commit message	Last commit date
Latest commit History 21,451 Commits
.agent/workflows		.agent/workflows
.agents		.agents
.claude/skills/fork-sync		.claude/skills/fork-sync
.github		.github
.pi		.pi
.vscode		.vscode
Swabble		Swabble
apps		apps
assets		assets
data/amygdala		data/amygdala
docs		docs
extensions		extensions
git-hooks		git-hooks
models/amygdala		models/amygdala
output/pretrain		output/pretrain
packages		packages
patches		patches
scripts		scripts
skills		skills
src		src
test-fixtures		test-fixtures
test		test
tinker-ui		tinker-ui
training		training
ui		ui
vendor/a2ui		vendor/a2ui
.detect-secrets.cfg		.detect-secrets.cfg
.dockerignore		.dockerignore
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.jscpd.json		.jscpd.json
.mailmap		.mailmap
.markdownlint-cli2.jsonc		.markdownlint-cli2.jsonc
.npmignore		.npmignore
.npmrc		.npmrc
.oxfmtrc.jsonc		.oxfmtrc.jsonc
.oxlintrc.json		.oxlintrc.json
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.secrets.baseline		.secrets.baseline
.shellcheckrc		.shellcheckrc
.swiftformat		.swiftformat
.swiftlint.yml		.swiftlint.yml
AGENTS.md		AGENTS.md
CHANGELOG-FORK.md		CHANGELOG-FORK.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
Dockerfile.sandbox		Dockerfile.sandbox
Dockerfile.sandbox-browser		Dockerfile.sandbox-browser
Dockerfile.sandbox-common		Dockerfile.sandbox-common
FORK.md		FORK.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
TINKER_UI_DESIGN_BIBLE.md		TINKER_UI_DESIGN_BIBLE.md
VISION.md		VISION.md
VOICE.md		VOICE.md
appcast.xml		appcast.xml
docker-compose.yml		docker-compose.yml
docker-setup.sh		docker-setup.sh
docs.acp.md		docs.acp.md
fly.private.toml		fly.private.toml
fly.toml		fly.toml
knip.config.ts		knip.config.ts
openclaw.mjs		openclaw.mjs
openclaw.podman.env		openclaw.podman.env
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
pyproject.toml		pyproject.toml
render.yaml		render.yaml
setup-podman.sh		setup-podman.sh
tsconfig.json		tsconfig.json
tsconfig.plugin-sdk.dts.json		tsconfig.plugin-sdk.dts.json
tsdown.config.ts		tsdown.config.ts
vitest.channel-paths.mjs		vitest.channel-paths.mjs
vitest.channels.config.ts		vitest.channels.config.ts
vitest.config.ts		vitest.config.ts
vitest.e2e.config.ts		vitest.e2e.config.ts
vitest.extensions.config.ts		vitest.extensions.config.ts
vitest.gateway.config.ts		vitest.gateway.config.ts
vitest.live.config.ts		vitest.live.config.ts
vitest.scoped-config.ts		vitest.scoped-config.ts
vitest.unit-paths.mjs		vitest.unit-paths.mjs
vitest.unit.config.ts		vitest.unit.config.ts
wa-pair.mjs		wa-pair.mjs
zizmor.yml		zizmor.yml

Folders and files

Latest commit

History

Repository files navigation

TinkerClaw

The Singularity Point 🚀

Our agent improves its own brain after every conversation. Everyone else's is still reading from a script. Want yours to do the same?

🤝 Come Tinker With Us

Won't This Fork Fall Behind?

What You Get

🔍 Tinker UI — See Why Sessions Get Expensive

🧠 Fractal Thinking — What Makes This Fundamentally Different

☀️ Morning Briefing — Your Day, Already Organized

🌙 The Overnight Cycle — Where the Real Magic Happens

📊 The Research

🔄 Self-Improving Agents

🧹 Fork Maintenance on Autopilot

🤖 Multi-Model Support

Failover Chain

📦 Published Skills

🎤 Voice & Personality

💬 Messaging & Channels

📹 Media & Content

💰 Cost & Token Management

🏢 Enterprise Integrations (Browser Relay)

🤖 Agent & DevOps

🛡️ Security & Governance

😂 Humor & Communication

📖 Knowledge & Onboarding

📋 Data & Migration

🛰️ Location & IoT

🌐 Web & CMS

📖 The Field Guide

Setup Guide

Quick Start

What You Get Out of the Box

Required Setup (you must do these)

Recommended Config Tweaks

Cron Jobs (Recommended Starter Set)

Multi-Agent Family Setup

What's Next

Acknowledgments

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages