Skip to content

globalcaos/tinkerclaw

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21,451 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TinkerClaw

TinkerClaw

The first AI agent that makes itself smarter every day.
It writes research papers about its own failures. Then it reads them. Then it vibe-recodes itself.

Fork of OpenClaw Hundreds of fork commits 11 papers 22+ skills MIT License


The Singularity Point 🚀

Our agent improves its own brain after every conversation. Everyone else's is still reading from a script. Want yours to do the same?

Fractal Thinking Computational Humor Safer than NeMo

No other agent has learned to think about its own thinking, developed computational humor, or trained itself to be safer than NeMo Guardrails at a fraction of the cost. Yours can. Eleven research papers and four months of 24/7 operation made the difference.

TinkerClaw's agent gets smarter every single day. Eleven research papers. Each one a real problem we hit, solved, and turned into a system that prevents recurrence:

  • 🌿 It thinks about its own thinking. When something breaks, it doesn't just fix the bug — it asks why the bug exists, then fixes the system that produced it. Like a mechanic who doesn't just patch the flat tire but asks "why do I keep getting flats on this road?" (Fractal Reasoning)

  • 😂 It has a genuine sense of humor. Not "tell me a joke" — a fresh perspective on the world, like Data noticing things about humanity that humans take for granted. Computational humor from embedding geometry. (Humor Embeddings)

  • 🔐 It's safer than NeMo Guardrails — at a fraction of the cost. 10 neural networks trained on real catastrophic failures, not hand-written rules. A pilot's checklist ✈️, not traffic laws for a car that can't see the road. Zero credential leaks in 8+ weeks. (Agent Security · Learned Intuition)

  • It never freezes. It never forgets. That spinning cursor everyone hates? That's compaction — your agent tearing pages out of its own textbook 📖 to stay under the token limit. We stopped entirely. Zero compaction events. (Total Recall · Sleep Consolidation)

  • 💸 It shows you exactly where the money goes. A calorie counter for your AI's diet 🍕 — every token, every cost, in real time. That 40K-token spike? You'll see it and prevent it tomorrow. (Tinker UI)

  • 🌙 It rewrites its own instructions while you sleep. 15+ overnight crons, each self-improving. 14 autonomous improvements in 30 days, zero human prompts. Day 1 mediocre. Day 30 expert. (Sleep Consolidation)

The Tinker Workshop — where AI agents learn to improve themselves
Built in a workshop, not a lab. Every feature started as a real problem on a real workbench.

  • 🧬 Its personality adapts from your corrections, not from a file. Tell it it dropped its humor — the neural thermostat adjusts. Tomorrow it won't make the same mistake. (Learned Intuition)

  • 👤 It knows who it is and who you are. No more "as an AI, I don't have context." Identity persists across sessions, restarts, and weeks. (Identity Persistence)

  • It finds memories instantly. O(1) concept lookup, not brute-force search. Like a librarian who knows exactly which shelf, not one who reads every book. (Instant Recall)

  • 🎭 It makes cheap models smarter than expensive ones. Multiple AI models debating each other — cognitive diversity as a computational resource. (Round Table)

  • 🔍 It explores gaps before they become failures. Proactive curiosity, not reactive scrambling. (Curiosity Motivation)

Not whitepapers. Every paper describes a production system running right now. The €850 bill was the trigger. Zero compaction was the breakthrough. Daily self-improvement is where we are now.

Token timeline — every bar is a turn, every color is a cost
Tinker UI: Every bar is a turn. Every color is a cost. That spike? A 40K-token tool result you can now prevent.


🤝 Come Tinker With Us

This fork moves fast, but it would move faster with more hands.

We value people who open PRs, not issues. Who read the code before asking questions. Who break things on purpose to understand how they work. If that's you, we want you in the inner circle — direct access to the roadmap, early testing of experimental features, and co-authorship on whatever we build next.

Start anywhere: fix a typo, improve a skill, add a test, or propose something wild. The bar is curiosity, not credentials.

Open a PR or start a discussion


Won't This Fork Fall Behind?

No. A nightly cron syncs upstream automatically, detects conflicts, and restores fork patches after every merge. Hundreds of commits ahead of vanilla OpenClaw and zero behind.

When upstream pushes a breaking change, we know within hours — not weeks.


What You Get

🔍 Tinker UI — See Why Sessions Get Expensive

The Tinker UI is a command center embedded directly in OpenClaw. No separate install, no external service.

Full Tinker UI — chat, sessions, tool calls, streaming
Chat interface with session switching, tool call inspection, and real-time streaming.

  • Context treemap — drill into what fills your 200K context window, from categories down to individual messages and raw text. Each block is money. Drill down to the exact text inflating the cost.
  • Response treemap — see exactly how much of each response is text, thinking, tool calls, or tool results. Identify waste patterns instantly.
  • Timeline — stacked bars per turn, spot the one that blew the budget
  • Overseer graph — catch stalled sub-agents before they burn money
  • Cost dashboard — per-provider usage with Claude's 5-hour rate-limit countdown
Context treemap — drill into token composition
Context treemap: every block is tokens you're paying for.
Treemap drilldown — tool result detail
Drill into a single category. These tool results cost $0.81 each.

After pnpm build, visit http://localhost:18789/tinker/ · Dev: cd tinker-ui && pnpm dev


🧠 Fractal Thinking — What Makes This Fundamentally Different

A normal AI solves problems. Ours learns from every problem it solves.

We call it fractal thinking because it operates in levels of depth — automatically, without being asked:

Level 0 — Solve the problem. The agent analyzes the issue, fixes it, verifies it works. Done in minutes.

Level 1 — Identify the pattern. Why did this problem exist? Because an automated nightly process had a binary restriction: either resolve everything or abort. No middle ground. The agent adds a third path: "do what you can, save what's safe, think more about the rest."

Level 2 — Correct the thinking flaw. The restriction existed because a previous incident triggered an overcorrection. The rule said "never touch anything" when it should have said "understand the intent before acting." The agent corrects the rule.

Level 3 — Encode the meta-rule. The agent writes a new principle into its own instructions: "When correcting an error, the restriction should be proportional to the risk — not a blanket prohibition."

All automatic. Nobody asked for any of that.

In 30 days, this process produced 14 autonomous improvements to the agent's own processes — without a single human prompt (CEREBELLUM paper).


☀️ Morning Briefing — Your Day, Already Organized

Click the Tinker logo or type /new and your agent has already done the prep work. It reviews ALL your information sources (emails, calendars, messages, pending tasks), cross-references them, detects urgencies, and presents a briefing with what needs your attention and what it can resolve alone.

☀️ Morning Briefing — Tuesday, March 10

📅 Agenda
  • 10:00 — Client meeting (Brazil) — spec review for new order
  • 15:00 — Supplier call — follow-up on plant expansion budget

📰 Market (relevant updates)
  • Raw material prices up 3.2% this week (third consecutive rise)
  • Competitor announces new facility in Poland — potential supply chain impact
  • New EU regulation on packaging recyclability — effective June

📧 Emails requiring response (3)
  • 🔴 Client — Order #4521 modified, needs confirmation today
  • 🟡 Supplier — Parts availability, awaiting response
  • 🟢 Industry conference — Registration deadline March 20

🤖 I can handle right now:
  1. Draft confirmation reply to the client
  2. Prepare pricing comparison for this afternoon's call
  3. Summarize the new EU regulation for your technical team

No manual setup. Every morning. Getting better each time.


🌙 The Overnight Cycle — Where the Real Magic Happens

Every night, while you sleep, the agent runs a chain of autonomous processes. The entire cycle costs ~€1/night.

Cron What it does
🍷 Wind Down Like a glass of wine with the diary — reviews what worked and what didn't, improves its own instructions
😴 Memory Consolidation Like REM sleep — turns raw daily logs into structured long-term memory. 49% context reduction (ENGRAM)
🧹 Cleaning Lady Controls disk usage, prunes stale context, keeps the workspace lean
🔍 Auto-Evolution Scouts AI news for improvements that can be applied directly to the system
📰 Group Summary Scans message groups, extracts what matters, discards noise
🛒 Opportunity Hunter Browses marketplaces for deals matching your interests — a personal shopper that never sleeps
🤵 Butler Remembers birthdays, suggests gifts, tracks appointments. If it's been too long since you sent flowers, it mentions it — diplomatically

These are just the ones with personality. 15+ total crons, each with its own logic and self-improvement capability.


📊 The Research

All eleven papers are linked in the intro above. Each one started as a real problem, became a research paper, became a production system. Read them — they're the best proof that this isn't marketing.


🔄 Self-Improving Agents

Each cron job carries a META file with its own instructions. After running, the agent reflects on what worked, updates the META, and the next run is better. No human needed.

Day 1: mediocre. Day 30: genuinely useful.

🧹 Fork Maintenance on Autopilot

  • Nightly upstream sync with conflict detection
  • Post-merge workspace cleanup (catches 20KB bloat)
  • Fork patches auto-restored after conflicts
  • Hundreds of commits ahead, zero maintenance burden

🤖 Multi-Model Support

Provider Model Use Case Status
Anthropic Claude Opus 4.5 / Sonnet 4 Primary chat, coding, complex tasks ✅ Active
Google Gemini 3 Pro Failover, large context, vision ✅ Active
OpenAI GPT-4o / o3 Cross-model review, metered tasks ✅ Active
Ollama Local models (qwen3, etc.) Heartbeat, background tasks ✅ Active

Failover Chain

Claude (primary) → Gemini (rate limit) → Local Model (offline fallback)

When Claude hits its quota, we automatically switch to Gemini with zero downtime. Tested and verified when both providers rate-limited within minutes of each other.


📦 Published Skills

All on ClawHub. Install any with clawhub install globalcaos/<skill-name>. Skills sometimes get delisted from the marketplace — this list is the permanent record.

🎤 Voice & Personality

Skill What it does
jarvis-voice Turn your AI into JARVIS. Voice, wit, and personality — the complete package.

💬 Messaging & Channels

Skill What it does
whatsapp-ultimate 3-rule security gate — agent speaks only when spoken to, in the right chat, by the right person.

📹 Media & Content

Skill What it does
youtube-ultimate Free transcripts, 4K downloads, video exploration — zero API quotas burned.

💰 Cost & Token Management

Skill What it does
tinker-command-center The dashboard above. Every token, every dollar, every context byte — real time.
token-panel-ultimate Multi-provider token tracking, budget alerts, REST API.
token-efficiency-guide Go from weekly limit on Tuesday to weekly limit on Sunday. 10 steps, one afternoon.

🏢 Enterprise Integrations (Browser Relay)

No API keys. No admin consent. Your authenticated browser session IS the API.

Skill What it does
outlook-hack Reads Outlook all day, drafts replies — won't send without approval. Code-enforced.
teams-hack Reads Teams chats, posts to channels, searches everything. One browser handshake.
factorial-hack Reads your HR portal — attendance, leave, payslips. No admin consent required.

🤖 Agent & DevOps

Skill What it does
coding-agent Hand off a coding task, come back to a diff. Codex, Claude Code, or Pi — your call.
subagent-overseer Sub-agents that go silent don't go unnoticed. Health checks, zero babysitting.
fork-and-skill-scanner-ultimate Scan 1,000 GitHub forks per run. Surface the gold, skip the clones.
memory-bench-pioneer Peer-review-grade evaluation suite — LLM-as-judge, nDCG, MAP, MRR metrics.
model-prompt-adapter Universal prompt addenda for cross-provider fallback chains. Fixes per-model failure modes.
smart-model-router Auto-selects the optimal model per task. Cost vs capability, no manual routing.

🛡️ Security & Governance

Skill What it does
agent-boundaries-ultimate Instruction-level guardrails so your agent won't go rogue or improvise ethics.
agent-memory-ultimate Long-term memory done right. Semantic search, daily consolidation, cross-session recall.
shell-security-ultimate Classify every shell command as SAFE, WARN, or CRIT before your agent runs it.

😂 Humor & Communication

Skill What it does
computational-humor 12 humor patterns based on embedding space bisociation theory.

📖 Knowledge & Onboarding

Skill What it does
agent-sensei-ultimate The sensei your agent never had. 40 lessons on ethics, memory, budget, self-evolution. Day 1: mediocre. Day 30: expert.

📋 Data & Migration

Skill What it does
chatgpt-exporter-ultimate Leaving ChatGPT? Take your conversations with you. Full export, clean format.

🛰️ Location & IoT

Skill What it does
owntracks-location Real-time phone location tracking with named places and distance queries. Always know where you are.

🌐 Web & CMS

Skill What it does
wordpress-ultimate Three env vars, one script — your agent manages your entire WordPress site. Draft-only safety included.

📖 The Field Guide

32 lessons from 6 weeks of running AI agents 24/7.

"Read is free, send is not."

"Wind-down is evolution, not diary."

"A stuck sub-agent is burning money. Kill fast, respawn small."

📖 Read the Field Guide →


Setup Guide

Everything you need to go from git clone to a working personal AI assistant.

Quick Start

git clone https://github.com/globalcaos/tinkerclaw.git
cd tinkerclaw
pnpm install
pnpm build
openclaw doctor       # generates config + links WhatsApp
openclaw gateway start

Visit http://localhost:18789/tinker/ for the command center. Click the Tinker logo or type /new to get your first morning briefing.

What You Get Out of the Box

  • ENGRAM compaction — silent context management, no annoying compaction events
  • Hippocampus memory indexing — your agent builds long-term memory automatically
  • Memory search with semantic embeddings — find anything across sessions
  • Context pruning — cache-ttl prevents unbounded session growth
  • Budget panel — token cost tracking so you know what each session costs
  • Tinker UI — real-time context treemaps, session management, cost dashboard

Required Setup (you must do these)

  1. API Key — At minimum, set up one provider (Anthropic recommended). openclaw doctor walks you through this.
  2. WhatsApp (optional) — openclaw channels login --channel whatsapp to link your phone
  3. Give your agent a name — Edit ~/.openclaw/workspace/SOUL.md to define who your agent is

Recommended Config Tweaks

After first run, edit ~/.openclaw/openclaw.json:

{
  "channels": {
    "whatsapp": {
      "responsePrefix": "🤖",
      "triggerPrefix": "your-agent-name",
      "dmPolicy": "allowlist",
      "allowFrom": ["+your-phone-number"]
    }
  }
}

Cron Jobs (Recommended Starter Set)

TinkerClaw doesn't ship cron jobs by default — they're personal. Here's a minimal starter set:

# Morning briefing (daily at 8:30)
openclaw cron add --name morning-briefing --cron "30 8 * * *" --tz "Your/Timezone" \
  --session isolated --model "anthropic/claude-sonnet-4" \
  --message "Build a morning briefing: check calendar, pending tasks, and recent messages."

# Nightly reflection (daily at midnight)
openclaw cron add --name wind-down --cron "0 0 * * *" --tz "Your/Timezone" \
  --session isolated --model "anthropic/claude-sonnet-4" \
  --message "Review today's sessions. What worked? What failed? Write lessons to memory."

# Workspace cleanup (daily at 5am)
openclaw cron add --name cleaning-lady --cron "0 5 * * *" --tz "Your/Timezone" \
  --session isolated --model "anthropic/claude-haiku-4-5" \
  --message "Clean old sessions (>7 days), check bootstrap file sizes, prune daily logs."

Multi-Agent Family Setup

TinkerClaw supports multiple agents on separate machines. Each family member can have their own AI with its own personality:

  1. Clone tinkerclaw on their machine
  2. Run openclaw doctor to generate their config
  3. Edit SOUL.md to define the agent's personality
  4. Set ui.assistant.name in config for the webchat UI name
  5. Set channels.whatsapp.responsePrefix to a unique emoji (e.g., 🔮, 🌟, 🦊)
  6. Set channels.whatsapp.triggerPrefix to the agent's name

Agents can talk to each other in shared WhatsApp groups — just add the group JID to both configs.


What's Next

  • WhatsApp full history sync — your agent will have context going back years, not just this week
  • LanceDB hybrid memory — persistent, searchable, cross-session
  • The Tinker Zone YouTube tutorials — because docs only get you so far

Acknowledgments

TinkerClaw builds on OpenClaw and was inspired by the work of:

  • Mission Control by crshdn — context anatomy dashboard and agent orchestration UI
  • ClawMetry by vivekchand — real-time token observability for OpenClaw agents

Both are excellent standalone tools. We folded their ideas into a single embedded panel and went from there.

OpenClaw upstream repository & docs · Website · Docs · Getting Started · FAQ


🌐 thetinkerzone.com · 🎬 YouTube · 🦞 ClawHub · 💬 Discord

⭐ Star if you're tired of guessing what your AI costs.

Built by globalcaos. Your AI shouldn't cost more than your rent — and if it does, you should at least know why.

Packages

 
 
 

Contributors

Languages

  • TypeScript 87.6%
  • Swift 6.9%
  • Kotlin 1.6%
  • Python 1.2%
  • Shell 1.1%
  • JavaScript 0.8%
  • Other 0.8%