OpenClaw Token Saving Ultimate Guide: Use the Strongest Model, Spend the Least Money / Includes Prompts

marsbitPubblicato 2026-02-11Pubblicato ultima volta 2026-02-11

Introduzione

This guide provides strategies to reduce OpenClaw token usage by 60-85% when using expensive models like Claude Opus. The main costs come not just from your input and the model's output, but from hidden overhead: a fixed System Prompt (~3000-5000 tokens), injected context files like AGENTS.md and MEMORY.md (~3000-14000 tokens), and conversation history. Key strategies include: 1. **Model Tiering:** Use the cheaper Claude Sonnet for 80% of daily tasks (chat, simple Q&A, cron jobs) and reserve Opus for complex tasks like writing and deep analysis. 2. **Context Slimming:** Drastically reduce the token count in injected files (AGENTS.md, SOUL.md, MEMORY.md) and remove unnecessary files from `workspaceFiles`. 3. **Cron Optimization:** Lower the frequency, merge tasks, and downgrade non-critical cron jobs to Sonnet. Configure deliveries for notifications only when necessary. 4. **Heartbeat Tuning:** Increase the interval (e.g., 45-60 minutes), set a silent period overnight, and slim down the HEARTBEAT.md file. 5. **Precise Retrieval with QMD:** Implement the local, zero-cost qmd tool for semantic search. This allows the agent to retrieve only specific relevant paragraphs from documents instead of reading entire files, saving up to 90% of tokens per query. 6. **Memory Search Selection:** For small memory files, use local embedding; for larger or multi-language needs, consider Voyage AI's free tier. By implementing these changes—model switching, context reduction, and smarter...

Author: xiyu

Want to use Claude Opus 4.6 but don't want the bill to explode at the end of the month? This guide will help you cut 60-85% of the cost.

1. Where do tokens go?

You think tokens are just "what you say + what the AI replies"? Actually, it's far more than that.

Hidden costs of each conversation:

System Prompt (~3000-5000 tokens): OpenClaw core instructions, cannot be changed
Context file injection (~3000-14000 tokens): AGENTS.md, SOUL.md, MEMORY.md, etc., included in every conversation – this is the biggest hidden cost
Message history: Gets longer the more you chat
Your input + AI output: This is what you thought was the "whole" thing

A simple "How's the weather today?" actually consumes 8000-15000 input tokens. Calculated with Opus, just the context costs $0.12-0.22.

Cron is even worse: Each trigger = a brand new conversation = re-injecting all context. A cron running every 15 minutes, 96 times a day, costs $10-20 per day under Opus.

Heartbeat is the same principle: Essentially also a conversation call, the shorter the interval, the more money it burns.

2. Model Tiering: Sonnet for Daily, Opus for Critical

The first major money-saving trick, with the most dramatic effect. Sonnet is priced at about 1/5 of Opus, and is fully sufficient for 80% of daily tasks.

markdown

Prompt:

Please help me change OpenClaw's default model to Claude Sonnet,

and only use Opus when deep analysis or creation is needed.

Specific needs:

1) Set default model to Sonnet

2) cron tasks default to Sonnet

3) Only specify Opus for writing, deep analysis tasks

Opus scenarios: Long-form writing, complex code, multi-step reasoning, creative tasks

Sonnet scenarios: Daily chat, simple Q&A, cron checks, heartbeat, file operations, translation

Actual test: After switching, monthly cost dropped 65%, experience almost no difference.

3. Context Slimming: Cut the Hidden Token Hogs

The "background noise" per call can be 3000-14000 tokens. Streamlining injected files is the optimization with the highest cost-performance ratio.

markdown

Prompt:

Help me streamline OpenClaw's context files to save tokens.

Specifically include: 1) Delete unnecessary parts of AGENTS.md (group chat rules, TTS, unused features), compress to within 800 tokens

2) Simplify SOUL.md to concise key points, 300-500 tokens

3) Clean up expired information in MEMORY.md, control within 2000 tokens

4) Check workspaceFiles configuration, remove unnecessary injected files

Rule of thumb: For every 1000 tokens reduced in injection, calculated at 100 Opus calls per day, save about $45 per month.

4. Cron Optimization: The Most Hidden Cost Killer

markdown

Prompt: Help me optimize OpenClaw's cron tasks to save tokens.

Please:

1) List all cron tasks, their frequency, and model

2) Downgrade all non-creative tasks to Sonnet

3) Merge tasks in the same time period (e.g., combine multiple checks into one)

4) Reduce unnecessary high frequency (system check from 10 minutes to 30 minutes, version check from 3 times/day to 1 time/day)

5) Configure delivery to notify on demand, no message when normal

Core principle: More frequent is not always better, most "real-time" demands are false demands. Merging 5 independent checks into 1 call saves 75% context injection cost.

5. Heartbeat Optimization

markdown

Prompt: Help me optimize OpenClaw heartbeat configuration:

1) Set work hour interval to 45-60 minutes

2) Set 23:00-08:00 at night as silent period

3) Streamline HEARTBEAT.md to the minimum number of lines

4) Merge scattered check tasks into heartbeat for batch execution

6. Precise Retrieval: Use qmd to Save 90% Input Token

When the agent looks up information, it defaults to "reading the full text" – a 500-line file is 3000-5000 tokens, but it only needs 10 lines from it. 90% of input tokens are wasted.

qmd is a local semantic retrieval tool that builds a full-text + vector index, allowing the agent to pinpoint paragraphs instead of reading the entire file. All computed locally, zero API cost.

Use with mq (Mini Query): Preview directory structure, precise paragraph extraction, keyword search – only read the needed 10-30 lines each time.

markdown

Prompt:

Help me configure qmd knowledge base retrieval to save tokens.

Github address: https://github.com/tobi/qmd

Needs:

1) Install qmd

2) Build index for the working directory

3) Add retrieval rules in AGENTS.md, force agent to prioritize qmd/mq search over direct read full text

4) Set up scheduled index updates

Actual effect: Each information lookup dropped from 15000 tokens to 1500 tokens, a 90% reduction.

Difference from memorySearch: memorySearch manages "memories" (MEMORY.md), qmd manages "looking up information" (custom knowledge base), they do not affect each other.

7. Memory Search Choice

markdown

Prompt: Help me configure OpenClaw's memorySearch.

If I don't have many memory files (dozens of md),

recommend using local embedding or Voyage AI?

Please explain the cost and retrieval quality differences of each.

Simple conclusion: Use local embedding for few memory files (zero cost), use Voyage AI for high multilingual needs or many files (200 million tokens per account free).

8. Ultimate Configuration Checklist

markdown

Prompt:

Please help me optimize OpenClaw configuration in one go to save tokens to the maximum extent, execute according to the following checklist:

Change default model to Sonnet, only reserve Opus for creative/analysis tasks

Streamline AGENTS.md / SOUL.md / MEMORY.md

Downgrade all cron tasks to Sonnet + merge + reduce frequency

Heartbeat interval 45 minutes + nighttime silence

Configure qmd precise retrieval to replace full-text reading

workspaceFiles only keep necessary files

Regularly streamline memory files, control MEMORY.md within 2000 tokens

Configure once, benefit long-term:

1. Model Tiering — Sonnet daily, Opus critical, save 60-80%

2. Context Slimming — Streamline files + qmd precise retrieval, save 30-90% input token

3. Reduce Calls — Merge cron, extend heartbeat, enable silent period

Sonnet 4 is already very strong, can't feel the difference in daily use. Just switch to Opus when you really need it.

Based on multi-agent system practical experience, data are desensitized estimates.

Domande pertinenti

QWhat are the main hidden costs of token usage in OpenClaw according to the article?

AThe main hidden costs include the System Prompt (~3000-5000 tokens), context file injections like AGENTS.md, SOUL.md, and MEMORY.md (~3000-14000 tokens), and the accumulation of historical messages in conversations.

QWhat is the primary strategy recommended for reducing costs with model selection?

AThe primary strategy is model layering: using Claude Sonnet for daily tasks and reserving Claude Opus only for critical tasks like deep analysis or creative work, as Sonnet is about 1/5 the cost of Opus.

QHow does using qmd help in reducing token consumption?

Aqmd is a local semantic retrieval tool that creates a vector index for precise paragraph retrieval instead of reading entire files, reducing input tokens by up to 90% for research tasks, as it only fetches the needed 10-30 lines.

QWhat optimizations are suggested for cron tasks to save tokens?

AOptimizations include downgrading non-creative tasks to Sonnet, merging multiple tasks into single calls, reducing unnecessary high frequency (e.g., from 10 to 30 minutes), and configuring delivery for on-demand notifications to avoid messages when normal.

QWhat is the recommended approach for heartbeat configuration to minimize costs?

ASet heartbeat intervals to 45-60 minutes during work hours, implement a silent period from 23:00 to 08:00,精简 HEARTBEAT.md to minimal lines, and consolidate scattered check tasks into batch executions within heartbeat.

Letture associate

Figure Technology Confirms Customer Data Breach After Social Engineering Attack

Figure Technology, a blockchain-based lending firm, has confirmed a data breach resulting from a social engineering attack. Hackers tricked an employee into providing access, leading to the theft of approximately 2.5 GB of customer data, including names, addresses, birth dates, and phone numbers. The group ShinyHunters claimed responsibility and released the data after the company refused to pay a ransom. No financial information or passwords were confirmed to be compromised. The company has launched an investigation, notified affected customers, and is offering free credit monitoring. Cybersecurity experts emphasize that such attacks target human vulnerabilities rather than technical flaws.

TheNewsCrypto7 min fa

Figure Technology Confirms Customer Data Breach After Social Engineering Attack

TheNewsCrypto7 min fa

Grayscale Moves to Convert AAVE Trust Into ETF

Grayscale has filed with the SEC to convert its AAVE Trust into a spot ETF to be listed on NYSE Arca, continuing its strategy of bringing crypto products into regulated structures. The Aave protocol is a leading DeFi lending platform, and its native token, AAVE, has a market cap of around $1.8 billion. This move reflects growing institutional interest in DeFi. If approved, the ETF would charge a 2.5% sponsor fee and use Coinbase for custody, allowing traditional investors to gain exposure to AAVE without directly managing tokens. Grayscale's effort follows its successful conversion of its Bitcoin Trust into an ETF, though Bitwise also filed for similar AAVE-related products in December.

TheNewsCrypto7 min fa

Grayscale Moves to Convert AAVE Trust Into ETF

TheNewsCrypto7 min fa

Is LayerZero's Zero an 'Ethereum Killer'?

LayerZero's upcoming Zero chain, developed in collaboration with Wall Street giants like Citadel Securities and DTCC, aims for 200 million TPS and ultra-low transaction fees. Rather than being an "Ethereum killer," Zero is positioned as a complementary scaling solution that amplifies Ethereum’s ecosystem. Technically, Zero uses innovations like QMDB and FAFO for high-speed execution but relies on Ethereum for security via ZK-proof bridging. It supports EVM compatibility, allowing easy DApp migration without code rewrites. This design makes Zero an "external execution layer" rather than a replacement. Economically, while some value may initially migrate, Zero’s cross-chain bridge (OFT standard) is expected to route value back to Ethereum, especially in RWA (Real World Asset) settlements. Its low fees target high-frequency TradFi use cases, with Ethereum serving as the security anchor. Ecosystem-wise, Zero enhances Ethereum’s role as a multi-chain hub. It enables hybrid deployments (execution on Zero, governance on Ethereum) and attracts non-crypto users through TradFi partnerships, ultimately strengthening Ethereum’s position in the expanding RWA landscape.

marsbit27 min fa

Is LayerZero's Zero an 'Ethereum Killer'?

marsbit27 min fa

Q4 Net Loss of $667 Million, Yet Stock Soars 16%, Don't Buy Coinbase Now

Coinbase reported a net loss of $667 million in Q4 2025, with revenue of $1.78 billion falling short of expectations. Despite this, its stock surged 16.46% the next day, reflecting short-term market confidence. However, analysts caution against investing in Coinbase at this time, citing high cyclicality and near-term headwinds. The company’s revenue is split between transaction-based income (56%) and subscription & services (44%). Transaction revenue relies heavily on retail trading spreads, which remain vulnerable to crypto market volatility. Subscription revenue includes stablecoin-related income (mainly from USDC interest sharing), staking, and emerging services like Coinbase One and Base L2. Key challenges include Coinbase’s high correlation with Bitcoin’s, regulatory uncertainty in the U.S., and growing competition from decentralized exchanges (DEXs) globally. Although Coinbase maintains a dominant position in the U.S. due to its regulatory compliance and trust, analysts expect continued pressure on brokerage fundamentals through 2026. Earnings are projected to underperform consensus estimates by 14% in 2026, with potential downside in a prolonged crypto downturn. While regulatory clarity may eventually benefit Coinbase, its effects are likely too slow to offset near-term financial weakness. Analysts advise waiting for a better entry point, as current risk-adjusted returns appear unfavorable.

marsbit43 min fa

Q4 Net Loss of $667 Million, Yet Stock Soars 16%, Don't Buy Coinbase Now

Coinbase reported a net loss of $667 million in Q4 2025, with revenue of $1.78 billion falling short of expectations. Despite this, its stock surged 16.46% the next day, reflecting short-term market optimism. However, analysts caution against investing in Coinbase at this time, citing high cyclicality and near-term headwinds. Revenue is split between transaction-based income (56%) and subscription/services (44%). Transaction revenue relies heavily on retail trading spreads, which are under pressure due to declining crypto prices and reduced volatility. Subscription revenue, led by USDC interest income and staking, offers diversification but remains sensitive to interest rates and market conditions. Key debates include whether Coinbase can reduce its dependence on crypto market cycles, the sustainability of stablecoin profits, and the impact of future regulations. While the company holds a dominant position in the U.S. market due to its regulatory compliance and trust, it faces growing competition from decentralized exchanges globally. Analysts project underperformance in 2026, with earnings potentially 14% below consensus due to compressed client assets and trading activity. Although regulatory clarity may benefit Coinbase long-term, it is unlikely to offset immediate financial pressures. Valuation scenarios range from a negative IRR in a bear case to a high IRR in an optimistic rebound, but near-term risks currently outweigh potential returns.

Odaily星球日报46 min fa

Trading

Spot

Futures

Articoli Popolari

Come comprare T

Benvenuto in HTX.com! Abbiamo reso l'acquisto di Threshold Network Token (T) semplice e conveniente. Segui la nostra guida passo passo per intraprendere il tuo viaggio nel mondo delle criptovalute.Step 1: Crea il tuo Account HTXUsa la tua email o numero di telefono per registrarti il tuo account gratuito su HTX. Vivi un'esperienza facile e sblocca tutte le funzionalità,Crea il mio accountStep 2: Vai in Acquista crypto e seleziona il tuo metodo di pagamentoCarta di credito/debito: utilizza la tua Visa o Mastercard per acquistare immediatamente Threshold Network TokenT.Bilancio: Usa i fondi dal bilancio del tuo account HTX per fare trading senza problemi.Terze parti: abbiamo aggiunto metodi di pagamento molto utilizzati come Google Pay e Apple Pay per maggiore comodità.P2P: Fai trading direttamente con altri utenti HTX.Over-the-Counter (OTC): Offriamo servizi su misura e tassi di cambio competitivi per i trader.Step 3: Conserva Threshold Network Token (T)Dopo aver acquistato Threshold Network Token (T), conserva nel tuo account HTX. In alternativa, puoi inviare tramite trasferimento blockchain o scambiare per altre criptovalute.Step 4: Scambia Threshold Network Token (T)Scambia facilmente Threshold Network Token (T) nel mercato spot di HTX. Accedi al tuo account, seleziona la tua coppia di trading, esegui le tue operazioni e monitora in tempo reale. Offriamo un'esperienza user-friendly sia per chi ha appena iniziato che per i trader più esperti.

181 Totale visualizzazioniPubblicato il 2024.12.10Aggiornato il 2025.03.21

Discussioni

Benvenuto nella Community HTX. Qui puoi rimanere informato sugli ultimi sviluppi della piattaforma e accedere ad approfondimenti esperti sul mercato. Le opinioni degli utenti sul prezzo di T T sono presentate come di seguito.